Skip to content
View tinaxfwu's full-sized avatar

Block or report tinaxfwu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Python 2,633 166 Updated Sep 27, 2024

🔥🕷️ Crawl4AI: Open-source LLM Friendly Web Crawler & Scrapper

Python 11,689 819 Updated Oct 2, 2024

Apache DataFusion SQL Query Engine

Rust 6,045 1,143 Updated Oct 7, 2024

A native PyTorch Library for large model training

Python 2,332 170 Updated Oct 7, 2024

Apache DataFusion Ray

Rust 76 5 Updated Oct 6, 2024

SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.

Python 6,631 486 Updated Oct 6, 2024

Puzzles for learning Triton

Jupyter Notebook 1,015 66 Updated Sep 25, 2024
HTML 46 33 Updated Sep 24, 2024

The Magic Mask for Android

C++ 47,554 12,077 Updated Oct 6, 2024

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 271,776 45,826 Updated Aug 7, 2024

Zero Bubble Pipeline Parallelism

Python 263 13 Updated Sep 4, 2024

tiktoken is a fast BPE tokeniser for use with OpenAI's models.

Python 12,007 818 Updated Oct 3, 2024

An open-source RAG-based tool for chatting with your documents.

Python 13,473 1,012 Updated Oct 6, 2024

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 7,222 727 Updated Oct 4, 2024

😎 Awesome lists about all kinds of interesting topics

327,569 27,767 Updated Sep 9, 2024

Development repository for the Triton language and compiler

C++ 12,946 1,574 Updated Oct 7, 2024

GPU programming related news and material links

1,162 70 Updated Sep 23, 2024

Efficient Triton Kernels for LLM Training

Python 3,140 166 Updated Oct 5, 2024

An isomorphic Javascript client for Supabase. Query your Supabase database, subscribe to realtime events, upload and download files, browse typescript examples, invoke postgres functions via rpc, i…

TypeScript 3,174 253 Updated Oct 1, 2024

Run your GitHub Actions locally 🚀

Go 54,379 1,356 Updated Oct 7, 2024

The open source Firebase alternative. Supabase gives you a dedicated Postgres database to build your web, mobile, and AI applications.

TypeScript 72,351 6,944 Updated Oct 7, 2024

A cross platform CLI for Flyte. Written in Golang. Offers an intuitive interface to Flyte https://docs.flyte.org/projects/flytectl/en/latest/

Go 46 82 Updated May 23, 2024

Apache Beam is a unified programming model for Batch and Streaming data processing.

Java 7,809 4,232 Updated Oct 7, 2024

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

Go 5,558 603 Updated Oct 7, 2024

Animation engine for explanatory math videos

Python 63,029 5,837 Updated Oct 4, 2024

The Multilayer Perceptron Language Model

Python 516 45 Updated Aug 9, 2024

The n-gram Language Model

C 1,317 93 Updated Aug 5, 2024

An Extensible Deep Learning Library

Python 1,816 245 Updated Oct 7, 2024

A Python framework for high performance GPU simulation and graphics

Python 4,163 232 Updated Oct 7, 2024

A lightweight library for portable low-level GPU computation using WebGPU.

C++ 3,691 176 Updated Oct 5, 2024
Next