Lists (1)
Sort Name ascending (A-Z)
Stars
Evaluate the accuracy of LLM generated outputs
Things you can do with the token embeddings of an LLM
High accuracy RAG for answering questions from scientific documents with citations
Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch
Decentralized deep learning in PyTorch. Built to train models on thousands of volunteers across the world.
Open deep learning compiler stack for cpu, gpu and specialized accelerators
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
Parameterize Python scripts/notebooks all from the command line and run on cloud GPUs
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
Open source UI framework written in Python, running on Windows, Linux, macOS, Android and iOS
You like pytorch? You like micrograd? You love tinygrad! ❤️
Sparsity-aware deep learning inference runtime for CPUs
Bookmarks in graphics, algorithms, low level programming, math, languages
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
A Python tool to enforce dependencies, using modular architecture 🌎 Open source 🐍 Installable via pip 🔧 Able to be adopted incrementally - ⚡ Implemented with no runtime impact ♾️ Interoperable with…
Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipelines from simple, reusable components.
List of papers on hallucination detection in LLMs.
Deploy infinitely scalable serverless apps, apis, and sites in seconds to AWS.
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
Lumina-T2X is a unified framework for Text to Any Modality Generation
TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.
The modern replacement for Jupyter Notebooks
Differentiable convex optimization layers