-
Databricks
- https://chengli.netlify.app
- in/cli99
Stars
Fast Matrix Multiplications for Lookup Table-Quantized LLMs
Create web-based user interfaces with Python. The nice way.
PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for evaluation of training and inference platforms.
Machine Learning Engineering Open Book
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
Advanced Python Mastery (course by @dabeaz)
Images to inference with no labeling (use foundation models to train supervised models).
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Source code for the C++ 20 Masterclass on udemy
Latency and Memory Analysis of Transformer Models for Training and Inference
Streamlit — A faster way to build and share data apps.
Interactive Data Visualization in the browser, from Python
Bonus materials, exercises, and example projects for our Python tutorials
A library to analyze PyTorch traces.
A library for helping developers craft prompts for Large Language Models
Prune a model while finetuning or training.
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
Jupyter notebooks for the Natural Language Processing with Transformers book
Hypermodern Python Cookiecutter
Bash completion support for conda
a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.