cli99

Follow

🐼

Cheng Li cli99

🐼

Follow

Senior Machine Learning Engineer @ Databricks GenAI. UIUC PhD. I build efficient AI training and inference systems with GPUs.

93 followers · 9 following

Achievements

Achievements

Organizations

Stars

gpu-mode / lectures

Material for gpu-mode lectures

Jupyter Notebook 2,566 256 Updated Sep 23, 2024

HanGuo97 / flute

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

Cuda 164 5 Updated Sep 15, 2024

zauberzeug / nicegui

Create web-based user interfaces with Python. The nice way.

Python 8,917 540 Updated Sep 28, 2024

facebookresearch / param

PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for evaluation of training and inference platforms.

Python 118 61 Updated Sep 19, 2024

3b1b / manim

Animation engine for explanatory math videos

Python 62,614 5,802 Updated Sep 28, 2024

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 11,070 663 Updated Sep 19, 2024

turboderp / exllama

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Python 2,741 215 Updated Sep 30, 2023

ModelTC / lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,317 191 Updated Sep 27, 2024

dabeaz-course / python-mastery

Advanced Python Mastery (course by @dabeaz)

Python 10,660 1,751 Updated Aug 10, 2024

autodistill / autodistill

Images to inference with no labeling (use foundation models to train supervised models).

Python 1,880 148 Updated Sep 19, 2024

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 36,550 4,505 Updated Sep 25, 2024

rutura / The-C-20-Masterclass-Source-Code

Source code for the C++ 20 Masterclass on udemy

C++ 1,764 918 Updated Sep 27, 2024

cli99 / llm-analysis

Latency and Memory Analysis of Transformer Models for Training and Inference

Python 339 40 Updated May 28, 2024

c3sr / tcu_scope

HTML 44 11 Updated Jun 27, 2019

pytorch / PiPPy

Pipeline Parallelism for PyTorch

Python 714 86 Updated Aug 21, 2024

streamlit / streamlit

Streamlit — A faster way to build and share data apps.

Python 34,826 3,020 Updated Sep 28, 2024

bokeh / bokeh

Interactive Data Visualization in the browser, from Python

TypeScript 19,254 4,176 Updated Sep 28, 2024

cli99 / flops-profiler

pytorch-profiler

Python 48 8 Updated Jun 1, 2023

realpython / materials

Bonus materials, exercises, and example projects for our Python tutorials

HTML 4,778 5,300 Updated Sep 28, 2024

facebookresearch / HolisticTraceAnalysis

A library to analyze PyTorch traces.

Python 274 37 Updated Sep 7, 2024

microsoft / prompt-engine

A library for helping developers craft prompts for Large Language Models

TypeScript 2,557 106 Updated Apr 25, 2023

mert-kurttutan / torchview

torchview: visualize pytorch models

Python 799 36 Updated May 1, 2024

clab / dynet

DyNet: The Dynamic Neural Network Toolkit

C++ 3,419 704 Updated Dec 1, 2023

huggingface / nn_pruning

Prune a model while finetuning or training.

Jupyter Notebook 393 57 Updated Jun 21, 2022

karpathy / minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 19,920 2,466 Updated Aug 15, 2024

jessevig / bertviz

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Python 6,814 771 Updated Aug 24, 2023

nlp-with-transformers / notebooks

Jupyter notebooks for the Natural Language Processing with Transformers book

Jupyter Notebook 3,834 1,189 Updated Aug 21, 2024

cjolowicz / cookiecutter-hypermodern-python

Hypermodern Python Cookiecutter

Python 1,800 232 Updated May 18, 2024

tartansandal / conda-bash-completion

Bash completion support for conda

Python 148 7 Updated Oct 1, 2023

Tencent / TurboTransformers

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

C++ 1,474 196 Updated Jun 12, 2023