Skip to content
View cli99's full-sized avatar
🐼
🐼

Organizations

@illinois-impact

Block or report cli99

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Material for gpu-mode lectures

Jupyter Notebook 2,566 256 Updated Sep 23, 2024

Fast Matrix Multiplications for Lookup Table-Quantized LLMs

Cuda 164 5 Updated Sep 15, 2024

Create web-based user interfaces with Python. The nice way.

Python 8,917 540 Updated Sep 28, 2024

PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for evaluation of training and inference platforms.

Python 118 61 Updated Sep 19, 2024

Animation engine for explanatory math videos

Python 62,614 5,802 Updated Sep 28, 2024

Machine Learning Engineering Open Book

Python 11,070 663 Updated Sep 19, 2024

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Python 2,741 215 Updated Sep 30, 2023

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,317 191 Updated Sep 27, 2024

Advanced Python Mastery (course by @dabeaz)

Python 10,660 1,751 Updated Aug 10, 2024

Images to inference with no labeling (use foundation models to train supervised models).

Python 1,880 148 Updated Sep 19, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 36,550 4,505 Updated Sep 25, 2024

Source code for the C++ 20 Masterclass on udemy

C++ 1,764 918 Updated Sep 27, 2024

Latency and Memory Analysis of Transformer Models for Training and Inference

Python 339 40 Updated May 28, 2024
HTML 44 11 Updated Jun 27, 2019

Pipeline Parallelism for PyTorch

Python 714 86 Updated Aug 21, 2024

Streamlit — A faster way to build and share data apps.

Python 34,826 3,020 Updated Sep 28, 2024

Interactive Data Visualization in the browser, from Python

TypeScript 19,254 4,176 Updated Sep 28, 2024

pytorch-profiler

Python 48 8 Updated Jun 1, 2023

Bonus materials, exercises, and example projects for our Python tutorials

HTML 4,778 5,300 Updated Sep 28, 2024

A library to analyze PyTorch traces.

Python 274 37 Updated Sep 7, 2024

A library for helping developers craft prompts for Large Language Models

TypeScript 2,557 106 Updated Apr 25, 2023

torchview: visualize pytorch models

Python 799 36 Updated May 1, 2024

DyNet: The Dynamic Neural Network Toolkit

C++ 3,419 704 Updated Dec 1, 2023

Prune a model while finetuning or training.

Jupyter Notebook 393 57 Updated Jun 21, 2022

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 19,920 2,466 Updated Aug 15, 2024

BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

Python 6,814 771 Updated Aug 24, 2023

Jupyter notebooks for the Natural Language Processing with Transformers book

Jupyter Notebook 3,834 1,189 Updated Aug 21, 2024

Hypermodern Python Cookiecutter

Python 1,800 232 Updated May 18, 2024

Bash completion support for conda

Python 148 7 Updated Oct 1, 2023

a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU.

C++ 1,474 196 Updated Jun 12, 2023
Next