Stars
Minimalistic large language model 3D-parallelism training
TensorRT Model Optimizer is a unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, etc. It compresses deep learning models for downstream d…
Simple samples for TensorRT programming
Tensors and Dynamic neural networks in Python with strong GPU acceleration
An Open Source Machine Learning Framework for Everyone
An action for ChatGPT's GPTs to get source code from github repository.
A high-performance, extensible Python AOT compiler.
A high-throughput and memory-efficient inference and serving engine for LLMs
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
[CVPR 2023] Towards Any Structural Pruning; LLMs / SAM / Diffusion / Transformers / YOLOv8 / CNNs
A curated list for Efficient Large Language Models
EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
OpenMMLab Model Compression Toolbox and Benchmark.
A python implementation of KITTI evaluation code for 2D detection task.
implementation of mixup paper ICLR 2018 with tensorflow 2.0
A collection of various deep learning architectures, models, and tips
A Simple and Versatile Framework for Object Detection and Instance Recognition
Repo for counting stars and contributing. Press F to pay respect to glorious developers.