-
llm-awq Public
Forked from mit-han-lab/llm-awq[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
Python MIT License UpdatedJul 16, 2024 -
smoothquant Public
Forked from mit-han-lab/smoothquant[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Python MIT License UpdatedMar 25, 2024 -
ppq Public
Forked from OpenPPL/ppqPPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
Python Apache License 2.0 UpdatedFeb 27, 2024 -
GPTQ-for-LLaMa Public
Forked from qwopqwop200/GPTQ-for-LLaMa4 bits quantization of LLaMA using GPTQ
Python Apache License 2.0 UpdatedJun 6, 2023 -
mmdeploy Public
Forked from open-mmlab/mmdeployOpenMMLab Model Deployment Framework
Python Apache License 2.0 UpdatedJun 6, 2023 -
vision_transformer Public
Forked from google-research/vision_transformerJupyter Notebook Apache License 2.0 UpdatedMay 30, 2023 -
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedJan 31, 2023 -
I-BERT Public
Forked from kssteven418/I-BERT[ICML'21 Oral] I-BERT: Integer-only BERT Quantization
Python MIT License UpdatedJan 29, 2023 -
TPAT Public
Forked from Tencent/TPATTensorRT Plugin Autogen Tool
Python Apache License 2.0 UpdatedOct 28, 2022 -
tpu-mlir Public
Forked from sophgo/tpu-mlirMachine learning compiler based on MLIR for Sophgo TPU.
C++ Apache License 2.0 UpdatedAug 8, 2022 -
Tengine Public
Forked from OAID/TengineTengine is a lite, high performance, modular inference engine for embedded device
C++ Apache License 2.0 UpdatedAug 6, 2022 -
tvm Public
Forked from apache/tvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Python Apache License 2.0 UpdatedJul 15, 2022 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedJul 13, 2022 -
aimet Public
Forked from quic/aimetAIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
Python Other UpdatedJul 11, 2022 -
MQBench Public
Forked from ModelTC/MQBenchModel Quantization Benchmark
Python Apache License 2.0 UpdatedJul 11, 2022 -
Open standard for machine learning interoperability
C++ Apache License 2.0 UpdatedJun 29, 2022 -
Deformable-DETR Public
Forked from fundamentalvision/Deformable-DETRDeformable DETR: Deformable Transformers for End-to-End Object Detection.
Python Apache License 2.0 UpdatedMay 22, 2022 -
NumCpp Public
Forked from dpilger26/NumCppC++ implementation of the Python Numpy library
C++ MIT License UpdatedMay 14, 2022 -
leetcode-master Public
Forked from youngyangyang04/leetcode-master《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
UpdatedFeb 11, 2022 -
chineseocr_lite Public
Forked from DayBreak-u/chineseocr_lite超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
C++ GNU General Public License v2.0 UpdatedFeb 9, 2022 -
insightface Public
Forked from deepinsight/insightfaceState-of-the-art 2D and 3D Face Analysis Project
Python MIT License UpdatedOct 26, 2021 -
nanodet Public
Forked from RangiLyu/nanodet⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥
Python Apache License 2.0 UpdatedOct 24, 2021 -
EasyQuant Public
Forked from deepglint/EasyQuantEasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activations.
Python Other UpdatedSep 8, 2021 -
BRECQ Public
Forked from yhhhli/BRECQPytorch implementation of BRECQ, ICLR 2021
Python MIT License UpdatedAug 1, 2021 -
PytorchToCaffe Public
Forked from xxradon/PytorchToCaffePytorch model to caffe model, supported pytorch 0.3, 0.3.1, 0.4, 0.4.1 ,1.0 , 1.0.1 , 1.2 ,1.3 .notice that only pytorch 1.1 have some bugs
Python MIT License UpdatedMay 22, 2021 -
mlir Public
Forked from tensorflow/mlir"Multi-Level Intermediate Representation" Compiler Infrastructure
UpdatedApr 22, 2021 -
Learn-Statistical-Learning-Method Public
Forked from hktxt/Learn-Statistical-Learning-MethodImplementation of Statistical Learning Method, Second Edition.《统计学习方法》第二版,算法实现。
Jupyter Notebook MIT License UpdatedFeb 9, 2021 -
cnn-quantization Public
Forked from submission2019/cnn-quantizationQuantization of Convolutional Neural networks.
Python UpdatedDec 8, 2020 -
pacnet Public
Forked from NVlabs/pacnetPixel-Adaptive Convolutional Neural Networks (CVPR '19)
Python Other UpdatedDec 4, 2020 -
wincnn Public
Forked from andravin/wincnnWinograd minimal convolution algorithm generator for convolutional neural networks.
Python Apache License 2.0 UpdatedOct 17, 2020