Skip to content
View cekcoco's full-sized avatar

Block or report cekcoco

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • llm-awq Public

    Forked from mit-han-lab/llm-awq

    [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

    Python MIT License Updated Jul 16, 2024
  • [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

    Python MIT License Updated Mar 25, 2024
  • ppq Public

    Forked from OpenPPL/ppq

    PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

    Python Apache License 2.0 Updated Feb 27, 2024
  • 4 bits quantization of LLaMA using GPTQ

    Python Apache License 2.0 Updated Jun 6, 2023
  • mmdeploy Public

    Forked from open-mmlab/mmdeploy

    OpenMMLab Model Deployment Framework

    Python Apache License 2.0 Updated Jun 6, 2023
  • Jupyter Notebook Apache License 2.0 Updated May 30, 2023
  • triton Public

    Forked from triton-lang/triton

    Development repository for the Triton language and compiler

    C++ MIT License Updated Jan 31, 2023
  • I-BERT Public

    Forked from kssteven418/I-BERT

    [ICML'21 Oral] I-BERT: Integer-only BERT Quantization

    Python MIT License Updated Jan 29, 2023
  • TPAT Public

    Forked from Tencent/TPAT

    TensorRT Plugin Autogen Tool

    Python Apache License 2.0 Updated Oct 28, 2022
  • tpu-mlir Public

    Forked from sophgo/tpu-mlir

    Machine learning compiler based on MLIR for Sophgo TPU.

    C++ Apache License 2.0 Updated Aug 8, 2022
  • Tengine Public

    Forked from OAID/Tengine

    Tengine is a lite, high performance, modular inference engine for embedded device

    C++ Apache License 2.0 Updated Aug 6, 2022
  • tvm Public

    Forked from apache/tvm

    Open deep learning compiler stack for cpu, gpu and specialized accelerators

    Python Apache License 2.0 Updated Jul 15, 2022
  • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

    Python Apache License 2.0 Updated Jul 13, 2022
  • aimet Public

    Forked from quic/aimet

    AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

    Python Other Updated Jul 11, 2022
  • MQBench Public

    Forked from ModelTC/MQBench

    Model Quantization Benchmark

    Python Apache License 2.0 Updated Jul 11, 2022
  • onnx Public

    Forked from onnx/onnx

    Open standard for machine learning interoperability

    C++ Apache License 2.0 Updated Jun 29, 2022
  • Deformable DETR: Deformable Transformers for End-to-End Object Detection.

    Python Apache License 2.0 Updated May 22, 2022
  • NumCpp Public

    Forked from dpilger26/NumCpp

    C++ implementation of the Python Numpy library

    C++ MIT License Updated May 14, 2022
  • 《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀

    Updated Feb 11, 2022
  • 超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M

    C++ GNU General Public License v2.0 Updated Feb 9, 2022
  • State-of-the-art 2D and 3D Face Analysis Project

    Python MIT License Updated Oct 26, 2021
  • nanodet Public

    Forked from RangiLyu/nanodet

    ⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥

    Python Apache License 2.0 Updated Oct 24, 2021
  • EasyQuant Public

    Forked from deepglint/EasyQuant

    EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activations.

    Python Other Updated Sep 8, 2021
  • BRECQ Public

    Forked from yhhhli/BRECQ

    Pytorch implementation of BRECQ, ICLR 2021

    Python MIT License Updated Aug 1, 2021
  • Pytorch model to caffe model, supported pytorch 0.3, 0.3.1, 0.4, 0.4.1 ,1.0 , 1.0.1 , 1.2 ,1.3 .notice that only pytorch 1.1 have some bugs

    Python MIT License Updated May 22, 2021
  • mlir Public

    Forked from tensorflow/mlir

    "Multi-Level Intermediate Representation" Compiler Infrastructure

    Updated Apr 22, 2021
  • Implementation of Statistical Learning Method, Second Edition.《统计学习方法》第二版,算法实现。

    Jupyter Notebook MIT License Updated Feb 9, 2021
  • Quantization of Convolutional Neural networks.

    Python Updated Dec 8, 2020
  • pacnet Public

    Forked from NVlabs/pacnet

    Pixel-Adaptive Convolutional Neural Networks (CVPR '19)

    Python Other Updated Dec 4, 2020
  • wincnn Public

    Forked from andravin/wincnn

    Winograd minimal convolution algorithm generator for convolutional neural networks.

    Python Apache License 2.0 Updated Oct 17, 2020