Block or Report
Block or report wejoncy
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuse-
QLLM Public
A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ, and export to onnx/onnx-runtime easily.
-
vllm-backup Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedDec 27, 2023 -
EAGLE Public
Forked from SafeAILab/EAGLEEAGLE: Lossless Acceleration of LLM Decoding by Feature Extrapolation
Python Apache License 2.0 UpdatedDec 14, 2023 -
XbitOps Public
[X] bit GEMV/DQ support for quantized LLM
-
onnxKapok Public
An AOT compiler for onnx model, for accelerating transformers on Mobile/Server/GPUs. One Line of code, 30% faster at most on ARM/INTEL CPU
-
onnxruntime-extensions Public
Forked from microsoft/onnxruntime-extensionsThe pre- and post processing library for ONNX Runtime
Python MIT License UpdatedFeb 27, 2023 -
kernl Public
Forked from ELS-RD/kernlKernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.
Jupyter Notebook Apache License 2.0 UpdatedFeb 18, 2023 -
onnxruntime Public
Forked from microsoft/onnxruntimeONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
C++ MIT License UpdatedJan 6, 2023 -
XNNPACK Public
Forked from google/XNNPACKHigh-efficiency floating-point neural network inference operators for mobile, server, and Web
C Other UpdatedNov 12, 2022 -
Open standard for machine learning interoperability
C++ Apache License 2.0 UpdatedJun 14, 2022 -
-
awesome-tensor-compilers Public
Forked from merrymercy/awesome-tensor-compilersA list of awesome compiler projects and papers for tensor computation and deep learning.
UpdatedAug 30, 2021 -
winograd_study Public
a easy understand python implementation
-
tvm Public
Forked from apache/tvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Python Apache License 2.0 UpdatedDec 24, 2020 -
AiLearning Public
Forked from apachecn/ailearningAiLearning: 机器学习 - MachineLearning - ML、深度学习 - DeepLearning - DL、自然语言处理 NLP
Python GNU General Public License v3.0 UpdatedJul 3, 2019 -
-
deeplearningbook-chinese Public
Forked from exacity/deeplearningbook-chineseDeep Learning Book Chinese Translation
TeX UpdatedMar 2, 2017 -
neural-networks-and-deep-learning Public
Forked from mnielsen/neural-networks-and-deep-learningCode samples for my book "Neural Networks and Deep Learning"
Python UpdatedNov 12, 2016 -
mapreduce Public
Forked from cdmh/mapreduceC++ MapReduce Library for efficient multi-threading on single-machine
C++ UpdatedJul 3, 2016 -
string-splitting Public
Forked from tobbez/string-splittingString splitting benchmarks
C++ UpdatedMay 29, 2016 -
machine-learning-cheat-sheet Public
Forked from soulmachine/machine-learning-cheat-sheetClassical equations and diagrams in machine learning
TeX UpdatedApr 14, 2016 -