vokkko

Follow

vokkko vokkko

Follow

contributions to the community.

1 follower · 20 following

Block or Report

Block or report vokkko

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

Awesome-Efficient-LLM Awesome-Efficient-LLM Public

Forked from horseee/Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Python
auto-round auto-round Public

Forked from intel/auto-round

SOTA Weight-only Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"

Python
EfficientDM EfficientDM Public

Forked from ThisisBillhe/EfficientDM

[ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models"

Jupyter Notebook
Quest Quest Public

Forked from mit-han-lab/Quest

[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

Cuda