tonyw

wangxin tonyw

10 followers · 10 following

China
http://dev4dev.com

Achievements

Lists (1)

Sort

✨ Inspiration

1 repository

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

mlfoundations / dclm

DataComp for Language Models

HTML 1,116 97 Updated Sep 5, 2024

zhihu / TLLM_QMM

TLLM_QMM strips the implementation of quantized kernels of Nvidia's TensorRT-LLM, removing NVInfer dependency and exposes ease of use Pytorch module. We modified the dequantation and weight preproc…

C++ 10 2 Updated Jul 5, 2024

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,047 836 Updated Jul 1, 2024

mit-han-lab / llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,347 179 Updated Jul 16, 2024

mit-han-lab / lmquant

Python 102 6 Updated Jun 12, 2024

voidint / g

Golang Version Manager

Go 1,860 207 Updated Aug 20, 2024

zsbai / wechat-versions

Forked from tom-snow/wechat-windows-versions

保存微信历史版本

Shell 340 22 Updated Aug 26, 2024

cognitivecomputations / laserRMT

This is our own implementation of 'Layer Selective Rank Reduction'

Python 229 26 Updated May 26, 2024

alibaba / Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 659 93 Updated Sep 22, 2024

ali-vilab / AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Python 3,933 359 Updated Apr 8, 2024

SJTU-IPADS / PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,885 404 Updated Sep 6, 2024

modelscope / facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Jupyter Notebook 8,890 835 Updated Aug 14, 2024

IEIT-Yuan / Yuan-2.0

Yuan 2.0 Large Language Model

Python 678 85 Updated Jul 11, 2024

geekan / MetaGPT

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 43,691 5,204 Updated Aug 21, 2024

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 13,474 1,235 Updated Sep 21, 2024

genggui001 / Megatron-DeepSpeed-Llama

Python 82 13 Updated Sep 9, 2023

LydiaXiaohongLi / Megatron-DeepSpeed

Forked from microsoft/Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 19 3 Updated Jul 20, 2023

GaiZhenbiao / ChuanhuChatGPT

GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

Python 15,156 2,284 Updated Sep 19, 2024

TigerResearch / TigerBot

TigerBot: A multi-language multi-task LLM

Python 2,235 194 Updated Jun 7, 2024

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 1,823 304 Updated Sep 20, 2024