Skip to content
View tonyw's full-sized avatar

Block or report tonyw

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

DataComp for Language Models

HTML 1,116 97 Updated Sep 5, 2024

TLLM_QMM strips the implementation of quantized kernels of Nvidia's TensorRT-LLM, removing NVInfer dependency and exposes ease of use Pytorch module. We modified the dequantation and weight preproc…

C++ 10 2 Updated Jul 5, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,047 836 Updated Jul 1, 2024

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 2,347 179 Updated Jul 16, 2024
Python 102 6 Updated Jun 12, 2024

Golang Version Manager

Go 1,860 207 Updated Aug 20, 2024

保存微信历史版本

Shell 340 22 Updated Aug 26, 2024

This is our own implementation of 'Layer Selective Rank Reduction'

Python 229 26 Updated May 26, 2024

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 659 93 Updated Sep 22, 2024

Official implementations for paper: Anydoor: zero-shot object-level image customization

Python 3,933 359 Updated Apr 8, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,885 404 Updated Sep 6, 2024

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Jupyter Notebook 8,890 835 Updated Aug 14, 2024

Yuan 2.0 Large Language Model

Python 678 85 Updated Jul 11, 2024

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 43,691 5,204 Updated Aug 21, 2024

Fast and memory-efficient exact attention

Python 13,474 1,235 Updated Sep 21, 2024

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 19 3 Updated Jul 20, 2023

GUI for ChatGPT API and many LLMs. Supports agents, file-based QA, GPT finetuning and query with web search. All with a neat UI.

Python 15,156 2,284 Updated Sep 19, 2024

TigerBot: A multi-language multi-task LLM

Python 2,235 194 Updated Jun 7, 2024

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 1,823 304 Updated Sep 20, 2024

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python 4,699 705 Updated Jul 3, 2024

SoftVC VITS Singing Voice Conversion

Python 25,396 4,763 Updated Nov 11, 2023

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 35,387 4,162 Updated Aug 19, 2024

A Gradio web UI for Large Language Models.

Python 39,658 5,211 Updated Sep 16, 2024

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 34,827 4,053 Updated Sep 21, 2024

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Python 1,609 74 Updated Oct 26, 2023

A playbook for systematically maximizing the performance of deep learning models.

26,533 2,205 Updated Jun 18, 2024

LLM training code for Databricks foundation models

Python 3,974 523 Updated Sep 22, 2024

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,201 713 Updated Aug 5, 2024

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,195 1,861 Updated Apr 30, 2024
Next