yaowenxu

👋

Back to the Future.

Michael_Xu yaowenxu

👋

Back to the Future.

81 followers · 25 following

Hangzhou, China
21:30 (UTC +08:00)
www.cnblogs.com/xuyaowen

Achievements

Highlights

Developer Program Member
Pro

Organizations

Lists (8)

Sort

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

cteant / SPACE

Official implementation of SPACE

Python 10 Updated May 19, 2024

NJUNLP / MCSD

Multi-Candidate Speculative Decoding

Python 29 5 Updated Apr 22, 2024

meta-llama / llama-stack

Model components of the Llama Stack APIs

Python 2,586 256 Updated Sep 30, 2024

Equationliu / Kangaroo

Implementation of Kangaroo: Lossless Self-Speculative Decoding via Double Early Exiting

Python 41 5 Updated Jun 26, 2024

Infini-AI-Lab / MagicDec

Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding

JavaScript 60 4 Updated Sep 28, 2024

meta-llama / llama-models

Utilities intended for use with Llama models.

Python 4,237 750 Updated Sep 25, 2024

triton-lang / triton

Development repository for the Triton language and compiler

C++ 12,895 1,561 Updated Sep 30, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 21,744 2,102 Updated Aug 9, 2024

smart-lty / ParallelSpeculativeDecoding

The official code for paper "parallel speculative decoding with adaptive draft length."

Python 17 Updated Aug 23, 2024

lfsszd / CS-Drafting

Cascade Speculative Drafting

Python 25 2 Updated Apr 2, 2024

lucidrains / speculative-decoding

Explorations into some recent techniques surrounding speculative decoding

Python 193 15 Updated Oct 9, 2023

hemingkx / SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

371 14 Updated Sep 26, 2024

bojone / papers.cool

Cool Papers - Immersive Paper Discovery

HTML 361 5 Updated Sep 11, 2024

hemingkx / Spec-Bench

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Python 166 16 Updated May 29, 2024

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,699 451 Updated May 3, 2024

Infini-AI-Lab / TriForce

[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

Python 209 12 Updated Aug 31, 2024

aristocratos / btop

A monitor of resources

C++ 19,922 617 Updated Sep 24, 2024

XuehaiPan / nvitop

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Python 4,670 146 Updated Sep 11, 2024

zhongyang219 / TrafficMonitor

这是一个用于显示当前网速、CPU及内存利用率的桌面悬浮窗软件，并支持任务栏显示，支持更换皮肤。

C++ 34,563 3,245 Updated Mar 16, 2024

Infini-AI-Lab / Sequoia

scalable and robust tree-based speculative decoding algorithm

Python 304 31 Updated Aug 13, 2024

dilab-zju / self-speculative-decoding

Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**

Jupyter Notebook 131 8 Updated May 24, 2024

lmmlzn / Awesome-LLMs-Datasets

Summarize existing representative LLMs text datasets.

838 83 Updated Sep 4, 2024

NUS-HPC-AI-Lab / VideoSys

VideoSys: An easy and efficient system for video generation

Python 1,669 112 Updated Sep 30, 2024

github / gitignore

A collection of useful .gitignore templates

161,451 83,121 Updated Sep 9, 2024

xdit-project / xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) on multi-GPU Clusters

Python 538 46 Updated Sep 28, 2024

karpathy / nn-zero-to-hero

Neural Networks: Zero to Hero

Jupyter Notebook 11,607 1,452 Updated Aug 18, 2024

apple / ml-recurrent-drafter

Python 62 2 Updated Aug 30, 2024

punica-ai / punica

Serving multiple LoRA finetuned LLM as one

Python 960 45 Updated May 8, 2024

predibase / lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 2,126 139 Updated Sep 30, 2024

apoorvumang / prompt-lookup-decoding

Jupyter Notebook 450 22 Updated Aug 23, 2024

Michael_Xu yaowenxu

Highlights

Organizations

Lists (8)

LLMs Inference Tools

Michael's Choices

Multi-LoRA Adapter

Multi-modal Serving

Research's Standby

Small Language Models

Speculative Decoding

System Tools

Stars