ethxnp

Ethan Petersen ethxnp

Highlights

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,832 4,108 Updated Oct 4, 2024

A throughput-oriented high-performance serving framework for LLMs

Cuda 566 23 Updated Sep 21, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 5,409 396 Updated Oct 4, 2024

Model components of the Llama Stack APIs

Python 3,124 376 Updated Oct 4, 2024

A new local-first, privacy-focused and open-source home for your markdown notes

Svelte 791 15 Updated Sep 15, 2024

Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation

Python 415 23 Updated Sep 16, 2024

Python 250 33 Updated Aug 20, 2024

Go 166 9 Updated Oct 4, 2024

Claudette is Claude's friend

Jupyter Notebook 172 27 Updated Oct 3, 2024