Stars
A high-throughput and memory-efficient inference and serving engine for LLMs
A throughput-oriented high-performance serving framework for LLMs
SGLang is a fast serving framework for large language models and vision language models.
Model components of the Llama Stack APIs
A new local-first, privacy-focused and open-source home for your markdown notes
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation