Stars
5
stars
written in Python
Clear filter
A high-throughput and memory-efficient inference and serving engine for LLMs
SGLang is a fast serving framework for large language models and vision language models.
Model components of the Llama Stack APIs
Official codes of VEnhancer: Generative Space-Time Enhancement for Video Generation