Stars
Use PEFT or Full-parameter to finetune 350+ LLMs or 90+ MLLMs. (Qwen2.5, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V-2.6, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
Open Source framework for voice and multimodal conversational AI
A fast inference library for running LLMs locally on modern consumer-class GPUs
Free and Open Source, Distributed, RESTful Search Engine
Maix Speech AI lib, a fast and small speech lib running on embedded devices, including ASR, chat, TTS etc.
A 10000+ hours dataset for Chinese speech recognition
LvHang / aps
Forked from funcwj/apsA workspace for single/multi-channel speech recognition & enhancement & separation.
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
Production First and Production Ready End-to-End Speech Recognition Toolkit
⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
FSA/FST algorithms, differentiable, with PyTorch compatibility.
kaldi-asr/kaldi is the official location of the Kaldi project.