Block or Report
Block or report test-dan-run
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
🤖 Build voice-based LLM agents. Modular + open source.
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
YOLOv10: Real-Time End-to-End Object Detection
User-friendly WebUI for LLMs (Formerly Ollama WebUI)
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
Official implementation of Half-Quadratic Quantization (HQQ)
Fast inference engine for Transformer models
🦀 Small exercises to get you used to reading and writing Rust code!
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
Official Implementation of EnCLAP (ICASSP 2024)
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Real-time stream processing for python
Zero-Shot Speech Editing and Text-to-Speech in the Wild
NOMAD is a fully unsupervised non-matching reference audio quality metric
Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.
Self-Supervised Speech Pre-training and Representation Learning Toolkit
Code for paper "Large Language Models are Efficient Learners of Noise-Robust Speech Recognition"
Reference-aware automatic speech evaluation toolkit