Highlights
- Pro
Block or Report
Block or report manbaaaa
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
LLM based TTS model, providing inference/training/deployment full-stack ability.
Material for cuda-mode lectures
screen sharing for developers https://screego.net/
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
The official repository of the paper "(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts"
An extremely fast Python package installer and resolver, written in Rust.
The open source implementation of Gemini, the model that will "eclipse ChatGPT" by Google
An open-source package providing standardized tools for sound event analysis and data management.
A library built for easier audio self-supervised training, downstream tasks evaluation
Segment a given audio into utterances using a trained end-to-end ASR model.
[INTERSPEECH 2024] EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark
This repository contains the python implementation of a Sound Event Detection systems working in real time.
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
Mac app for crushing remote tech interviews with AI
Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark
Open source real-time translation app for Android that runs locally
paraformer的输出token和编码器alpha系数进行强制对齐