Block or Report
Block or report 201930240256
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Multilingual Voice Understanding Model
手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Clarity Challenge toolkit - software for building Clarity Challenge systems
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
STGI: A speech intelligibility prediction algorithm based on spectro-temporal modulation glimpsing
Speech quality measure of SDR、SAR、STOI、ESTOI、PESQ via MATLAB
Python implementation of the Short Term Objective Intelligibility measure
Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deplo…
Official code for Conformer: Local Features Coupling Global Representations for Visual Recognition
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Robust Speech Recognition via Large-Scale Weak Supervision
🧑🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gan…
Homepage for STAT 157 at UC Berkeley