Stars
An unofficial PyTorch implementation of the audio LM VALL-E
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
XianruiWang / AudioDec
Forked from facebookresearch/AudioDecAn Open-source Streaming High-fidelity Neural Audio Codec
An Open-source Streaming High-fidelity Neural Audio Codec
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
AI powered speech denoising and enhancement
AcademiCodec: An Open Source Audio Codec Model for Academic Research
Unofficial implementation of fully cnvolutional time-domain audio separation network (ConvTasNet v3)
Compare AIRES BSS with TRINICON, ILRMA and AuxIVA (online and offline versions)
Deep Learning Book Chinese Translation