LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,266 137 Updated Sep 24, 2024

modelscope / FunCodec

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Python 354 30 Updated Jan 25, 2024

resemble-ai / resemble-enhance

AI powered speech denoising and enhancement

Python 1,332 135 Updated Jun 21, 2024

yangdongchao / AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

Python 574 80 Updated Dec 27, 2023

yoonsanghyu / Conv-TasNet-v3-PyTorch

Unofficial implementation of fully cnvolutional time-domain audio separation network (ConvTasNet v3)

Python 5 1 Updated Mar 31, 2021

TUIlmenauAMS / Comparison-of-Blind-Source-Separation-techniques

Compare AIRES BSS with TRINICON, ILRMA and AuxIVA (online and offline versions)

MATLAB 67 33 Updated Aug 7, 2020

exacity / deeplearningbook-chinese

Deep Learning Book Chinese Translation

TeX 35,676 9,121 Updated Dec 3, 2019

WenDesi / lihang_book_algorithm

致力于将李航博士《统计学习方法》一书中所有算法实现一遍

Python 5,695 1,987 Updated Apr 29, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

manyeyarsltc ltcxjtu

Block or report ltcxjtu

Stars

enhuiz / vall-e

lifeiteng / vall-e

fishaudio / fish-speech

XianruiWang / AudioDec

facebookresearch / AudioDec

homebrewltd / ichigo