Wendison

🎯

Focusing

Disong Wang Wendison

🎯

Focusing

PhD@CUHK, focus on voice conversion & speech synthesis.

78 followers · 6 following

https://wendison.github.io/

Achievements

Block or Report

Block or report Wendison

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

fishaudio / fish-speech

Brand new TTS solution

Python 5,028 399 Updated Jul 9, 2024

Yuan-ManX / ai-audio-datasets

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio a…

380 31 Updated Jul 10, 2024

cwang621 / blsp-emo

BLSP-Emo: Towards Empathetic Large Speech-Language Models

Python 26 2 Updated Jun 7, 2024

line / LibriTTS-P

LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning

94 1 Updated Jun 13, 2024

fixie-ai / ultravox

Python 640 29 Updated Jul 9, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

10,549 701 Updated Jul 4, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 27,440 2,988 Updated Jul 10, 2024

Plachtaa / FAcodec

Training code for FAcodec presented in NaturalSpeech3

Python 117 12 Updated Jul 7, 2024

huggingface / dataspeech

Python 224 23 Updated Jul 5, 2024

jasonppy / VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,231 710 Updated Jun 24, 2024

huggingface / parler-tts

Inference and training library for high-quality TTS models.

Python 2,870 294 Updated Jul 9, 2024

aixplain / tts-qa

Python 60 2 Updated May 3, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,200 2,022 Updated Jun 19, 2024

LinkSoul-AI / LLaSM

第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验，同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。

Python 493 52 Updated Sep 11, 2023

allenai / OLMo

Modeling, training, eval, and inference code for OLMo

Python 4,203 393 Updated Jul 10, 2024

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 14,984 1,430 Updated Jul 10, 2024

yangdongchao / UniAudio

The Open Source Code of UniAudio

Python 479 31 Updated May 3, 2024

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 28,960 3,350 Updated Jul 10, 2024

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 34,041 3,553 Updated Jun 11, 2024

lucidrains / voicebox-pytorch

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

Python 544 45 Updated Feb 16, 2024

luosiallen / latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,206 215 Updated Jun 14, 2024

wl-zhao / UniPC

[NeurIPS 2023] UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models

Jupyter Notebook 283 12 Updated Sep 22, 2023

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,537 1,018 Updated Jun 26, 2024

X-LANCE / UniCATS-CTX-txt2vec

[AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS

Python 55 8 Updated Feb 23, 2024

lucidrains / vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Python 2,164 180 Updated Jul 10, 2024

ZhangXInFD / SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Python 369 32 Updated Jun 9, 2024

huggingface / distil-whisper

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

Python 3,345 242 Updated Jul 9, 2024

collabora / WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 3,569 188 Updated Jun 18, 2024

w-okada / voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer

Python 15,336 1,647 Updated Jul 10, 2024

declare-lab / tango

A family of diffusion models for text-to-audio generation.

Python 953 75 Updated Jul 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disong Wang Wendison

Achievements

Achievements

Block or report Wendison

Stars

fishaudio / fish-speech

Yuan-ManX / ai-audio-datasets

cwang621 / blsp-emo

line / LibriTTS-P

fixie-ai / ultravox

BradyFU / Awesome-Multimodal-Large-Language-Models

2noise / ChatTTS

Plachtaa / FAcodec

huggingface / dataspeech

jasonppy / VoiceCraft

huggingface / parler-tts

aixplain / tts-qa

facebookresearch / audiocraft

LinkSoul-AI / LLaSM

allenai / OLMo

huggingface / peft

yangdongchao / UniAudio

RVC-Boss / GPT-SoVITS

mlabonne / llm-course

lucidrains / voicebox-pytorch

luosiallen / latent-consistency-model

wl-zhao / UniPC

facebookresearch / seamless_communication

X-LANCE / UniCATS-CTX-txt2vec

lucidrains / vector-quantize-pytorch

ZhangXInFD / SpeechTokenizer

huggingface / distil-whisper

collabora / WhisperSpeech

w-okada / voice-changer

declare-lab / tango