KunZhou9646

🙃

I am here!

Kun Zhou KunZhou9646

🙃

I am here!

PhD student in National University of Singapore (NUS).

110 followers · 51 following

Human Language Technology Lab, NUS
Singapore
https://kunzhou9646.github.io/
@KunZhou65685140

Achievements

Stars

baaivision / Emu3

Next-Token Prediction is All You Need

Python 805 23 Updated Sep 30, 2024

wdndev / llm_interview_note

主要记录大语言大模型（LLMs）算法（应用）工程师相关的知识及面试题

HTML 2,929 339 Updated Aug 19, 2024

FireRedTeam / FireRedTTS

An Open-Sourced LLM-empowered Foundation TTS System

Python 191 8 Updated Sep 25, 2024

X-LANCE / VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Python 301 21 Updated Sep 3, 2024

qiuk2 / AAR

[Official Implementation] Acoustic Autoregressive Modeling 🔥

Python 54 5 Updated Aug 24, 2024

lucidrains / e2-tts-pytorch

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Python 250 23 Updated Sep 11, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 5,237 538 Updated Sep 29, 2024

haoheliu / SemantiCodec-inference

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Python 131 8 Updated Aug 25, 2024

line / LibriTTS-P

LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning

111 2 Updated Jun 13, 2024

sony / soundctm

Pytorch implementation of SoundCTM

Python 68 6 Updated Oct 1, 2024

Text-to-Audio / AudioLCM

PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.

Python 1,117 177 Updated Jul 17, 2024

bytedance / Make-An-Audio-2

a text-conditional diffusion probabilistic model capable of generating high fidelity audio.

Python 119 14 Updated May 29, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 21,763 2,105 Updated Aug 9, 2024

HSU-ANT / beaqlejs

*BeaqleJS* provides a framework to create browser based listening tests and is purely based on open web standards like HTML5 and Javascript.

JavaScript 86 49 Updated Mar 9, 2019

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 31,186 3,387 Updated Sep 21, 2024

X-LANCE / SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

Python 513 43 Updated Oct 2, 2024

shivammehta25 / Lumina-T2X

Forked from Alpha-VLLM/Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 1 Updated May 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kun Zhou KunZhou9646

Achievements

Achievements

Block or report KunZhou9646

Stars

baaivision / Emu3

wdndev / llm_interview_note

FireRedTeam / FireRedTTS

X-LANCE / VoiceFlow-TTS

qiuk2 / AAR

lucidrains / e2-tts-pytorch

FunAudioLLM / CosyVoice

haoheliu / SemantiCodec-inference

line / LibriTTS-P

sony / soundctm

Text-to-Audio / AudioLCM

bytedance / Make-An-Audio-2

hpcaitech / Open-Sora

HSU-ANT / beaqlejs

2noise / ChatTTS

X-LANCE / SLAM-LLM

shivammehta25 / Lumina-T2X

CorentinJ / librispeech-alignments

Alpha-VLLM / Lumina-T2X

JinhuaLiang / WavCraft

Text-to-Audio / Make-An-Audio

XinhaoMei / WavCaps

collabora / WhisperSpeech

michen00 / unified_multilingual_dataset_of_emotional_human_utterances

p0p4k / pflowtts_pytorch

declare-lab / tango

cdjkim / audiocaps

huggingface / dataspeech

descriptinc / descript-audio-codec

huggingface / parler-tts