State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook 13,298 3,183 Updated Aug 12, 2024

r9y9 / wavenet_vocoder

WaveNet vocoder

Python 2,314 499 Updated Jul 29, 2023

wsntxxn / AudioCaption

Audio captioning recipe

Python 41 4 Updated Jun 22, 2024

Labbeti / conette-audio-captioning

CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding

Python 11 Updated Aug 13, 2024

prompteus / audio-captioning

Audio captioning - DCASE challenge 2023 task 6a

Jupyter Notebook 20 2 Updated Jan 28, 2024

rom1504 / cc2dataset

Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...

Python 303 23 Updated Dec 9, 2023

savoirfairelinux / num2words

Modules to convert numbers to words. 42 --> forty-two

Python 816 488 Updated Sep 20, 2024

JohnSnowLabs / spark-nlp

State of the Art Natural Language Processing

Scala 3,822 709 Updated Sep 28, 2024

MontrealCorpusTools / Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Python 1,306 243 Updated Sep 28, 2024

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Python 8,616 1,367 Updated Sep 25, 2024

facebookresearch / denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…

Python 1,648 301 Updated Mar 14, 2023

xiph / rnnoise

Recurrent neural network for audio noise reduction

C 4,013 888 Updated Aug 24, 2024

iawia002 / lux

👾 Fast and simple video download library and CLI tool written in Go

Go 27,216 2,950 Updated Sep 27, 2024

esbatmop / MNBVC

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化，也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,411 233 Updated Sep 14, 2024

brightmart / nlp_chinese_corpus

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,430 1,542 Updated May 23, 2024

dbry / WavPack

WavPack encode/decode library, command-line programs, and several plugins

C 362 66 Updated Sep 16, 2024

xiph / flac

Free Lossless Audio Codec

C 1,625 277 Updated Sep 27, 2024

BlinkDL / RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,454 847 Updated Sep 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Yike Zhang zhangyike

Block or report zhangyike

Stars

keonlee9420 / Stepwise_Monotonic_Multihead_Attention

thuhcsi / tacotron

thu-coai / CDial-GPT

mozillazg / pypinyin-g2pW

uiuc-sst / g2ps

2noise / ChatTTS

OpenBGBenchmark / OpenBG

Alibaba-NLP / EcomGPT

HqWu-HITCS / Awesome-Chinese-LLM

CSTR-Edinburgh / merlin

Vaibhavs10 / open-tts-tracker

ming024 / FastSpeech2

NVIDIA / DeepLearningExamples