Skip to content
View zhangyike's full-sized avatar

Block or report zhangyike

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PyTorch Implementation of Stepwise Monotonic Multihead Attention similar to Enhancing Monotonicity for Robust Autoregressive Transformer TTS

Python 31 6 Updated May 16, 2021

PyTorch implementation of Tacotron and Tacotron2

Python 32 12 Updated Jul 19, 2022

A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models

Python 1,765 255 Updated Jun 12, 2023

基于 g2pW 提升 pypinyin 的准确性

Python 75 7 Updated Jun 24, 2023

Data and code for grapheme-to-phoneme transducers in lots of languages

HTML 129 18 Updated Apr 5, 2024

A generative speech model for daily dialogue.

Python 31,082 3,378 Updated Sep 21, 2024

Datasets for Evaluation on Domain Knowledge Graph

52 3 Updated Jun 11, 2023

An Instruction-tuned Large Language Model for E-commerce

Python 221 14 Updated Sep 26, 2023

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

15,177 1,407 Updated Sep 19, 2024

This is now the official location of the Merlin project.

Python 1,306 441 Updated Mar 3, 2020

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python 1,779 527 Updated Oct 27, 2023

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

Jupyter Notebook 13,298 3,183 Updated Aug 12, 2024

WaveNet vocoder

Python 2,314 499 Updated Jul 29, 2023

Audio captioning recipe

Python 41 4 Updated Jun 22, 2024

CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding

Python 11 Updated Aug 13, 2024

Audio captioning - DCASE challenge 2023 task 6a

Jupyter Notebook 20 2 Updated Jan 28, 2024

Easily convert common crawl to a dataset of caption and document. Image/text Audio/text Video/text, ...

Python 303 23 Updated Dec 9, 2023

Modules to convert numbers to words. 42 --> forty-two

Python 816 488 Updated Sep 20, 2024

State of the Art Natural Language Processing

Scala 3,822 709 Updated Sep 28, 2024

Command line utility for forced alignment using Kaldi

Python 1,306 243 Updated Sep 28, 2024

A PyTorch-based Speech Toolkit

Python 8,616 1,367 Updated Sep 25, 2024

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…

Python 1,648 301 Updated Mar 14, 2023

Recurrent neural network for audio noise reduction

C 4,013 888 Updated Aug 24, 2024

👾 Fast and simple video download library and CLI tool written in Go

Go 27,216 2,950 Updated Sep 27, 2024

MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。

3,411 233 Updated Sep 14, 2024

大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

9,430 1,542 Updated May 23, 2024

WavPack encode/decode library, command-line programs, and several plugins

C 362 66 Updated Sep 16, 2024

Free Lossless Audio Codec

C 1,625 277 Updated Sep 27, 2024

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…

Python 12,454 847 Updated Sep 23, 2024
Next