Skip to content
View young01ai's full-sized avatar

Block or report young01ai

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Speech-to-text, text-to-speech, speaker recognition, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 …

C++ 3,243 380 Updated Oct 3, 2024

端到端语音唤醒工具箱,从模型训练到模型推理。

Python 69 10 Updated Sep 4, 2024

🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI …

JavaScript 5,977 734 Updated Jul 17, 2024

An open source chat bot architecture for voice/vision (and multimodal) assistants, local and remote to run; if u run achatbot by yourself, u can learn more, fork to contribute

Python 10 1 Updated Oct 4, 2024

Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit

Python 688 116 Updated Sep 25, 2024

Anim-400K: A dataset designed from the ground up for automated dubbing of video

97 1 Updated Jun 21, 2024

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 6,182 659 Updated Sep 30, 2024

Perceptual Quality Estimator for speech and audio

C++ 683 124 Updated Aug 2, 2024

MARS5 speech model (TTS) from CAMB.AI

Jupyter Notebook 2,477 201 Updated Aug 1, 2024

【地球拼音】輸入方案

135 23 Updated Jul 23, 2024
Jupyter Notebook 4 1 Updated Apr 16, 2024

GUI for a Vocal Remover that uses Deep Neural Networks.

Python 17,706 1,324 Updated May 23, 2024

model_repo

101 7 Updated Dec 19, 2022

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 771 88 Updated Aug 7, 2024

TTS frontend gRPC service

Python 3 3 Updated Apr 18, 2023

🍦 ChatTTS-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.

Python 692 85 Updated Oct 4, 2024

A generative speech model for daily dialogue.

Python 31,179 3,386 Updated Sep 21, 2024

Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context

Python 172 10 Updated Sep 10, 2024