- Zagreb
Stars
MARS5 speech model (TTS) from CAMB.AI
This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Inference and training library for high-quality TTS models.
A complete computer science study plan to become a software engineer.
Zero-Shot Speech Editing and Text-to-Speech in the Wild
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processin…
A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.
Instant voice cloning by MIT and MyShell.
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer
A simple and easy-to-use library to enjoy videogames programming
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, …
a reactive Clojure dialect for web development that uses a compiler to manage the frontend/backend boundary
Neural Networks: Zero to Hero
GPU-accelerated force graph layout and rendering
Godot Engine – Multi-platform 2D and 3D game engine
Resumes generated using the GitHub informations
Controllable and fast Text-to-Speech for over 7000 languages!