Skip to content
View dpoljak's full-sized avatar
Studying man made horrors so they're no longer beyond my comprehension.
Studying man made horrors so they're no longer beyond my comprehension.

Block or report dpoljak

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A programming language for game development

Clojure 361 13 Updated Oct 2, 2024

MARS5 speech model (TTS) from CAMB.AI

Jupyter Notebook 2,471 200 Updated Aug 1, 2024

This repo contains the official PyTorch implementation of "Audio Super Resolution in the Spectral Domain" (ICASSP 2023)

Python 199 26 Updated Jul 14, 2024

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 16,331 2,586 Updated Jul 26, 2024

Inference and training library for high-quality TTS models.

Python 4,289 430 Updated Sep 23, 2024

A complete computer science study plan to become a software engineer.

305,070 76,554 Updated Sep 13, 2024

Create Music in Seconds with SunoAPI. 👇

Python 1,429 205 Updated Aug 12, 2024

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,516 739 Updated Jun 24, 2024
TSQL 5 Updated Sep 6, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 33,447 3,839 Updated Oct 2, 2024

GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis

Python 964 214 Updated Aug 28, 2023

Preprocess Audio for training

Python 232 45 Updated Sep 17, 2024

INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processin…

633 42 Updated Aug 9, 2024

Pytorch implementation of BigVSAN

Python 196 16 Updated Mar 23, 2024

A reactive notebook for Python — run reproducible experiments, execute as a script, deploy as an app, and version with git.

Python 6,593 225 Updated Oct 2, 2024

Instant voice cloning by MIT and MyShell.

Python 28,936 2,823 Updated Aug 21, 2024

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 37,697 3,964 Updated Jul 28, 2024

Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow

Jupyter Notebook 128 26 Updated Apr 9, 2021

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Python 990 205 Updated Apr 25, 2022

VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer

Python 309 42 Updated Jul 17, 2024

A simple and easy-to-use library to enjoy videogames programming

C 21,906 2,216 Updated Oct 2, 2024

A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, …

Python 318 41 Updated Sep 24, 2022

a reactive Clojure dialect for web development that uses a compiler to manage the frontend/backend boundary

Clojure 1,797 46 Updated Oct 2, 2024
TypeScript 1 Updated Jan 28, 2023

Neural Networks: Zero to Hero

Jupyter Notebook 11,625 1,455 Updated Aug 18, 2024

GPU-accelerated force graph layout and rendering

TypeScript 823 53 Updated Sep 27, 2024

Godot Engine – Multi-platform 2D and 3D game engine

C++ 89,697 20,803 Updated Oct 2, 2024
Clojure 3 Updated Feb 25, 2021

Resumes generated using the GitHub informations

JavaScript 61,907 1,356 Updated Feb 15, 2023

Controllable and fast Text-to-Speech for over 7000 languages!

Python 1,401 158 Updated Oct 2, 2024
Next