Skip to content
View rishikksh20's full-sized avatar
🖐️
Happy to help you !!!
🖐️
Happy to help you !!!

Organizations

@EpicGames @coala
Block or Report

Block or report rishikksh20

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

188 results for sponsorable starred repositories
Clear filter

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Python 71 2 Updated Jun 21, 2024
HTML 37 Updated Jun 11, 2024

LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progres…

Go 5,410 332 Updated Jun 3, 2024

FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3

Python 128 8 Updated Apr 20, 2024

Audio Codec Speech processing Universal PERformance Benchmark

Python 183 21 Updated Jun 19, 2024

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Python 987 96 Updated May 10, 2024

Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale

26 Updated Aug 4, 2023

Text-to-Audio/Music Generation

Python 2,150 172 Updated Jun 27, 2024

TTS Text Analyzer

32 3 Updated Jul 20, 2023

The code for the bark-voicecloning model. Training and inference.

Python 610 104 Updated Sep 13, 2023

Joplin - the secure note taking and to-do app with synchronisation capabilities for Windows, macOS, Linux, Android and iOS.

TypeScript 44,234 4,787 Updated Jul 9, 2024

Google's SoundStorm: Efficient Parallel Audio Generation

Python 116 12 Updated Aug 8, 2023
Python 39 9 Updated May 15, 2023
Python 69 3 Updated May 19, 2023

The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions

Python 48 1 Updated Apr 1, 2021

🐙 Guides, papers, lecture, notebooks and resources for prompt engineering

MDX 46,104 4,441 Updated Jul 9, 2024

Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)

Python 532 64 Updated May 9, 2024

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 1,930 316 Updated Nov 14, 2023

AudioLDM: Generate speech, sound effects, music and beyond, with text.

Python 2,334 222 Updated Jun 2, 2024

A timeline of the latest AI models for audio generation, starting in 2023!

1,873 66 Updated Jan 4, 2024

PyTorch implementation of normalizing flow models

Python 639 100 Updated Jul 9, 2024

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

Python 859 151 Updated Apr 4, 2024

Reverse engineered ChatGPT API

Python 28,005 4,497 Updated Aug 2, 2023
Jupyter Notebook 77 8 Updated May 21, 2023

Trainer for audio-diffusion-pytorch

Python 125 22 Updated Jan 13, 2023

Repository for paper "Non-intrusive speech intelligibility prediction from discrete latent representations"

Python 10 Updated Nov 25, 2021

Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.

Python 8,002 710 Updated Dec 10, 2023

Stable Diffusion web UI

Python 7,846 888 Updated May 20, 2024
Next