![tensorflow logo](https://raw.githubusercontent.com/github/explore/80688e429a7d4ef2fca1e82350fe8e3517d3494d/topics/tensorflow/tensorflow.png)
-
Dubpro.ai
- New Delhi, India
- https://www.dubpro.ai/
- @ai_rishikesh
Block or Report
Block or report rishikksh20
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (6)
Sort Name ascending (A-Z)
Language
Sort by: Recently starred
Starred repositories
Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.
LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a chain of LLMs to find the answer. The user can see the progres…
FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3
Audio Codec Speech processing Universal PERformance Benchmark
Versatile audio super resolution (any -> 48kHz) with AudioSR.
Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale
The code for the bark-voicecloning model. Training and inference.
Joplin - the secure note taking and to-do app with synchronisation capabilities for Windows, macOS, Linux, Android and iOS.
Google's SoundStorm: Efficient Parallel Audio Generation
The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions
🐙 Guides, papers, lecture, notebooks and resources for prompt engineering
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-176B/bloom/gpt/bart/T5/MetaICL)
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
AudioLDM: Generate speech, sound effects, music and beyond, with text.
A timeline of the latest AI models for audio generation, starting in 2023!
PyTorch implementation of normalizing flow models
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition
Trainer for audio-diffusion-pytorch
Repository for paper "Non-intrusive speech intelligibility prediction from discrete latent representations"
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.