Skip to content
View rishikksh20's full-sized avatar
🖐️
Happy to help you !!!
🖐️
Happy to help you !!!

Organizations

@EpicGames @coala
Block or Report

Block or report rishikksh20

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results
Python 5 2 Updated Jun 22, 2024
Python 14 2 Updated Jun 27, 2024

This repository contains the SpeechBrain Benchmarks

Python 66 31 Updated Jun 26, 2024

VALL-E 2 reproduction

Jupyter Notebook 20 1 Updated Jun 25, 2024

Compressed using encodec librilight datasets

Jupyter Notebook 7 Updated Jun 22, 2024

The official repository of Dynamic-SUPERB.

Python 136 84 Updated Jun 29, 2024

AudioSR-Upsampling (any -> 48kHz)

Python 34 1 Updated Feb 13, 2024

Spherical residual vector quantization (SRVQ)

Python 24 Updated Jun 8, 2024

Official Demo Page for DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer

HTML 24 Updated Jun 18, 2024

Have a natural voice conversation with an LLM

Python 38 11 Updated Jun 23, 2024

OmniTokenizer: one model and one weight for image-video joint tokenization.

Python 166 3 Updated Jun 26, 2024

Official implementation of "Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling"

Python 627 34 Updated Jun 27, 2024

A Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS

18 Updated Jun 13, 2024

LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning

92 1 Updated Jun 13, 2024

MARS5 speech model (TTS) from CAMB.AI

Python 1,497 107 Updated Jun 28, 2024

Expressive Anechoic Recordings of Speech (EARS)

Python 83 5 Updated Jun 25, 2024

PitchVC: Pitch Conditioned Any-to-Many Voice Conversion

Python 19 4 Updated Jun 6, 2024
Python 43 3 Updated Jun 27, 2024

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 872 32 Updated Jun 28, 2024

PyTorch implementation of Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities.

Python 65 Updated Jun 26, 2024

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Python 55 2 Updated Jun 21, 2024

Pytorch implementation of SoundCTM

Python 67 6 Updated Jun 7, 2024

An official implementation for SSAMBA: Self-Supervised Audio Mamba

Python 69 2 Updated Jun 3, 2024

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Python 683 49 Updated Jun 27, 2024

official code for Diff-Instruct algorithm for one-step diffusion distillation

Python 37 2 Updated Apr 6, 2024

ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec

Python 124 3 Updated Jun 4, 2024

Schedule-Free Optimization in PyTorch

Python 1,534 49 Updated May 30, 2024

High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec

Jupyter Notebook 58 6 Updated May 23, 2024
Python 612 23 Updated Jun 26, 2024
Next