kobenaxie

kobenaxie

6 followers · 8 following

USTC
HeiFei,CHINA

Achievements

Stars

kyutai-labs / moshi

Python 5,986 446 Updated Oct 4, 2024

zhenye234 / xcodec

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Python 89 3 Updated Oct 1, 2024

gpt-omni / mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,745 254 Updated Sep 25, 2024

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,124 67 Updated Aug 13, 2024

frankenliu / LOAE

Python 8 Updated Sep 25, 2024

lhotse-speech / lhotse

Tools for handling speech data in machine learning projects.

Python 936 214 Updated Oct 4, 2024

karpathy / LLM101n

LLM101n: Let's build a Storyteller

29,147 1,598 Updated Aug 1, 2024

niedev / RTranslator

Open source real-time translation app for Android that runs locally

C++ 6,593 498 Updated Sep 27, 2024

hubertsiuzdak / snac

Multi-Scale Neural Audio Codec (SNAC) compresses audio into discrete codes at a low bitrate

Python 372 21 Updated Sep 11, 2024

X-LANCE / SLAM-LLM

Speech, Language, Audio, Music Processing with Large Language Model

Python 513 43 Updated Oct 2, 2024

XuezheMax / megalodon

Reference implementation of Megalodon 7B model

Cuda 502 52 Updated Apr 18, 2024

KdaiP / StableTTS

Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3

Python 347 39 Updated Sep 13, 2024

jasonppy / VoiceCraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

Jupyter Notebook 7,520 740 Updated Jun 24, 2024

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 21,763 2,105 Updated Aug 9, 2024

facebookresearch / fairseq2

FAIR Sequence Modeling Toolkit 2

Python 682 78 Updated Oct 4, 2024

nikvaessen / w2v2-batch-size

Code for paper "The effect of batch size on contrastive self-supervised speech representation learning"

Python 8 1 Updated Aug 29, 2024

willisma / SiT

Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"

Python 600 28 Updated Mar 12, 2024

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,078 539 Updated May 31, 2024

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,079 838 Updated Jul 1, 2024

ml-explore / mlx-data

Efficient framework-agnostic data loading

C++ 359 38 Updated Sep 7, 2024

IBM / unitxt

🦄 Unitxt: a python library for getting data fired up and set for training and evaluation

Python 153 40 Updated Oct 4, 2024

nnaisense / bayesian-flow-networks

This is the official code release for Bayesian Flow Networks.

Python 244 27 Updated Jul 18, 2024

fishaudio / fish-speech

Brand new TTS solution

Python 12,848 961 Updated Oct 3, 2024

juicedata / juicefs

JuiceFS is a distributed POSIX file system built on top of Redis and S3.

Go 10,647 930 Updated Sep 30, 2024

resemble-ai / resemble-enhance

AI powered speech denoising and enhancement

Python 1,324 135 Updated Jun 21, 2024

QwenLM / Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,420 105 Updated Jul 5, 2024

fishaudio / Bert-VITS2

vits2 backbone with multilingual-bert

Python 7,857 1,112 Updated Oct 1, 2024

mosaicml / streaming

A Data Streaming Library for Efficient Neural Network Training

Python 1,087 137 Updated Oct 2, 2024

wladradchenko / wunjo.wladradchenko.ru

Wunjo CE: Face Swap, Lip Sync, Control Remove Objects & Text & Background, Restyling, Audio Separator, Clone Voice, Video Generation. Open Source, Local & Free.

Python 828 95 Updated Sep 19, 2024

bytedance / SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

Python 996 78 Updated Sep 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kobenaxie

Achievements

Achievements

Block or report kobenaxie

Stars

kyutai-labs / moshi

zhenye234 / xcodec

gpt-omni / mini-omni

QwenLM / Qwen2-Audio

frankenliu / LOAE

lhotse-speech / lhotse

karpathy / LLM101n

niedev / RTranslator

hubertsiuzdak / snac

X-LANCE / SLAM-LLM

XuezheMax / megalodon

KdaiP / StableTTS

jasonppy / VoiceCraft

hpcaitech / Open-Sora

facebookresearch / fairseq2

nikvaessen / w2v2-batch-size

willisma / SiT

facebookresearch / DiT

karpathy / minbpe

ml-explore / mlx-data

IBM / unitxt

nnaisense / bayesian-flow-networks

fishaudio / fish-speech

juicedata / juicefs

resemble-ai / resemble-enhance

QwenLM / Qwen-Audio

fishaudio / Bert-VITS2

mosaicml / streaming

wladradchenko / wunjo.wladradchenko.ru

bytedance / SALMONN