Skip to content
View enhuiz's full-sized avatar
💭
💭
  • Hong Kong
  • 20:42 (UTC +08:00)

Highlights

  • Pro
Block or Report

Block or report enhuiz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models

Python 179 13 Updated Apr 25, 2024

VAE modified from Descript Audio Codec, which replaces the RVQ with VAE

Python 41 5 Updated Apr 2, 2024

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 3,760 283 Updated Apr 30, 2024

A family of diffusion models for text-to-audio generation.

Python 947 75 Updated May 2, 2024

PyTorch Implementation of [AudioLCM]: a efficient and high-quality text-to-audio generation with latent consistency model.

Python 241 36 Updated Jun 19, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 29,232 3,468 Updated Jun 29, 2024

Multi-level network clustering based on the Map Equation

C++ 423 88 Updated Jun 20, 2024

SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 476 28 Updated Jun 25, 2024

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 686 82 Updated Feb 19, 2024

A generative speech model for daily dialogue.

Python 26,438 2,874 Updated Jun 29, 2024

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 9,642 613 Updated May 2, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,573 401 Updated Jun 28, 2024

Mamba SSM architecture

Python 11,392 924 Updated Jun 24, 2024

Instant voice cloning by MyShell.

Python 26,999 2,613 Updated Jun 24, 2024

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Python 3,989 262 Updated Jun 21, 2024

PyTorch implementation of normalizing flow models

Python 634 98 Updated Mar 1, 2024
Python 276 10 Updated Jun 21, 2024

PyTorch Implementation of DSB for Score Based Generative Modeling. Experiments managed using Hydra.

Python 123 12 Updated Nov 23, 2021

AI powered speech denoising and enhancement

Python 1,070 100 Updated Jun 21, 2024

RWKV in nanoGPT style

Python 160 12 Updated Jun 9, 2024

Google Drive CLI Client

Rust 1,311 71 Updated Mar 15, 2024

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,210 85 Updated Apr 18, 2024

An unofficial PyTorch implementation of VALL-E

Python 57 3 Updated Jun 29, 2024

LLM powered development for Neovim

Lua 634 40 Updated Jun 18, 2024

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

Python 537 44 Updated Feb 16, 2024

Text-to-Audio/Music Generation

Python 2,141 170 Updated Jun 27, 2024

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Python 7,401 745 Updated Feb 11, 2024

Inference Llama 2 in one file of pure C

C 16,704 1,940 Updated Jun 26, 2024

Awesome-LLM: a curated list of Large Language Model

15,840 1,268 Updated Jun 28, 2024

Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E

Python 132 17 Updated Jul 28, 2023
Next