y-ren16

Follow

Yong Ren y-ren16

Follow

12 followers · 20 following

University of Chinese Academy of Sciences

Achievements

Achievements

Highlights

Pro

Stars

MassimilianoPasquini97 / raycast_ollama

Raycast extention for Ollama

TypeScript 189 13 Updated Aug 6, 2024

kyutai-labs / moshi

Python 5,921 445 Updated Oct 2, 2024

moyangkuo / AudioMarkBench

Dataset/code for AudioMarkBench: Benchmarking Robustness of Audio Watermarking

Python 10 1 Updated Aug 23, 2024

dair-ai / ML-YouTube-Courses

📺 Discover the latest machine learning / AI courses on YouTube.

15,817 1,893 Updated Jan 22, 2024

microsoft / LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 10,429 669 Updated Aug 14, 2024

TimbreWatermarking / TimbreWatermarking

Python 33 4 Updated Dec 11, 2023

huggingface / peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 16,000 1,565 Updated Oct 2, 2024

facebookresearch / audioseal

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Python 420 51 Updated Aug 28, 2024

comydream / Discop

Provably Secure Steganography in Practice Based on “Distribution Copies”

Python 26 2 Updated Apr 18, 2024

Georgefwt / AquaLoRA

This repository contains the implementation for the paper "AquaLoRA: Toward White-box Protection for Customized Stable Diffusion Models via Watermark LoRA", accepted by ICML 2024.

Python 26 Updated Sep 2, 2024

NVIDIA / BigVGAN

Official PyTorch implementation of BigVGAN (ICLR 2023)

Python 853 97 Updated Sep 5, 2024

IDSIA / kohonen-vae

Official repository for the paper "Topological Neural Discrete Representation Learning à la Kohonen" (ICML 2023 Workshop on Sampling and Optimization in Discrete Space)

Python 8 1 Updated Apr 27, 2023

facebookresearch / quantized_identifiability

Repository for the code associated with the paper "On the Identifiability of Quantized Factors" by Vitória Barin-Pacela, Kartik Ahuja, Simon Lacoste-Julien, Pascal Vincent, Conference on Causal Lea…

Jupyter Notebook 7 1 Updated May 7, 2024

DaiDaiLoh / QG-VAE

According to the paper "Quantised Global Autoencoder: A Holistic Approach to Representing Visual Data"

Jupyter Notebook 4 Updated Aug 16, 2024

lucidrains / e2-tts-pytorch

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Python 244 22 Updated Sep 11, 2024

Bai-YT / ConsistencyTTA

ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation

Python 27 Updated Jun 15, 2024

lllyasviel / Paints-UNDO

Understand Human Behavior to Align True Needs

Python 3,313 292 Updated Jul 20, 2024

open-mmlab / FoleyCrafter

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师，给你的无声视频添加生动而且同步的音效 😝

Python 419 38 Updated Jul 26, 2024

FunAudioLLM / CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 5,212 535 Updated Sep 29, 2024

google / style-aligned

Official code for "Style Aligned Image Generation via Shared Attention"

Python 1,204 88 Updated Dec 29, 2023

guyyariv / TempoTokens

This repo contains the official PyTorch implementation of: Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation

Python 101 11 Updated Apr 23, 2024

speechbrain / benchmarks

This repository contains the SpeechBrain Benchmarks

Python 92 35 Updated Sep 19, 2024

ShihaoZhaoZSH / Uni-ControlNet

[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models

Python 588 42 Updated Jul 17, 2024

haoheliu / SemantiCodec-inference

Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.

Python 130 8 Updated Aug 25, 2024

luosiallen / latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,302 224 Updated Jun 14, 2024

luosiallen / Diff-Foley

Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models

Python 150 18 Updated May 29, 2024

2noise / ChatTTS

A generative speech model for daily dialogue.

Python 31,173 3,390 Updated Sep 21, 2024

mhamilton723 / DenseAV

Offical code for the CVPR 2024 Paper: Separating the "Chirp" from the "Chat": Self-supervised Visual Grounding of Sound and Language

Jupyter Notebook 54 9 Updated Jun 12, 2024

lllyasviel / ControlNet-v1-1-nightly

Nightly release of ControlNet 1.1

Python 4,682 371 Updated Aug 8, 2024

lllyasviel / ControlNet

Let us control diffusion models!

Python 29,930 2,702 Updated Feb 25, 2024