Skip to content
View leffff's full-sized avatar
🍥
🍥

Highlights

  • Pro

Block or report leffff

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

[NeurIPS 2024] Boosting the performance of consistency models with PCM!

Python 342 11 Updated Sep 20, 2024

[CVPR2024] Official PyTorch implementation of "Contrastive Denoising Score(CDS) for Text-guided Latent Diffusion Image Editing"

Python 87 3 Updated Apr 5, 2024
Jupyter Notebook 94 11 Updated Feb 12, 2024

My take on E(n) Equivariant Graph Neural Networks

Python 6 Updated Sep 20, 2024
Jupyter Notebook 152 15 Updated Feb 2, 2024

Exploration into the Scaling Value Iteration Networks paper, from Schmidhuber's group

Python 36 Updated Sep 23, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 33,520 3,846 Updated Oct 2, 2024

[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

Python 390 22 Updated Jun 5, 2024

KandinskyVideo — multilingual end-to-end text2video latent diffusion model

Python 169 19 Updated May 28, 2024

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 7,853 732 Updated Oct 5, 2024

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 2,141 128 Updated Sep 24, 2024

Official PyTorch implementation for the paper Minimizing Trajectory Curvature of ODE-based Generative Models, ICML 2023

Python 76 6 Updated May 22, 2024

The codebase of our paper "Improving the Training of Rectified Flows"

Python 67 3 Updated Jul 11, 2024

Light and Optimal Schrödinger Bridge Matching (ICML 2024) official PyTorch implementation=

Python 33 4 Updated Aug 8, 2024

PyTorch Implementation of Diffusion Schrodinger Bridge Matching

Python 115 5 Updated May 28, 2023

Text-to-Music Generation with Rectified Flow Transformers

Python 1,533 116 Updated Sep 6, 2024

PyTorch implementation of Real-ESRGAN model

Python 478 122 Updated Apr 15, 2024

Model for watermark classification implemented with PyTorch

Jupyter Notebook 86 22 Updated Sep 19, 2024

Code for "Aligning Optimization Trajectories with Diffusion Models for Constrained Design Generation" @ NeurIPS 2023

Python 8 5 Updated Oct 12, 2023

Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)

Python 513 38 Updated Apr 23, 2024

[IJCAI-24] Spatial-Temporal-Decoupled Masked Pre-training for Spatiotemporal Forecasting

Python 98 7 Updated Sep 30, 2024

Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)

Python 1,558 135 Updated Jan 23, 2024

Scaling Diffusion Transformers with Mixture of Experts

Python 187 8 Updated Sep 9, 2024

[CVPR2024] Official implementation of High-fidelity Person-centric Subject-to-Image Synthesis.

Python 37 1 Updated Aug 23, 2024

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 610 22 Updated Oct 1, 2024

Official Github Repo for Neurips 2024 Paper Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment

Python 28 1 Updated Oct 3, 2024

Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]

Python 27 Updated Sep 21, 2024

VoiceLDM: Text-to-Speech with Environmental Context

Python 159 8 Updated Aug 9, 2024

FMBoost: Boosting Latent Diffusion with Flow Matching (ECCV 2024 Oral)

Python 152 1 Updated Oct 2, 2024

Zero-shot Image-to-Image Translation [SIGGRAPH 2023]

Python 1,057 79 Updated Sep 5, 2024
Next