Starred repositories
Official Implementation of Rectified Flow (ICLR2023 Spotlight)
This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.
[CVPR 2023] An academic alternative to Tesla's occupancy network for autonomous driving.
Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
⚡ InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
[CVPR24] Official Implementation of 'A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing'
Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"
Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"
This is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing". ICCV 2023
[NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"
InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
COLMAP - Structure-from-Motion and Multi-View Stereo
[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.
[NeurIPS 2024 D&B Track] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions
[ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxing Yang, Nanyi Fei, Yuqi Huo, Zhiwu Lu, Ping Luo, Mingyu Ding.
👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation"
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
Implementation of MagViT2 Tokenizer in Pytorch
[ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation