yangbinb

yangbinb

6 followers · 12 following

Starred repositories

639 results for source starred repositories

Clear filter

aim-uofa / VLModel

Repo of HawkLlama.

Python 10 Updated Aug 10, 2024

aim-uofa / MovieDreamer

244 8 Updated Aug 10, 2024

gnobitab / RectifiedFlow

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Python 854 52 Updated Jul 20, 2024

NiuTrans / Vision-LLM-Alignment

This repository contains the code for SFT, RLHF, and DPO, designed for vision-based LLMs, including the LLaVA models and the LLaMA-3.2-vision models.

Python 55 3 Updated Oct 3, 2024

wzzheng / TPVFormer

[CVPR 2023] An academic alternative to Tesla's occupancy network for autonomous driving.

Python 1,166 105 Updated Sep 7, 2024

imlixinyang / Director3D

Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).

Python 280 14 Updated Sep 26, 2024

test-time-training / ttt-lm-pytorch

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 996 56 Updated Jul 14, 2024

gnobitab / InstaFlow

⚡ InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)

Python 1,149 36 Updated Jun 7, 2024

BadToBest / EchoMimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 2,574 307 Updated Aug 15, 2024

STEM-Inv / STEM-Inv

[CVPR24] Official Implementation of 'A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing'

Python 114 10 Updated Jun 18, 2024

scofield7419 / Video-of-Thought

Codes for ICML 2024 paper: "Video-of-Thought: Step-by-Step Video Reasoning from Perception to Cognition"

35 2 Updated Jun 24, 2024

YuqiYang213 / MLoRE

Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"

Python 50 2 Updated May 9, 2024

ysy31415 / direct_a_video

Python 56 4 Updated May 25, 2024

aimagelab / multimodal-garment-designer

This is the official repository for the paper "Multimodal Garment Designer: Human-Centric Latent Diffusion Models for Fashion Image Editing". ICCV 2023

Python 408 47 Updated Mar 28, 2024

KwaiVGI / LivePortrait

Bring portraits to life!

Python 12,176 1,278 Updated Oct 7, 2024

jianzongwu / MotionBooth

[NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"

Python 93 7 Updated Oct 8, 2024

invictus717 / InteractiveVideo

InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions

Python 125 8 Updated Feb 7, 2024

Tencent / MimicMotion

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Python 1,706 138 Updated Sep 23, 2024

colmap / colmap

COLMAP - Structure-from-Motion and Multi-View Stereo

C++ 7,530 1,504 Updated Oct 8, 2024

MyNiuuu / MOFA-Video

[ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.

Python 598 32 Updated Aug 6, 2024

ShareGPT4Omni / ShareGPT4Video

[NeurIPS 2024 D&B Track] An official implementation of ShareGPT4Video: Improving Video Understanding and Generation with Better Captions

Python 1,238 44 Updated Aug 7, 2024

RERV / VDT

[ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxing Yang, Nanyi Fei, Yuqi Huo, Zhiwu Lu, Ping Luo, Mingyu Ding.

Jupyter Notebook 206 12 Updated May 5, 2024

ali-vilab / Ranni

Python 210 15 Updated Apr 10, 2024

muzishen / IMAGDressing

👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing

Python 981 84 Updated Aug 28, 2024

google-research / magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Python 944 42 Updated Jan 17, 2024

CrossmodalGroup / MaskedVectorQuantization

Official Pytorch Implementation of Our CVPR2023 Paper: "Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation"

Python 49 3 Updated Jul 21, 2023

TencentARC / Open-MAGVIT2

Open-MAGVIT2: Democratizing Autoregressive Visual Generation

Python 638 25 Updated Sep 27, 2024

lucidrains / magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Python 547 35 Updated Jul 23, 2024

ShareGPT4Omni / ShareGPT4V

[ECCV 2024] ShareGPT4V: Improving Large Multi-modal Models with Better Captions

Python 129 4 Updated Jul 1, 2024

FoundationVision / LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,221 49 Updated Aug 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly