Skip to content
View JosephZZ's full-sized avatar

Block or report JosephZZ

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

world modeling challenge for humanoid robots

Python 317 19 Updated Aug 23, 2024

Official implementation of HumanVid, NeurIPS D&B Track 2024

Python 213 3 Updated Sep 28, 2024

This is the official reproduction of FancyVideo.

Python 563 72 Updated Sep 12, 2024

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 52,343 5,520 Updated Oct 4, 2024

Hiera: A fast, powerful, and simple hierarchical vision transformer.

Python 871 39 Updated Mar 2, 2024

A PyTorch implementation of MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis

Python 507 26 Updated Mar 10, 2023
Python 433 43 Updated Jun 30, 2022

Ultra-low Bitrate Image Semantic Compression Driven by Large Multimodal Model

Jupyter Notebook 19 Updated Apr 24, 2024

[LMM + codec] A new paradigm of visual signal compression!

Python 25 Updated Jun 14, 2024
Python 85 11 Updated Jul 4, 2024

Code for "Hierarchical World Models as Visual Whole-Body Humanoid Controllers"

Python 143 7 Updated Jul 2, 2024

Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"

Python 107 3 Updated Apr 22, 2024

[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

Python 2,278 109 Updated Jul 19, 2024

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 520 20 Updated Sep 26, 2024
Jupyter Notebook 152 15 Updated Feb 2, 2024

Kolors Team

Python 3,664 242 Updated Sep 4, 2024

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 5,085 331 Updated Jun 28, 2024

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 4,037 302 Updated Jul 16, 2024

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,460 197 Updated Sep 8, 2024

PyTorch implementation of RCG https://arxiv.org/abs/2312.03701

Python 794 38 Updated Sep 27, 2024
Python 1,747 54 Updated Jun 28, 2024
Jupyter Notebook 963 119 Updated Sep 18, 2024

Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV 2022)

Python 263 17 Updated May 1, 2024

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Python 943 42 Updated Jan 17, 2024

Reference implementation for DPO (Direct Preference Optimization)

Python 2,059 167 Updated Aug 11, 2024

[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition

Python 577 35 Updated Jul 22, 2024

A quick guide (especially) for trending instruction finetuning datasets

2,482 160 Updated Nov 28, 2023

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,464 233 Updated Sep 26, 2024
Next