Skip to content
View ZhendongWang6's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro
Block or Report

Block or report ZhendongWang6

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Official implementation of EG4D: Explicit Generation of 4D Object without Score Distillation

16 2 Updated May 29, 2024

Lumina-T2X is a unified framework for Text to Any Modality Generation

Python 1,822 75 Updated Jun 29, 2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

2,824 105 Updated Jun 26, 2024

[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"

Python 417 19 Updated Jun 10, 2024

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Jupyter Notebook 1,587 90 Updated Jun 6, 2024

Code for "Diffusion Model Alignment Using Direct Preference Optimization"

Python 165 16 Updated Dec 28, 2023

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 3,772 285 Updated Apr 30, 2024

Latte: Latent Diffusion Transformer for Video Generation.

Python 1,430 146 Updated Jun 20, 2024

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Python 1,249 74 Updated Jul 1, 2024

GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling

Python 252 12 Updated Jun 25, 2024
Python 411 12 Updated Jan 31, 2024

One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more

Python 1,310 141 Updated Apr 30, 2024

Grok open release

Python 49,127 8,313 Updated May 29, 2024

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Python 665 40 Updated Jun 30, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 5,576 389 Updated May 29, 2024

[WIP] Layer Diffusion for WebUI (via Forge)

Python 3,633 323 Updated Jun 12, 2024

A collection of resources on controllable generation with text-to-image diffusion models.

734 20 Updated Jun 10, 2024

[ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model

339 7 Updated Feb 20, 2024

Fast Diffusion Models with Transformers

Python 602 83 Updated Oct 7, 2023

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 5,590 498 Updated May 31, 2024

FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention

Python 621 34 Updated Dec 9, 2023

Huggingface-compatible SDXL Unet implementation that is readily hackable

Jupyter Notebook 364 29 Updated Aug 9, 2023
Python 2,454 296 Updated May 19, 2024
Python 1,682 51 Updated Jun 28, 2024

The code of "Mask TextSpotter v3: Segmentation Proposal Network for Robust Scene Text Spotting"

Python 617 121 Updated Jan 20, 2022

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Python 3,422 322 Updated Jun 16, 2024

Better Aligning Text-to-Image Models with Human Preference. ICCV 2023

Python 255 8 Updated Jul 14, 2023

[NeurIPS2023] This is the official code of the paper "GlyphControl: Glyph Conditional Control for Visual Text Generation"

Python 189 12 Updated Feb 12, 2024

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Python 3,925 415 Updated Nov 29, 2023

Consistency Distilled Diff VAE

Python 2,096 76 Updated Nov 7, 2023
Next