Stars
[ICLR 2023 Spotlight] Equiformer: Equivariant Graph Attention Transformer for 3D Atomistic Graphs
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Open-Sora: Democratizing Efficient Video Production for All
[ECCV 2024] OMG: Occlusion-friendly Personalized Multi-concept Generation In Diffusion Models
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image …
[AAAI 2024] Descanning: From Scanned to the Original Images with a Color Correction Diffusion Model
leoxiaobin / CvT
Forked from microsoft/CvTThis is an official implementation of CvT: Introducing Convolutions to Vision Transformers.
Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.
Official implementation for "Break-A-Scene: Extracting Multiple Concepts from a Single Image" [SIGGRAPH Asia 2023]
Foundational Models for State-of-the-Art Speech and Text Translation
Unofficial implementation of the paper "The Chosen One: Consistent Characters in Text-to-Image Diffusion Models"
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
A collection of state-of-the-art video frame interpolation (VFI) methods.
Source code for AAAI 2020 paper "Channel Attention Is All You Need for Video Frame Interpolation"
FILM: Frame Interpolation for Large Motion, In ECCV 2022.