![google logo](https://raw.githubusercontent.com/github/explore/80688e429a7d4ef2fca1e82350fe8e3517d3494d/topics/google/google.png)
Block or Report
Block or report LancasterLi
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLanguage
Sort by: Recently starred
Starred repositories
This is the pytorch implement of the paper "RSMamba: Remote Sensing Image Classification with State Space Model"
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Stable Diffusion web UI
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
GLIDE: a diffusion-based text-conditional image synthesis model
Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)
[CVPR 2022] Official PyTorch Implementation for DiffusionCLIP: Text-guided Image Manipulation Using Diffusion Models
Diffusion Model-Based Image Editing: A Survey (arXiv)
InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
WebUI extension for ControlNet
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
Official Pytorch Implementation of Paper "A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties"
[CVPR 2024] The official implementation for "MS-DETR: Efficient DETR Training with Mixed Supervision"
Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".
This is the official code release for our work, Denoising Vision Transformers.
Official implementation of 'CLIP-DINOiser: Teaching CLIP a few DINO tricks' paper.
ICLR 2023 DeCap: Decoding CLIP Latents for Zero-shot Captioning
Copy-paste augmentation in detectron2 pipeline
[CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
[CVPR2024] Official Pytorch Implementation of SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation.
Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)