Skip to content
View k15201363625's full-sized avatar

Block or report k15201363625

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"

Python 281 13 Updated Sep 23, 2024

🔨AI 方向好用的科研工具

2,323 344 Updated Jun 10, 2024

A curated list of image inpainting and video inpainting papers and resources

Python 1,851 253 Updated Aug 12, 2024

Project and dataset webpage:

Python 227 64 Updated Oct 12, 2023

[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…

Python 585 38 Updated Sep 8, 2024

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,653 440 Updated Sep 19, 2024

Character Animation (AnimateAnyone, Face Reenactment)

Python 3,105 241 Updated May 31, 2024

Multi-object image datasets with ground-truth segmentation masks and generative factors.

Python 255 24 Updated Dec 17, 2021

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,720 174 Updated Aug 1, 2024

[RSS 2024] Learning Manipulation by Predicting Interaction

Python 82 Updated Aug 18, 2024

A curated list of papers, code, and resources pertaining to generative image composition or object insertion.

Python 76 6 Updated Jul 7, 2024

[ICLR 2024 Poster] SCHEMA: State CHangEs MAtter for Procedure Planning in Instructional Videos

Python 14 2 Updated Mar 14, 2024

Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)

Python 29 1 Updated Sep 9, 2024

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,463 197 Updated Sep 8, 2024

Compose multimodal datasets 🎹

Python 182 8 Updated Sep 26, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,879 1,379 Updated Sep 5, 2024

An efficient video loader for deep learning with smart shuffling that's super easy to digest

C++ 1,835 160 Updated Jul 17, 2024
Python 562 27 Updated Feb 15, 2024
Jupyter Notebook 964 119 Updated Sep 18, 2024

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 5,732 1,138 Updated Jul 30, 2024

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,344 71 Updated Aug 21, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,292 1,009 Updated Oct 6, 2024

Download YouTube videos faster using a large number of VMs

Python 8 Updated Nov 15, 2022

Easily create large video dataset from video urls

Python 533 65 Updated Jul 30, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,879 4,111 Updated Oct 6, 2024

Official repository of Learning to Act from Actionless Videos through Dense Correspondences.

Python 162 15 Updated Apr 25, 2024

The official codebase for running the experiments described in the AVDC paper.

Python 11 5 Updated Oct 2, 2024

Official Code for MotionCtrl [SIGGRAPH 2024]

Python 1,282 71 Updated Sep 20, 2024

Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV 2022)

Python 263 17 Updated May 1, 2024

[ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxing Yang, Nanyi Fei, Yuqi Huo, Zhiwu Lu, Ping Luo, Mingyu Ding.

Jupyter Notebook 206 11 Updated May 5, 2024
Next