k15201363625

k15201363625

11 followers · 94 following

Achievements

Stars

hustvl / EVF-SAM

Official code of "EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model"

Python 281 13 Updated Sep 23, 2024

bighuang624 / AI-research-tools

🔨AI 方向好用的科研工具

2,323 344 Updated Jun 10, 2024

zengyh1900 / Awesome-Image-Inpainting

A curated list of image inpainting and video inpainting papers and resources

Python 1,851 253 Updated Aug 12, 2024

ddshan / hand_object_detector

Project and dataset webpage:

Python 227 64 Updated Oct 12, 2023

open-mmlab / PowerPaint

[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…

Python 585 38 Updated Sep 8, 2024

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 5,653 440 Updated Sep 19, 2024

MooreThreads / Moore-AnimateAnyone

Character Animation (AnimateAnyone, Face Reenactment)

Python 3,105 241 Updated May 31, 2024

google-deepmind / multi_object_datasets

Multi-object image datasets with ground-truth segmentation masks and generative factors.

Python 255 24 Updated Dec 17, 2021

PixArt-alpha / PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,720 174 Updated Aug 1, 2024

OpenDriveLab / MPI

[RSS 2024] Learning Manipulation by Predicting Interaction

Python 82 Updated Aug 18, 2024

bcmi / Awesome-Generative-Image-Composition

A curated list of papers, code, and resources pertaining to generative image composition or object insertion.

Python 76 6 Updated Jul 7, 2024

WenliangGuo / SCHEMA

[ICLR 2024 Poster] SCHEMA: State CHangEs MAtter for Procedure Planning in Instructional Videos

Python 14 2 Updated Mar 14, 2024

facebookresearch / VidOSC

Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)

Python 29 1 Updated Sep 9, 2024

Doubiiu / DynamiCrafter

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,463 197 Updated Sep 8, 2024

remyxai / VQASynth

Compose multimodal datasets 🎹

Python 182 8 Updated Sep 26, 2024

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,879 1,379 Updated Sep 5, 2024

dmlc / decord

An efficient video loader for deep learning with smart shuffling that's super easy to digest

C++ 1,835 160 Updated Jul 17, 2024

allenai / unified-io-2

Python 562 27 Updated Feb 15, 2024

wilson1yan / VideoGPT

Jupyter Notebook 964 119 Updated Sep 18, 2024

CompVis / taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Jupyter Notebook 5,732 1,138 Updated Jul 30, 2024

yunlong10 / Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,344 71 Updated Aug 21, 2024

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,292 1,009 Updated Oct 6, 2024

RyanMarten / distributed_gcp_youtube_download

Download YouTube videos faster using a large number of VMs

Python 8 Updated Nov 15, 2022

iejMac / video2dataset

Easily create large video dataset from video urls

Python 533 65 Updated Jul 30, 2024

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 27,879 4,111 Updated Oct 6, 2024

flow-diffusion / AVDC

Official repository of Learning to Act from Actionless Videos through Dense Correspondences.

Python 162 15 Updated Apr 25, 2024

flow-diffusion / AVDC_experiments

The official codebase for running the experiments described in the AVDC paper.

Python 11 5 Updated Oct 2, 2024

TencentARC / MotionCtrl

Official Code for MotionCtrl [SIGGRAPH 2024]

Python 1,282 71 Updated Sep 20, 2024

songweige / TATS

Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV 2022)

Python 263 17 Updated May 1, 2024

RERV / VDT

[ICLR2024] The official implementation of paper "VDT: General-purpose Video Diffusion Transformers via Mask Modeling", by Haoyu Lu, Guoxing Yang, Nanyi Fei, Yuqi Huo, Zhiwu Lu, Ping Luo, Mingyu Ding.

Jupyter Notebook 206 11 Updated May 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

k15201363625

Achievements

Achievements

Block or report k15201363625

Stars

hustvl / EVF-SAM

bighuang624 / AI-research-tools

zengyh1900 / Awesome-Image-Inpainting

ddshan / hand_object_detector

open-mmlab / PowerPaint

OpenGVLab / InternVL

MooreThreads / Moore-AnimateAnyone

google-deepmind / multi_object_datasets

PixArt-alpha / PixArt-alpha

OpenDriveLab / MPI

bcmi / Awesome-Generative-Image-Composition

WenliangGuo / SCHEMA

facebookresearch / VidOSC

Doubiiu / DynamiCrafter

remyxai / VQASynth

IDEA-Research / Grounded-Segment-Anything

dmlc / decord

allenai / unified-io-2

wilson1yan / VideoGPT

CompVis / taming-transformers

yunlong10 / Awesome-LLMs-for-Video-Understanding

PKU-YuanGroup / Open-Sora-Plan

RyanMarten / distributed_gcp_youtube_download

iejMac / video2dataset

vllm-project / vllm

flow-diffusion / AVDC

flow-diffusion / AVDC_experiments

TencentARC / MotionCtrl

songweige / TATS

RERV / VDT