-
Amazon
- Santa Clara, CA
-
04:35
(UTC -07:00) - sy-zhang.github.io
- @zhangsongyang
- in/songyang-zhang
Stars
A native PyTorch Library for large model training
Perceptual video quality assessment based on multi-method fusion.
Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stitching"
A list for Text-to-Video, Image-to-Video works
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)
Official Code for MotionCtrl [SIGGRAPH 2024]
Generative Models by Stability AI
Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models (ICLR 2024)
GPT-4V(ision) as A Social Media Analysis Engine
[ECCV 2024 Oral] MotionDirector: Motion Customization of Text-to-Video Diffusion Models.
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale
A collection of resources on controllable generation with text-to-image diffusion models.
Using Low-rank adaptation to quickly fine-tune diffusion models.
A helper library to connect into Amazon SageMaker with AWS Systems Manager and SSH (Secure Shell)
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
Retrieve author and publication information from Google Scholar in a friendly, Pythonic way without having to worry about CAPTCHAs!
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Official implementation of "TM2T: Stochastic and Tokenized Modeling for the Reciprocal Generation of 3D Human Motions and Texts (ECCV2022)"
Official implementation for "Generating Diverse and Natural 3D Human Motions from Texts (CVPR2022)."
[CVPR 2023] Executing your Commands via Motion Diffusion in Latent Space, a fast and high-quality motion diffusion model
[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
A large-scale text-to-image prompt gallery dataset based on Stable Diffusion
A curated list of text-guided generative models resources
AI绘画资料合集(包含国内外可使用平台、使用教程、参数教程、部署教程、业界新闻等等) Stable diffusion、AnimateDiff、Stable Cascade 、Stable SDXL Turbo
Official implementations for "Action2Motion: Conditioned Generation of 3D Human Motions (ACM MultiMedia 2020)"