Lists (6)
Sort Name ascending (A-Z)
Stars
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).
Replication of the paper "Text Is All You Need: Learning Language Representations for Sequential Recommendation" on KDD'23.
OpenP5: An Open-Source Platform for Developing, Training, and Evaluating LLM-based Recommender Systems
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
A collection of awesome video generation studies.
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
Implementation of MagViT2 Tokenizer in Pytorch
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
Open-source and strong foundation image recognition models.
A curated list of awesome resources about multimodal recommender systems.
LaVIT: Empower the Large Language Model to Understand and Generate Visual Content
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
A Collection of Papers and Codes for CVPR2024/ECCV2024 AIGC
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
Karras et al. (2022) diffusion models for PyTorch
Comparison between Frechet Video Distance implementation from StyleGAN-V and the original paper
Open-Sora: Democratizing Efficient Video Production for All
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
MiniSora: A community aims to explore the implementation path and future development direction of Sora.