Stars
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Train transformer language models with reinforcement learning.
A Collection of Variational Autoencoders (VAE) in PyTorch.
PyTorch implementations of Generative Adversarial Networks.
Graph Neural Network Library for PyTorch
Fast and memory-efficient exact attention
Ongoing research training gaussian splatting at scale by distributed system
Understand Human Behavior to Align True Needs
Image Restoration with Mean-Reverting Stochastic Differential Equations, ICML 2023. Winning solution of the NTIRE 2023 Image Shadow Removal Challenge.
[CVPR 2024] Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Cambrian-1 is a family of multimodal LLMs with a vision-centric design.
Image-to-Image Translation in PyTorch
DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention
Taming Transformers for High-Resolution Image Synthesis
Video+code lecture on building nanoGPT from scratch
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)
A generative speech model for daily dialogue.
A Framework of Small-scale Large Multimodal Models
GPT4V-level open-source multi-modal model based on Llama3-8B
llama3 implementation one matrix multiplication at a time
Recipes to train reward model for RLHF.