Skip to content
View mausset's full-sized avatar

Block or report mausset

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Triton implementation of FlashAttention2 that adds Custom Masks.

Python 65 5 Updated Aug 14, 2024

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 5,094 615 Updated Oct 4, 2024

Schedule-Free Optimization in PyTorch

Python 1,835 64 Updated Sep 24, 2024

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 2,631 251 Updated Aug 9, 2024

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 4,623 396 Updated Oct 2, 2024

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…

Python 2,794 354 Updated May 8, 2024