mausset

riko mausset

Stars

alexzhang13 / flashattention2-custom-mask

Triton implementation of FlashAttention2 that adds Custom Masks.

Python 65 5 Updated Aug 14, 2024

NVIDIA / DALI

A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.

C++ 5,094 615 Updated Oct 4, 2024

facebookresearch / schedule_free

Schedule-Free Optimization in PyTorch

Python 1,835 64 Updated Sep 24, 2024

facebookresearch / jepa

PyTorch code and models for V-JEPA self-supervised learning from video.

Python 2,631 251 Updated Aug 9, 2024

lucidrains / x-transformers

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 4,623 396 Updated Oct 2, 2024

facebookresearch / ijepa

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…

Python 2,794 354 Updated May 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly