Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
-
Updated
Oct 1, 2024 - Python
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
[TMLR 2024] Efficient Large Language Models: A Survey
Infrastructures™ for Machine Learning Training/Inference in Production.
Dive into machine learning system, start from reinventing the wheel.
Learn how to design Machine Learning systems and prepare for an interview.
Curated collection of papers in machine learning systems
Oort: Efficient Federated Learning via Guided Participant Selection
Course Material for the UG Course COMP4901Y
Machine Learning Compiler Road Map
Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and other interesting stuffs).
CSCE 585 - Machine Learning Systems
A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup
A C++ implementation of the scalar-valued autograd engine micrograd
A curated list of resources to deep dive into the intersection of applied machine learning and threat detection.
This is the course project for CSCE585: ML Systems. Students will build their machine learning systems based on the provided infrastructure --- Athena.
[Long Term Support] [SIGCOMM 2023] Lightning: A Reconfigurable Photonic-Electronic SmartNIC for Fast and Energy-Efficient Inference
Assignments for Data Intensive Systems for Machine Learning Coursework
[ICML 2022] Rethinking Image-Scaling Attacks: The Interplay Between Vulnerabilities in Machine Learning Systems
Add a description, image, and links to the machine-learning-systems topic page so that developers can more easily learn about it.
To associate your repository with the machine-learning-systems topic, visit your repo's landing page and select "manage topics."