11 results found Sort:

🚀 Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Created 2023-12-20
1,035 commits to main branch, last one 6 hours ago
[TMLR 2024] Efficient Large Language Models: A Survey
Created 2023-04-18
654 commits to main branch, last one 16 days ago
Learn how to design and implement effective Machine Learning systems from start to finish.
Created 2022-03-08
17 commits to main branch, last one 2 months ago
Curated collection of papers in machine learning systems
Created 2023-12-30
31 commits to main branch, last one about a month ago
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
Created 2025-01-05
10 commits to master branch, last one 24 days ago
25
126
apache-2.0
5
Oort: Efficient Federated Learning via Guided Participant Selection
Created 2021-03-13
380 commits to master branch, last one 3 years ago
a curated list of high-quality papers on resource-efficient LLMs 🌱
Created 2023-12-28
70 commits to main branch, last one 28 days ago
Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and other interesting stuffs).
Created 2021-09-27
223 commits to develop branch, last one 4 days ago
Course Material for the UG Course COMP4901Y
Created 2024-01-20
80 commits to main branch, last one 8 months ago
Machine Learning Compiler Road Map
Created 2022-09-28
49 commits to main branch, last one about a year ago
Triton implement of bi-directional (non-causal) linear attention
Created 2024-11-20
31 commits to main branch, last one 17 days ago