12 results found Sort:

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton
Created 2023-12-20
1,401 commits to main branch, last one 5 hours ago
[TMLR 2024] Efficient Large Language Models: A Survey
Created 2023-04-18
659 commits to main branch, last one 20 days ago
50
1.1k
apache-2.0
20
Distributed RL System for LLM Reasoning
Created 2025-02-24
131 commits to main branch, last one 14 days ago
Curated collection of papers in machine learning systems
Created 2023-12-30
38 commits to main branch, last one 18 days ago
Learn how to design and implement effective Machine Learning systems from start to finish.
Created 2022-03-08
17 commits to main branch, last one 5 months ago
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
Created 2025-01-05
10 commits to master branch, last one 3 months ago
26
126
apache-2.0
4
Oort: Efficient Federated Learning via Guided Participant Selection
Created 2021-03-13
380 commits to master branch, last one 3 years ago
a curated list of high-quality papers on resource-efficient LLMs 🌱
Created 2023-12-28
79 commits to main branch, last one about a month ago
Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and other interesting stuffs).
Created 2021-09-27
240 commits to develop branch, last one 13 days ago
Triton implement of bi-directional (non-causal) linear attention
Created 2024-11-20
32 commits to main branch, last one 2 months ago
Efficient Diffusion Models: A Survey
Created 2024-05-31
192 commits to main branch, last one 13 days ago
Machine Learning Compiler Road Map
Created 2022-09-28
49 commits to main branch, last one about a year ago