12 results found Sort:
- Filter by Primary Language:
- Python (4)
- HTML (1)
- Jupyter Notebook (1)
- +
🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton
Created
2023-12-20
1,401 commits to main branch, last one 5 hours ago
[TMLR 2024] Efficient Large Language Models: A Survey
Created
2023-04-18
659 commits to main branch, last one 20 days ago
Distributed RL System for LLM Reasoning
Created
2025-02-24
131 commits to main branch, last one 14 days ago
Curated collection of papers in machine learning systems
Created
2023-12-30
38 commits to main branch, last one 18 days ago
Learn how to design and implement effective Machine Learning systems from start to finish.
Created
2022-03-08
17 commits to main branch, last one 5 months ago
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
Created
2025-01-05
10 commits to master branch, last one 3 months ago
Oort: Efficient Federated Learning via Guided Participant Selection
Created
2021-03-13
380 commits to master branch, last one 3 years ago
a curated list of high-quality papers on resource-efficient LLMs 🌱
Created
2023-12-28
79 commits to main branch, last one about a month ago
Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and other interesting stuffs).
Created
2021-09-27
240 commits to develop branch, last one 13 days ago
Triton implement of bi-directional (non-causal) linear attention
Created
2024-11-20
32 commits to main branch, last one 2 months ago
Efficient Diffusion Models: A Survey
Created
2024-05-31
192 commits to main branch, last one 13 days ago
Machine Learning Compiler Road Map
Created
2022-09-28
49 commits to main branch, last one about a year ago