11 results found Sort:
- Filter by Primary Language:
- Python (4)
- HTML (1)
- Jupyter Notebook (1)
- +
🚀 Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Created
2023-12-20
1,035 commits to main branch, last one 6 hours ago
[TMLR 2024] Efficient Large Language Models: A Survey
Created
2023-04-18
654 commits to main branch, last one 16 days ago
Learn how to design and implement effective Machine Learning systems from start to finish.
Created
2022-03-08
17 commits to main branch, last one 2 months ago
Curated collection of papers in machine learning systems
Created
2023-12-30
31 commits to main branch, last one about a month ago
The repository has collected a batch of noteworthy MLSys bloggers (Algorithms/Systems)
Created
2025-01-05
10 commits to master branch, last one 24 days ago
Oort: Efficient Federated Learning via Guided Participant Selection
Created
2021-03-13
380 commits to master branch, last one 3 years ago
a curated list of high-quality papers on resource-efficient LLMs 🌱
Created
2023-12-28
70 commits to main branch, last one 28 days ago
Here are my personal paper reading notes (including cloud computing, resource management, systems, machine learning, deep learning, and other interesting stuffs).
Created
2021-09-27
223 commits to develop branch, last one 4 days ago
Course Material for the UG Course COMP4901Y
Created
2024-01-20
80 commits to main branch, last one 8 months ago
Machine Learning Compiler Road Map
Created
2022-09-28
49 commits to main branch, last one about a year ago
Triton implement of bi-directional (non-causal) linear attention
Created
2024-11-20
31 commits to main branch, last one 17 days ago