11 results found Sort:

266
2.4k
apache-2.0
32
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Created 2020-07-21
3,756 commits to master branch, last one a day ago
85
1.6k
other
30
bpftune uses BPF to auto-tune Linux systems
Created 2023-05-09
670 commits to main branch, last one 17 days ago
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
Created 2020-06-17
255 commits to master branch, last one 12 days ago
28
243
apache-2.0
17
Machine Learning Framework for Operating Systems - Brings ML to Linux kernel
Created 2021-11-10
28 commits to main branch, last one 3 years ago
158
236
mit
56
Stretching GPU performance for GEMMs and tensor contractions.
Created 2015-11-05
5,579 commits to develop branch, last one 3 days ago
36
178
other
18
CLTune: An automatic OpenCL & CUDA kernel tuner
Created 2015-01-11
307 commits to master branch, last one 2 years ago
8
115
apache-2.0
6
Alchemy Cat —— 🔥Config System for SOTA
Created 2019-12-07
398 commits to master branch, last one about a month ago
14
89
bsd-3-clause
14
Phoebe
Created 2021-01-05
216 commits to main branch, last one 3 years ago
29
74
unknown
8
Benchmark scripts for TVM
Created 2020-11-19
4 commits to main branch, last one 3 years ago
3
71
apache-2.0
2
ebpf profiler for jvm
Created 2020-02-24
156 commits to master branch, last one 3 years ago