7 results found Sort:

149
2.1k
apache-2.0
49
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
Created 2020-12-11
1,826 commits to main branch, last one 7 months ago
A curated list for Efficient Large Language Models
Created 2023-05-22
543 commits to main branch, last one 4 days ago
109
940
apache-2.0
10
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.
Created 2023-05-17
165 commits to main branch, last one 3 months ago
25
213
unknown
3
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
Created 2024-04-10
14 commits to main branch, last one 9 months ago
A research library for pytorch-based neural network pruning, compression, and more.
Created 2020-02-14
12 commits to main branch, last one 2 years ago
12
44
apache-2.0
2
[AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models
Created 2023-12-18
6 commits to main branch, last one about a year ago
Model optimizer used in Adlik.
Created 2020-02-28
15 commits to master branch, last one about a year ago