7 results found Sort:

148
2.1k
apache-2.0
49
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
Created 2020-12-11
1,826 commits to main branch, last one 5 months ago
A curated list for Efficient Large Language Models
Created 2023-05-22
529 commits to main branch, last one 12 days ago
106
907
apache-2.0
10
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.
Created 2023-05-17
165 commits to main branch, last one 2 months ago
26
205
unknown
3
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
Created 2024-04-10
14 commits to main branch, last one 8 months ago
A research library for pytorch-based neural network pruning, compression, and more.
Created 2020-02-14
12 commits to main branch, last one 2 years ago
Model optimizer used in Adlik.
Created 2020-02-28
15 commits to master branch, last one about a year ago
11
37
apache-2.0
3
[AAAI 2024] Fluctuation-based Adaptive Structured Pruning for Large Language Models
Created 2023-12-18
6 commits to main branch, last one 11 months ago