28 results found Sort:

251
2.2k
apache-2.0
34
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Created 2020-07-21
3,579 commits to master branch, last one 15 hours ago
144
2.0k
apache-2.0
48
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
Created 2020-12-11
1,826 commits to main branch, last one 2 months ago
345
1.6k
apache-2.0
92
PaddleSlim is an open-source library for deep model compression and architecture search.
Created 2019-12-16
1,243 commits to develop branch, last one 6 months ago
319
1.5k
apache-2.0
119
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
Created 2018-10-31
832 commits to master branch, last one 2 days ago
228
916
apache-2.0
30
Neural Network Compression Framework for enhanced OpenVINO™ inference
Created 2020-05-13
2,210 commits to develop branch, last one a day ago
Network Slimming (Pytorch) (ICCV 2017)
Created 2018-07-05
15 commits to master branch, last one 3 years ago
100
827
bsd-3-clause
37
PyTorch native quantization and sparsity for training and inference
Created 2023-11-03
586 commits to main branch, last one 20 hours ago
120
665
gpl-3.0
10
More readable and flexible yolov5 with more backbone(gcn, resnet, shufflenet, moblienet, efficientnet, hrnet, swin-transformer, etc) and (cbam,dcn and so on), and tensorrt
Created 2021-03-26
206 commits to v2 branch, last one about a month ago
40
497
apache-2.0
12
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Created 2024-06-20
2,043 commits to main branch, last one 10 hours ago
37
366
unknown
5
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
Created 2023-06-12
41 commits to main branch, last one 3 months ago
36
342
apache-2.0
8
An innovative library for efficient LLM inference via low-bit quantization
This repository has been archived (exclude archived)
Created 2023-11-20
345 commits to main branch, last one 27 days ago
Always sparse. Never dense. But never say never. A Sparse Training repository for the Adaptive Sparse Connectivity concept and its algorithmic instantiation, i.e. Sparse Evolutionary Training, to boos...
Created 2018-03-02
26 commits to master branch, last one 3 years ago
[CVPR 2021] Exploring Sparsity in Image Super-Resolution for Efficient Inference
Created 2020-07-26
38 commits to master branch, last one 2 years ago
Learning both Weights and Connections for Efficient Neural Networks https://arxiv.org/abs/1506.02626
Created 2017-11-03
88 commits to master branch, last one 3 years ago
A research library for pytorch-based neural network pruning, compression, and more.
Created 2020-02-14
12 commits to main branch, last one about a year ago
Zero-label image classification via OpenCLIP knowledge distillation
Created 2023-06-27
17 commits to master branch, last one about a year ago
5
95
bsd-3-clause
5
[ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers.
Created 2023-05-27
146 commits to main branch, last one 10 months ago
11
88
apache-2.0
7
Soft Threshold Weight Reparameterization for Learnable Sparsity
Created 2020-04-11
53 commits to master branch, last one 3 years ago
This repository has no description...
Created 2024-08-28
20 commits to main branch, last one 3 days ago
7
49
mit
3
Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"
Created 2023-10-07
35 commits to main branch, last one 3 months ago
[ICML 2021] "Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training" by Shiwei Liu, Lu Yin, Decebal Constantin Mocanu, Mykola Pechenizkiy
Created 2021-06-10
70 commits to main branch, last one 10 months ago
Codes and data coming with article "A Survey and an Extensive Evaluation of Popular Audio Declipping Methods", and others closely related
Created 2020-06-04
20 commits to master branch, last one 11 months ago
[Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Pruning
Created 2022-12-11
20 commits to master branch, last one about a year ago
Model Compression/Inference Made Easy
Created 2023-07-21
583 commits to main branch, last one about a month ago
[ICLR 2022] "Sparsity Winning Twice: Better Robust Generalization from More Efficient Training" by Tianlong Chen*, Zhenyu Zhang*, Pengjun Wang*, Santosh Balachandra*, Haoyu Ma*, Zehao Wang, Zhangyang ...
Created 2021-12-06
5 commits to main branch, last one 2 years ago
Fast operator-overloading Jacobian & Hessian sparsity detection.
Created 2024-03-28
143 commits to main branch, last one 4 days ago
[ICCV2023 Official PyTorch code] for Iterative Soft Shrinkage Learning for Efficient Image Super-Resolution
Created 2023-08-07
4 commits to main branch, last one 6 months ago