28 results found Sort:

257
2.2k
apache-2.0
33
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Created 2020-07-21
3,619 commits to master branch, last one a day ago
148
2.1k
apache-2.0
49
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
Created 2020-12-11
1,826 commits to main branch, last one 4 months ago
177
1.6k
bsd-3-clause
41
PyTorch native quantization and sparsity for training and inference
Created 2023-11-03
776 commits to main branch, last one 13 hours ago
345
1.6k
apache-2.0
91
PaddleSlim is an open-source library for deep model compression and architecture search.
Created 2019-12-16
1,245 commits to develop branch, last one a day ago
323
1.5k
apache-2.0
120
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
Created 2018-10-31
833 commits to master branch, last one 17 days ago
234
943
apache-2.0
31
Neural Network Compression Framework for enhanced OpenVINO™ inference
Created 2020-05-13
2,293 commits to develop branch, last one 21 hours ago
Network Slimming (Pytorch) (ICCV 2017)
Created 2018-07-05
15 commits to master branch, last one 4 years ago
58
690
apache-2.0
12
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Created 2024-06-20
2,098 commits to main branch, last one 8 hours ago
120
667
gpl-3.0
10
More readable and flexible yolov5 with more backbone(gcn, resnet, shufflenet, moblienet, efficientnet, hrnet, swin-transformer, etc) and (cbam,dcn and so on), and tensorrt
Created 2021-03-26
206 commits to v2 branch, last one 3 months ago
42
391
unknown
5
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
Created 2023-06-12
41 commits to main branch, last one 5 months ago
38
348
apache-2.0
8
An innovative library for efficient LLM inference via low-bit quantization
This repository has been archived (exclude archived)
Created 2023-11-20
345 commits to main branch, last one 2 months ago
Always sparse. Never dense. But never say never. A Sparse Training repository for the Adaptive Sparse Connectivity concept and its algorithmic instantiation, i.e. Sparse Evolutionary Training, to boos...
Created 2018-03-02
26 commits to master branch, last one 3 years ago
[CVPR 2021] Exploring Sparsity in Image Super-Resolution for Efficient Inference
Created 2020-07-26
38 commits to master branch, last one 3 years ago
Learning both Weights and Connections for Efficient Neural Networks https://arxiv.org/abs/1506.02626
Created 2017-11-03
88 commits to master branch, last one 3 years ago
A research library for pytorch-based neural network pruning, compression, and more.
Created 2020-02-14
12 commits to main branch, last one 2 years ago
Zero-label image classification via OpenCLIP knowledge distillation
Created 2023-06-27
17 commits to master branch, last one about a year ago
5
99
bsd-3-clause
5
[ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers.
Created 2023-05-27
146 commits to main branch, last one about a year ago
This repository has no description...
Created 2024-08-28
20 commits to main branch, last one about a month ago
11
88
apache-2.0
7
Soft Threshold Weight Reparameterization for Learnable Sparsity
Created 2020-04-11
53 commits to master branch, last one 3 years ago
8
51
mit
3
Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"
Created 2023-10-07
35 commits to main branch, last one 4 months ago
[ICML 2021] "Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training" by Shiwei Liu, Lu Yin, Decebal Constantin Mocanu, Mykola Pechenizkiy
Created 2021-06-10
70 commits to main branch, last one about a year ago
Codes and data coming with article "A Survey and an Extensive Evaluation of Popular Audio Declipping Methods", and others closely related
Created 2020-06-04
20 commits to master branch, last one about a year ago
[Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Pruning
Created 2022-12-11
20 commits to master branch, last one about a year ago
Model Compression/Inference Made Easy
Created 2023-07-21
583 commits to main branch, last one 2 months ago
[ICLR 2022] "Sparsity Winning Twice: Better Robust Generalization from More Efficient Training" by Tianlong Chen*, Zhenyu Zhang*, Pengjun Wang*, Santosh Balachandra*, Haoyu Ma*, Zehao Wang, Zhangyang ...
Created 2021-12-06
5 commits to main branch, last one 2 years ago
Fast operator-overloading Jacobian & Hessian sparsity detection.
Created 2024-03-28
160 commits to main branch, last one about a month ago
[ICCV2023 Official PyTorch code] for Iterative Soft Shrinkage Learning for Efficient Image Super-Resolution
Created 2023-08-07
4 commits to main branch, last one 8 months ago