28 results found Sort:
- Filter by Primary Language:
- Python (21)
- Jupyter Notebook (2)
- C++ (1)
- Julia (1)
- MATLAB (1)
- Shell (1)
- +
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Created
2020-07-21
3,579 commits to master branch, last one 15 hours ago
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
Created
2020-12-11
1,826 commits to main branch, last one 2 months ago
PaddleSlim is an open-source library for deep model compression and architecture search.
Created
2019-12-16
1,243 commits to develop branch, last one 6 months ago
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
Created
2018-10-31
832 commits to master branch, last one 2 days ago
Neural Network Compression Framework for enhanced OpenVINO™ inference
Created
2020-05-13
2,210 commits to develop branch, last one a day ago
Network Slimming (Pytorch) (ICCV 2017)
Created
2018-07-05
15 commits to master branch, last one 3 years ago
PyTorch native quantization and sparsity for training and inference
Created
2023-11-03
586 commits to main branch, last one 20 hours ago
More readable and flexible yolov5 with more backbone(gcn, resnet, shufflenet, moblienet, efficientnet, hrnet, swin-transformer, etc) and (cbam,dcn and so on), and tensorrt
Created
2021-03-26
206 commits to v2 branch, last one about a month ago
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Created
2024-06-20
2,043 commits to main branch, last one 10 hours ago
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
Created
2023-06-12
41 commits to main branch, last one 3 months ago
An innovative library for efficient LLM inference via low-bit quantization
This repository has been archived
(exclude archived)
Created
2023-11-20
345 commits to main branch, last one 27 days ago
Sparse Optimisation Research Code
Created
2016-04-21
1,116 commits to master branch, last one 5 months ago
Always sparse. Never dense. But never say never. A Sparse Training repository for the Adaptive Sparse Connectivity concept and its algorithmic instantiation, i.e. Sparse Evolutionary Training, to boos...
sparsity
scalability
deep-learning
randomization
classification
neuroevolution
sparse-training
complex-networks
generative-models
deep-learning-papers
deep-neural-networks
multi-layer-perceptron
scalable-deep-learning
sparse-neural-networks
evolutionary-algorithms
deep-learning-algorithms
artificial-neural-networks
adaptive-sparse-connectivity
restricted-boltzmann-machine
sparse-evolutionary-training
Created
2018-03-02
26 commits to master branch, last one 3 years ago
[CVPR 2021] Exploring Sparsity in Image Super-Resolution for Efficient Inference
Created
2020-07-26
38 commits to master branch, last one 2 years ago
Learning both Weights and Connections for Efficient Neural Networks https://arxiv.org/abs/1506.02626
Created
2017-11-03
88 commits to master branch, last one 3 years ago
A research library for pytorch-based neural network pruning, compression, and more.
Created
2020-02-14
12 commits to main branch, last one about a year ago
Zero-label image classification via OpenCLIP knowledge distillation
Created
2023-06-27
17 commits to master branch, last one about a year ago
[ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers.
Created
2023-05-27
146 commits to main branch, last one 10 months ago
Soft Threshold Weight Reparameterization for Learnable Sparsity
Created
2020-04-11
53 commits to master branch, last one 3 years ago
This repository has no description...
Created
2024-08-28
20 commits to main branch, last one 3 days ago
Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"
Created
2023-10-07
35 commits to main branch, last one 3 months ago
[ICML 2021] "Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training" by Shiwei Liu, Lu Yin, Decebal Constantin Mocanu, Mykola Pechenizkiy
Created
2021-06-10
70 commits to main branch, last one 10 months ago
Codes and data coming with article "A Survey and an Extensive Evaluation of Popular Audio Declipping Methods", and others closely related
Created
2020-06-04
20 commits to master branch, last one 11 months ago
[Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Pruning
Created
2022-12-11
20 commits to master branch, last one about a year ago
Model Compression/Inference Made Easy
Created
2023-07-21
583 commits to main branch, last one about a month ago
[ICLR 2022] "Sparsity Winning Twice: Better Robust Generalization from More Efficient Training" by Tianlong Chen*, Zhenyu Zhang*, Pengjun Wang*, Santosh Balachandra*, Haoyu Ma*, Zehao Wang, Zhangyang ...
Created
2021-12-06
5 commits to main branch, last one 2 years ago
Fast operator-overloading Jacobian & Hessian sparsity detection.
Created
2024-03-28
143 commits to main branch, last one 4 days ago
[ICCV2023 Official PyTorch code] for Iterative Soft Shrinkage Learning for Efficient Image Super-Resolution
Created
2023-08-07
4 commits to main branch, last one 6 months ago