28 results found Sort:
- Filter by Primary Language:
- Python (21)
- Jupyter Notebook (2)
- C++ (1)
- Julia (1)
- MATLAB (1)
- Shell (1)
- +
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Created
2020-07-21
3,619 commits to master branch, last one a day ago
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
Created
2020-12-11
1,826 commits to main branch, last one 4 months ago
PyTorch native quantization and sparsity for training and inference
Created
2023-11-03
776 commits to main branch, last one 13 hours ago
PaddleSlim is an open-source library for deep model compression and architecture search.
Created
2019-12-16
1,245 commits to develop branch, last one a day ago
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
Created
2018-10-31
833 commits to master branch, last one 17 days ago
Neural Network Compression Framework for enhanced OpenVINO™ inference
Created
2020-05-13
2,293 commits to develop branch, last one 21 hours ago
Network Slimming (Pytorch) (ICCV 2017)
Created
2018-07-05
15 commits to master branch, last one 4 years ago
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
Created
2024-06-20
2,098 commits to main branch, last one 8 hours ago
More readable and flexible yolov5 with more backbone(gcn, resnet, shufflenet, moblienet, efficientnet, hrnet, swin-transformer, etc) and (cbam,dcn and so on), and tensorrt
Created
2021-03-26
206 commits to v2 branch, last one 3 months ago
[NeurIPS'23] H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models.
Created
2023-06-12
41 commits to main branch, last one 5 months ago
An innovative library for efficient LLM inference via low-bit quantization
This repository has been archived
(exclude archived)
Created
2023-11-20
345 commits to main branch, last one 2 months ago
Sparse Optimisation Research Code
Created
2016-04-21
1,116 commits to master branch, last one 6 months ago
Always sparse. Never dense. But never say never. A Sparse Training repository for the Adaptive Sparse Connectivity concept and its algorithmic instantiation, i.e. Sparse Evolutionary Training, to boos...
sparsity
scalability
deep-learning
randomization
classification
neuroevolution
sparse-training
complex-networks
generative-models
deep-learning-papers
deep-neural-networks
multi-layer-perceptron
scalable-deep-learning
sparse-neural-networks
evolutionary-algorithms
deep-learning-algorithms
artificial-neural-networks
adaptive-sparse-connectivity
restricted-boltzmann-machine
sparse-evolutionary-training
Created
2018-03-02
26 commits to master branch, last one 3 years ago
[CVPR 2021] Exploring Sparsity in Image Super-Resolution for Efficient Inference
Created
2020-07-26
38 commits to master branch, last one 3 years ago
Learning both Weights and Connections for Efficient Neural Networks https://arxiv.org/abs/1506.02626
Created
2017-11-03
88 commits to master branch, last one 3 years ago
A research library for pytorch-based neural network pruning, compression, and more.
Created
2020-02-14
12 commits to main branch, last one 2 years ago
Zero-label image classification via OpenCLIP knowledge distillation
Created
2023-06-27
17 commits to master branch, last one about a year ago
[ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers.
Created
2023-05-27
146 commits to main branch, last one about a year ago
This repository has no description...
Created
2024-08-28
20 commits to main branch, last one about a month ago
Soft Threshold Weight Reparameterization for Learnable Sparsity
Created
2020-04-11
53 commits to master branch, last one 3 years ago
Official Pytorch Implementation of "Outlier Weighed Layerwise Sparsity (OWL): A Missing Secret Sauce for Pruning LLMs to High Sparsity"
Created
2023-10-07
35 commits to main branch, last one 4 months ago
[ICML 2021] "Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training" by Shiwei Liu, Lu Yin, Decebal Constantin Mocanu, Mykola Pechenizkiy
Created
2021-06-10
70 commits to main branch, last one about a year ago
Codes and data coming with article "A Survey and an Extensive Evaluation of Popular Audio Declipping Methods", and others closely related
Created
2020-06-04
20 commits to master branch, last one about a year ago
[Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Pruning
Created
2022-12-11
20 commits to master branch, last one about a year ago
Model Compression/Inference Made Easy
Created
2023-07-21
583 commits to main branch, last one 2 months ago
[ICLR 2022] "Sparsity Winning Twice: Better Robust Generalization from More Efficient Training" by Tianlong Chen*, Zhenyu Zhang*, Pengjun Wang*, Santosh Balachandra*, Haoyu Ma*, Zehao Wang, Zhangyang ...
Created
2021-12-06
5 commits to main branch, last one 2 years ago
Fast operator-overloading Jacobian & Hessian sparsity detection.
Created
2024-03-28
160 commits to main branch, last one about a month ago
[ICCV2023 Official PyTorch code] for Iterative Soft Shrinkage Learning for Efficient Image Super-Resolution
Created
2023-08-07
4 commits to main branch, last one 8 months ago