92 results found Sort:

《李宏毅深度学习教程》(李宏毅老师推荐👍),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
Created 2019-07-02
520 commits to master branch, last one 5 days ago
798
4.3k
apache-2.0
132
Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller
This repository has been archived (exclude archived)
Created 2018-04-24
643 commits to master branch, last one about a year ago
169
2.9k
other
55
Sparsity-aware deep learning inference runtime for CPUs
Created 2020-12-14
1,050 commits to main branch, last one 25 days ago
301
2.4k
mit
32
[CVPR 2023] Towards Any Structural Pruning; LLMs / SAM / Diffusion / Transformers / YOLOv8 / CNNs
Created 2019-12-15
1,324 commits to master branch, last one 18 days ago
327
2.2k
unknown
87
A curated list of neural network pruning resources.
Created 2019-05-30
61 commits to master branch, last one about a month ago
478
2.2k
mit
40
micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Ari...
Created 2019-12-04
295 commits to master branch, last one 2 years ago
243
2.0k
apache-2.0
34
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Created 2020-07-21
3,432 commits to master branch, last one a day ago
141
2.0k
apache-2.0
47
Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models
Created 2020-12-11
1,803 commits to main branch, last one a day ago
354
2.0k
other
47
AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.
Created 2020-04-21
1,919 commits to develop branch, last one a day ago
348
1.5k
apache-2.0
92
PaddleSlim is an open-source library for deep model compression and architecture search.
Created 2019-12-16
1,243 commits to develop branch, last one 2 months ago
320
1.5k
apache-2.0
119
A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.
Created 2018-10-31
830 commits to master branch, last one 29 days ago
218
1.4k
apache-2.0
20
OpenMMLab Model Compression Toolbox and Benchmark.
Created 2021-12-22
229 commits to main branch, last one 11 months ago
66
1.1k
apache-2.0
10
Config driven, easy backup cli for restic.
Created 2019-06-20
525 commits to master branch, last one 14 days ago
Efficient computing methods developed by Huawei Noah's Ark Lab
Created 2019-09-04
149 commits to master branch, last one about a month ago
204
868
unknown
22
PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference
Created 2017-06-23
6 commits to master branch, last one 4 years ago
210
834
apache-2.0
30
Neural Network Compression Framework for enhanced OpenVINO™ inference
Created 2020-05-13
2,025 commits to develop branch, last one 20 hours ago
mobilev2-yolov5s剪枝、蒸馏,支持ncnn,tensorRT部署。ultra-light but better performence!
Created 2020-09-07
17 commits to master branch, last one 2 years ago
Embedded and mobile deep learning research resources
Created 2017-06-06
60 commits to master branch, last one about a year ago
TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
Created 2021-11-02
781 commits to main branch, last one a day ago
73
694
apache-2.0
13
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.
Created 2023-05-17
136 commits to main branch, last one 18 days ago
57
576
mit
16
A Pytorch Knowledge Distillation library for benchmarking and extending works in the domains of Knowledge Distillation, Pruning, and Quantization.
Created 2020-05-10
298 commits to master branch, last one about a year ago
Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration (CVPR 2019 Oral)
Created 2019-03-26
19 commits to master branch, last one 10 months ago
[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
Created 2023-10-16
71 commits to main branch, last one 2 months ago
Awesome machine learning model compression research papers, tools, and learning material.
Created 2018-12-06
38 commits to master branch, last one 23 days ago
YOLO ModelCompression MultidatasetTraining
Created 2019-12-24
438 commits to master branch, last one 2 years ago
Pruning and other network surgery for trained Keras models.
Created 2017-08-22
80 commits to master branch, last one 3 years ago
23
362
apache-2.0
26
Neural network model repository for highly sparse and sparse-quantized models with matching sparsification recipes
Created 2020-12-11
298 commits to main branch, last one about a month ago
31
354
apache-2.0
5
A PyTorch-based model pruning toolkit for pre-trained language models
Created 2021-07-20
42 commits to main branch, last one 9 months ago
94
339
apache-2.0
38
🤗 Optimum Intel: Accelerate inference with Intel optimization tools
Created 2022-05-25
719 commits to main branch, last one 20 hours ago
39
321
apache-2.0
12
A model compression and acceleration toolbox based on pytorch.
Created 2022-07-21
134 commits to main branch, last one 10 months ago