15 results found Sort:

257
2.2k
apache-2.0
33
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Created 2020-07-21
3,628 commits to master branch, last one 9 hours ago
478
2.2k
mit
41
micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Ari...
Created 2019-12-04
295 commits to master branch, last one 3 years ago
237
951
apache-2.0
31
Neural Network Compression Framework for enhanced OpenVINO™ inference
Created 2020-05-13
2,308 commits to develop branch, last one a day ago
TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
Created 2021-11-02
818 commits to main branch, last one a day ago
YOLO ModelCompression MultidatasetTraining
Created 2019-12-24
438 commits to master branch, last one 2 years ago
40
328
apache-2.0
12
A model compression and acceleration toolbox based on pytorch.
Created 2022-07-21
134 commits to main branch, last one about a year ago
Tutorial notebooks for hls4ml
Created 2020-06-02
101 commits to main branch, last one 2 months ago
0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch with Llama(2) Architecture
Created 2023-10-19
18 commits to main branch, last one about a year ago
针对pytorch模型的自动化模型结构分析和修改工具集,包含自动分析模型结构的模型压缩算法库
Created 2021-06-04
49 commits to main branch, last one about a year ago
This repository contains notebooks that show the usage of TensorFlow Lite for quantizing deep neural networks.
Created 2020-04-29
143 commits to master branch, last one about a year ago
Quantization Aware Training
Created 2023-05-24
1 commits to master branch, last one about a year ago
Notes on quantization in neural networks
Created 2023-11-24
15 commits to main branch, last one 11 months ago
Train neural networks with joint quantization and pruning on both weights and activations using any pytorch modules
Created 2021-06-19
161 commits to main branch, last one 2 years ago
Quantization-aware training with spiking neural networks
Created 2022-01-03
8 commits to main branch, last one 2 years ago