15 results found Sort:
- Filter by Primary Language:
- Python (12)
- Jupyter Notebook (3)
- +
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Created
2020-07-21
3,628 commits to master branch, last one 9 hours ago
micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Ari...
bnn
twn
onnx
dorefa
pruning
pytorch
tensorrt
xnor-net
quantization
network-slimming
group-convolution
model-compression
network-in-network
tensorrt-int8-python
convolutional-networks
neuromorphic-computing
integer-arithmetic-only
batch-normalization-fuse
post-training-quantization
quantization-aware-training
Created
2019-12-04
295 commits to master branch, last one 3 years ago
Neural Network Compression Framework for enhanced OpenVINO™ inference
Created
2020-05-13
2,308 commits to develop branch, last one a day ago
TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
Created
2021-11-02
818 commits to main branch, last one a day ago
YOLO ModelCompression MultidatasetTraining
Created
2019-12-24
438 commits to master branch, last one 2 years ago
A model compression and acceleration toolbox based on pytorch.
Created
2022-07-21
134 commits to main branch, last one about a year ago
Tutorial notebooks for hls4ml
Created
2020-06-02
101 commits to main branch, last one 2 months ago
0️⃣1️⃣🤗 BitNet-Transformers: Huggingface Transformers Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch with Llama(2) Architecture
Created
2023-10-19
18 commits to main branch, last one about a year ago
针对pytorch模型的自动化模型结构分析和修改工具集,包含自动分析模型结构的模型压缩算法库
Created
2021-06-04
49 commits to main branch, last one about a year ago
This repository contains notebooks that show the usage of TensorFlow Lite for quantizing deep neural networks.
Created
2020-04-29
143 commits to master branch, last one about a year ago
FrostNet: Towards Quantization-Aware Network Architecture Search
Created
2020-06-17
36 commits to master branch, last one 3 years ago
Quantization Aware Training
Created
2023-05-24
1 commits to master branch, last one about a year ago
Notes on quantization in neural networks
Created
2023-11-24
15 commits to main branch, last one 11 months ago
Train neural networks with joint quantization and pruning on both weights and activations using any pytorch modules
Created
2021-06-19
161 commits to main branch, last one 2 years ago
Quantization-aware training with spiking neural networks
Created
2022-01-03
8 commits to main branch, last one 2 years ago