Statistics for topic model-compression
RepositoryStats tracks 633,100 Github repositories, of these 108 are tagged with the model-compression topic. The most common primary language for repositories using this topic is Python (77).
Stargazers over time for topic model-compression
Most starred repositories for topic model-compression (view more)
Trending repositories for topic model-compression (view more)
List of papers related to neural network quantization in recent AI conferences and journals.
micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Ari...
The official implementation of the paper "Towards Efficient Mixture of Experts: A Holistic Study of Compression Techniques (TMLR)".
Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"
List of papers related to neural network quantization in recent AI conferences and journals.
Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (pape...
😎 A curated list of tensor decomposition resources for model compression.
The official implementation of the paper "Towards Efficient Mixture of Experts: A Holistic Study of Compression Techniques (TMLR)".
Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (pape...
TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
📚 Collection of token reduction for model compression resources.
😎 A curated list of tensor decomposition resources for model compression.
Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"
The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".
The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>
📚 Collection of token reduction for model compression resources.
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (pape...
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.
📚 Collection of token reduction for model compression resources.
😎 A curated list of tensor decomposition resources for model compression.