Trending repositories for topic model-compression

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

FLHonker/Awesome-Knowledge-Distillation

Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。

2,529 (+7)

VainF/Torch-Pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning

2,790 (+6)

mit

microsoft/nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

14,081 (+5)

mit

dkozlov/awesome-knowledge-distillation

Awesome Knowledge Distillation

3,521 (+2)

apache-2.0

huawei-noah/Pretrained-Language-Model

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

3,040 (+2)

VainF/Diff-Pruning

[NeurIPS 2023] Structural Pruning for Diffusion Models

171 (+1)

apache-2.0

HanXinzi-AI/awesome-computer-vision-resources

a collection of computer vision projects&tools. 计算机视觉方向项目和工具集合。

209 (+1)

Last 3 days (relative gain)

VainF/Diff-Pruning

[NeurIPS 2023] Structural Pruning for Diffusion Models

171 (+0.6%)

apache-2.0

HanXinzi-AI/awesome-computer-vision-resources

a collection of computer vision projects&tools. 计算机视觉方向项目和工具集合。

209 (+0.5%)

FLHonker/Awesome-Knowledge-Distillation

Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。

2,529 (+0.3%)

VainF/Torch-Pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning

2,790 (+0.2%)

mit

huawei-noah/Pretrained-Language-Model

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

3,040 (+0.1%)

dkozlov/awesome-knowledge-distillation

Awesome Knowledge Distillation

3,521 (+0.1%)

apache-2.0

microsoft/nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

14,081 (+0.0%)

mit

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

FLHonker/Awesome-Knowledge-Distillation

Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。

2,529 (+11)

VainF/Torch-Pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning

2,790 (+9)

mit

Efficient-ML/Awesome-Model-Quantization

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (pape...

1,926 (+7)

dkozlov/awesome-knowledge-distillation

Awesome Knowledge Distillation

3,521 (+7)

apache-2.0

HanXinzi-AI/awesome-computer-vision-resources

a collection of computer vision projects&tools. 计算机视觉方向项目和工具集合。

209 (+6)

huawei-noah/Efficient-AI-Backbones

Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

4,101 (+6)

microsoft/nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

14,081 (+6)

mit

he-y/Awesome-Pruning

A curated list of neural network pruning resources.

2,376 (+5)

horseee/DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

823 (+4)

apache-2.0

huawei-noah/Efficient-Computing

Efficient computing methods developed by Huawei Noah's Ark Lab

1,220 (+4)

AIoT-MLSys-Lab/SVD-LLM

Official Code for "SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression"

107 (+3)

apache-2.0

Picovoice/picollm

On-device LLM Inference Powered by X-Bit Quantization

196 (+3)

apache-2.0

thu-nics/MoA

The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>

108 (+2)

mit

VainF/Diff-Pruning

[NeurIPS 2023] Structural Pruning for Diffusion Models

171 (+2)

apache-2.0

SqueezeAILab/KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

315 (+2)

AberHu/Knowledge-Distillation-Zoo

Pytorch implementation of various Knowledge Distillation (KD) methods.

1,640 (+2)

Efficient-ML/Awesome-Efficient-LLM-Diffusion

A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including language and vision, we are continuously improving the project. Welcom...

162 (+2)

BaiTheBest/SparseLLM

Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)

47 (+1)

apache-2.0

hnuzhy/CV_DL_Gather

Gather research papers, corresponding codes (if having), reading notes and any other related materials about Hot🔥🔥🔥 fields in Computer Vision based on Deep Learning.

64 (+1)

diaoenmao/HeteroFL-Computation-and-Communication-Efficient-Federated-Learning-for-Heterogeneous-Clients

[ICLR 2021] HeteroFL: Computation and Communication Efficient Federated Learning for Heterogeneous Clients

156 (+1)

mit

Last week (relative gain)

HanXinzi-AI/awesome-computer-vision-resources

a collection of computer vision projects&tools. 计算机视觉方向项目和工具集合。

209 (+3%)

AIoT-MLSys-Lab/SVD-LLM

Official Code for "SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression"

107 (+3%)

apache-2.0

BaiTheBest/SparseLLM

Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)

47 (+2%)

apache-2.0

thu-nics/MoA

The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>

108 (+2%)

mit

hnuzhy/CV_DL_Gather

Gather research papers, corresponding codes (if having), reading notes and any other related materials about Hot🔥🔥🔥 fields in Computer Vision based on Deep Learning.

64 (+2%)

Picovoice/picollm

On-device LLM Inference Powered by X-Bit Quantization

196 (+2%)

apache-2.0

Efficient-ML/Awesome-Efficient-LLM-Diffusion

162 (+1%)

VainF/Diff-Pruning

[NeurIPS 2023] Structural Pruning for Diffusion Models

171 (+1%)

apache-2.0

diaoenmao/HeteroFL-Computation-and-Communication-Efficient-Federated-Learning-for-Heterogeneous-Clients

[ICLR 2021] HeteroFL: Computation and Communication Efficient Federated Learning for Heterogeneous Clients

156 (+0.6%)

mit

SqueezeAILab/KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

315 (+0.6%)

horseee/DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

823 (+0.5%)

apache-2.0

HoyTta0/KnowledgeDistillation

Knowledge distillation in text classification with pytorch. 知识蒸馏，中文文本分类，教师模型BERT、XLNET，学生模型biLSTM。

215 (+0.5%)

apache-2.0

datawhalechina/awesome-compression

模型压缩的小白入门教程

215 (+0.5%)

FLHonker/Awesome-Knowledge-Distillation

Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。

2,529 (+0.4%)

Efficient-ML/Awesome-Model-Quantization

1,926 (+0.4%)

huawei-noah/Efficient-Computing

Efficient computing methods developed by Huawei Noah's Ark Lab

1,220 (+0.3%)

VainF/Torch-Pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning

2,790 (+0.3%)

mit

Xiuyu-Li/q-diffusion

[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.

336 (+0.3%)

mit

he-y/Awesome-Pruning

A curated list of neural network pruning resources.

2,376 (+0.2%)

cedrickchee/awesome-ml-model-compression

Awesome machine learning model compression research papers, quantization, tools, and learning material.

496 (+0.2%)

mit

Last month (new repositories)

no newly created repositories trending in the last month

Last month (absolute gain)

VainF/Torch-Pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning

2,790 (+63)

mit

Efficient-ML/Awesome-Model-Quantization

1,926 (+39)

dkozlov/awesome-knowledge-distillation

Awesome Knowledge Distillation

3,521 (+37)

apache-2.0

huawei-noah/Efficient-AI-Backbones

Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

4,101 (+31)

FLHonker/Awesome-Knowledge-Distillation

Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。

2,529 (+28)

microsoft/nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

14,081 (+22)

mit

Zhen-Dong/Awesome-Quantization-Papers

List of papers related to neural network quantization in recent AI conferences and journals.

480 (+20)

mit

horseee/DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

823 (+20)

apache-2.0

AberHu/Knowledge-Distillation-Zoo

Pytorch implementation of various Knowledge Distillation (KD) methods.

1,640 (+20)

haitongli/knowledge-distillation-pytorch

A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility

1,883 (+18)

mit

datawhalechina/awesome-compression

模型压缩的小白入门教程

215 (+16)

HanXinzi-AI/awesome-computer-vision-resources

a collection of computer vision projects&tools. 计算机视觉方向项目和工具集合。

209 (+15)

huawei-noah/Efficient-Computing

Efficient computing methods developed by Huawei Noah's Ark Lab

1,220 (+15)

alibaba/TinyNeuralNetwork

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.

769 (+14)

mit

sunshangquan/logit-standardization-KD

[CVPR 2024 Highlight] Logit Standardization in Knowledge Distillation

329 (+13)

he-y/Awesome-Pruning

A curated list of neural network pruning resources.

2,376 (+13)

SqueezeAILab/SqueezeLLM

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

662 (+12)

mit

huawei-noah/Pretrained-Language-Model

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

3,040 (+11)

AIoT-MLSys-Lab/SVD-LLM

Official Code for "SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression"

107 (+10)

apache-2.0

CASE-Lab-UMD/LLM-Drop

The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".

152 (+8)

apache-2.0

Last month (relative gain)

BaiTheBest/SparseLLM

Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)

47 (+18%)

apache-2.0

pvti/Awesome-Tensor-Decomposition

😎 A curated list of tensor decomposition resources for model compression.

26 (+13%)

gpl-3.0

AIoT-MLSys-Lab/SVD-LLM

Official Code for "SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression"

107 (+10%)

apache-2.0

datawhalechina/awesome-compression

模型压缩的小白入门教程

215 (+8%)

HanXinzi-AI/awesome-computer-vision-resources

a collection of computer vision projects&tools. 计算机视觉方向项目和工具集合。

209 (+8%)

thu-nics/MoA

The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>

108 (+7%)

mit

onnx/neural-compressor

Model compression for ONNX

81 (+7%)

apache-2.0

CASE-Lab-UMD/LLM-Drop

The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".

152 (+6%)

apache-2.0

Efficient-ML/Awesome-Efficient-LLM-Diffusion

162 (+5%)

Zhen-Dong/Awesome-Quantization-Papers

List of papers related to neural network quantization in recent AI conferences and journals.

480 (+4%)

mit

sunshangquan/logit-standardization-KD

[CVPR 2024 Highlight] Logit Standardization in Knowledge Distillation

329 (+4%)

cantbebetter2/Awesome-Diffusion-Distillation

A list of papers, docs, codes about diffusion distillation.This repo collects various distillation methods for the Diffusion model. Welcome to PR the works (papers, repositories) missed by the repo.

26 (+4%)

wangxb96/Awesome-EdgeAI

Resources of our survey paper "A Comprehensive Survey on AI Integration at the Edge: Techniques, Applications, and Challenges"

55 (+4%)

mit

VainF/Diff-Pruning

[NeurIPS 2023] Structural Pruning for Diffusion Models

171 (+4%)

apache-2.0

hnuzhy/CV_DL_Gather

Gather research papers, corresponding codes (if having), reading notes and any other related materials about Hot🔥🔥🔥 fields in Computer Vision based on Deep Learning.

64 (+3%)

asahi417/lm-vocab-trimmer

Vocabulary Trimming (VT) is a model compression technique, which reduces a multilingual LM vocabulary to a target language by deleting irrelevant tokens from its vocabulary. This repository contains a...

33 (+3%)

mit

Picovoice/picollm

On-device LLM Inference Powered by X-Bit Quantization

196 (+3%)

apache-2.0

SqueezeAILab/KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

315 (+3%)

horseee/DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

823 (+2%)

apache-2.0

VainF/Torch-Pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning

2,790 (+2%)

mit

Last 12-months (new repositories)

sunshangquan/logit-standardization-KD

[CVPR 2024 Highlight] Logit Standardization in Knowledge Distillation

329

SqueezeAILab/KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

315

datawhalechina/awesome-compression

模型压缩的小白入门教程

215

Picovoice/picollm

On-device LLM Inference Powered by X-Bit Quantization

196

apache-2.0

CASE-Lab-UMD/LLM-Drop

The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".

152

apache-2.0

thu-nics/MoA

The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>

108

mit

AIoT-MLSys-Lab/SVD-LLM

Official Code for "SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression"

107

apache-2.0

jordddan/Pruning-LLMs

The framework to prune LLMs to any size and any config.

apache-2.0

onnx/neural-compressor

Model compression for ONNX

apache-2.0

CASE-Lab-UMD/Unified-MoE-Compression

The official implementation of the paper "Demystifying the Compression of Mixture-of-Experts Through a Unified Framework".

apache-2.0

BaiTheBest/SparseLLM

Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)

apache-2.0

Last 12-months (absolute gain)

VainF/Torch-Pruning

[CVPR 2023] DepGraph: Towards Any Structural Pruning

2,790 (+842)

mit

microsoft/nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

14,081 (+621)

mit

Efficient-ML/Awesome-Model-Quantization

1,926 (+507)

horseee/DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

823 (+491)

apache-2.0

huawei-noah/Efficient-AI-Backbones

Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

4,101 (+482)

pratyushasharma/laser

The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

375 (+367)

mit

sunshangquan/logit-standardization-KD

[CVPR 2024 Highlight] Logit Standardization in Knowledge Distillation

329 (+327)

SqueezeAILab/KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

315 (+313)

dkozlov/awesome-knowledge-distillation

Awesome Knowledge Distillation

3,521 (+298)

apache-2.0

Zhen-Dong/Awesome-Quantization-Papers

List of papers related to neural network quantization in recent AI conferences and journals.

480 (+297)

mit

he-y/Awesome-Pruning

A curated list of neural network pruning resources.

2,376 (+273)

AberHu/Knowledge-Distillation-Zoo

Pytorch implementation of various Knowledge Distillation (KD) methods.

1,640 (+216)

datawhalechina/awesome-compression

模型压缩的小白入门教程

215 (+214)

SqueezeAILab/SqueezeLLM

[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

662 (+204)

mit

czg1225/SlimSAM

[NeurIPS 2024] SlimSAM: 0.1% Data Makes Segment Anything Slim

310 (+202)

apache-2.0

huawei-noah/Efficient-Computing

Efficient computing methods developed by Huawei Noah's Ark Lab

1,220 (+196)

FLHonker/Awesome-Knowledge-Distillation

Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。

2,529 (+193)

Picovoice/picollm

On-device LLM Inference Powered by X-Bit Quantization

196 (+192)

apache-2.0

haitongli/knowledge-distillation-pytorch

A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility

1,883 (+164)

mit

CASE-Lab-UMD/LLM-Drop

The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".

152 (+151)

apache-2.0

Last 12-months (relative gain)

Picovoice/picollm

On-device LLM Inference Powered by X-Bit Quantization

196 (+4,800%)

apache-2.0

pratyushasharma/laser

The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction

375 (+4,588%)

mit

onnx/neural-compressor

Model compression for ONNX

81 (+1,925%)

apache-2.0

AIoT-MLSys-Lab/SVD-LLM

Official Code for "SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression"

107 (+1,683%)

apache-2.0

HanXinzi-AI/awesome-computer-vision-resources

a collection of computer vision projects&tools. 计算机视觉方向项目和工具集合。

209 (+203%)

czg1225/SlimSAM

[NeurIPS 2024] SlimSAM: 0.1% Data Makes Segment Anything Slim

310 (+187%)

apache-2.0

Zhen-Dong/Awesome-Quantization-Papers

List of papers related to neural network quantization in recent AI conferences and journals.

480 (+162%)

mit

ciodar/deep-compression

PyTorch Lightning implementation of the paper Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding. This repository allows to reproduce the main fin...

26 (+160%)

mit

horseee/DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

823 (+148%)

apache-2.0

Efficient-ML/Awesome-Efficient-LLM-Diffusion

162 (+125%)

cliang1453/task-aware-distillation

Less is More: Task-aware Layer-wise Distillation for Language Model Compression (ICML2023)

30 (+114%)

wangxb96/Awesome-EdgeAI

Resources of our survey paper "A Comprehensive Survey on AI Integration at the Edge: Techniques, Applications, and Challenges"

55 (+96%)

mit

VainF/Diff-Pruning

[NeurIPS 2023] Structural Pruning for Diffusion Models

171 (+76%)

apache-2.0

microsoft/Moonlit

This is a collection of our research on efficient AI, covering hardware-aware NAS and model compression.

74 (+76%)

mit

asahi417/lm-vocab-trimmer

33 (+74%)

mit

Xiuyu-Li/q-diffusion

[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.

336 (+69%)

mit

hnuzhy/CV_DL_Gather

Gather research papers, corresponding codes (if having), reading notes and any other related materials about Hot🔥🔥🔥 fields in Computer Vision based on Deep Learning.

64 (+60%)

htqin/BiFSMNv2

Pytorch implementation of BiFSMNv2, TNNLS 2023

29 (+53%)

sdc17/UPop

[ICML 2023] UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers.

99 (+52%)

bsd-3-clause

THU-MIG/torch-model-compression

针对pytorch模型的自动化模型结构分析和修改工具集，包含自动分析模型结构的模型压缩算法库

244 (+47%)

mit