11 results found Sort:

332
2.4k
unknown
88
A curated list of neural network pruning resources.
Created 2019-05-30
61 commits to master branch, last one 11 months ago
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (pape...
Created 2018-10-18
301 commits to master branch, last one 19 days ago
A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures, 3.) Model Compression, Quantization and Acceleration, 4.) Hype...
Created 2018-12-28
187 commits to master branch, last one 4 years ago
Papers for deep neural network compression and acceleration
Created 2018-03-13
7 commits to master branch, last one 3 years ago
📚 Collection of awesome generation acceleration resources.
Created 2024-07-14
226 commits to main branch, last one 13 days ago
4
88
mit
1
[CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient
Created 2024-11-22
17 commits to master branch, last one 21 days ago
Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"
Created 2023-01-19
125 commits to main branch, last one 2 months ago
16
71
bsd-3-clause
8
MUSCO: MUlti-Stage COmpression of neural networks
Created 2019-05-08
59 commits to master branch, last one 4 years ago
[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
Created 2024-10-11
17 commits to master branch, last one about a month ago
📚 Collection of token reduction for model compression resources.
Created 2024-12-04
146 commits to main branch, last one about a month ago
A list of papers, docs, codes about diffusion distillation.This repo collects various distillation methods for the Diffusion model. Welcome to PR the works (papers, repositories) missed by the repo.
Created 2023-12-10
1 commits to main branch, last one about a year ago