11 results found Sort:

330
2.4k
unknown
88
A curated list of neural network pruning resources.
Created 2019-05-30
61 commits to master branch, last one 10 months ago
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (pape...
Created 2018-10-18
299 commits to master branch, last one 3 months ago
A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures, 3.) Model Compression, Quantization and Acceleration, 4.) Hype...
Created 2018-12-28
187 commits to master branch, last one 4 years ago
Papers for deep neural network compression and acceleration
Created 2018-03-13
7 commits to master branch, last one 3 years ago
📚 Collection of awesome generation acceleration resources.
Created 2024-07-14
224 commits to main branch, last one a day ago
2
75
mit
1
CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient
Created 2024-11-22
14 commits to master branch, last one 25 days ago
16
71
bsd-3-clause
9
MUSCO: MUlti-Stage COmpression of neural networks
Created 2019-05-08
59 commits to master branch, last one 4 years ago
Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"
Created 2023-01-19
125 commits to main branch, last one about a month ago
[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
Created 2024-10-11
17 commits to master branch, last one 27 days ago
A list of papers, docs, codes about diffusion distillation.This repo collects various distillation methods for the Diffusion model. Welcome to PR the works (papers, repositories) missed by the repo.
Created 2023-12-10
1 commits to main branch, last one about a year ago
📚 Collection of token reduction for model compression resources.
Created 2024-12-04
146 commits to main branch, last one a day ago