11 results found Sort:
- Filter by Primary Language:
- Python (2)
- Jupyter Notebook (1)
- +
A curated list of neural network pruning resources.
Created
2019-05-30
61 commits to master branch, last one 10 months ago
A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (pape...
Created
2018-10-18
299 commits to master branch, last one 3 months ago
A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures, 3.) Model Compression, Quantization and Acceleration, 4.) Hype...
Created
2018-12-28
187 commits to master branch, last one 4 years ago
Papers for deep neural network compression and acceleration
Created
2018-03-13
7 commits to master branch, last one 3 years ago
📚 Collection of awesome generation acceleration resources.
Created
2024-07-14
224 commits to main branch, last one a day ago
CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient
Created
2024-11-22
14 commits to master branch, last one 25 days ago
MUSCO: MUlti-Stage COmpression of neural networks
Created
2019-05-08
59 commits to master branch, last one 4 years ago
Resources of our survey paper "Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies"
Created
2023-01-19
125 commits to main branch, last one about a month ago
[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
Created
2024-10-11
17 commits to master branch, last one 27 days ago
A list of papers, docs, codes about diffusion distillation.This repo collects various distillation methods for the Diffusion model. Welcome to PR the works (papers, repositories) missed by the repo.
Created
2023-12-10
1 commits to main branch, last one about a year ago
📚 Collection of token reduction for model compression resources.
Created
2024-12-04
146 commits to main branch, last one a day ago