30 results found Sort:

Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.
Created 2019-11-16
153 commits to master branch, last one about a month ago
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
Created 2023-12-06
53 commits to main branch, last one 9 months ago
EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]
Created 2022-06-02
23 commits to main branch, last one about a year ago
185
958
bsd-3-clause
24
Code for paper " AdderNet: Do We Really Need Multiplications in Deep Learning?"
Created 2020-02-25
31 commits to master branch, last one 3 years ago
43
886
apache-2.0
14
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
Created 2023-12-01
125 commits to master branch, last one 9 months ago
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
Created 2023-06-12
50 commits to main branch, last one about a year ago
[NeurIPS 2024 Spotlight]"LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS", Zhiwen Fan, Kevin Wang, Kairun Wen, Zehao Zhu, Dejia Xu, Zhangyang Wang
Created 2023-11-26
73 commits to main branch, last one 3 months ago
List of papers related to neural network quantization in recent AI conferences and journals.
Created 2022-01-01
35 commits to main branch, last one 21 days ago
30
340
unknown
10
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
Created 2024-01-31
12 commits to main branch, last one 9 months ago
Explorations into some recent techniques surrounding speculative decoding
Created 2023-08-27
74 commits to main branch, last one 3 months ago
30
239
unknown
7
[CVPR 2021] Exploring Sparsity in Image Super-Resolution for Efficient Inference
Created 2020-07-26
38 commits to master branch, last one 3 years ago
12
233
apache-2.0
8
On-device LLM Inference Powered by X-Bit Quantization
Created 2024-04-09
61 commits to main branch, last one 6 days ago
19
229
unknown
9
(CVPR 2021, Oral) Dynamic Slimmable Network
Created 2021-03-23
12 commits to main branch, last one 3 years ago
20
222
apache-2.0
7
[ECCV2022] Efficient Long-Range Attention Network for Image Super-resolution
Created 2022-03-12
8 commits to main branch, last one 2 years ago
📚 Collection of awesome generation acceleration resources.
Created 2024-07-14
230 commits to main branch, last one a day ago
12
196
apache-2.0
3
[NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising
Created 2024-05-31
64 commits to main branch, last one about a month ago
16
179
apache-2.0
10
[ECCV 2022] Official implementation of the paper "DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation"
Created 2022-04-25
47 commits to main branch, last one 2 years ago
21
107
mit
5
Official code repository for Sketch-of-Thought (SoT)
Created 2025-03-03
21 commits to main branch, last one 15 days ago
[NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching
Created 2024-06-03
22 commits to main branch, last one 9 months ago
[NeurIPS'23] Speculative Decoding with Big Little Decoder
Created 2023-02-10
11,217 commits to main branch, last one about a year ago
11
89
apache-2.0
6
Soft Threshold Weight Reparameterization for Learnable Sparsity
Created 2020-04-11
53 commits to master branch, last one 3 years ago
[ICLR 2022] Code for Graph-less Neural Networks: Teaching Old MLPs New Tricks via Distillation (GLNN)
Created 2021-10-27
33 commits to main branch, last one 5 months ago
6
67
unknown
7
[Official Implementation] Acoustic Autoregressive Modeling 🔥
Created 2024-08-16
28 commits to main branch, last one 7 months ago
[NeurIPS'24] Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy
Created 2024-10-11
17 commits to master branch, last one 2 months ago
Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)
Created 2023-06-26
19 commits to main branch, last one 6 months ago
Implementation of AAAI 21 paper: Nested Named Entity Recognition with Partially Observed TreeCRFs
Created 2020-12-10
10 commits to main branch, last one 3 years ago
9
50
apache-2.0
5
Official implementation of AdaMML. https://arxiv.org/abs/2105.05165.
This repository has been archived (exclude archived)
Created 2021-10-07
16 commits to main branch, last one 3 years ago
Code for Learning to Zoom and Unzoom (CVPR 2023)
Created 2023-03-16
7 commits to main branch, last one about a year ago
Code for WF-IoT paper 'TinyML Benchmark: Executing Fully Connected Neural Networks on Commodity Microcontrollers'
Created 2021-05-08
40 commits to main branch, last one 2 years ago
Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML'24)
Created 2024-05-31
10 commits to main branch, last one 8 months ago