3 results found Sort:

801
4.4k
apache-2.0
132
Neural Network Distiller by Intel AI Lab: a Python package for neural network compression research. https://intellabs.github.io/distiller
This repository has been archived (exclude archived)
Created 2018-04-24
643 commits to master branch, last one about a year ago
Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024
Created 2024-02-26
26 commits to main branch, last one 24 days ago
A curated list of early exiting (LLM, CV, NLP, etc)
Created 2023-08-01
8 commits to main branch, last one 3 months ago