2 results found Sort:

73
579
apache-2.0
13
⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.
Created 2021-03-11
38 commits to master branch, last one 3 years ago
3
68
unknown
0
SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator
Created 2024-12-11
65 commits to main branch, last one 3 months ago