7 results found Sort:

694
11.0k
mit
70
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Created 2021-06-18
46 commits to main branch, last one 4 days ago
Constrained optimization toolkit for PyTorch
Created 2020-02-19
224 commits to master branch, last one 2 years ago
13
181
apache-2.0
11
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
Created 2024-06-06
9 commits to main branch, last one 5 months ago
6
140
apache-2.0
3
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference
Created 2024-10-22
13 commits to main branch, last one about a month ago
Repository to track the progress in model compression and acceleration
Created 2019-06-05
45 commits to master branch, last one 4 years ago
16
71
bsd-3-clause
9
MUSCO: MUlti-Stage COmpression of neural networks
Created 2019-05-08
59 commits to master branch, last one 3 years ago
This repository contains the code to train flan t5 with alpaca instructions and low rank adaptation.
Created 2023-03-29
16 commits to main branch, last one about a year ago