7 results found Sort:
- Filter by Primary Language:
- Python (5)
- Jupyter Notebook (1)
- +
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Created
2021-06-18
46 commits to main branch, last one 4 days ago
Constrained optimization toolkit for PyTorch
Created
2020-02-19
224 commits to master branch, last one 2 years ago
Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.
Created
2024-06-06
9 commits to main branch, last one 5 months ago
ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference
Created
2024-10-22
13 commits to main branch, last one about a month ago
Repository to track the progress in model compression and acceleration
Created
2019-06-05
45 commits to master branch, last one 4 years ago
MUSCO: MUlti-Stage COmpression of neural networks
Created
2019-05-08
59 commits to master branch, last one 3 years ago
This repository contains the code to train flan t5 with alpaca instructions and low rank adaptation.
Created
2023-03-29
16 commits to main branch, last one about a year ago