Trending repositories for topic lora

Last 3 days (new repositories)

no newly created repositories trending in the last 3 days

Last 3 days (absolute gain)

hiyouga/LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

36,513 (+188)

apache-2.0

unslothai/unsloth

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

19,517 (+117)

apache-2.0

datawhalechina/self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

10,358 (+85)

apache-2.0

labmlai/annotated_deep_learning_paper_implementations

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga...

57,308 (+77)

mit

modelscope/ms-swift

Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, Llama3.2, GLM4, Internlm2.5, Yi1.5, Mistral, Baichuan2, DeepSeek, ...) or 100+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, InternVL...

4,715 (+38)

apache-2.0

huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

16,766 (+25)

apache-2.0

ymcui/Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

18,548 (+22)

apache-2.0

microsoft/LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

10,982 (+16)

mit

meshtastic/firmware

Meshtastic device firmware

3,762 (+13)

gpl-3.0

Nerogar/OneTrainer

OneTrainer is a one-stop solution for all your stable diffusion training needs.

1,866 (+10)

agpl-3.0

Akegarasu/lora-scripts

SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

4,741 (+10)

agpl-3.0

lyogavin/airllm

AirLLM 70B inference with single 4GB GPU

5,454 (+8)

apache-2.0

predibase/lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

2,266 (+7)

apache-2.0

PygmalionAI/aphrodite-engine

Large-scale LLM inference engine

1,192 (+6)

agpl-3.0

ashishpatel26/LLM-Finetuning

LLM Finetuning with peft

2,244 (+6)

TalkUHulk/ai.deploy.box

527 (+5)

mit

markqvist/Reticulum

The cryptography-based networking stack for building unstoppable networks with LoRa, Packet Radio, WiFi and everything in between.

2,137 (+5)

mit

ExpressLRS/ExpressLRS

ESP32/ESP8285-based High-Performance Radio Link for RC applications

3,747 (+5)

gpl-3.0

yangjianxin1/Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

5,960 (+5)

LianjiaTech/BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

7,991 (+4)

apache-2.0

Last 3 days (relative gain)

GURPREETKAURJETHRA/END-TO-END-GENERATIVE-AI-PROJECTS

End to End Generative AI Industry Projects on LLM Models with Deployment_Awesome LLM Projects

121 (+3%)

mit

fshnkarimi/Fine-tuning-an-LLM-using-LoRA

📚 Text Classification with LoRA (Low-Rank Adaptation) of Language Models - Efficiently fine-tune large language models for text classification tasks using the Stanford Sentiment Treebank (SST-2) data...

42 (+2%)

TalkUHulk/ai.deploy.box

527 (+1.0%)

mit

datawhalechina/self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

10,358 (+0.8%)

apache-2.0

modelscope/ms-swift

4,715 (+0.8%)

apache-2.0

unslothai/unsloth

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

19,517 (+0.6%)

apache-2.0

Nerogar/OneTrainer

OneTrainer is a one-stop solution for all your stable diffusion training needs.

1,866 (+0.5%)

agpl-3.0

hiyouga/LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

36,513 (+0.5%)

apache-2.0

PygmalionAI/aphrodite-engine

Large-scale LLM inference engine

1,192 (+0.5%)

agpl-3.0

ddzipp/AutoAudit

AutoAudit—— the LLM for Cyber Security 网络安全大语言模型

267 (+0.4%)

mit

meshtastic/firmware

Meshtastic device firmware

3,762 (+0.3%)

gpl-3.0

chirpstack/chirpstack

ChirpStack open-source LoRaWAN Network Server

613 (+0.3%)

mit

predibase/lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

2,266 (+0.3%)

apache-2.0

ashishpatel26/LLM-Finetuning

LLM Finetuning with peft

2,244 (+0.3%)

finegrain-ai/refiners

A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation

770 (+0.3%)

mit

georgian-io/LLM-Finetuning-Toolkit

Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.

793 (+0.3%)

apache-2.0

markqvist/Sideband

LXMF client for Android, Linux and macOS allowing you to communicate with people or LXMF-compatible systems over Reticulum networks using LoRa, Packet Radio, WiFi, I2P, or anything else Reticulum supp...

400 (+0.3%)

markqvist/Reticulum

The cryptography-based networking stack for building unstoppable networks with LoRa, Packet Radio, WiFi and everything in between.

2,137 (+0.2%)

mit

mit-han-lab/nunchaku

SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

535 (+0.2%)

apache-2.0

huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

16,766 (+0.1%)

apache-2.0

Last week (new repositories)

no newly created repositories trending in the last week

Last week (absolute gain)

hiyouga/LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

36,513 (+421)

apache-2.0

unslothai/unsloth

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

19,517 (+240)

apache-2.0

datawhalechina/self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

10,358 (+189)

apache-2.0

labmlai/annotated_deep_learning_paper_implementations

57,308 (+164)

mit

modelscope/ms-swift

4,715 (+87)

apache-2.0

huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

16,766 (+57)

apache-2.0

ymcui/Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

18,548 (+40)

apache-2.0

microsoft/LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

10,982 (+36)

mit

meshtastic/firmware

Meshtastic device firmware

3,762 (+32)

gpl-3.0

Akegarasu/lora-scripts

SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

4,741 (+20)

agpl-3.0

ashishpatel26/LLM-Finetuning

LLM Finetuning with peft

2,244 (+17)

Nerogar/OneTrainer

OneTrainer is a one-stop solution for all your stable diffusion training needs.

1,866 (+15)

agpl-3.0

mit-han-lab/nunchaku

SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

535 (+14)

apache-2.0

markqvist/Reticulum

The cryptography-based networking stack for building unstoppable networks with LoRa, Packet Radio, WiFi and everything in between.

2,137 (+13)

mit

predibase/lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

2,266 (+13)

apache-2.0

ExpressLRS/ExpressLRS

ESP32/ESP8285-based High-Performance Radio Link for RC applications

3,747 (+13)

gpl-3.0

yangjianxin1/Firefly

5,960 (+13)

TalkUHulk/ai.deploy.box

527 (+12)

mit

lyogavin/airllm

AirLLM 70B inference with single 4GB GPU

5,454 (+11)

apache-2.0

PygmalionAI/aphrodite-engine

Large-scale LLM inference engine

1,192 (+10)

agpl-3.0

Last week (relative gain)

RedAIGC/Target-Driven-Distillation

Consistency Distillation with Target Timestep Selection and Decoupled Guidance

56 (+6%)

fshnkarimi/Fine-tuning-an-LLM-using-LoRA

42 (+5%)

Chongjie-Si/Subspace-Tuning

A generalized framework for subspace tuning methods in parameter efficient fine-tuning.

118 (+4%)

apache-2.0

edenartlab/sd-lora-trainer

LoRa trainer for SDXL and SD15

29 (+4%)

GURPREETKAURJETHRA/END-TO-END-GENERATIVE-AI-PROJECTS

End to End Generative AI Industry Projects on LLM Models with Deployment_Awesome LLM Projects

121 (+3%)

mit

mit-han-lab/nunchaku

SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

535 (+3%)

apache-2.0

chuangchuangtan/LLaVA-NeXT-Image-Llama3-Lora

LLaVA-NeXT-Image-Llama3-Lora, Modified from https://github.com/arielnlee/LLaVA-1.6-ft

41 (+3%)

apache-2.0

fengredrum/finetune-whisper-lora

Fine-Tune Whisper with Transformers and PEFT

41 (+3%)

mit

chandrawi/LoRaRF-Python

Python library for basic transmitting and receiving data using LoRa and FSK modem

41 (+3%)

mit

TalkUHulk/ai.deploy.box

527 (+2%)

mit

xreef/EByte_LoRa_E220_Series_Library

Arduino LoRa EBYTE E220 LLCC68 device library complete and tested with Arduino, esp8266, esp32, STM32 and Raspberry Pi Pico (rp2040 boards)..

98 (+2%)

DaoyuanLi2816/Llama3-8B_Emotion_Text_Classification_LoRA

Emotion text classification using Llama3-8b with LoRA and FlashAttention. Based on LLaMA-Factory.

51 (+2%)

apache-2.0

modelscope/ms-swift

4,715 (+2%)

apache-2.0

datawhalechina/self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

10,358 (+2%)

apache-2.0

taishan1994/Llama3.1-Finetuning

对llama3进行全参微调、lora微调以及qlora微调。

167 (+2%)

apache-2.0

EricLBuehler/xlora

X-LoRA: Mixture of LoRA Experts

189 (+2%)

apache-2.0

MeshAddicts/meshinfo

Realtime web UI to run against a Meshtastic regional or private mesh network.

64 (+2%)

markqvist/Sideband

400 (+1%)

unslothai/unsloth

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

19,517 (+1%)

apache-2.0

liamcottle/reticulum-meshchat

A simple mesh network communications app powered by the Reticulum Network Stack.

163 (+1%)

mit

Last month (new repositories)

no newly created repositories trending in the last month

Last month (absolute gain)

hiyouga/LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

36,513 (+1,802)

apache-2.0

unslothai/unsloth

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

19,517 (+1,155)

apache-2.0

labmlai/annotated_deep_learning_paper_implementations

57,308 (+881)

mit

datawhalechina/self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

10,358 (+802)

apache-2.0

modelscope/ms-swift

4,715 (+394)

apache-2.0

huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

16,766 (+264)

apache-2.0

mit-han-lab/nunchaku

SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

535 (+201)

apache-2.0

microsoft/LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

10,982 (+188)

mit

meshtastic/firmware

Meshtastic device firmware

3,762 (+146)

gpl-3.0

ymcui/Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

18,548 (+133)

apache-2.0

lyogavin/airllm

AirLLM 70B inference with single 4GB GPU

5,454 (+131)

apache-2.0

Akegarasu/lora-scripts

SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

4,741 (+96)

agpl-3.0

yangjianxin1/Firefly

5,960 (+85)

ashishpatel26/LLM-Finetuning

LLM Finetuning with peft

2,244 (+75)

Nerogar/OneTrainer

OneTrainer is a one-stop solution for all your stable diffusion training needs.

1,866 (+68)

agpl-3.0

predibase/lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

2,266 (+60)

apache-2.0

ExpressLRS/ExpressLRS

ESP32/ESP8285-based High-Performance Radio Link for RC applications

3,747 (+59)

gpl-3.0

LianjiaTech/BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

7,991 (+55)

apache-2.0

PygmalionAI/aphrodite-engine

Large-scale LLM inference engine

1,192 (+47)

agpl-3.0

markqvist/Reticulum

The cryptography-based networking stack for building unstoppable networks with LoRa, Packet Radio, WiFi and everything in between.

2,137 (+43)

mit

Last month (relative gain)

mit-han-lab/nunchaku

SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

535 (+60%)

apache-2.0

datawhalechina/llm-deploy

大模型/LLM推理和部署理论与实践

110 (+34%)

sandner-art/ai-research

Settings for AI Training

37 (+28%)

gpl-3.0

xyfJASON/ctrlora

Codebase for "CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation"

186 (+18%)

apache-2.0

artiommocrenco/meshtastic-prometheus-exporter

Monitor your meshtastic network with Meshtastic Prometheus exporter

27 (+17%)

agpl-3.0

GURPREETKAURJETHRA/END-TO-END-GENERATIVE-AI-PROJECTS

End to End Generative AI Industry Projects on LLM Models with Deployment_Awesome LLM Projects

121 (+16%)

mit

Chongjie-Si/Subspace-Tuning

A generalized framework for subspace tuning methods in parameter efficient fine-tuning.

118 (+13%)

apache-2.0

MeshAddicts/meshinfo

Realtime web UI to run against a Meshtastic regional or private mesh network.

64 (+12%)

RedAIGC/Target-Driven-Distillation

Consistency Distillation with Target Timestep Selection and Decoupled Guidance

56 (+12%)

htotoo/ESP32-Portapack

An addon module for portapack to add extra features to it for more fun.

94 (+12%)

taishan1994/Llama3.1-Finetuning

对llama3进行全参微调、lora微调以及qlora微调。

167 (+9%)

apache-2.0

modelscope/ms-swift

4,715 (+9%)

apache-2.0

liamcottle/reticulum-meshchat

A simple mesh network communications app powered by the Reticulum Network Stack.

163 (+9%)

mit

datawhalechina/self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

10,358 (+8%)

apache-2.0

ssbc/tinySSB

tinySSB is "Secure Scuttlebutt over LoRa and BLE"

40 (+8%)

fshnkarimi/Fine-tuning-an-LLM-using-LoRA

42 (+8%)

edenartlab/sd-lora-trainer

LoRa trainer for SDXL and SD15

29 (+7%)

liamcottle/meshtastic-map

A map of all Meshtastic nodes heard via MQTT.

103 (+7%)

mit

Nenotriple/img-txt_viewer

A suite of tools for easy image tagging. Focused on LoRA training dataset creation and preparation

48 (+7%)

aniquetahir/JORA

JORA: JAX Tensor-Parallel LoRA Library (ACL 2024)

32 (+7%)

Last 12-months (new repositories)

NVlabs/DoRA

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

662

mit-han-lab/nunchaku

SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

535

apache-2.0

transformerlab/transformerlab-app

Open Source Application for Advanced LLM Engineering: interact, train, fine-tune, and evaluate large language models on your own computer.

437

agpl-3.0

Leeroo-AI/mergoo

A library for easily merging multiple LLM experts, and efficiently train the merged LLM.

421

lgpl-3.0

JosefAlbers/Phi-3-Vision-MLX

Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon

245

mit

armbues/SiLLM

SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.

234

mit

EricLBuehler/xlora

X-LoRA: Mixture of LoRA Experts

189

apache-2.0

xyfJASON/ctrlora

Codebase for "CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation"

186

apache-2.0

taishan1994/Llama3.1-Finetuning

对llama3进行全参微调、lora微调以及qlora微调。

167

apache-2.0

liamcottle/reticulum-meshchat

A simple mesh network communications app powered by the Reticulum Network Stack.

163

mit

nbasyl/DoRA

Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"

123

GURPREETKAURJETHRA/END-TO-END-GENERATIVE-AI-PROJECTS

End to End Generative AI Industry Projects on LLM Models with Deployment_Awesome LLM Projects

121

mit

Chongjie-Si/Subspace-Tuning

A generalized framework for subspace tuning methods in parameter efficient fine-tuning.

118

apache-2.0

datawhalechina/llm-deploy

大模型/LLM推理和部署理论与实践

110

liamcottle/meshtastic-map

A map of all Meshtastic nodes heard via MQTT.

103

mit

htotoo/ESP32-Portapack

An addon module for portapack to add extra features to it for more fun.

simplifine-llm/Simplifine

🚀 Easy, open-source LLM finetuning with one-line commands, seamless cloud integration, and popular optimization frameworks. ✨

gpl-3.0

BorealisAI/flora-opt

This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.

lgpl-3.0

eliahuhorwitz/Spectral-DeTuning

Official PyTorch Implementation for the "Recovering the Pre-Fine-Tuning Weights of Generative Models" paper (ICML 2024).

kamalkraj/e5-mistral-7b-instruct

Finetune mistral-7b-instruct for sentence embeddings

apache-2.0

Last 12-months (absolute gain)

hiyouga/LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

36,513 (+28,178)

apache-2.0

unslothai/unsloth

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

19,517 (+17,283)

apache-2.0

labmlai/annotated_deep_learning_paper_implementations

57,308 (+16,633)

mit

datawhalechina/self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

10,358 (+10,276)

apache-2.0

huggingface/peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

16,766 (+5,134)

apache-2.0

modelscope/ms-swift

4,715 (+4,312)

apache-2.0

lyogavin/airllm

AirLLM 70B inference with single 4GB GPU

5,454 (+3,565)

apache-2.0

microsoft/LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

10,982 (+3,351)

mit

ymcui/Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

18,548 (+2,742)

apache-2.0

yangjianxin1/Firefly

5,960 (+2,666)

predibase/lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

2,266 (+1,720)

apache-2.0

ashishpatel26/LLM-Finetuning

LLM Finetuning with peft

2,244 (+1,609)

Akegarasu/lora-scripts

SD-Trainer. LoRA & Dreambooth training scripts & GUI use kohya-ss's trainer, for diffusion model.

4,741 (+1,589)

agpl-3.0

meshtastic/firmware

Meshtastic device firmware

3,762 (+1,548)

gpl-3.0

Nerogar/OneTrainer

OneTrainer is a one-stop solution for all your stable diffusion training needs.

1,866 (+1,323)

agpl-3.0

camenduru/stable-diffusion-webui-colab

stable diffusion webui colab

15,679 (+1,185)

unlicense

siliconflow/onediff

OneDiff: An out-of-the-box acceleration library for diffusion models.

1,742 (+1,083)

apache-2.0

PygmalionAI/aphrodite-engine

Large-scale LLM inference engine

1,192 (+1,035)

agpl-3.0

cloneofsimo/lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

7,114 (+984)

apache-2.0

ExpressLRS/ExpressLRS

ESP32/ESP8285-based High-Performance Radio Link for RC applications

3,747 (+938)

gpl-3.0

Last 12-months (relative gain)

datawhalechina/self-llm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

10,358 (+12,532%)

apache-2.0

Leeroo-AI/mergoo

A library for easily merging multiple LLM experts, and efficiently train the merged LLM.

421 (+5,914%)

lgpl-3.0

Chongjie-Si/Subspace-Tuning

A generalized framework for subspace tuning methods in parameter efficient fine-tuning.

118 (+2,850%)

apache-2.0

JosefAlbers/Phi-3-Vision-MLX

Phi-3.5 for Mac: Locally-run Vision and Language Models for Apple Silicon

245 (+1,650%)

mit

WangRongsheng/Aurora

The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"

259 (+1,339%)

apache-2.0

blib-la/captain

Give your computer an AI Brain

56 (+1,300%)

agpl-3.0

modelscope/ms-swift

4,715 (+1,070%)

apache-2.0

mit-han-lab/nunchaku

SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

535 (+970%)

apache-2.0

jazelly/FinetuneLLMs

Easy and Fast LLM finetuning on GPU or CPU.

41 (+925%)

mit

armbues/SiLLM

SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.

234 (+836%)

mit

xyjigsaw/LLM-Pretrain-SFT

Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)

72 (+800%)

apache-2.0

unslothai/unsloth

Finetune Llama 3.3, Mistral, Phi, Qwen 2.5 & Gemma LLMs 2-5x faster with 70% less memory

19,517 (+774%)

apache-2.0

jhiggason/lorawirelesstracker

Heltec Wireless Tracker (MakerFocus) ESP32/Oled/GNSS/Lora SKU: ZC-193-915

41 (+720%)

gpl-3.0

PygmalionAI/aphrodite-engine

Large-scale LLM inference engine

1,192 (+659%)

agpl-3.0

fshnkarimi/Fine-tuning-an-LLM-using-LoRA

42 (+500%)

finegrain-ai/refiners

A microframework on top of PyTorch with first-class citizen APIs for foundation model adaptation

770 (+431%)

mit

UCDvision/NOLA

Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"

49 (+390%)

mit

km1994/llms_paper

该仓库主要记录 LLMs 算法工程师相关的顶会论文研读笔记（多模态、PEFT、小样本QA问答、RAG、LMMs可解释性、Agents、CoT）

279 (+373%)

chinhsuanwu/ifusion

[3DV 2025] iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views

80 (+344%)

mit

billvsme/train_law_llm

✏️0成本LLM微调上手项目，⚡️一步一步使用colab训练法律LLM，基于microsoft/phi-1_5、chatglm3，包含lora微调，全参微调

57 (+338%)