27 results found Sort:

428
4.7k
apache-2.0
38
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Created 2023-06-15
1,012 commits to main branch, last one 2 days ago
250
2.7k
apache-2.0
24
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention)
Created 2023-07-30
952 commits to main branch, last one 4 days ago
120
1.4k
apache-2.0
18
Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Created 2023-05-15
111 commits to main branch, last one 5 months ago
Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed
Created 2021-03-26
118 commits to main branch, last one 2 years ago
Large Language Models for All, 🦙 Cult and More, Stay in touch !
Created 2023-03-30
27 commits to main branch, last one about a year ago
58
411
apache-2.0
10
Collaborative Training of Large Language Models in an Efficient Way
Created 2023-04-02
980 commits to main branch, last one 2 months ago
20
381
unknown
4
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train...
Created 2023-10-24
133 commits to master branch, last one about a month ago
GLake: optimizing GPU memory management and IO transmission.
Created 2023-06-06
53 commits to master branch, last one 3 months ago
36
287
unknown
8
Implementation of Chinese ChatGPT
Created 2023-02-17
371 commits to master branch, last one about a year ago
Best practices & guides on how to write distributed pytorch training code
Created 2024-07-31
230 commits to main branch, last one 18 hours ago
19
203
apache-2.0
4
一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测,低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。
Created 2023-07-20
268 commits to main branch, last one 11 months ago
llama2 finetuning with deepspeed and lora
Created 2023-07-21
8 commits to main branch, last one about a year ago
A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatG...
Created 2023-04-18
17 commits to main branch, last one about a year ago
6
124
bsd-2-clause
4
Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload
Created 2021-08-17
97 commits to master branch, last one 2 years ago
5
120
apache-2.0
3
Super-Efficient RLHF Training of LLMs with Parameter Reallocation
Created 2024-06-18
1,071 commits to main branch, last one a day ago
DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)
Created 2023-08-11
16 commits to main branch, last one about a year ago
8
90
apache-2.0
1
Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.
Created 2023-06-24
27 commits to master branch, last one 9 months ago
16
70
apache-2.0
13
llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource...
Created 2024-02-28
116 commits to main branch, last one 6 months ago
Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)
Created 2023-11-14
21 commits to master branch, last one 9 months ago
8
66
apache-2.0
5
Application of the L2HMC algorithm to simulations in lattice QCD.
Created 2019-03-21
6,366 commits to main branch, last one 11 months ago
✏️0成本LLM微调上手项目,⚡️一步一步使用colab训练法律LLM,基于microsoft/phi-1_5、chatglm3,包含lora微调,全参微调
Created 2023-11-07
31 commits to master branch, last one 11 months ago
A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT...
Created 2023-04-18
25 commits to main branch, last one about a year ago
A toy large model for recommender system based on LLaMA2/SASRec/Meta's generative recommenders. Besides, note and experiments of official implementation for Meta's generative recommenders.
Created 2024-03-29
36 commits to main branch, last one 6 months ago
All about large language models
Created 2023-05-09
106 commits to main branch, last one 5 months ago
10
46
unknown
3
Training & Implementation of chatbots leveraging GPT-like architecture with the aitextgen package to enable dynamic conversations.
This repository has been archived (exclude archived)
Created 2021-10-15
369 commits to main branch, last one 2 years ago
3
37
unknown
1
一套代码指令微调大模型
Created 2023-06-09
28 commits to main branch, last one about a year ago