Search Results - RepositoryStats

lmdeploy InternLM

528

6.1k

apache-2.0

48

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

llm llama llama2 llama3 internlm codellama deepspeed turbomind cuda-kernels llm-inference fastertransformer

Created 2023-06-15

1,237 commits to main branch, last one a day ago

safe-rlhf PKU-Alignment

119

1.4k

apache-2.0

16

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Created 2023-05-15

111 commits to main branch, last one 10 months ago

KnowLM zjunlp

132

1.3k

mit

11

An Open-sourced Knowledgable Large Language Model Framework.

Created 2023-04-01

540 commits to main branch, last one 3 months ago

glake antgroup

40

455

apache-2.0

7

GLake: optimizing GPU memory management and IO transmission.

gpu llm onnx memory pytorch deepspeed

Created 2023-06-06

54 commits to master branch, last one 4 months ago

LLaMA-Cult-and-More shm007g

24

446

mit

33

Large Language Models for All, 🦙 Cult and More, Stay in touch !

gpt llm ggml gpt4 gptq llama alpaca vicuna chatgpt loralib pytorch deepspeed tensorflow transformers

Created 2023-03-30

27 commits to main branch, last one about a year ago

MPP-LLaVA Coobiw

23

437

unknown

6

Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Train...

mllm qwen deepspeed fine-tuning pretraining model-parallel pipeline-parallelism video-language-model video-large-language-models multimodal-large-language-models

Created 2023-10-24

135 commits to master branch, last one about a month ago

finetune-gpt2xl Xirider

74

437

mit

5

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and finetune GPT-NEO (2.7 B) on a single GPU with Huggingface Transformers using DeepSpeed

gpt2 gpt3 gpt-neo deepspeed finetuning huggingface gpt-neo-fine-tuning huggingface-transformers

Created 2021-03-26

118 commits to main branch, last one 2 years ago

CoLLiE OpenMOSS

58

415

apache-2.0

11

Collaborative Training of Large Language Models in an Efficient Way

nlp pytorch deepspeed deep-learning

Created 2023-04-02

980 commits to main branch, last one 7 months ago

distributed-training-guide LambdaLabsML

28

391

mit

6

Best practices & guides on how to write distributed pytorch training code

gpu mpi cuda fsdp nccl slurm cluster pytorch sharding deepspeed kuberentes lambdalabs gpu-cluster distributed-training

Created 2024-07-31

271 commits to main branch, last one about a month ago

RLHF sunzeyeah

36

287

unknown

7

Implementation of Chinese ChatGPT

glm nlp pangu chatgpt pytorch deepspeed deep-learning

Created 2023-02-17

371 commits to master branch, last one about a year ago

ReaLHF openpsi-project

17

277

apache-2.0

5

Super-Efficient RLHF Training of LLMs with Parameter Reallocation

llm deepspeed megatron-lm llm-training transformers llm-framework distributed-systems distributed-computing large-language-models reinforcement-learning large-scale-machine-learning reinforcement-learning-from-human-feedback

Created 2024-06-18

1,078 commits to main branch, last one 3 months ago

llms_tool stanleylsx

20

214

apache-2.0

4

一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测，低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。

moss qwen bloom llama aquila falcon llama2 xverse aquila2 chatglm mistral pytorch baichuan chatglm2 chatglm3 internlm baichuan2 deepspeed

Created 2023-07-20

268 commits to main branch, last one about a year ago

llama2-lora-fine-tuning git-cloner

14

174

mit

3

llama2 finetuning with deepspeed and lora

lora llama2 deepspeed finetuning

Created 2023-07-21

8 commits to main branch, last one about a year ago

LearnDeepSpeed bobo0810

2

158

mit

1

DeepSpeed教程 & 示例注释 & 学习笔记（大模型高效训练）

examples deepspeed large-language-models

Created 2023-08-11

16 commits to main branch, last one about a year ago

ChatGLM-LoRA-RLHF-PyTorch jackaduma

10

134

mit

6

A full pipeline to finetune ChatGLM LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the ChatGLM architecture. Basically ChatG...

gpt llm ppo lora peft rlhf llama chatglm chatgpt pytorch finetune deepspeed chatglm-6b reward-models

Created 2023-04-18

17 commits to main branch, last one about a year ago

revlib HomebrewML

6

127

bsd-2-clause

4

Simple and efficient RevNet-Library for PyTorch with XLA and DeepSpeed support and parameter offload

tpu xla revnet pytorch deepspeed momentumnet deep-learning

Created 2021-08-17

97 commits to master branch, last one 2 years ago

gdGPT CoinCheung

8

95

apache-2.0

1

Train llm (bloom, llama, baichuan2-7b, chatglm3-6b) with deepspeed pipeline mode. Faster than zero/zero++/fsdp.

llm nlp bloom llama2 pytorch pipeline deepspeed chatglm3-6b baichuan2-7b mixtral-8x7b full-finetune flash-attention model-parallization

Created 2023-06-24

27 commits to master branch, last one about a year ago

llm-inference OpenCSGs

16

80

apache-2.0

12

llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deployment, such as UI, RESTful API, auto-scaling, computing resource...

ray vllm deepspeed llama-cpp transformer llm-inference

Created 2024-02-28

116 commits to main branch, last one 11 months ago

LLM-Pretrain-SFT xyjigsaw

16

77

apache-2.0

4

Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)

lora llama mistral baichuan2 deepspeed large-language-models

Created 2023-11-14

21 commits to master branch, last one about a year ago

train_law_llm billvsme

11

72

unknown

2

✏️0成本LLM微调上手项目，⚡️一步一步使用colab训练法律LLM，基于microsoft/phi-1_5、chatglm3，包含lora微调，全参微调

ai law llm lora llama2 python deepspeed

Created 2023-11-07

31 commits to master branch, last one about a year ago

l2hmc-qcd saforem2

9

68

apache-2.0

5

Application of the L2HMC algorithm to simulations in lattice QCD.

hmc mcmc hydra horovod lattice pytorch deepspeed tensorflow lattice-qcd monte-carlo gauge-theory deep-learning machine-learning hamiltonian-monte-carlo

Created 2019-03-21

6,366 commits to main branch, last one about a year ago

Alpaca-LoRA-RLHF-PyTorch jackaduma

6

58

mit

4

A full pipeline to finetune Alpaca LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Alpaca architecture. Basically ChatGPT...

gpt llm ppo lora peft rlhf llama alpaca chatgpt pytorch finetune deepspeed reward-models

Created 2023-04-18

25 commits to main branch, last one about a year ago

Toy-RecLM glb400

4

54

mit

1

A toy large model for recommender system based on LLaMA2/SASRec/Meta's generative recommenders. Besides, note and experiments of official implementation for Meta's generative recommenders.

llama2 sasrec deepspeed recommender-system large-language-models actions-speak-louder-than-words

Created 2024-03-29

36 commits to main branch, last one 11 months ago