Search Results - RepositoryStats

2 results found Sort:

mit

Implements pre-training, supervised fine-tuning (SFT), and reinforcement learning from human feedback (RLHF), to train and fine-tune the LLaMA2 model to follow human instructions, similar to InstructG...

ppo rhlf llam2 qlora instructgpt 4bit-fine-tune

Created 2023-07-08

35 commits to main branch, last one 9 months ago

LLM-Survey anas-zafar

unknown

The official GitHub page for the survey paper "Large language models: a comprehensive survey of its applications, challenges, limitations, and future prospects"

llms rhlf chatgpt generative-ai large-language-models vision-language-model natural-language-processing pre-trained-language-models

Created 2023-06-11

53 commits to main branch, last one about a month ago