Search Results - RepositoryStats

4 results found Sort:

Filter by Primary Language:
Python (2)
Jupyter Notebook (1)
Pascal (1)
+

212

mit

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT...

gpt llm ppo lora peft rlhf llama vicuna chatgpt pytorch finetune vicuna-7b reward-models

Created 2023-04-22

21 commits to main branch, last one about a year ago

LLM4RL ZJLAB-AMMI

unknown

A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM

llm ppo vicuna-7b vicuna-13b interaction reinforcement-learning

Created 2023-06-05

27 commits to main branch, last one 7 months ago

llm_notebooks danielsobrado

unknown

Concepts and examples on using and training LLMs

llms open unslo llama3 pytorch llamacpp langchain vicuna-7b open-llama llama-index transformers huggingface-transformers

Created 2023-04-30

27 commits to main branch, last one 10 months ago

AI-Playground-DesktopClient FMXExpress

mit

Language Model playground to access StableLM, ChatGPT, and more.

ai gpt gpt-4 linux macos delphi openai chatgpt desktop windows llama-7b stablelm replicate vicuna-7b desktop-app object-pascal language-model

Created 2023-04-21

12 commits to main branch, last one about a year ago