4 results found Sort:

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT...
Created 2023-04-22
21 commits to main branch, last one about a year ago
13
66
unknown
2
A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
Created 2023-06-05
27 commits to main branch, last one 4 months ago
Concepts and examples on using and training LLMs
Created 2023-04-30
27 commits to main branch, last one 6 months ago
Language Model playground to access StableLM, ChatGPT, and more.
Created 2023-04-21
12 commits to main branch, last one about a year ago