4 results found Sort:

A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT...
Created 2023-04-22
21 commits to main branch, last one about a year ago
11
52
unknown
2
A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
Created 2023-06-05
25 commits to main branch, last one 29 days ago
Concepts and examples on using and training LLMs
Created 2023-04-30
27 commits to main branch, last one about a month ago
Language Model playground to access StableLM, ChatGPT, and more.
Created 2023-04-21
12 commits to main branch, last one 12 months ago