4 results found Sort:
- Filter by Primary Language:
- Python (2)
- Jupyter Notebook (1)
- Pascal (1)
- +
A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT...
Created
2023-04-22
21 commits to main branch, last one about a year ago
A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
Created
2023-06-05
25 commits to main branch, last one 29 days ago
Concepts and examples on using and training LLMs
Created
2023-04-30
27 commits to main branch, last one about a month ago
Language Model playground to access StableLM, ChatGPT, and more.
Created
2023-04-21
12 commits to main branch, last one 12 months ago