4 results found Sort:
- Filter by Primary Language:
- Python (2)
- Jupyter Notebook (1)
- Pascal (1)
- +
A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the Vicuna architecture. Basically ChatGPT...
Created
2023-04-22
21 commits to main branch, last one about a year ago
A RL approach to enable cost-effective, intelligent interactions between a local agent and a remote LLM
Created
2023-06-05
27 commits to main branch, last one 4 months ago
Concepts and examples on using and training LLMs
Created
2023-04-30
27 commits to main branch, last one 6 months ago
Language Model playground to access StableLM, ChatGPT, and more.
Created
2023-04-21
12 commits to main branch, last one about a year ago