5 results found Sort:
- Filter by Primary Language:
- Python (4)
- Jupyter Notebook (1)
- +
LLaMA-TRL: Fine-tuning LLaMA with PPO and LoRA
Created
2023-03-30
66 commits to main branch, last one about a year ago
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first approach
Created
2023-11-16
52 commits to main branch, last one about a year ago
基于DPO算法微调语言大模型,简单好上手。
Created
2024-03-27
16 commits to master branch, last one 7 months ago
Various training, inference and validation code and results related to Open LLM's that were pretrained (full or partially) on the Dutch language.
Created
2023-07-02
29 commits to main branch, last one 9 months ago