2 results found Sort:
A brief and partial summary of RLHF algorithms.
Created
2024-11-15
14 commits to main branch, last one 28 days ago
[EMNLP 2022] Continual Training of Language Models for Few-Shot Learning
Created
2022-10-11
15 commits to main branch, last one about a year ago