2 results found Sort:

A brief and partial summary of RLHF algorithms.
Created 2024-11-15
14 commits to main branch, last one 28 days ago
1
44
unknown
7
[EMNLP 2022] Continual Training of Language Models for Few-Shot Learning
Created 2022-10-11
15 commits to main branch, last one about a year ago