2 results found Sort:

ZYN: Zero-Shot Reward Models with Yes-No Questions
Created 2023-03-03
21 commits to main branch, last one 10 months ago
1
25
unknown
2
realize the reinforcement learning training for gpt2 llama bloom and so on llm model
Created 2023-04-19
123 commits to main branch, last one 9 months ago