2 results found Sort:

ZYN: Zero-Shot Reward Models with Yes-No Questions
Created 2023-03-03
21 commits to main branch, last one about a year ago
2
26
unknown
2
realize the reinforcement learning training for gpt2 llama bloom and so on llm model
Created 2023-04-19
123 commits to main branch, last one about a year ago