1 result found Sort:

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
Created 2024-01-19
140 commits to main branch, last one 2 months ago