1 result found Sort:

6
107
apache-2.0
5
🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.
Created 2024-10-15
26 commits to main branch, last one 13 hours ago