3 results found Sort:

260
4.2k
apache-2.0
40
Solve Visual Understanding with Reinforced VLMs
Created 2025-02-06
165 commits to main branch, last one a day ago
Explore the Multimodal “Aha Moment” on 2B Model
Created 2025-02-24
148 commits to main branch, last one 24 hours ago
13
224
apache-2.0
5
🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.
Created 2024-10-15
30 commits to main branch, last one 9 days ago