3 results found Sort:
Solve Visual Understanding with Reinforced VLMs
Created
2025-02-06
165 commits to main branch, last one a day ago
Explore the Multimodal “Aha Moment” on 2B Model
Created
2025-02-24
148 commits to main branch, last one 24 hours ago
🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.
Created
2024-10-15
30 commits to main branch, last one 9 days ago