3 results found Sort:

12
213
apache-2.0
5
🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.
Created 2024-10-15
29 commits to main branch, last one 13 days ago
[NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection
Created 2024-10-17
5 commits to main branch, last one 4 months ago
A curated list of resources for activation engineering
Created 2025-01-20
39 commits to main branch, last one 2 days ago