Search Results - RepositoryStats

12

213

apache-2.0

5

🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.

dpo llm ppo grpo rlhf r1-zero alignment online-rl reasoning llm-aligment distributed-rl dueling-bandits llm-exploration online-alignment thompson-sampling distributed-training

Created 2024-10-15

29 commits to main branch, last one 13 days ago

4

40

unknown

1

[NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection

best-of-n acceleration llm-aligment inference-scaling

Created 2024-10-17

5 commits to main branch, last one 4 months ago

1

38

mit

1

A curated list of resources for activation engineering

llm concept control ai-safety concept-rep transparent llm-aligment interpretability large-language-models activation-engineering concept-activation-vector

Created 2025-01-20

39 commits to main branch, last one 2 days ago