1 result found Sort:

Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".
Created 2025-02-10
53 commits to main branch, last one 28 days ago