3 results found Sort:
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.
Created
2024-09-15
162 commits to main branch, last one a day ago
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Created
2023-07-30
1,009 commits to main branch, last one 2 days ago
Reasoning in Large Language Models: Papers and Resources, including Chain-of-Thought and OpenAI o1 🍓
Created
2022-11-05
173 commits to main branch, last one 4 days ago