9 results found Sort:
- Filter by Primary Language:
- Python (6)
- Jupyter Notebook (2)
- JavaScript (1)
- +
Latest Advances on System-2 Reasoning
Created
2025-02-09
46 commits to main branch, last one 2 days ago
Explore the Multimodal “Aha Moment” on 2B Model
Created
2025-02-24
147 commits to main branch, last one a day ago
Collect every awesome work about r1!
Created
2025-01-30
81 commits to main branch, last one 6 hours ago
Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".
Created
2025-02-10
53 commits to main branch, last one 27 days ago
Doge Family of Small Language Model
Created
2025-02-01
268 commits to main branch, last one a day ago
Model Context Protocol server for DeepSeek's advanced language models
Created
2025-01-21
21 commits to main branch, last one about a month ago
SOTA RL fine-tuning solution for advanced math reasoning of LLM
Created
2025-02-09
36 commits to main branch, last one 8 days ago
Notebooks to demo the use of Azure AI Python SDK / LangChain with DeepSeek R1 reasoning model in Azure AI Foundry.
Created
2025-02-02
30 commits to main branch, last one about a month ago
使用langchain进行任务规划,构建子任务的会话场景资源,通过MCTS任务执行器,来让每个子任务通过在上下文中资源,通过自身反思探索来获取自身对问题的最优答案;这种方式依赖模型的对齐偏好,我们在每种偏好上设计了一个工程框架,来完成自我对不同答案的奖励进行采样策略
Created
2023-09-16
226 commits to main branch, last one 9 days ago