5 results found Sort:

521
5.4k
apache-2.0
45
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
Created 2025-02-08
174 commits to master branch, last one 4 days ago
Simple extension on vLLM to help you speed up reasoning model without training.
Created 2025-01-09
66 commits to main branch, last one about a month ago
MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
Created 2025-04-01
8 commits to main branch, last one 6 days ago
Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".
Created 2025-02-28
17 commits to main branch, last one 14 days ago
Pure RL to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.
Created 2025-03-12
9 commits to main branch, last one 29 days ago