dvlab-research / Step-DPO

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Date Created 2024-06-24 (7 months ago)

Commits 22 (last one 9 days ago)

Stargazers 335 (4 this week)

Watchers 2 (0 this week)

Forks 13

License unknown

Ranking

RepositoryStats indexes 608,712 repositories, of these dvlab-research/Step-DPO is ranked #125,519 (79th percentile) for total stargazers, and #492,614 for total watchers. Github reports the primary language for this repository as Python, for repositories using this language it is ranked #21,595/122,751.

dvlab-research/Step-DPO is also tagged with popular topics, for these it's ranked: llm (#943/3103), math (#139/495)

Other Information

dvlab-research/Step-DPO has Github issues enabled, there are 18 open issues and 8 closed issues.

All Topics

dpo llm math reasoning

Star History

Github stargazers over time

Watcher History

Github watchers over time, collection started in '23

Recent Commit History

22 commits on the default branch (main) since jan '22

Yearly Commits

Commits to the default branch (main) per year

Issue History

Languages

The primary language is Python but there's also others...

updated: 2025-01-25 @ 06:04pm, id: 819228692 / R_kgDOMNRwFA