deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the available tool, methods, repo, code etc to detect hallucination, LLM evaluation, grading and much more.
RepositoryStats indexes 595,856 repositories, of these deshwalmahesh/PHUDGE is ranked #475,677 (20th percentile) for total stargazers, and #544,643 for total watchers. Github reports the primary language for this repository as Jupyter Notebook, for repositories using this language it is ranked #12,574/17,543.
deshwalmahesh/PHUDGE has Github issues enabled, there is 1 open issue and 0 closed issues.
Homepage URL: https://arxiv.org/abs/2405.08029
Star History
Github stargazers over time
Watcher History
Github watchers over time, collection started in '23
Recent Commit History
30 commits on the default branch (main) since jan '22
Yearly Commits
Commits to the default branch (main) per year
Issue History
Languages
The primary language is Jupyter Notebook but there's also others...
updated: 2024-11-27 @ 12:05pm, id: 799117410 / R_kgDOL6GQYg