deshwalmahesh / PHUDGE

Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the available tool, methods, repo, code etc to detect hallucination, LLM evaluation, grading and much more.

Date Created 2024-05-11 (about a month ago)
Commits 29 (last one 29 days ago)
Stargazers 44 (1 this week)
Watchers 1 (0 this week)
Forks 6
License unknown
Ranking

RepositoryStats indexes 534,551 repositories, of these deshwalmahesh/PHUDGE is ranked #459,196 (14th percentile) for total stargazers, and #497,194 for total watchers. Github reports the primary language for this repository as Jupyter Notebook, for repositories using this language it is ranked #11,882/14,929.

deshwalmahesh/PHUDGE is also tagged with popular topics, for these it's ranked: pytorch (#4,943/5588),  ai (#2,518/3134),  nlp (#2,008/2260),  llm (#1,599/2043),  ml (#462/534)

Other Information

deshwalmahesh/PHUDGE has Github issues enabled, there is 1 open issue and 0 closed issues.

Homepage URL: https://arxiv.org/abs/2405.08029

Star History

Github stargazers over time

Watcher History

Github watchers over time, collection started in '23

Recent Commit History

29 commits on the default branch (main) since jan '22

Yearly Commits

Commits to the default branch (main) per year

Issue History

Languages

The primary language is Jupyter Notebook but there's also others...

updated: 2024-06-25 @ 12:53am, id: 799117410 / R_kgDOL6GQYg