4 results found Sort:

Evaluate your LLM's response with Prometheus and GPT4 💯
Created 2024-04-18
205 commits to main branch, last one 25 days ago
[ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation
Created 2024-05-19
40 commits to main branch, last one 9 days ago
This is the repo for the survey of Bias and Fairness in IR with LLMs.
Created 2024-03-18
49 commits to main branch, last one 3 months ago