4 results found Sort:
Evaluate your LLM's response with Prometheus and GPT4 💯
Created
2024-04-18
209 commits to main branch, last one about a month ago
Deliver safe & effective language models
Created
2022-11-18
5,706 commits to main branch, last one about a month ago
[ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation
Created
2024-05-19
41 commits to main branch, last one about a month ago
This is the repo for the survey of Bias and Fairness in IR with LLMs.
Created
2024-03-18
55 commits to main branch, last one 16 days ago