3 results found Sort:
[ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation
Created
2024-05-19
40 commits to main branch, last one about a month ago
CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.
Created
2024-06-11
23 commits to main branch, last one 3 months ago
Latxa: An Open Language Model and Evaluation Suite for Basque
Created
2024-02-19
55 commits to main branch, last one 8 months ago