3 results found Sort:

[ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation
Created 2024-05-19
40 commits to main branch, last one about a month ago
4
40
other
10
CiteME is a benchmark designed to test the abilities of language models in finding papers that are cited in scientific texts.
Created 2024-06-11
23 commits to main branch, last one 3 months ago
Latxa: An Open Language Model and Evaluation Suite for Basque
Created 2024-02-19
55 commits to main branch, last one 8 months ago