1 result found Sort:
ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels (easy/hard) across eight real-life scenarios.
Created
2023-06-06
23 commits to main branch, last one about a year ago