5 results found Sort:

Python SDK for running evaluations on LLM generated responses
Created 2023-11-22
566 commits to main branch, last one 2 days ago
A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.
Created 2023-11-19
40 commits to main branch, last one 9 months ago
6
74
apache-2.0
2
Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)
Created 2023-07-24
1,067 commits to main branch, last one about a month ago
2
32
unknown
3
[ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Models
Created 2024-02-23
19 commits to master branch, last one 3 months ago