5 results found Sort:

Python SDK for running evaluations on LLM generated responses
Created 2023-11-22
686 commits to main branch, last one 17 hours ago
A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.
Created 2023-11-19
40 commits to main branch, last one 11 months ago
6
75
apache-2.0
2
Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)
Created 2023-07-24
1,067 commits to main branch, last one 3 months ago
2
35
unknown
3
[ACL'24] A Knowledge-grounded Interactive Evaluation Framework for Large Language Models
Created 2024-02-23
19 commits to master branch, last one 5 months ago