3 results found Sort:
Python SDK for running evaluations on LLM generated responses
Created
2023-11-22
455 commits to main branch, last one a day ago
A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.
Created
2023-11-19
40 commits to main branch, last one 4 months ago
Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)
Created
2023-07-24
925 commits to main branch, last one 6 days ago