12 results found Sort:

405
5.0k
mit
21
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command ...
Created 2023-04-28
3,165 commits to main branch, last one 12 hours ago
320
4.3k
other
31
AI Observability & Evaluation
Created 2022-11-09
4,036 commits to main branch, last one 13 hours ago
193
2.2k
apache-2.0
21
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform ro...
Created 2022-11-07
770 commits to main branch, last one 4 months ago
93
2.1k
apache-2.0
15
ETL, Analytics, Versioning for Unstructured Data
Created 2024-06-25
419 commits to main branch, last one a day ago
Python SDK for running evaluations on LLM generated responses
Created 2023-11-22
639 commits to main branch, last one 3 days ago
Generate ideal question-answers for testing RAG
This repository has been archived (exclude archived)
Created 2023-07-04
60 commits to master branch, last one a day ago
A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.
Created 2023-11-19
40 commits to main branch, last one 10 months ago
6
75
apache-2.0
2
Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)
Created 2023-07-24
1,067 commits to main branch, last one 3 months ago
2
57
unknown
3
Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat
Created 2023-08-23
112 commits to master branch, last one about a year ago
12
29
apache-2.0
1
🎯 Your free LLM evaluation toolkit helps you assess the accuracy of facts, how well it understands context, its tone, and more. This helps you see how good your LLM applications are.
Created 2024-02-17
280 commits to main branch, last one 3 months ago
An open source library for asynchronous querying of LLM endpoints
Created 2024-04-03
529 commits to main branch, last one 16 days ago