12 results found Sort:
- Filter by Primary Language:
- Python (9)
- Jupyter Notebook (2)
- TypeScript (1)
- +
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command ...
Created
2023-04-28
3,165 commits to main branch, last one 12 hours ago
AI Observability & Evaluation
Created
2022-11-09
4,036 commits to main branch, last one 13 hours ago
🐢 Open-Source Evaluation & Testing for AI & LLM systems
Created
2022-03-06
10,170 commits to main branch, last one 2 days ago
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform ro...
Created
2022-11-07
770 commits to main branch, last one 4 months ago
ETL, Analytics, Versioning for Unstructured Data
Created
2024-06-25
419 commits to main branch, last one a day ago
Python SDK for running evaluations on LLM generated responses
Created
2023-11-22
639 commits to main branch, last one 3 days ago
Generate ideal question-answers for testing RAG
This repository has been archived
(exclude archived)
Created
2023-07-04
60 commits to master branch, last one a day ago
A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.
Created
2023-11-19
40 commits to main branch, last one 10 months ago
Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)
Created
2023-07-24
1,067 commits to main branch, last one 3 months ago
Бенчмарк сравнивает русские аналоги ChatGPT: Saiga, YandexGPT, Gigachat
Created
2023-08-23
112 commits to master branch, last one about a year ago
🎯 Your free LLM evaluation toolkit helps you assess the accuracy of facts, how well it understands context, its tone, and more. This helps you see how good your LLM applications are.
Created
2024-02-17
280 commits to main branch, last one 3 months ago
An open source library for asynchronous querying of LLM endpoints
Created
2024-04-03
529 commits to main branch, last one 16 days ago