5 results found Sort:

190
2.2k
apache-2.0
20
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform ro...
Created 2022-11-07
770 commits to main branch, last one 2 months ago
17
177
apache-2.0
11
[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA
Created 2023-11-06
153 commits to main branch, last one 4 days ago
Official implementation for the paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"
Created 2024-07-08
7 commits to main branch, last one about a month ago
3
49
apache-2.0
4
An Easy-to-use Hallucination Detection Framework for LLMs.
Created 2023-12-31
56 commits to main branch, last one 5 months ago
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the availab...
Created 2024-05-11
30 commits to main branch, last one 2 months ago