10 results found Sort:

197
2.2k
apache-2.0
20
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform ro...
Created 2022-11-07
770 commits to main branch, last one 6 months ago
17
158
apache-2.0
10
[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.
Created 2023-11-06
167 commits to main branch, last one 2 months ago
Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"
Created 2024-07-08
7 commits to main branch, last one 5 months ago
3
56
apache-2.0
5
An Easy-to-use Hallucination Detection Framework for LLMs.
Created 2023-12-31
56 commits to main branch, last one 9 months ago
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the availab...
Created 2024-05-11
30 commits to main branch, last one 6 months ago
VerifAI initiative to build open-source easy-to-deploy generative question-answering engine that can reference and verify answers for correctness (using posteriori model)
Created 2023-05-23
320 commits to main branch, last one 4 days ago
3
29
apache-2.0
3
[ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2
Created 2024-04-09
19 commits to main branch, last one about a month ago
VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)
Created 2024-04-07
53 commits to main branch, last one 7 months ago