14 results found Sort:

199
2.3k
apache-2.0
19
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform ro...
Created 2022-11-07
770 commits to main branch, last one 9 months ago
LettuceDetect is a hallucination detection framework for RAG applications.
Created 2025-02-05
55 commits to main branch, last one 20 days ago
17
162
apache-2.0
10
[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.
Created 2023-11-06
167 commits to main branch, last one 5 months ago
up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources
Created 2024-03-15
55 commits to master branch, last one 19 days ago
Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"
Created 2024-07-08
7 commits to main branch, last one 8 months ago
VerifAI initiative to build open-source easy-to-deploy generative question-answering engine that can reference and verify answers for correctness (using posteriori model)
Created 2023-05-23
325 commits to main branch, last one about a month ago
4
58
apache-2.0
4
An Easy-to-use Hallucination Detection Framework for LLMs.
Created 2023-12-31
56 commits to main branch, last one about a year ago
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the availab...
Created 2024-05-11
30 commits to main branch, last one 9 months ago
[ICLR 2025] Code and Data Repo for Paper "Latent Space Chain-of-Embedding Enables Output-free LLM Self-Evaluation"
Created 2024-12-09
7 commits to master branch, last one 4 months ago
4
43
apache-2.0
2
[ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2 & [ICLR 2025] Mask-DPO
Created 2024-04-09
24 commits to main branch, last one 23 days ago
🔎Official code for our paper: "VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation".
Created 2024-11-07
8 commits to main branch, last one about a month ago
1
30
apache-2.0
2
Try out HallOumi, a state-of-the-art claim verification model in a simple UI!
Created 2025-03-19
26 commits to main branch, last one 24 days ago
VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)
Created 2024-04-07
54 commits to main branch, last one 25 days ago