10 results found Sort:
- Filter by Primary Language:
- Python (7)
- Jupyter Notebook (2)
- +
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform ro...
Created
2022-11-07
770 commits to main branch, last one 6 months ago
[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.
Created
2023-11-06
167 commits to main branch, last one 2 months ago
Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"
Created
2024-07-08
7 commits to main branch, last one 5 months ago
up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources
llm
mlm
lvlm
mllm
hallucination
hallucination-survey
large-language-models
hallucination-research
vision-language-models
hallucination-benchmark
hallucination-detection
hallucination-evaluation
hallucination-mitigation
multimodal-language-model
large-vision-language-models
multimodal-large-language-models
Created
2024-03-15
52 commits to master branch, last one 4 days ago
An Easy-to-use Hallucination Detection Framework for LLMs.
Created
2023-12-31
56 commits to main branch, last one 9 months ago
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the availab...
Created
2024-05-11
30 commits to main branch, last one 6 months ago
VerifAI initiative to build open-source easy-to-deploy generative question-answering engine that can reference and verify answers for correctness (using posteriori model)
Created
2023-05-23
320 commits to main branch, last one 4 days ago
[ACL 2024] ANAH & [NeurIPS 2024] ANAH-v2
Created
2024-04-09
19 commits to main branch, last one about a month ago
VideoHallucer, The first comprehensive benchmark for hallucination detection in large video-language models (LVLMs)
Created
2024-04-07
53 commits to main branch, last one 7 months ago
[ACL 2024] An Easy-to-use Hallucination Detection Framework for LLMs.
Created
2024-04-20
40 commits to main branch, last one 4 months ago