5 results found Sort:
- Filter by Primary Language:
- Python (3)
- JavaScript (1)
- Jupyter Notebook (1)
- +
UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform ro...
Created
2022-11-07
770 commits to main branch, last one 2 months ago
[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA
Created
2023-11-06
153 commits to main branch, last one 4 days ago
Official implementation for the paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"
Created
2024-07-08
7 commits to main branch, last one about a month ago
An Easy-to-use Hallucination Detection Framework for LLMs.
Created
2023-12-31
56 commits to main branch, last one 5 months ago
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the availab...
Created
2024-05-11
30 commits to main branch, last one 2 months ago