Search Results - RepositoryStats

OpenFactVerification Libr-AI

48

1.1k

mit

5

Loki: Open-source solution designed to automate the process of verifying factuality

ai factuality hallucination

Created 2024-03-25

48 commits to main branch, last one 5 months ago

Awesome-LLM-Uncertainty-Reliability-Robustness jxzhangjhu

48

721

mit

25

Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models

llms gpt-3 gpt-4 safety chatgpt prompting robustness calibration reliability awesome-list hallucination chain-of-thought prompt-engineering in-context-learning large-language-models uncertainty-estimation uncertainty-quantification

Created 2023-03-20

156 commits to main branch, last one 11 days ago

Woodpecker BradyFU

31

631

unknown

16

✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models

llm mllm hallucination multimodality hallucinations large-language-models multimodal-large-language-models

Created 2023-09-26

107 commits to main branch, last one 2 months ago

RefChecker amazon-science

38

351

apache-2.0

10

RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.

llms factuality hallucination

Created 2023-12-04

102 commits to main branch, last one 4 months ago

LRV-Instruction FuxiaoLiu

13

272

bsd-3-clause

12

[ICLR'24] Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning

gpt vqa iclr gpt-4 llama llava vicuna vision chatgpt iclr2024 evaluation multimodal hallucination object-detection foundation-models evaluation-metrics prompt-engineering vision-and-language

Created 2023-06-15

366 commits to main branch, last one 12 months ago

HallusionBench tianyi-lab

8

270

bsd-3-clause

5

[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

llm lmm vlms gpt-4 llava gpt-4v benchmark benchmarks hallucination large-language-models large-vision-language-models

Created 2023-10-22

136 commits to main branch, last one 3 months ago

ICSFSurvey IAAR-Shanghai

4

160

unknown

4

Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasoning elevation🍓 and hallucination alleviation🍄.

decoding reasoning self-refine self-correct hallucination self-feedback attention-head self-correction chain-of-thought self-consistency self-improvement data-augmentation preference-learning internal-consistency large-language-model large-language-models knowledge-distillation

Created 2024-06-01

36 commits to master branch, last one 3 months ago

UHGEval IAAR-Shanghai

17

160

apache-2.0

10

[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.

llm qwen ceval gpt-3 gpt-4 openai chatgpt dataset benchmark evaluation openai-api huggingface hallucination hallucinations large-language-models hallucination-detection hallucination-evaluation huggingface-transformers

Created 2023-11-06

167 commits to main branch, last one 3 months ago

awesome-Large-MultiModal-Hallucination xieyuquanxx

14

149

unknown

5

😎 curated list of awesome LMM hallucinations papers, methods & resources.

lmm multimodal multi-modal hallucination

This repository has been archived (exclude archived)

Created 2023-10-11

57 commits to main branch, last one 11 months ago

TruthX ictnlp

6

144

gpl-3.0

4

Code for ACL 2024 paper "TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space"

llm llms gpt-4 llama llama2 llama3 safety chatglm chatgpt mistral baichuan truthfulness hallucination llm-inference explainable-ai hallucinations language-model representation

Created 2024-02-27

18 commits to main branch, last one 11 months ago

KnowledgeCircuits zjunlp

9

130

mit

8

[NeurIPS 2024] Knowledge Circuits in Pretrained Transformers

circuit transformer hallucination model-editing interpretability knowledge-edting knowledge-circuit knowledge-editing large-language-models artificial-intelligence natural-language-processing

Created 2024-01-16

48 commits to main branch, last one 19 days ago

Awesome-LVLM-Hallucination NishilBalar

5

99

unknown

3

up-to-date curated list of state-of-the-art Large vision language models hallucinations research work, papers & resources

llm mlm lvlm mllm hallucination hallucination-survey large-language-models hallucination-research vision-language-models hallucination-benchmark hallucination-detection hallucination-evaluation hallucination-mitigation multimodal-language-model large-vision-language-models multimodal-large-language-models

Created 2024-03-15

54 commits to master branch, last one 16 days ago

Reliable-LLM AmourWaltz

4

98

unknown

1

This repository has no description...

reliable knowledge uncertainty hallucination

Created 2023-12-14

32 commits to main branch, last one 6 months ago

FactCHD zjunlp

3

86

mit

4

[IJCAI 2024] FactCHD: Benchmarking Fact-Conflicting Hallucination Detection

dataset factchd factual benchmark knowledge hallucination large-language-models natural-language-processing

Created 2023-04-05

22 commits to main branch, last one 10 months ago

LLaVA-Align yfzhang114

2

75

apache-2.0

1

This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual Debias Decoding strategy.

debiasing hallucination large-vision-language-models

Created 2024-01-23

8 commits to main branch, last one 17 days ago

ICD HillZhang1999

6

63

mit

2

Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"

hallucination decoding-algorithm large-language-models

Created 2023-12-23

10 commits to main branch, last one about a year ago

PHUDGE deshwalmahesh

7

49

unknown

1

Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute, relative and much more. It contains a list of all the availab...

ai ml llm nlp sota judge phi-3 pytorch evaluation finetuning hallucination custom-dataset llm-evaluation feedback-collection hallucination-detection

Created 2024-05-11

30 commits to main branch, last one 8 months ago

LTI_Neural_Navigator anlp-team

3

42

mit

1

"Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases" by Jiarui Li and Ye Yuan and Zehua Zhang

llm nlp rag llama datasets web-crawler hallucination system-design machine-learning dataset-generation large-language-models

Created 2024-02-17

205 commits to main branch, last one 11 months ago

Deco zjunlp

2

40

mit

4

[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation

doco mllm decoding iclr2025 hallucination large-language-models artificial-intelligence hallucination-mitigation natural-language-processing multimodal-large-language-models

Created 2024-09-29

17 commits to main branch, last one 3 months ago

OLAPH dmis-lab

4

38

unknown

2

OLAPH: Improving Factuality in Biomedical Long-form Question Answering

factuality hallucination question-answering biomedical-research

Created 2024-05-12

84 commits to main branch, last one 6 months ago

3D-GRAND sled-group

2

36

unknown

7

[CVPR 2025] 3D-GRAND: Towards Better Grounding and Less Hallucination for 3D-LLMs

3d llm dataset hallucination

Created 2024-06-12

11 commits to main branch, last one 9 months ago

Re-Align taco-group

0

33

apache-2.0

0

A novel alignment framework that leverages image retrieval to mitigate hallucinations in Vision Language Models.

dpo llm ppo vlm mllm rlhf safety alignment hallucination post-training large-language-models vision-language-model hallucination-mitigation multimodal-large-language-models

Created 2025-02-18

12 commits to main branch, last one 19 days ago

EasyDetect zjunlp

2

30

apache-2.0

4

[ACL 2024] An Easy-to-use Hallucination Detection Framework for LLMs.

aigc knowlm easydetect generation multimodal generative-ai hallucination model-editing knowledge-graph knowledge-editing large-language-models artificial-intelligence hallucination-detection natural-language-processing multimodal-large-language-models

Created 2024-04-20

41 commits to main branch, last one 13 days ago

Principles-of-AI-LLMs dobriban

0

28

cc0-1.0

2

Materials for the course Principles of AI: LLMs at UPenn (Stat 9911, Spring 2025). LLM architectures, training paradigms (pre- and post-training, alignment), test-time computation, reasoning, safety a...

ai llms rlhf safety aisafety circuits alignment education inference robustness fine-tuning jailbreaking transformers hallucination interpretability test-time-computation

Created 2024-12-18

78 commits to main branch, last one 4 days ago