FactCHD: Benchmarking Fact-Conflicting Hallucination Detection

Published 18 Oct 2023 in cs.CL, cs.AI, cs.CV, cs.IR, and cs.LG | (2310.12086v3)

Abstract: Despite their impressive generative capabilities, LLMs are hindered by fact-conflicting hallucinations in real-world applications. The accurate identification of hallucinations in texts generated by LLMs, especially in complex inferential scenarios, is a relatively unexplored area. To address this gap, we present FactCHD, a dedicated benchmark designed for the detection of fact-conflicting hallucinations from LLMs. FactCHD features a diverse dataset that spans various factuality patterns, including vanilla, multi-hop, comparison, and set operation. A distinctive element of FactCHD is its integration of fact-based evidence chains, significantly enhancing the depth of evaluating the detectors' explanations. Experiments on different LLMs expose the shortcomings of current approaches in detecting factual errors accurately. Furthermore, we introduce Truth-Triangulator that synthesizes reflective considerations by tool-enhanced ChatGPT and LoRA-tuning based on Llama2, aiming to yield more credible detection through the amalgamation of predictive results and evidence. The benchmark dataset is available at https://github.com/zjunlp/FactCHD.

Abstract PDF HTML Upgrade to Chat

References (51)

Citations (11)

View on Semantic Scholar

Summary

The paper presents FactCHD, a benchmark that detects and explains factually inconsistent outputs from large language models.
It uses diverse data sources to simulate realistic query-response scenarios across various factuality patterns including multi-hop and comparative reasoning.
Empirical evaluations with models like GPT-3.5-turbo and Llama2-chat validate the novel Truth-Triangulator framework and specialized tuning approaches.

FactCHD: Benchmarking Fact-Conflicting Hallucination Detection

The paper presents FactCHD, a dedicated benchmark aimed at addressing the challenge of detecting fact-conflicting hallucinations in outputs generated by LLMs. While LLMs have demonstrated significant generative capabilities, their tendency to produce factually inaccurate or hallucinatory text poses a barrier to their deployment in critical domains such as finance, healthcare, and law. This work tackles the relatively unexplored area of hallucination detection by establishing a comprehensive framework and dataset to evaluate LLMs' ability to recognize and explain factual inconsistencies.

Core Contributions

Introduction of FactCHD: FactCHD is a benchmark designed to detect hallucinations stemming from conflicting facts. Unlike traditional fact verification tasks, it simulates a realistic "Query-Response" scenario where explicit claims or evidence might be absent. The dataset incorporates diverse factuality patterns, including vanilla, multi-hop, comparison, and set-operation, each presenting unique challenges in reasoning and fact comprehension.
Diverse Data Collection: The dataset spans multiple domains, derived from varied sources such as knowledge graphs (KGs) and text corpora. This diversity aims to reflect real-world application scenarios. Notably, FactCHD is positioned within a novel categorical framework that includes vanilla, multi-hop reasoning, comparative analysis, and set-operations as fundamental factuality patterns.
Golden Evidence Chains: FactCHD introduces golden chains of evidence to evaluate the capacity of hallucination detectors not only to identify non-factual statements but also to provide coherent and accurate explanations for their judgments. This aspect of explanation is a key distinguishing factor in the dataset.
Evaluation of LLMs and Approaches: The paper provides empirical evaluations using models like GPT-3.5-turbo, Llama2-chat, and Alpaca across various learning paradigms—zero-shot, in-context learning, and specifically-tuned detection models. Results indicate significant variability in performance, highlighting the effectiveness of specialized tuning and knowledge augmented approaches.
Truth-Triangulator Framework: To enhance the reliability of hallucination detection, the authors propose the Truth-Triangulator, a framework inspired by triangulation theory. It involves cross-referencing multiple evidence sources, employing roles such as Truth Seeker and Truth Guardian to independently assess the factual accuracy of responses before reaching a consensus through Fact Verdict Manager.

Implications and Future Directions

The introduction of FactCHD has several implications for the deployment of LLMs in sensitive or high-stakes environments. By providing a structured means to assess and improve hallucination detection, it supports the development of more reliable AI systems. Additionally, by combining evidence-based explanations with detection, FactCHD encourages transparency in model decision-making processes.

From a theoretical perspective, the benchmark offers a paradigm for understanding complex factual relationships within generated text, paving the way for more sophisticated LLM designs that could inherently manage fact verification. Practically, the approach espoused in FactCHD could facilitate the creation of AI tools better attuned to diverse, nuanced applications, ultimately enhancing trustworthiness and user confidence in AI systems.

For ongoing research and future developments, FactCHD prompts further exploration into scalable knowledge integration, leveraging advancements in retrieval-augmented generation, and methodologies to counteract the inherent limitations of LLMs in real-world fact-checking tasks. These endeavors will be crucial in refining hallucination detection frameworks to meet the rigors of real-world deployment and utility.

Markdown Report Issue