Think Twice Before Trusting: Self-Detection for Large Language Models through Comprehensive Answer Reflection

Li, Moxin; Wang, Wenjie; Feng, Fuli; Zhu, Fengbin; Wang, Qifan; Chua, Tat-Seng

Computer Science > Computation and Language

arXiv:2403.09972 (cs)

[Submitted on 15 Mar 2024 (v1), last revised 27 Sep 2024 (this version, v3)]

Title:Think Twice Before Trusting: Self-Detection for Large Language Models through Comprehensive Answer Reflection

Authors:Moxin Li, Wenjie Wang, Fuli Feng, Fengbin Zhu, Qifan Wang, Tat-Seng Chua

View PDF

Abstract:Self-detection for Large Language Models (LLMs) seeks to evaluate the trustworthiness of the LLM's output by leveraging its own capabilities, thereby alleviating the issue of output hallucination. However, existing self-detection approaches only retrospectively evaluate answers generated by LLM, typically leading to the over-trust in incorrectly generated answers. To tackle this limitation, we propose a novel self-detection paradigm that considers the comprehensive answer space beyond LLM-generated answers. It thoroughly compares the trustworthiness of multiple candidate answers to mitigate the over-trust in LLM-generated incorrect answers. Building upon this paradigm, we introduce a two-step framework, which firstly instructs LLM to reflect and provide justifications for each candidate answer, and then aggregates the justifications for comprehensive target answer evaluation. This framework can be seamlessly integrated with existing approaches for superior self-detection. Extensive experiments on six datasets spanning three tasks demonstrate the effectiveness of the proposed framework.

Comments:	EMNLP findings 2024
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2403.09972 [cs.CL]
	(or arXiv:2403.09972v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2403.09972

Submission history

From: Moxin Li [view email]
[v1] Fri, 15 Mar 2024 02:38:26 UTC (7,374 KB)
[v2] Tue, 4 Jun 2024 05:42:12 UTC (7,424 KB)
[v3] Fri, 27 Sep 2024 08:22:21 UTC (7,425 KB)

Computer Science > Computation and Language

Title:Think Twice Before Trusting: Self-Detection for Large Language Models through Comprehensive Answer Reflection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Think Twice Before Trusting: Self-Detection for Large Language Models through Comprehensive Answer Reflection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators