Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up

Yuan, Jiahao; Du, Dehui; Zhang, Hao; Di, Zixiang; Naseem, Usman

Computer Science > Computation and Language

arXiv:2410.12323 (cs)

[Submitted on 16 Oct 2024]

Title:Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up

Authors:Jiahao Yuan, Dehui Du, Hao Zhang, Zixiang Di, Usman Naseem

View PDF HTML (experimental)

Abstract:Large language models (LLMs) have shown remarkable performance in reasoning tasks but face limitations in mathematical and complex logical reasoning. Existing methods to improve LLMs' logical capabilities either involve traceable or verifiable logical sequences that generate more reliable responses by constructing logical structures yet increase computational costs, or introduces rigid logic template rules, reducing flexibility. In this paper, we propose Reversal of Thought (RoT), a novel framework aimed at enhancing the logical reasoning abilities of LLMs. RoT utilizes a Preference-Guided Reverse Reasoning warm-up strategy, which integrates logical symbols for pseudocode planning through meta-cognitive mechanisms and pairwise preference self-evaluation to generate task-specific prompts solely through demonstrations, aligning with LLMs' cognitive preferences shaped by Reinforcement Learning with Human Feedback (RLHF). Through reverse reasoning, we ultilize a Cognitive Preference Manager to assess knowledge boundaries and further expand LLMs' reasoning capabilities by aggregating solution logic for known tasks and stylistic templates for unknown tasks. Experiments across various tasks demonstrate that RoT surpasses existing baselines in both reasoning accuracy and efficiency.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2410.12323 [cs.CL]
	(or arXiv:2410.12323v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2410.12323

Submission history

From: Jiahao Yuan [view email]
[v1] Wed, 16 Oct 2024 07:44:28 UTC (5,511 KB)

Computer Science > Computation and Language

Title:Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Reversal of Thought: Enhancing Large Language Models with Preference-Guided Reverse Reasoning Warm-up

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators