Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint

Chen, Zhipeng; Zhou, Kun; Zhao, Wayne Xin; Wan, Junchen; Zhang, Fuzheng; Zhang, Di; Wen, Ji-Rong

Computer Science > Computation and Language

arXiv:2401.06081 (cs)

[Submitted on 11 Jan 2024 (v1), last revised 17 Jun 2024 (this version, v2)]

Title:Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint

Authors:Zhipeng Chen, Kun Zhou, Wayne Xin Zhao, Junchen Wan, Fuzheng Zhang, Di Zhang, Ji-Rong Wen

View PDF HTML (experimental)

Abstract:Reinforcement learning (RL) has been widely used in training large language models (LLMs) for preventing unexpected outputs, eg reducing harmfulness and errors. However, existing RL methods mostly adopt the instance-level reward, which is unable to provide fine-grained supervision for complex reasoning tasks, and can not focus on the few key tokens that lead to the incorrectness. To address it, we propose a new RL method named RLMEC that incorporates a generative model as the reward model, which is trained by the erroneous solution rewriting task under the minimum editing constraint, and can produce token-level rewards for RL training. Based on the generative reward model, we design the token-level RL objective for training and an imitation-based regularization for stabilizing RL process. And the both objectives focus on the learning of the key tokens for the erroneous solution, reducing the effect of other unimportant tokens. The experiment results on mathematical tasks and question-answering tasks have demonstrated the effectiveness of our approach. Our code and data are available at this https URL.

Comments:	18 pages, Findings of ACL2024
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2401.06081 [cs.CL]
	(or arXiv:2401.06081v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2401.06081

Submission history

From: Zhipeng Chen [view email]
[v1] Thu, 11 Jan 2024 17:58:41 UTC (7,455 KB)
[v2] Mon, 17 Jun 2024 05:52:06 UTC (7,618 KB)

Computer Science > Computation and Language

Title:Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators