CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models

Gong, Zi; Yu, Hang; Liao, Cong; Liu, Bingchang; Chen, Chaoyu; Li, Jianguo

Computer Science > Computation and Language

arXiv:2410.06741 (cs)

[Submitted on 9 Oct 2024 (v1), last revised 28 Oct 2024 (this version, v2)]

Title:CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models

Authors:Zi Gong, Hang Yu, Cong Liao, Bingchang Liu, Chaoyu Chen, Jianguo Li

View PDF HTML (experimental)

Abstract:Multi-task learning (MTL) benefits the fine-tuning of large language models (LLMs) by providing a single model with improved performance and generalization ability across tasks, presenting a resource-efficient alternative to developing separate models for each task. Yet, existing MTL strategies for LLMs often fall short by either being computationally intensive or failing to ensure simultaneous task convergence. This paper presents CoBa, a new MTL approach designed to effectively manage task convergence balance with minimal computational overhead. Utilizing Relative Convergence Scores (RCS), Absolute Convergence Scores (ACS), and a Divergence Factor (DF), CoBa dynamically adjusts task weights during the training process, ensuring that the validation loss of all tasks progress towards convergence at an even pace while mitigating the issue of individual task divergence. The results of our experiments involving three disparate datasets underscore that this approach not only fosters equilibrium in task convergence but enhances the LLMs' performance by up to 13% relative to the second-best baselines. Code is open-sourced at this https URL.

Comments:	15 pages, main conference of EMNLP 2024
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2410.06741 [cs.CL]
	(or arXiv:2410.06741v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2410.06741

Submission history

From: Zi Gong [view email]
[v1] Wed, 9 Oct 2024 10:20:32 UTC (7,917 KB)
[v2] Mon, 28 Oct 2024 15:05:54 UTC (7,917 KB)

Computer Science > Computation and Language

Title:CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators