IterSelectTune: An Iterative Training Framework for Efficient Instruction-Tuning Data Selection

Song, Jielin; Liu, Siyu; Zhu, Bin; Rao, Yanghui

Computer Science > Computation and Language

arXiv:2410.13464 (cs)

[Submitted on 17 Oct 2024]

Title:IterSelectTune: An Iterative Training Framework for Efficient Instruction-Tuning Data Selection

Authors:Jielin Song, Siyu Liu, Bin Zhu, Yanghui Rao

View PDF HTML (experimental)

Abstract:As large language models (LLMs) continue to advance, instruction tuning has become critical for improving their ability to generate accurate and contextually appropriate responses. Although numerous instruction-tuning datasets have been developed to enhance LLM performance, selecting high-quality instruction data from large source datasets typically demands significant human effort. In this work, we introduce $\textbf{IterSelectTune}$, an efficient, cost-effective iterative training policy for selecting high-quality instruction data with no human involvement and limited reliance on GPT-4. By fine-tuning on approximately 20\% of the source data, our method consistently outperforms models fine-tuned on the full dataset across multiple benchmarks and public test datasets. These results highlight the effectiveness of our approach in enhancing LLM performance while reducing the computational resources required for instruction tuning.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2410.13464 [cs.CL]
	(or arXiv:2410.13464v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2410.13464

Submission history

From: Jielin Song [view email]
[v1] Thu, 17 Oct 2024 11:48:57 UTC (2,457 KB)

Computer Science > Computation and Language

Title:IterSelectTune: An Iterative Training Framework for Efficient Instruction-Tuning Data Selection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:IterSelectTune: An Iterative Training Framework for Efficient Instruction-Tuning Data Selection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators