TEGEE: Task dEfinition Guided Expert Ensembling for Generalizable and Few-shot Learning

Qu, Xingwei; Liang, Yiming; Wang, Yucheng; Zheng, Tianyu; Yue, Tommy; Bu, Xingyuan; Ma, Lei; Huang, Stephen W.; Zhang, Jiajun; Shi, Yinan; Lin, Chenghua; Fu, Jie; Zhang, Ge

Computer Science > Computation and Language

arXiv:2403.04233 (cs)

[Submitted on 7 Mar 2024 (v1), last revised 14 Dec 2024 (this version, v3)]

Title:TEGEE: Task dEfinition Guided Expert Ensembling for Generalizable and Few-shot Learning

Authors:Xingwei Qu, Yiming Liang, Yucheng Wang, Tianyu Zheng, Tommy Yue, Xingyuan Bu, Lei Ma, Stephen W. Huang, Jiajun Zhang, Yinan Shi, Chenghua Lin, Jie Fu, Ge Zhang

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) exhibit the ability to perform in-context learning (ICL), where they acquire new tasks directly from examples provided in demonstrations. This process is thought to operate through an implicit task selection mechanism that involves extracting and processing task definitions from these demonstrations. However, critical questions remain: Which is more essential -- task extraction or definition? And how can these capabilities be further improved? To address these questions, we propose \textbf{TEGEE} (Task Definition Guided Expert Ensembling), a method that explicitly extracts task definitions and generates responses based on specific tasks. Our framework employs a dual 3B model approach, with each model assigned a distinct role: one focuses on task definition extraction, while the other handles learning from demonstrations. This modular approach supports the hypothesis that extracting task definitions is more vital than processing the task itself. Empirical evaluations show that TEGEE performs comparably to the larger LLaMA2-13B model. By leveraging a modular design, our approach extends traditional ICL from few-shot to many-shot learning, supporting an unlimited number of demonstrations and enhancing continual learning capabilities.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2403.04233 [cs.CL]
	(or arXiv:2403.04233v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2403.04233

Submission history

From: Yiming Liang [view email]
[v1] Thu, 7 Mar 2024 05:26:41 UTC (1,204 KB)
[v2] Sun, 16 Jun 2024 06:44:50 UTC (1,204 KB)
[v3] Sat, 14 Dec 2024 14:39:57 UTC (1,212 KB)

Computer Science > Computation and Language

Title:TEGEE: Task dEfinition Guided Expert Ensembling for Generalizable and Few-shot Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:TEGEE: Task dEfinition Guided Expert Ensembling for Generalizable and Few-shot Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators