TAT-LLM: A Specialized Language Model for Discrete Reasoning over Tabular and Textual Data

Zhu, Fengbin; Liu, Ziyang; Feng, Fuli; Wang, Chao; Li, Moxin; Chua, Tat-Seng

Computer Science > Computation and Language

arXiv:2401.13223 (cs)

[Submitted on 24 Jan 2024 (v1), last revised 28 Sep 2024 (this version, v3)]

Title:TAT-LLM: A Specialized Language Model for Discrete Reasoning over Tabular and Textual Data

Authors:Fengbin Zhu, Ziyang Liu, Fuli Feng, Chao Wang, Moxin Li, Tat-Seng Chua

View PDF HTML (experimental)

Abstract:In this work, we address question answering (QA) over a hybrid of tabular and textual data that are very common content on the Web (e.g. SEC filings), where discrete reasoning capabilities are often required. Recently, large language models (LLMs) like GPT-4 have demonstrated strong multi-step reasoning capabilities. We then consider harnessing the amazing power of LLMs to solve our task. We abstract a Step-wise Pipeline for tabular and textual QA, which consists of three key steps, including Extractor, Reasoner and Executor, and initially design an instruction to instantiate the pipeline and validate that GPT-4 outperforms all existing methods. However, utilizing an online LLM like GPT-4 holds various challenges in terms of cost, latency, and data security risk, which motivates us to specialize smaller LLMs in this task. We develop a TAT-LLM language model by fine-tuning LLaMA 2 with the training data generated automatically from existing expert-annotated datasets following the Step-wise Pipeline. The experimental results have verified that our TAT-LLM model can outperform all baseline models, including the previous best fine-tuned models and very large-scale LLMs like GPT-4 on FinQA, TAT-QA and TAT-DQA benchmarks.

Comments:	Accepted by ICAIF 24
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2401.13223 [cs.CL]
	(or arXiv:2401.13223v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2401.13223

Submission history

From: Fengbin Zhu [view email]
[v1] Wed, 24 Jan 2024 04:28:50 UTC (886 KB)
[v2] Thu, 22 Feb 2024 13:36:56 UTC (392 KB)
[v3] Sat, 28 Sep 2024 01:40:33 UTC (392 KB)

Computer Science > Computation and Language

Title:TAT-LLM: A Specialized Language Model for Discrete Reasoning over Tabular and Textual Data

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:TAT-LLM: A Specialized Language Model for Discrete Reasoning over Tabular and Textual Data

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators