MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning

Yue, Xiang; Qu, Xingwei; Zhang, Ge; Fu, Yao; Huang, Wenhao; Sun, Huan; Su, Yu; Chen, Wenhu

Computer Science > Computation and Language

arXiv:2309.05653 (cs)

[Submitted on 11 Sep 2023 (v1), last revised 3 Oct 2023 (this version, v3)]

Title:MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning

Authors:Xiang Yue, Xingwei Qu, Ge Zhang, Yao Fu, Wenhao Huang, Huan Sun, Yu Su, Wenhu Chen

View PDF

Abstract:We introduce MAmmoTH, a series of open-source large language models (LLMs) specifically tailored for general math problem-solving. The MAmmoTH models are trained on MathInstruct, our meticulously curated instruction tuning dataset. MathInstruct is compiled from 13 math datasets with intermediate rationales, six of which have rationales newly curated by us. It presents a unique hybrid of chain-of-thought (CoT) and program-of-thought (PoT) rationales, and also ensures extensive coverage of diverse fields in math. The hybrid of CoT and PoT not only unleashes the potential of tool use but also allows different thought processes for different math problems. As a result, the MAmmoTH series substantially outperform existing open-source models on nine mathematical reasoning datasets across all scales with an average accuracy gain between 16% and 32%. Remarkably, our MAmmoTH-7B model reaches 33% on MATH (a competition-level dataset), which exceeds the best open-source 7B model (WizardMath) by 23%, and the MAmmoTH-34B model achieves 44% accuracy on MATH, even surpassing GPT-4's CoT result. Our work underscores the importance of diverse problem coverage and the use of hybrid rationales in developing superior math generalist models.

Comments:	Work in progress; Xiang Yue and Wenhu Chen contributed equally to this paper
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2309.05653 [cs.CL]
	(or arXiv:2309.05653v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2309.05653

Submission history

From: Xiang Yue [view email]
[v1] Mon, 11 Sep 2023 17:47:22 UTC (608 KB)
[v2] Sun, 1 Oct 2023 15:25:41 UTC (717 KB)
[v3] Tue, 3 Oct 2023 02:48:42 UTC (717 KB)

Computer Science > Computation and Language

Title:MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators