MuPT: A Generative Symbolic Music Pretrained Transformer

Qu, Xingwei; Bai, Yuelin; Ma, Yinghao; Zhou, Ziya; Lo, Ka Man; Liu, Jiaheng; Yuan, Ruibin; Min, Lejun; Liu, Xueling; Zhang, Tianyu; Du, Xinrun; Guo, Shuyue; Liang, Yiming; Li, Yizhi; Wu, Shangda; Zhou, Junting; Zheng, Tianyu; Ma, Ziyang; Han, Fengze; Xue, Wei; Xia, Gus; Benetos, Emmanouil; Yue, Xiang; Lin, Chenghua; Tan, Xu; Huang, Stephen W.; Fu, Jie; Zhang, Ge

Computer Science > Sound

arXiv:2404.06393 (cs)

[Submitted on 9 Apr 2024 (v1), last revised 5 Nov 2024 (this version, v4)]

Title:MuPT: A Generative Symbolic Music Pretrained Transformer

Abstract:In this paper, we explore the application of Large Language Models (LLMs) to the pre-training of music. While the prevalent use of MIDI in music modeling is well-established, our findings suggest that LLMs are inherently more compatible with ABC Notation, which aligns more closely with their design and strengths, thereby enhancing the model's performance in musical composition. To address the challenges associated with misaligned measures from different tracks during generation, we propose the development of a Synchronized Multi-Track ABC Notation (SMT-ABC Notation), which aims to preserve coherence across multiple musical tracks. Our contributions include a series of models capable of handling up to 8192 tokens, covering 90% of the symbolic music data in our training set. Furthermore, we explore the implications of the Symbolic Music Scaling Law (SMS Law) on model performance. The results indicate a promising direction for future research in music generation, offering extensive resources for community-led research through our open-source contributions.

Subjects:	Sound (cs.SD); Artificial Intelligence (cs.AI); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2404.06393 [cs.SD]
	(or arXiv:2404.06393v4 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2404.06393

Submission history

From: Tianyu Zheng [view email]
[v1] Tue, 9 Apr 2024 15:35:52 UTC (1,547 KB)
[v2] Wed, 10 Apr 2024 15:09:52 UTC (1,547 KB)
[v3] Tue, 10 Sep 2024 12:58:22 UTC (1,558 KB)
[v4] Tue, 5 Nov 2024 15:40:25 UTC (1,558 KB)

Computer Science > Sound

Title:MuPT: A Generative Symbolic Music Pretrained Transformer

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:MuPT: A Generative Symbolic Music Pretrained Transformer

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators