ChartSumm: A Comprehensive Benchmark for Automatic Chart Summarization of Long and Short Summaries

Rahman, Raian; Hasan, Rizvi; Farhad, Abdullah Al; Laskar, Md Tahmid Rahman; Ashmafee, Md. Hamjajul; Kamal, Abu Raihan Mostofa

doi:10.21428/594757db.0b1f96f6

Computer Science > Computation and Language

arXiv:2304.13620 (cs)

[Submitted on 26 Apr 2023 (v1), last revised 11 Jun 2023 (this version, v3)]

Title:ChartSumm: A Comprehensive Benchmark for Automatic Chart Summarization of Long and Short Summaries

Authors:Raian Rahman, Rizvi Hasan, Abdullah Al Farhad, Md Tahmid Rahman Laskar, Md. Hamjajul Ashmafee, Abu Raihan Mostofa Kamal

View PDF

Abstract:Automatic chart to text summarization is an effective tool for the visually impaired people along with providing precise insights of tabular data in natural language to the user. A large and well-structured dataset is always a key part for data driven models. In this paper, we propose ChartSumm: a large-scale benchmark dataset consisting of a total of 84,363 charts along with their metadata and descriptions covering a wide range of topics and chart types to generate short and long summaries. Extensive experiments with strong baseline models show that even though these models generate fluent and informative summaries by achieving decent scores in various automatic evaluation metrics, they often face issues like suffering from hallucination, missing out important data points, in addition to incorrect explanation of complex trends in the charts. We also investigated the potential of expanding ChartSumm to other languages using automated translation tools. These make our dataset a challenging benchmark for future research.

Comments:	Accepted as a long paper at the Canadian AI 2023
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2304.13620 [cs.CL]
	(or arXiv:2304.13620v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2304.13620
Related DOI:	https://doi.org/10.21428/594757db.0b1f96f6

Submission history

From: Raian Rahman [view email]
[v1] Wed, 26 Apr 2023 15:25:24 UTC (6,961 KB)
[v2] Sat, 29 Apr 2023 17:22:08 UTC (6,961 KB)
[v3] Sun, 11 Jun 2023 04:07:27 UTC (6,961 KB)

Computer Science > Computation and Language

Title:ChartSumm: A Comprehensive Benchmark for Automatic Chart Summarization of Long and Short Summaries

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:ChartSumm: A Comprehensive Benchmark for Automatic Chart Summarization of Long and Short Summaries

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators