uMedSum: A Unified Framework for Advancing Medical Abstractive Summarization

Nagar, Aishik; Liu, Yutong; Liu, Andy T.; Schlegel, Viktor; Dwivedi, Vijay Prakash; Kaliya-Perumal, Arun-Kumar; Kalanchiam, Guna Pratheep; Tang, Yili; Tan, Robby T.

Computer Science > Computation and Language

arXiv:2408.12095 (cs)

[Submitted on 22 Aug 2024 (v1), last revised 26 Aug 2024 (this version, v2)]

Title:uMedSum: A Unified Framework for Advancing Medical Abstractive Summarization

Authors:Aishik Nagar, Yutong Liu, Andy T. Liu, Viktor Schlegel, Vijay Prakash Dwivedi, Arun-Kumar Kaliya-Perumal, Guna Pratheep Kalanchiam, Yili Tang, Robby T. Tan

View PDF HTML (experimental)

Abstract:Medical abstractive summarization faces the challenge of balancing faithfulness and informativeness. Current methods often sacrifice key information for faithfulness or introduce confabulations when prioritizing informativeness. While recent advancements in techniques like in-context learning (ICL) and fine-tuning have improved medical summarization, they often overlook crucial aspects such as faithfulness and informativeness without considering advanced methods like model reasoning and self-improvement. Moreover, the field lacks a unified benchmark, hindering systematic evaluation due to varied metrics and datasets. This paper addresses these gaps by presenting a comprehensive benchmark of six advanced abstractive summarization methods across three diverse datasets using five standardized metrics. Building on these findings, we propose uMedSum, a modular hybrid summarization framework that introduces novel approaches for sequential confabulation removal followed by key missing information addition, ensuring both faithfulness and informativeness. Our work improves upon previous GPT-4-based state-of-the-art (SOTA) medical summarization methods, significantly outperforming them in both quantitative metrics and qualitative domain expert evaluations. Notably, we achieve an average relative performance improvement of 11.8% in reference-free metrics over the previous SOTA. Doctors prefer uMedSum's summaries 6 times more than previous SOTA in difficult cases where there are chances of confabulations or missing information. These results highlight uMedSum's effectiveness and generalizability across various datasets and metrics, marking a significant advancement in medical summarization.

Comments:	12 pages
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2408.12095 [cs.CL]
	(or arXiv:2408.12095v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2408.12095

Submission history

From: Aishik Nagar [view email]
[v1] Thu, 22 Aug 2024 03:08:49 UTC (634 KB)
[v2] Mon, 26 Aug 2024 02:26:31 UTC (634 KB)

Computer Science > Computation and Language

Title:uMedSum: A Unified Framework for Advancing Medical Abstractive Summarization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:uMedSum: A Unified Framework for Advancing Medical Abstractive Summarization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators