AbdomenAtlas: A Large-Scale, Detailed-Annotated, & Multi-Center Dataset for Efficient Transfer Learning and Open Algorithmic Benchmarking

Li, Wenxuan; Qu, Chongyu; Chen, Xiaoxi; Bassi, Pedro R. A. S.; Shi, Yijia; Lai, Yuxiang; Yu, Qian; Xue, Huimin; Chen, Yixiong; Lin, Xiaorui; Tang, Yutong; Cao, Yining; Han, Haoqi; Zhang, Zheyuan; Liu, Jiawei; Zhang, Tiezheng; Ma, Yujiu; Wang, Jincheng; Zhang, Guang; Yuille, Alan; Zhou, Zongwei

Abstract:We introduce the largest abdominal CT dataset (termed AbdomenAtlas) of 20,460 three-dimensional CT volumes sourced from 112 hospitals across diverse populations, geographies, and facilities. AbdomenAtlas provides 673K high-quality masks of anatomical structures in the abdominal region annotated by a team of 10 radiologists with the help of AI algorithms. We start by having expert radiologists manually annotate 22 anatomical structures in 5,246 CT volumes. Following this, a semi-automatic annotation procedure is performed for the remaining CT volumes, where radiologists revise the annotations predicted by AI, and in turn, AI improves its predictions by learning from revised annotations. Such a large-scale, detailed-annotated, and multi-center dataset is needed for two reasons. Firstly, AbdomenAtlas provides important resources for AI development at scale, branded as large pre-trained models, which can alleviate the annotation workload of expert radiologists to transfer to broader clinical applications. Secondly, AbdomenAtlas establishes a large-scale benchmark for evaluating AI algorithms -- the more data we use to test the algorithms, the better we can guarantee reliable performance in complex clinical scenarios. An ISBI & MICCAI challenge named BodyMaps: Towards 3D Atlas of Human Body was launched using a subset of our AbdomenAtlas, aiming to stimulate AI innovation and to benchmark segmentation accuracy, inference efficiency, and domain generalizability. We hope our AbdomenAtlas can set the stage for larger-scale clinical trials and offer exceptional opportunities to practitioners in the medical imaging community. Codes, models, and datasets are available at this https URL

Comments:	Published in Medical Image Analysis
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2407.16697 [cs.CV]
	(or arXiv:2407.16697v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2407.16697

Computer Science > Computer Vision and Pattern Recognition

Title:AbdomenAtlas: A Large-Scale, Detailed-Annotated, & Multi-Center Dataset for Efficient Transfer Learning and Open Algorithmic Benchmarking

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators