research-article

SG-Fusion: : A swin-transformer and graph convolution-based multi-modal deep neural network for glioma prognosis

Authors:

Bo Liao,

Fang-Xiang WuAuthors Info & Claims

Volume 157, Issue C

https://doi.org/10.1016/j.artmed.2024.102972

Published: 01 November 2024 Publication History

Abstract

The integration of morphological attributes extracted from histopathological images and genomic data holds significant importance in advancing tumor diagnosis, prognosis, and grading. Histopathological images are acquired through microscopic examination of tissue slices, providing valuable insights into cellular structures and pathological features. On the other hand, genomic data provides information about tumor gene expression and functionality. The fusion of these two distinct data types is crucial for gaining a more comprehensive understanding of tumor characteristics and progression. In the past, many studies relied on single-modal approaches for tumor diagnosis. However, these approaches had limitations as they were unable to fully harness the information from multiple data sources. To address these limitations, researchers have turned to multi-modal methods that concurrently leverage both histopathological images and genomic data. These methods better capture the multifaceted nature of tumors and enhance diagnostic accuracy. Nonetheless, existing multi-modal methods have, to some extent, oversimplified the extraction processes for both modalities and the fusion process. In this study, we presented a dual-branch neural network, namely SG-Fusion. Specifically, for the histopathological modality, we utilize the Swin-Transformer structure to capture both local and global features and incorporate contrastive learning to encourage the model to discern commonalities and differences in the representation space. For the genomic modality, we developed a graph convolutional network based on gene functional and expression level similarities. Additionally, our model integrates a cross-attention module to enhance information interaction and employs divergence-based regularization to enhance the model’s generalization performance. Validation conducted on glioma datasets from the Cancer Genome Atlas unequivocally demonstrates that our SG-Fusion model outperforms both single-modal methods and existing multi-modal approaches in both survival analysis and tumor grading.

Highlights

•

Developed a multi-modal, multi-task framework for glioma diagnosis.

•

Integrated Swin-Transformer v2 with contrastive learning to enhance image features.

•

Implemented a novel gene selection method to reduce data redundancy.

•

Enhanced feature quality with cross-modal attention and divergence regularization.

References

[1]

Louis D.N., Gerber G.K., Baron J.M., Bry L., Dighe A.S., Getz G., et al., Computational pathology: an emerging definition, Arch Pathol Lab Med 138 (9) (2014) 1133–1138.

Abstract

Highlights

References

Index Terms

Recommendations

Predicting Papillary Renal Cell Carcinoma Prognosis Using Integrative Analysis of Histopathological Images and Genomic Data

Breast cancer intelligent analysis of histopathological data: A systematic review

Breast Cancer Classification from Histopathological Images Using Transfer Learning and Deep Neural Networks

Comments

Information

Published In

Publisher

Publication History

Author Tags

Qualifiers

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

View options

Share

Share this Publication link

Share on social media

Affiliations