Learning disentangled representations for explainable chest X-ray classification using Dirichlet VAEs

Harkness, Rachael; Frangi, Alejandro F; Zucker, Kieran; Ravikumar, Nishant

Computer Science > Computer Vision and Pattern Recognition

arXiv:2302.02979 (cs)

[Submitted on 6 Feb 2023]

Title:Learning disentangled representations for explainable chest X-ray classification using Dirichlet VAEs

Authors:Rachael Harkness, Alejandro F Frangi, Kieran Zucker, Nishant Ravikumar

View PDF

Abstract:This study explores the use of the Dirichlet Variational Autoencoder (DirVAE) for learning disentangled latent representations of chest X-ray (CXR) images. Our working hypothesis is that distributional sparsity, as facilitated by the Dirichlet prior, will encourage disentangled feature learning for the complex task of multi-label classification of CXR images. The DirVAE is trained using CXR images from the CheXpert database, and the predictive capacity of multi-modal latent representations learned by DirVAE models is investigated through implementation of an auxiliary multi-label classification task, with a view to enforce separation of latent factors according to class-specific features. The predictive performance and explainability of the latent space learned using the DirVAE were quantitatively and qualitatively assessed, respectively, and compared with a standard Gaussian prior-VAE (GVAE). We introduce a new approach for explainable multi-label classification in which we conduct gradient-guided latent traversals for each class of interest. Study findings indicate that the DirVAE is able to disentangle latent factors into class-specific visual features, a property not afforded by the GVAE, and achieve a marginal increase in predictive performance relative to GVAE. We generate visual examples to show that our explainability method, when applied to the trained DirVAE, is able to highlight regions in CXR images that are clinically relevant to the class(es) of interest and additionally, can identify cases where classification relies on spurious feature correlations.

Comments:	13 pages, 8 figures, to be published in SPIE Medical Imaging 2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2302.02979 [cs.CV]
	(or arXiv:2302.02979v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2302.02979

Submission history

From: Rachael Harkness [view email]
[v1] Mon, 6 Feb 2023 18:10:08 UTC (19,653 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Learning disentangled representations for explainable chest X-ray classification using Dirichlet VAEs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning disentangled representations for explainable chest X-ray classification using Dirichlet VAEs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators