Distilling BlackBox to Interpretable models for Efficient Transfer Learning

Ghosh, Shantanu; Yu, Ke; Batmanghelich, Kayhan

Computer Science > Computer Vision and Pattern Recognition

arXiv:2305.17303 (cs)

[Submitted on 26 May 2023 (v1), last revised 7 Jul 2023 (this version, v7)]

Title:Distilling BlackBox to Interpretable models for Efficient Transfer Learning

Authors:Shantanu Ghosh, Ke Yu, Kayhan Batmanghelich

View PDF

Abstract:Building generalizable AI models is one of the primary challenges in the healthcare domain. While radiologists rely on generalizable descriptive rules of abnormality, Neural Network (NN) models suffer even with a slight shift in input distribution (e.g., scanner type). Fine-tuning a model to transfer knowledge from one domain to another requires a significant amount of labeled data in the target domain. In this paper, we develop an interpretable model that can be efficiently fine-tuned to an unseen target domain with minimal computational cost. We assume the interpretable component of NN to be approximately domain-invariant. However, interpretable models typically underperform compared to their Blackbox (BB) variants. We start with a BB in the source domain and distill it into a \emph{mixture} of shallow interpretable models using human-understandable concepts. As each interpretable model covers a subset of data, a mixture of interpretable models achieves comparable performance as BB. Further, we use the pseudo-labeling technique from semi-supervised learning (SSL) to learn the concept classifier in the target domain, followed by fine-tuning the interpretable models in the target domain. We evaluate our model using a real-life large-scale chest-X-ray (CXR) classification dataset. The code is available at: \url{this https URL}.

Comments:	26th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2023, Early accept
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2305.17303 [cs.CV]
	(or arXiv:2305.17303v7 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2305.17303

Submission history

From: Shantanu Ghosh [view email]
[v1] Fri, 26 May 2023 23:23:48 UTC (6,469 KB)
[v2] Wed, 31 May 2023 22:14:57 UTC (6,470 KB)
[v3] Fri, 2 Jun 2023 19:05:36 UTC (6,470 KB)
[v4] Thu, 8 Jun 2023 06:00:55 UTC (6,456 KB)
[v5] Sat, 10 Jun 2023 06:34:11 UTC (6,457 KB)
[v6] Fri, 16 Jun 2023 20:23:43 UTC (6,457 KB)
[v7] Fri, 7 Jul 2023 21:30:01 UTC (6,470 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Distilling BlackBox to Interpretable models for Efficient Transfer Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Distilling BlackBox to Interpretable models for Efficient Transfer Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators