Explainers in the Wild: Making Surrogate Explainers Robust to Distortions through Perception

Hepburn, Alexander; Santos-Rodriguez, Raul

Computer Science > Computer Vision and Pattern Recognition

arXiv:2102.10951 (cs)

[Submitted on 22 Feb 2021 (v1), last revised 16 Jun 2021 (this version, v2)]

Title:Explainers in the Wild: Making Surrogate Explainers Robust to Distortions through Perception

Authors:Alexander Hepburn, Raul Santos-Rodriguez

View PDF

Abstract:Explaining the decisions of models is becoming pervasive in the image processing domain, whether it is by using post-hoc methods or by creating inherently interpretable models. While the widespread use of surrogate explainers is a welcome addition to inspect and understand black-box models, assessing the robustness and reliability of the explanations is key for their success. Additionally, whilst existing work in the explainability field proposes various strategies to address this problem, the challenges of working with data in the wild is often overlooked. For instance, in image classification, distortions to images can not only affect the predictions assigned by the model, but also the explanation. Given a clean and a distorted version of an image, even if the prediction probabilities are similar, the explanation may still be different. In this paper we propose a methodology to evaluate the effect of distortions in explanations by embedding perceptual distances that tailor the neighbourhoods used to training surrogate explainers. We also show that by operating in this way, we can make the explanations more robust to distortions. We generate explanations for images in the Imagenet-C dataset and demonstrate how using a perceptual distances in the surrogate explainer creates more coherent explanations for the distorted and reference images.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:2102.10951 [cs.CV]
	(or arXiv:2102.10951v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2102.10951
Journal reference:	2021 IEEE International Conference on Image Processing (ICIP), Anchorage, Alaska, USA

Submission history

From: Alexander Hepburn [view email]
[v1] Mon, 22 Feb 2021 12:38:53 UTC (1,468 KB)
[v2] Wed, 16 Jun 2021 10:39:04 UTC (1,468 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Explainers in the Wild: Making Surrogate Explainers Robust to Distortions through Perception

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Explainers in the Wild: Making Surrogate Explainers Robust to Distortions through Perception

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators