Benchmarking XAI Explanations with Human-Aligned Evaluations

Kazmierczak, Rémi; Azzolin, Steve; Berthier, Eloïse; Hedström, Anna; Delhomme, Patricia; Bousquet, Nicolas; Frehse, Goran; Mancini, Massimiliano; Caramiaux, Baptiste; Passerini, Andrea; Franchi, Gianni

Computer Science > Computer Vision and Pattern Recognition

arXiv:2411.02470 (cs)

[Submitted on 4 Nov 2024]

Title:Benchmarking XAI Explanations with Human-Aligned Evaluations

Authors:Rémi Kazmierczak, Steve Azzolin, Eloïse Berthier, Anna Hedström, Patricia Delhomme, Nicolas Bousquet, Goran Frehse, Massimiliano Mancini, Baptiste Caramiaux, Andrea Passerini, Gianni Franchi

View PDF HTML (experimental)

Abstract:In this paper, we introduce PASTA (Perceptual Assessment System for explanaTion of Artificial intelligence), a novel framework for a human-centric evaluation of XAI techniques in computer vision. Our first key contribution is a human evaluation of XAI explanations on four diverse datasets (COCO, Pascal Parts, Cats Dogs Cars, and MonumAI) which constitutes the first large-scale benchmark dataset for XAI, with annotations at both the image and concept levels. This dataset allows for robust evaluation and comparison across various XAI methods. Our second major contribution is a data-based metric for assessing the interpretability of explanations. It mimics human preferences, based on a database of human evaluations of explanations in the PASTA-dataset. With its dataset and metric, the PASTA framework provides consistent and reliable comparisons between XAI techniques, in a way that is scalable but still aligned with human evaluations. Additionally, our benchmark allows for comparisons between explanations across different modalities, an aspect previously unaddressed. Our findings indicate that humans tend to prefer saliency maps over other explanation types. Moreover, we provide evidence that human assessments show a low correlation with existing XAI metrics that are numerically simulated by probing the model.

Comments:	this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2411.02470 [cs.CV]
	(or arXiv:2411.02470v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2411.02470

Submission history

From: Gianni Franchi [view email]
[v1] Mon, 4 Nov 2024 15:18:20 UTC (2,539 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Benchmarking XAI Explanations with Human-Aligned Evaluations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Benchmarking XAI Explanations with Human-Aligned Evaluations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators