Model-Agnostic Counterfactual Explanations for Consequential Decisions

Karimi, Amir-Hossein; Barthe, Gilles; Balle, Borja; Valera, Isabel

Computer Science > Machine Learning

arXiv:1905.11190v2 (cs)

[Submitted on 27 May 2019 (v1), revised 28 May 2019 (this version, v2), latest version 28 Feb 2020 (v5)]

Title:Model-Agnostic Counterfactual Explanations for Consequential Decisions

Authors:Amir-Hossein Karimi, Gilles Barthe, Borja Balle, Isabel Valera

View PDF

Abstract:Predictive models are being increasingly used to support consequential decision making at the individual level in contexts such as pretrial bail and loan approval. As a result, there is increasing social and legal pressure to provide explanations that help the affected individuals not only to understand why a prediction was output, but also how to act to obtain a desired outcome. To this end, several works have proposed methods to generate counterfactual explanations. However, they are often restricted to a particular subset of models (e.g., decision trees or linear models), and cannot directly handle the mixed (numerical and nominal) nature of the features describing each individual. In this paper, we propose a model-agnostic algorithm to generate counterfactual explanations that builds on the standard theory and tools from formal verification. Specifically, our algorithm solves a sequence of satisfiability problems, where a wide variety of predictive models and distances in mixed feature spaces, as well as natural notions of plausibility and diversity, are represented as logic formulas. Our experiments on real-world data demonstrate that our approach can flexibly handle widely deployed predictive models, while providing meaningfully closer counterfactuals than existing approaches.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Logic in Computer Science (cs.LO); Machine Learning (stat.ML)
Cite as:	arXiv:1905.11190 [cs.LG]
	(or arXiv:1905.11190v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1905.11190

Submission history

From: Borja Balle [view email]
[v1] Mon, 27 May 2019 13:22:39 UTC (341 KB)
[v2] Tue, 28 May 2019 08:00:19 UTC (341 KB)
[v3] Tue, 8 Oct 2019 10:21:41 UTC (2,980 KB)
[v4] Fri, 14 Feb 2020 16:49:52 UTC (1,993 KB)
[v5] Fri, 28 Feb 2020 16:24:45 UTC (1,993 KB)

Computer Science > Machine Learning

Title:Model-Agnostic Counterfactual Explanations for Consequential Decisions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Model-Agnostic Counterfactual Explanations for Consequential Decisions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators