On the Connection between Game-Theoretic Feature Attributions and Counterfactual Explanations

Albini, Emanuele; Sharma, Shubham; Mishra, Saumitra; Dervovic, Danial; Magazzeni, Daniele

doi:10.1145/3600211.3604676

Computer Science > Artificial Intelligence

arXiv:2307.06941 (cs)

[Submitted on 13 Jul 2023]

Title:On the Connection between Game-Theoretic Feature Attributions and Counterfactual Explanations

Authors:Emanuele Albini, Shubham Sharma, Saumitra Mishra, Danial Dervovic, Daniele Magazzeni

View PDF

Abstract:Explainable Artificial Intelligence (XAI) has received widespread interest in recent years, and two of the most popular types of explanations are feature attributions, and counterfactual explanations. These classes of approaches have been largely studied independently and the few attempts at reconciling them have been primarily empirical. This work establishes a clear theoretical connection between game-theoretic feature attributions, focusing on but not limited to SHAP, and counterfactuals explanations. After motivating operative changes to Shapley values based feature attributions and counterfactual explanations, we prove that, under conditions, they are in fact equivalent. We then extend the equivalency result to game-theoretic solution concepts beyond Shapley values. Moreover, through the analysis of the conditions of such equivalence, we shed light on the limitations of naively using counterfactual explanations to provide feature importances. Experiments on three datasets quantitatively show the difference in explanations at every stage of the connection between the two approaches and corroborate the theoretical findings.

Comments:	Accepted at AIES 2023
Subjects:	Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Computer Science and Game Theory (cs.GT); Human-Computer Interaction (cs.HC); Machine Learning (cs.LG)
ACM classes:	I.2; I.5; H.5; F.2
Cite as:	arXiv:2307.06941 [cs.AI]
	(or arXiv:2307.06941v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2307.06941
Journal reference:	AIES '23: Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society
Related DOI:	https://doi.org/10.1145/3600211.3604676

Submission history

From: Emanuele Albini [view email]
[v1] Thu, 13 Jul 2023 17:57:21 UTC (2,675 KB)

Computer Science > Artificial Intelligence

Title:On the Connection between Game-Theoretic Feature Attributions and Counterfactual Explanations

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:On the Connection between Game-Theoretic Feature Attributions and Counterfactual Explanations

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators