T-Explainer: A Model-Agnostic Explainability Framework Based on Gradients

Ortigossa, Evandro S.; Dias, Fábio F.; Barr, Brian; Silva, Claudio T.; Nonato, Luis Gustavo

Computer Science > Machine Learning

arXiv:2404.16495 (cs)

[Submitted on 25 Apr 2024 (v1), last revised 6 Aug 2024 (this version, v2)]

Title:T-Explainer: A Model-Agnostic Explainability Framework Based on Gradients

Authors:Evandro S. Ortigossa, Fábio F. Dias, Brian Barr, Claudio T. Silva, Luis Gustavo Nonato

View PDF HTML (experimental)

Abstract:The development of machine learning applications has increased significantly in recent years, motivated by the remarkable ability of learning-powered systems to discover and generalize intricate patterns hidden in massive datasets. Modern learning models, while powerful, often have a level of complexity that renders them opaque black boxes, resulting in a notable lack of transparency that hinders our ability to decipher their reasoning. Opacity challenges the interpretability and practical application of machine learning, especially in critical domains where understanding the underlying reasons is essential for informed decision-making. Explainable Artificial Intelligence (XAI) rises to address that challenge, unraveling the complexity of black boxes by providing elucidating explanations. Among the various XAI approaches, feature attribution/importance stands out for its capacity to delineate the significance of input features in the prediction process. However, most existing attribution methods have limitations, such as instability, when divergent explanations may result from similar or even the same instance. This work introduces T-Explainer, a novel local additive attribution explainer based on Taylor expansion. It has desirable properties, such as local accuracy and consistency, making T-Explainer stable over multiple runs. We demonstrate T-Explainer's effectiveness in quantitative benchmark experiments against well-known attribution methods. Additionally, we provide several tools to evaluate and visualize explanations, turning T-Explainer into a comprehensive XAI framework.

Comments:	16 pages -- 2 figures and 20 tables -- Under review. This work has been submitted to the IEEE for possible publication
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2404.16495 [cs.LG]
	(or arXiv:2404.16495v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2404.16495

Submission history

From: Evandro S. Ortigossa [view email]
[v1] Thu, 25 Apr 2024 10:40:49 UTC (197 KB)
[v2] Tue, 6 Aug 2024 15:03:50 UTC (102 KB)

Computer Science > Machine Learning

Title:T-Explainer: A Model-Agnostic Explainability Framework Based on Gradients

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:T-Explainer: A Model-Agnostic Explainability Framework Based on Gradients

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators