0% found this document useful (0 votes)

20 views24 pages

13 Model Interpretability

Uploaded by

sairohith068620

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views24 pages

13 Model Interpretability

Uploaded by

sairohith068620

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Model Explainability and

Interpretability
Instructor: Saravanan Thirumuruganathan
Caveat Emptor
• Assumption: You know how to interpret simple ML models

• The ideas in this lecture are quite simple and published > 5 years ago
• So NOT the state of the art which is much much more complex

• I have included some simple and intuitive approaches to give a taste

• Goal: introduce some ideas needed for PA4
Explainable AI

XAI for Science and Medicine by Scott Lundberg, 2019

Explainable AI

Image from Darpa XAI Initiative

AI is increasingly used in many high-stakes tasks

https://hcixaitutorial.github.io/
Explainable AI

• Model Interpretability : Understand what the model is doing by

analyzing its features, parameters, weights etc

• Model Explainability: Takes an ML model and explain the behavior in

human terms. Not always possible for complex models

• Increasingly mandated by government regulations

https://hcixaitutorial.github.io/
Why Explainable AI?
• GDPR: Article 22 empowers individuals with the right to demand an explanation of how an
automated system made a decision that affects them.
• Algorithmic Accountability Act 2019: Requires companies to provide an assessment of the risks
posed by the automated decision system to the privacy or security and the risks that
contribute to inaccurate, unfair, biased, or discriminatory decisions impacting consumers
• California Consumer Privacy Act: Requires companies to rethink their approach to capturing,
storing, and sharing personal data to align with the new requirements by January 1, 2020.
• Washington Bill 1655: Establishes guidelines for the use of automated decision systems to
protect consumers, improve transparency, and create more market predictability.
• Massachusetts Bill H.2701: Establishes a commission on automated decision-making,
transparency, fairness, and individual rights.
• Illinois House Bill 3415: States predictive data analytics determining creditworthiness or hiring
decisions may not include information that correlates with the applicant race or zip code.
XAI Tutorial AAAI 2020 Lecue et al
XAI Taxonomy

Guidotti et al. (2018). A survey of methods for explaining black box models. ACM computing surveys (CSUR) .
Post-hoc Global Explanation: Knowledge
Distillation

https://hcixaitutorial.github.io/
Global Surrogate model / Distillation
• Train a complex model M1 on training data D_T
• For dataset D_X, get the prediction from M1
• Choose a simple model M2
• Train M2 using D_X and prediction of M1 on D_X (not ground truth!)
• Make sure the accuracy of M2 on D_X is not too bad compared to M1

• Observation: overfitting is okay here as we are only using it for

explanations
Decision Tree Approximation

Dhurandhar et al. Improving Simple Models with Confidence Profiles. NeurIPS 2018
Glob Exp via Permutation Feature Importance
• Goal: measures the contribution of each feature in a model

• Intuition: break the relationship between feature and target

• Importance of an attribute A = performance of a model with A –

performance of model without A
• Question: how to estimate this as the model needs A for prediction?
Permutation Feature Importance
Intuition: break the relationship between feature and target

XAI Tutorial AAAI 2020 Lecue et al

Permutation Feature Importance
Given: classifier C, Dataset D, accuracy A when applying C on D

For each feature f

For i = 1 to K
Randomly shuffle f to generate a corrupted version of the dataset D’
Compute accuracy A’_i of C on D’
Importance(f) = A – average (A’ )
Permutation Feature Importance

Eliana Pastor, XAI Course 2024

Glob Exp via Partial Dependence Plots
• PDP shows the dependence between the target and a feature of
interest by marginalizing over the values of all other input features

• Simplest application: check whether the relationship between the

target and a feature is linear, monotonic or more complex.

• Visualize how changes to a feature influences the predicted target

Partial Dependence Plots

Eliana Pastor, XAI Course 2024

Partial Dependence Plots

Eliana Pastor, XAI Course 2024

Partial Dependence Plots

Eliana Pastor, XAI Course 2024

Explaining a prediction: Local Feature Contribution

Dhurandhar et al. Improving Simple Models with Confidence Profiles. NeurIPS 2018
LIME

Ribeiro, et al. Why should i trust you? Explaining the predictions of any classifier. KDD 2016
Local Approximation via LIME

• Given an instance to explain, generate synthetic instances in the neighborhood by perturbation

• Get label for them using the model
• Train a linear model using the synthetic data but with weights so that predictions of nearer
instances have a higher weight

Eliana Pastor, XAI Course 2024

Explaining a prediction: Similar Examples

Gurumoorthy et al. Efficient Data Representation by Selecting Prototypes with Importance Weights”, ICDM 2019
Inspecting Counterfactual: Contrastive Features

Dhurandhar, et al. Explanations based on the missing: Towards contrastive explanations with pertinent negatives. NeurIPS 2018

306 Seminar Report
No ratings yet
306 Seminar Report
39 pages
PrachFreqAdjSwitch - Trial Report
No ratings yet
PrachFreqAdjSwitch - Trial Report
16 pages
Evaluating XAI Models
No ratings yet
Evaluating XAI Models
122 pages
XAI Basics
No ratings yet
XAI Basics
34 pages
EXI Pratheepan
No ratings yet
EXI Pratheepan
48 pages
Unit 5 Advanced Topics in Data Science
No ratings yet
Unit 5 Advanced Topics in Data Science
31 pages
Healthcare
No ratings yet
Healthcare
39 pages
Pizarroso Gonzalo, Jaime - Tesis
No ratings yet
Pizarroso Gonzalo, Jaime - Tesis
136 pages
Evaluating Models Based On Explainable Ai: Keywords: Gradcam, Explainability, Xai, Artificial Intelligence
No ratings yet
Evaluating Models Based On Explainable Ai: Keywords: Gradcam, Explainability, Xai, Artificial Intelligence
22 pages
A Comparative Study and Systematic Analysis of XAI Models and Their Applications in Healthcare
No ratings yet
A Comparative Study and Systematic Analysis of XAI Models and Their Applications in Healthcare
26 pages
Explainable AI (XAI) : Core Ideas, Techniques, and Solutions
No ratings yet
Explainable AI (XAI) : Core Ideas, Techniques, and Solutions
33 pages
Deeplearning Ai
No ratings yet
Deeplearning Ai
71 pages
Explainable AI
No ratings yet
Explainable AI
41 pages
Unit 3
No ratings yet
Unit 3
32 pages
Module 1 Xai
No ratings yet
Module 1 Xai
10 pages
Sensors 23 00634 v2
No ratings yet
Sensors 23 00634 v2
19 pages
Different XAI Techniques
No ratings yet
Different XAI Techniques
52 pages
The Future of Human-Centric Explainable Artificial Intelligence (Xai) Is Not Post-Hoc Explanations
No ratings yet
The Future of Human-Centric Explainable Artificial Intelligence (Xai) Is Not Post-Hoc Explanations
9 pages
Explainable Artificial Intelligence Challenges and Future Directions
No ratings yet
Explainable Artificial Intelligence Challenges and Future Directions
36 pages
Ensuring Medical AI Safety: Explainable AI-Driven Detection and Mitigation of Spurious Model Behavior and Associated Data
No ratings yet
Ensuring Medical AI Safety: Explainable AI-Driven Detection and Mitigation of Spurious Model Behavior and Associated Data
42 pages
Evaluating Explainable Machine Learning Models For Clinicians
No ratings yet
Evaluating Explainable Machine Learning Models For Clinicians
11 pages
Talk MBA AI XAI 2 PDF
No ratings yet
Talk MBA AI XAI 2 PDF
76 pages
CSCM23 L10 Introduction To XAI
No ratings yet
CSCM23 L10 Introduction To XAI
36 pages
Explainable AI
No ratings yet
Explainable AI
4 pages
Model Explainablity
No ratings yet
Model Explainablity
7 pages
Role of Explainable Artificial Intelligence Approaches in Cyber Security
No ratings yet
Role of Explainable Artificial Intelligence Approaches in Cyber Security
18 pages
Abstract / Summary
No ratings yet
Abstract / Summary
1 page
Bcse418l Explainable-Artificial-Intelligence TH 1.1 0 Bcse418l
No ratings yet
Bcse418l Explainable-Artificial-Intelligence TH 1.1 0 Bcse418l
3 pages
Process Quality Assurance of Artificial Intelligence in Medical Diagnosis
No ratings yet
Process Quality Assurance of Artificial Intelligence in Medical Diagnosis
8 pages
Explainable AI in Healthcare
No ratings yet
Explainable AI in Healthcare
2 pages
Unlocking Machine Learning Model Decisions A Compa
No ratings yet
Unlocking Machine Learning Model Decisions A Compa
16 pages
Interpreting Black Box Models: A Review On Explainable Artificial Intelligence
No ratings yet
Interpreting Black Box Models: A Review On Explainable Artificial Intelligence
30 pages
Explainable
No ratings yet
Explainable
5 pages
Drug Discovery With Explainable Artificial Intelligence
No ratings yet
Drug Discovery With Explainable Artificial Intelligence
15 pages
Applications of Explainable Artificial Intelligence in Diagnosis - Pranta - Saha
No ratings yet
Applications of Explainable Artificial Intelligence in Diagnosis - Pranta - Saha
11 pages
Explainable AI XAI Explained
No ratings yet
Explainable AI XAI Explained
6 pages
ORCA - Online Research at Cardiff
No ratings yet
ORCA - Online Research at Cardiff
35 pages
Explainable Artificial Intelligence and Machine Learning: A Reality Rooted Perspective
No ratings yet
Explainable Artificial Intelligence and Machine Learning: A Reality Rooted Perspective
8 pages
MLIBooklet
No ratings yet
MLIBooklet
40 pages
1 s2.0 S0010482522007569 Main
No ratings yet
1 s2.0 S0010482522007569 Main
23 pages
XAI Question Bank For Umit 4,5,6
No ratings yet
XAI Question Bank For Umit 4,5,6
2 pages
XAI For Intrusion Detection System Comparing Explanations Based On Global and Local Scope
No ratings yet
XAI For Intrusion Detection System Comparing Explanations Based On Global and Local Scope
23 pages
Shap Lime
No ratings yet
Shap Lime
6 pages
Survey On Explainable AI - From Approaches, Limitations and Applications Aspects
No ratings yet
Survey On Explainable AI - From Approaches, Limitations and Applications Aspects
28 pages
Explainable Artificial Intelligence: A Comprehensive Review: Dang Minh H. Xiang Wang Y. Fen Li Tan N. Nguyen
No ratings yet
Explainable Artificial Intelligence: A Comprehensive Review: Dang Minh H. Xiang Wang Y. Fen Li Tan N. Nguyen
66 pages
Hima Lakkaraju XAI ShortCourse
No ratings yet
Hima Lakkaraju XAI ShortCourse
271 pages
Explainable Artificial Intelligence Approaches
No ratings yet
Explainable Artificial Intelligence Approaches
14 pages
Unit 5 Ananth
No ratings yet
Unit 5 Ananth
31 pages
Tesi
No ratings yet
Tesi
106 pages
Camera Ready Paper-10
No ratings yet
Camera Ready Paper-10
15 pages
A Trustworthy View On Explainable
No ratings yet
A Trustworthy View On Explainable
11 pages
Explainable AI Introduction
No ratings yet
Explainable AI Introduction
51 pages
An Assessment Framework For Explainable AI With Applications To Cybersecurity
No ratings yet
An Assessment Framework For Explainable AI With Applications To Cybersecurity
19 pages
Explainable AI
No ratings yet
Explainable AI
18 pages
Explainable AI
No ratings yet
Explainable AI
5 pages
The - Essential - Guide - To - Explainable - AI 20241221
No ratings yet
The - Essential - Guide - To - Explainable - AI 20241221
71 pages
Fake News Detection Using Xai: Bachelors of Technology
No ratings yet
Fake News Detection Using Xai: Bachelors of Technology
13 pages
Information Fusion: Sciencedirect
No ratings yet
Information Fusion: Sciencedirect
34 pages
XAI MajorProject
No ratings yet
XAI MajorProject
14 pages
The Agentic Revolution
From Everand
The Agentic Revolution
Rio Vale
No ratings yet
ITIL 4 Foundation Exam Study Guide
From Everand
ITIL 4 Foundation Exam Study Guide
Georgio Daccache
No ratings yet
Build Your Own Windows Server IT Lab PDF
No ratings yet
Build Your Own Windows Server IT Lab PDF
15 pages
Lift Book
No ratings yet
Lift Book
277 pages
Ajp Tyif (9165)
No ratings yet
Ajp Tyif (9165)
14 pages
Table of Specifications (Tos) Epp 6 - Ict and Entrepreneurship - Quarter 1
100% (1)
Table of Specifications (Tos) Epp 6 - Ict and Entrepreneurship - Quarter 1
1 page
Fdma Cdma Tdma
No ratings yet
Fdma Cdma Tdma
23 pages
Mar 2023 - 6th 7th 8th 9th 10th Standard Print
No ratings yet
Mar 2023 - 6th 7th 8th 9th 10th Standard Print
190 pages
Intrusão HS2TCHP DSC Datasheet
No ratings yet
Intrusão HS2TCHP DSC Datasheet
2 pages
LRC Resources For : Animation, Interaction & Moving Image
No ratings yet
LRC Resources For : Animation, Interaction & Moving Image
8 pages
Omp Hands On SC08 PDF
No ratings yet
Omp Hands On SC08 PDF
153 pages
I Get To Love You Ruelle Sheet Music For Piano, Violin (Mixed Duet)
No ratings yet
I Get To Love You Ruelle Sheet Music For Piano, Violin (Mixed Duet)
1 page
Switch Configuration 2013
No ratings yet
Switch Configuration 2013
2 pages
WD (UNIT-2) PPT
No ratings yet
WD (UNIT-2) PPT
170 pages
Random and Raster Scan Displays
No ratings yet
Random and Raster Scan Displays
20 pages
Benefits of Eband Over Other Technologies White Paper V051310
No ratings yet
Benefits of Eband Over Other Technologies White Paper V051310
10 pages
Chapter 3 - Model
No ratings yet
Chapter 3 - Model
37 pages
Ex No: Date: Design, Implementation and Verification of Multiplexer and Demultiplexer
No ratings yet
Ex No: Date: Design, Implementation and Verification of Multiplexer and Demultiplexer
3 pages
Functional Safety Assessment PDF
No ratings yet
Functional Safety Assessment PDF
26 pages
Cannot Delete DTP Delta Initial Request: Symptom
No ratings yet
Cannot Delete DTP Delta Initial Request: Symptom
2 pages
200 Assignment
No ratings yet
200 Assignment
2 pages
Implementing ISO IEC 12207 Standard Usin
No ratings yet
Implementing ISO IEC 12207 Standard Usin
14 pages
Ian Catterick
No ratings yet
Ian Catterick
6 pages
Create Varchar Varchar Varchar Int
No ratings yet
Create Varchar Varchar Varchar Int
3 pages
Alcatel-Lucent Vs Microsoft
100% (1)
Alcatel-Lucent Vs Microsoft
11 pages
Advanced Java Programming Microproject Report
No ratings yet
Advanced Java Programming Microproject Report
9 pages
Tours and Travels Management System Project Report
No ratings yet
Tours and Travels Management System Project Report
5 pages
Open Research Online: Integrating Web Services Into Data Intensive Web Sites
No ratings yet
Open Research Online: Integrating Web Services Into Data Intensive Web Sites
9 pages
Install & Running An EMC VNX VSA v2.0
No ratings yet
Install & Running An EMC VNX VSA v2.0
42 pages
LP-P55 User Guide
No ratings yet
LP-P55 User Guide
22 pages
Chatgpt
No ratings yet
Chatgpt
2 pages