0% found this document useful (0 votes)

18 views71 pages

Deeplearning Ai

These slides are distributed under the Creative Commons License and may be used for educational purposes as long as DeepLearning.AI is cited as the source. The slides cannot be used or distributed for commercial purposes. For full details of the license, see the provided URL.

Uploaded by

Jian Quan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views71 pages

Deeplearning Ai

Uploaded by

Jian Quan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 71

Copyright Notice

These slides are distributed under the Creative Commons License.

DeepLearning.AI makes these slides available for educational purposes. You may not
use or distribute these slides for commercial purposes. You may make copies of these
slides and use or distribute them for educational purposes as long as you
cite DeepLearning.AI as the source of the slides.

For the rest of the details of the license, see

https://creativecommons.org/licenses/by-sa/2.0/legalcode
Interpretability

Welcome
Explainable AI

Explainable AI
Responsible AI

● Development of AI is creating new opportunities to improve lives of people

● Also raises new questions about the best way to build the following into AI systems:

Fairness Explainability Privacy Security

● Ensure working ● Understanding how Training models using Identifying potential

towards systems that and why ML models sensitive data needs threats can help keep
are fair and inclusive make certain privacy preserving AI systems safe and
to all users. predictions. safeguards. secure.

● Explainability helps ● Explainability helps

ensure fairness. ensure fairness.
Explainable Artiﬁcial Intelligence (XAI)

The ﬁeld of XAI allow ML system to be more transparent, providing

explanations of their decisions in some level of detail.

These explanations are important:

To ensure algorithmic fairness.

Identify potential bias and problems in training data.

To ensure algorithms/models work as expected.

Need for Explainability in AI

1. Models with high sensitivity, including natural language networks, can generate
wildly wrong results

2. Attacks

3. Fairness

4. Reputation and Branding

5. Legal and regulatory concerns

6. Customers and other stakeholders may question or challenge model decisions

Deep Neural Networks (DNNs) can be fooled

DNNs can be fooled into misclassifying inputs with no resemblance to the true category.
Deep Neural Networks (DNNs) can be fooled

+ε

“Panda” “Nematode” “Gibbon”

57.7 % confidence 8.2 % confidence 99.3 % confidence
Interpretability

Model Interpretation
Methods
What is interpretability?

“(Models) are interpretable if their operations

can be understood by a human, either through
introspection or through a produced explanation.”

“Explanation and justiﬁcation in machine learning: A survey”

- O. Biran, C. Cotton
What are the requirements?

Why did the model behave in a certain way?

You should be
able to query
How can we trust the predictions made by the model?
the model to
understand:
What information can model provide to avoid prediction
errors?
Categorizing Model Interpretation Methods

Model
Intrinsic or
Speciﬁc or
Post-Hoc?
Model
Agnostic?

Interpretation
Local or Methods
Global?
Intrinsic or Post-Hoc?

Model which is
intrinsically
interpretable

Intrinsic
Interpretability

Linear
models,
Tree-base
d models,
Lattice, etc
Intrinsic or Post-Hoc?

● Post-hoc methods treat models as black boxes

● Agnostic to model architecture
● Extracts relationships between features and model predictions,
agnostic of model architecture
● Applied after training
Types of results produced by Interpretation Methods

Feature Summary Feature Summary

Statistics Visualization

Model Internals Data point

Model Speciﬁc or Model Agnostic

● These tools are limited to speciﬁc model classes Prediction

● Example: Interpretation of regression weights in linear models

Model Speciﬁc Data Model
● Intrinsically interpretable model techniques are model speciﬁc
Explanation
● Tools designed for particular model architectures

● Applied to any model after it is trained model Prediction

Model Agnostic ● Do not have access to the internals of the model Data

● Work by analyzing feature input and output pairs

magic Explanation
Interpretability of ML Models
Model Agnostic

Local Global

Model Speciﬁc
Local or Global?

● Local: interpretation method explains an individual prediction.

● Feature attribution is identiﬁcation of relevant features as an
explanation for a model.
Local or Global?

● Global: interpretation method

explains entire model behaviour
● Feature attribution summary for
the entire test data set
Interpretability

Intrinsically Interpretable Models

● How the model works is self evident

● Many classic models are highly interpretable
● Neural networks look like “black boxes”
● Newer architectures focus on designing for interpretability
Monotonicity improves interpretability
Monotonic

Not Monotonic

Monotonic
Interpretable Models

Algorithm Linear Monotonic Feature Task

Interaction

Linear regression Yes Yes No regr

Logistic regression No Yes No class

Decision trees No Some Yes class, regr

RuleFit Yes* No Yes class, regr

K-nearest neighbors No No No class, regr

TF Lattice Yes* Yes Yes class, regr

Model Architecture Inﬂuence on Interpretability

Linear Regression

Interpretability
Decision Trees
TF Lattice
K-nearest neighbours
Random Forests
SVMs
Neural Networks
Accuracy
Interpretability vs Accuracy Trade off
Classics: Linear Regression
Interpretation from Weights

Linear models have easy to understand interpretation from weights

● Numerical features: Increase of one unit in a feature increases

prediction by the value of corresponding weight.
● Binary features: Changing between 0 or 1 category changes the
prediction by value of the feature’s weight.
● Categorical features: one hot encoding affects only one weight.
Feature Importance

● Relevance of a given feature to generate model results

● Calculation is model dependent
● Example: linear regression model, t-statistic
More advanced models: TensorFlow Lattice

● Overlaps a grid onto the feature

space and learns values for the
output at the vertices of the
grid
● Linearly interpolates from the
lattice values surrounding a
point
More advanced models: TensorFlow Lattice

● Enables you to inject domain

knowledge into the learning
process through common-sense
or policy-driven shape
constraints
● Set constraints such as
monotonicity, convexity, and how
features interact
TensorFlow Lattice: Accuracy

Accuracy

● TensorFlow Lattice achieves

accuracies comparable to
neural networks
● TensorFlow Lattice provides
greater interpretability
TensorFlow Lattice: Issues

Dimensionality

● The number of parameters of a lattice layer increases exponentially

with the number of input features
● Very Rough Rule: Less than 20 features ok without ensembling
Understanding Model
Predictions

Model Agnostic Methods

These methods separate explanations from the machine learning model.

Desired characteristics:
● Model flexibility
● Explanation flexibility
● Representation flexibility
Model Agnostic Methods

Partial Dependence Plots Individual Conditional Expectation

Accumulated Local Effects Permutation Feature Importance

Permutation Feature Importance Global Surrogate

Local Surrogate (LIME) Shapley Values

SHAP
Understanding Model
Predictions

Partial Dependence Plots

Partial Dependence Plots (PDP)

A partial dependence plot shows:

● The marginal effect one or two features have on the model result
● Whether the relationship between the targets and the feature is
linear, monotonic, or more complex
Partial Dependence Plots

The partial function fxs is estimated by calculating averages in the training data:
Partial Dependence Plots: Examples
PDP plots for a linear regression
model trained on a bike rentals
dataset to predict the number of
bikes rented
PDP for Categorical Features

4000

2000

Spring Summer Fall Winter

Season
Advantages of PDP

● Computation is intuitive
● If the feature whose PDP is calculated has no feature correlations, PDP
perfectly represents how feature inﬂuences the prediction on average
● Easy to implement
Disadvantages of PDP

● Realistic maximum number of features in PDP is 2

● PDP assumes that feature values have no interactions
Understanding Model
Predictions

Permutation Feature
Importance
Permutation Feature Importance

Feature importance measures the increase in prediction error after

permuting the features

Feature is important if:

● Shufﬂing its values increases model error

Feature is unimportant if:

● Shufﬂing its values leaves model error unchanged
Permutation Feature Importance

● Estimate the original model error

● For each feature:
○ Permute the feature values in the data to break its association with
the true outcome
○ Estimate error based on the predictions of the permuted data
○ Calculate permutation feature importance
○ Sort features by descending feature importance .
Advantages of Permutation Feature Importance

● Nice interpretation: Shows the increase in model error when the

feature's information is destroyed.
● Provides global insight to model’s behaviour
● Does not require retraining of model

pretation: Shows the increase in model error

feature's information is destroyed.
Disadvantages of Permutation Feature Importance

● It is unclear if testing or training data should be used for visualization

● Can be biased since it can create unlikely feature combinations in case
of strongly correlated features
● You need access to the labeled data
Understanding Model
Predictions

Shapley Values
Shapley Value

● The Shapley value is a method for assigning payouts to players

depending on their contribution to the total
● Applying that to ML we deﬁne that:
○ Feature is a “player” in a game
○ Prediction is the “payout”
○ Shapley value tells us how the “payout” (feature contribution)
can be distributed among features
Shapley Value: Example

Suppose you trained an ML

model to predict apartment
€300,000 prices

You need to explain why the

50m2
model predicts €300,000 for a
2nd ﬂoor certain apartment.

Average prediction of all

apartments: €310,000.
Shapley Value

Relation to
Term in Game Theory Relation to ML
House Prices Example

Prediction task for Prediction of house prices

Game
single instance of dataset for a single instance

Actual prediction for instance - Prediction for house price (€300,000) -

Gain Average prediction for all Average Prediction(€310,000) =
instances -€10,000

Feature values that contribute ‘Park=nearby’, ‘cat=banned’,

Players
to prediction ‘area=50m2’, ‘ﬂoor=2nd’
Shapley Value
Goal :
Explain the difference between the actual prediction (€300,000) and the average prediction
(€310,000): a difference of -€10,000.

Feature Contribution

‘park-nearby’ €30,000

size-50 €10,000 One possible

explanation
floor-2nd €0

cat-banned -€50,000

Total: -€10,000 (Final prediction - Average Prediction)

Advantages of Shapley Values

Based on solid theoretical foundation.

Satisﬁes Efﬁciency, Symmetry, Dummy, and Additivity properties

Value is fairly distributed among all features

Enables contrastive explanations

Disadvantages of Shapley Values

● Computationally expensive
● Can be easily misinterpreted
● Always uses all the features, so not good for explanations of only a few
features.
● No prediction model. Can’t be used for “what if” hypothesis testing.
● Does not work well when features are correlated
Understanding Model
Predictions

SHAP (SHapley Additive

exPlanations)
SHAP
● SHAP (SHapley Additive exPlanations) is a framework for Shapley Values which
assigns each feature an importance value for a particular prediction

● Includes extensions for:

○ TreeExplainer: high-speed exact algorithm for tree ensembles
○ DeepExplainer: high-speed approximation algorithm for SHAP values
in deep learning models
○ GradientExplainer: combines ideas from Integrated Gradients, SHAP,
and SmoothGrad into a single expected value equation
○ KernelExplainer: uses a specially-weighted local linear regression to
estimate SHAP values for any model
SHAP Explanation Force Plots

● Shapley Values can be visualized as forces

● Prediction starts from the baseline (Average of all predictions)
● Each feature value is a force that increases (red) or decreases (blue) the
prediction
SHAP Summary Plot
SHAP Dependence Plot with Interaction
Understanding Model
Predictions

Testing Concept Activation

Vectors
Testing Concept Activation Vectors (TCAV)

Concept Activation Vectors (CAVs)

● A neural network’s internal state in terms of human-friendly concepts
● Deﬁned using examples which show the concept
Example Concepts
Understanding Model
Predictions

LIME
Local Interpretable Model-agnostic Explanations (LIME)

● Implements local surrogate models - interpretable models that are used

to explain individual predictions
● Using data points close to the individual prediction, LIME trains an
interpretable model to approximate the predictions of the real model
● The new interpretable model is then used to interpret the real result
Understanding Model
Predictions

AI Explanations
Google Cloud AI Explanations for AI Platform

Explain why an individual data point received that

prediction

Debug odd behavior from a model

Reﬁne a model or data collection process

Verify that the model’s behavior is acceptable

Present the gist of the model

AI Explanations: Feature Attributions

Tabular Data Example

AI Explanations: Feature Attributions

Image Data Examples

AI Explanations: Feature Attribution Methods
AI Explanations: Integrated Gradients

A gradients-based method to efﬁciently compute feature

attributions with the same axiomatic properties as Shapley
values
AI Explanations: XRAI (eXplanation with Ranked Area
Integrals)

XRAI assesses overlapping regions of the image to create a saliency map

● Highlights relevant regions of the image rather than pixels
● Aggregates the pixel-level attribution within each segment and ranks
the segments
AI Explanations: XRAI (eXplanation with Ranked Area
Integrals)

Shap
100% (1)
Shap
214 pages
Interpretable Machine Learning
100% (4)
Interpretable Machine Learning
251 pages
IOT UNIT II New
100% (3)
IOT UNIT II New
105 pages
CSNG UserManual
No ratings yet
CSNG UserManual
214 pages
Testing and Commissioning of Fire Alarm System
100% (2)
Testing and Commissioning of Fire Alarm System
10 pages
Machine Learning Notes
100% (3)
Machine Learning Notes
134 pages
Interpretable Machine Learning PDF
100% (2)
Interpretable Machine Learning PDF
251 pages
Hands-On Machine Learning Model Interpretation - Towards Data Science
No ratings yet
Hands-On Machine Learning Model Interpretation - Towards Data Science
78 pages
Interpretable Machine Learning
No ratings yet
Interpretable Machine Learning
252 pages
Machine Learning Batch 8 2021
100% (1)
Machine Learning Batch 8 2021
73 pages
Computer History, Generations and Functions
No ratings yet
Computer History, Generations and Functions
30 pages
Graph Traversal
100% (1)
Graph Traversal
38 pages
Interpretable Ai Not Just For Regulators
No ratings yet
Interpretable Ai Not Just For Regulators
18 pages
AI-Lecture 8 (Machine Learning Overview)
No ratings yet
AI-Lecture 8 (Machine Learning Overview)
42 pages
Ricoh Aficio 1515 Service Manual
No ratings yet
Ricoh Aficio 1515 Service Manual
329 pages
Interpretable Machine Learning
No ratings yet
Interpretable Machine Learning
185 pages
ML Interpretability Assignment
No ratings yet
ML Interpretability Assignment
5 pages
Talk MBA AI XAI 2 PDF
No ratings yet
Talk MBA AI XAI 2 PDF
76 pages
Machine Learning - Unit - 1
100% (1)
Machine Learning - Unit - 1
58 pages
Contemporary ML For Physicists
No ratings yet
Contemporary ML For Physicists
91 pages
Algorithms For Interpretable Machine Learning
No ratings yet
Algorithms For Interpretable Machine Learning
125 pages
Christoph Molnar - Interpretable Machine Learning-Lulu - Com (2020)
No ratings yet
Christoph Molnar - Interpretable Machine Learning-Lulu - Com (2020)
255 pages
Multiplication Special
No ratings yet
Multiplication Special
6 pages
MAI Lecture 01 Introduction
No ratings yet
MAI Lecture 01 Introduction
52 pages
2024-04-16 Jagoda Bobińska Pasquale Gravante SHAP
No ratings yet
2024-04-16 Jagoda Bobińska Pasquale Gravante SHAP
11 pages
Explainable AI: Analytics Summit, 13 June 2019
No ratings yet
Explainable AI: Analytics Summit, 13 June 2019
24 pages
Module 2
No ratings yet
Module 2
73 pages
Explainable Artificial Intelligence Challenges and Future Directions
No ratings yet
Explainable Artificial Intelligence Challenges and Future Directions
36 pages
Super VIP Cheat Sheet: Arti Cial Intelligence
No ratings yet
Super VIP Cheat Sheet: Arti Cial Intelligence
18 pages
Digital Video
No ratings yet
Digital Video
26 pages
01 - Introduction
No ratings yet
01 - Introduction
35 pages
Shap Lime
No ratings yet
Shap Lime
6 pages
Inherently Interpretable Models 2 0f 2
No ratings yet
Inherently Interpretable Models 2 0f 2
60 pages
Scanning Tools
No ratings yet
Scanning Tools
7 pages
EXI Pratheepan
No ratings yet
EXI Pratheepan
48 pages
Module 1 Xai
No ratings yet
Module 1 Xai
10 pages
Implement Ethical and Unbiased Algorithms
No ratings yet
Implement Ethical and Unbiased Algorithms
19 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
91 pages
Iductive Ias
No ratings yet
Iductive Ias
47 pages
3032 2023-IEEE TFS-Fuzzy Rule-Based Explainer Systems For Deep Neural Networks From Local Explainability To Global Understanding
No ratings yet
3032 2023-IEEE TFS-Fuzzy Rule-Based Explainer Systems For Deep Neural Networks From Local Explainability To Global Understanding
12 pages
Shapley Value: From Cooperative Game To Explainable Artificial Intelligence
No ratings yet
Shapley Value: From Cooperative Game To Explainable Artificial Intelligence
12 pages
Lesson 9 - App Architecture (Persistence)
No ratings yet
Lesson 9 - App Architecture (Persistence)
55 pages
11 and .NET 7 Part7
No ratings yet
11 and .NET 7 Part7
8 pages
XAI Basics
No ratings yet
XAI Basics
34 pages
Lecture 1
No ratings yet
Lecture 1
47 pages
Cracking The AI Code - Unlocking The Secrets of Machine Learning
No ratings yet
Cracking The AI Code - Unlocking The Secrets of Machine Learning
18 pages
Uncertainty in Modeling
No ratings yet
Uncertainty in Modeling
25 pages
Week01 Intro AI
No ratings yet
Week01 Intro AI
53 pages
ML Midterm Cheatsheet
No ratings yet
ML Midterm Cheatsheet
2 pages
DMG48480F021 01WN DataSheet
No ratings yet
DMG48480F021 01WN DataSheet
16 pages
Amd Cdna Whitepaper
No ratings yet
Amd Cdna Whitepaper
11 pages
Deeplearning Ai
No ratings yet
Deeplearning Ai
64 pages
Explainable AI
No ratings yet
Explainable AI
41 pages
Learning Local Discrete Features in Explainable-By
No ratings yet
Learning Local Discrete Features in Explainable-By
37 pages
Flutter Bloc Architecture
No ratings yet
Flutter Bloc Architecture
6 pages
xai발표
No ratings yet
xai발표
42 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
99 pages
Theory of Approximation and Splines-I Lecture-1 Basic Concepts of Interpolation
No ratings yet
Theory of Approximation and Splines-I Lecture-1 Basic Concepts of Interpolation
4 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
79 pages
Chapter 3
No ratings yet
Chapter 3
4 pages
Approach To Provide Interpretability in Machine Le
No ratings yet
Approach To Provide Interpretability in Machine Le
15 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
32 pages
C2 - W3 Mlopssasaddsad
No ratings yet
C2 - W3 Mlopssasaddsad
65 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
144 pages
Deeplearning Ai
No ratings yet
Deeplearning Ai
123 pages
DeepMind Models
No ratings yet
DeepMind Models
24 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
91 pages
CH 5
No ratings yet
CH 5
40 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
42 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
41 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
40 pages
HP 3PAR Storage Replication Adapter 5.0 FPR SRM Troubleshooting Guide c03110217
No ratings yet
HP 3PAR Storage Replication Adapter 5.0 FPR SRM Troubleshooting Guide c03110217
127 pages
M2 AI Chap1 Neural-Network
No ratings yet
M2 AI Chap1 Neural-Network
60 pages
T9 Iml
No ratings yet
T9 Iml
44 pages
Product Lifecycle Matrix
No ratings yet
Product Lifecycle Matrix
16 pages
Unit IIAIProjectCycle
No ratings yet
Unit IIAIProjectCycle
9 pages
Interpretable Machine Learning (MSBA 7027) : Zhengli Wang
No ratings yet
Interpretable Machine Learning (MSBA 7027) : Zhengli Wang
45 pages
New Class 10-Worksheet
No ratings yet
New Class 10-Worksheet
4 pages
SuccessFactors Integration CPI Basic Guide
No ratings yet
SuccessFactors Integration CPI Basic Guide
10 pages
The Daily Stoic 366 Meditations On Wisdom Perseverance and The Art of Living PDFDrive - Com Ryan Holiday Free Download, Borrow 3
No ratings yet
The Daily Stoic 366 Meditations On Wisdom Perseverance and The Art of Living PDFDrive - Com Ryan Holiday Free Download, Borrow 3
1 page
Mern Mini Project
No ratings yet
Mern Mini Project
37 pages
MCQ D
No ratings yet
MCQ D
19 pages
Unit 5 Advanced Topics in Data Science
No ratings yet
Unit 5 Advanced Topics in Data Science
31 pages
Crash 2024 12 15 - 14.00.07 FML
No ratings yet
Crash 2024 12 15 - 14.00.07 FML
14 pages
GP25
No ratings yet
GP25
2 pages
Machine Leaning 1 Unit
No ratings yet
Machine Leaning 1 Unit
10 pages
Full Stack Web Development Using Javascript, Node
No ratings yet
Full Stack Web Development Using Javascript, Node
3 pages
Tesi
No ratings yet
Tesi
106 pages
ML Mdu 2024 10939237
No ratings yet
ML Mdu 2024 10939237
20 pages
Module-3 Memory-PPT Part 1
No ratings yet
Module-3 Memory-PPT Part 1
105 pages
Ai - Foundations of Machine Learning I
No ratings yet
Ai - Foundations of Machine Learning I
40 pages
A Comprehensive Guide To Explainable Ai: From Classical Models To Llms
No ratings yet
A Comprehensive Guide To Explainable Ai: From Classical Models To Llms
255 pages
Unit II
No ratings yet
Unit II
14 pages
DM Chapter 10
No ratings yet
DM Chapter 10
3 pages
Harish Kumar - Admin
No ratings yet
Harish Kumar - Admin
4 pages
PCS v4x10x7x16
No ratings yet
PCS v4x10x7x16
103 pages
Modelling
No ratings yet
Modelling
69 pages
Class IX Computer Project 2025-2026
No ratings yet
Class IX Computer Project 2025-2026
3 pages
Machine Learning
No ratings yet
Machine Learning
38 pages
Transformers in Deep Learning Architecture: Definitive Reference for Developers and Engineers
From Everand
Transformers in Deep Learning Architecture: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Artificial Intelligence 2024 Book 2 of 2: AI, #2
From Everand
Artificial Intelligence 2024 Book 2 of 2: AI, #2
Yang Yen Thaw
No ratings yet
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
From Everand
Image Classification: Step-by-step Classifying Images with Python and Techniques of Computer Vision and Machine Learning
Mark Magic
No ratings yet