0% found this document useful (0 votes)

14 views39 pages

Module1 Lecture 1

Uploaded by

Next Einstein

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views39 pages

Module1 Lecture 1

Uploaded by

Next Einstein

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 39

CS282BR: Topics in Machine Learning

Interpretability and Explainability

Hima Lakkaraju

Assistant Professor
Harvard Business School + Computer Science
Background

 Strong understanding: Linear algebra, probability,

algorithms, machine learning (cs181 or equivalent),
programming in python, numpy, sklearn;

 Familiarity with statistics, optimization

2
Motivation

Machine Learning is EVERYWHERE!!

[ Weller 2017 ]
Motivation: Why Model Understanding?

Input This model is

relying on incorrect
features to make
this prediction!! Let
me fix the model
Model understanding facilitates debugging.
Model Understanding

Predictive Prediction = Siberian Husky

Model
Motivation: Why Model Understanding?

This prediction is
Defendant Details biased. Race and
gender are being
used to make the
prediction!!
Model Understanding
Model understanding facilitates bias detection.
Race

Crimes

Gender

Predictive
Prediction = Risky to Release
Model

[ Larson et. al. 2016 ]

Motivation: Why Model Understanding?
Model Understanding

Increase
Loan Applicant Details
salary by I have some means
50K + pay for recourse. Let me
credit card go and work on my
promotion and pay
bills on time
my bills on time.
Model understanding helps
for next 3 provide recourse to individuals
months
who are adversely to
affected by model predictions.
get a loan

Predictive
Prediction = Denied Loan
Model
Loan Applicant
Motivation: Why Model Understanding?

Model Understanding
Patient Data This model is using
If gender = female, irrelevant features when
if ID_num > 200, then sick predicting on female
subpopulation. I should
If gender = male, not trust its predictions
25, Female, Cold if cold = true and cough = true, then for that group.
32, Male, No
31, Male, Cough
Model understanding helps assess
sick if and when to trust
model predictions when making decisions.
Predictions

Healthy
Sick
Predictive Sick
Model .
.
Healthy
Healthy 7
Sick
Motivation: Why Model Understanding?

Patient Data Model Understanding

If gender = female, This model is using

if ID_num > 200, then sick irrelevant features when
predicting on female
25, Female, Cold If gender = male, subpopulation. This
32, Male, No if cold = true and cough = true, cannot be approved!
31, Male, Cough then sick
.

Predictions

Healthy
Sick
Predictive Sick
Model .
.
Healthy
Healthy 8
Sick
Motivation: Why Model Understanding?

Utility Stakeholders

Debugging End users (e.g., loan applicants)

Bias Detection Decision makers (e.g., doctors, judges)

Recourse Regulatory agencies (e.g., FDA, European

commission)
If and when to trust model predictions
Researchers and engineers
Vet models to assess suitability for
deployment
Achieving Model Understanding

Take 1: Build inherently interpretable predictive models

[ Letham and Rudin 2015; Lakkaraju et. al. 2016 ]

Achieving Model Understanding
Take 2: Explain pre-built models in a post-hoc manner

Explainer

[ Ribeiro et. al. 2016, 2018; Lakkaraju et. al. 2019]

Inherently Interpretable Models vs.
Post hoc Explanations

Example

In certain settings, accuracy-interpretability trade offs may exist.

[ Cireşan et. al. 2012, Caruana et. al. 2006, Frosst et. al. 2017, Stewart 2020]
Inherently Interpretable Models vs.
Post hoc Explanations

can build interpretable + complex models might

accurate models achieve higher accuracy
Inherently Interpretable Models vs.
Post hoc Explanations
Sometimes, you don’t have enough data to build your model
from scratch.

And, all you have is a (proprietary) black box!

[ Ribeiro et. al. 2016 ]

Inherently Interpretable Models vs.
Post hoc Explanations

If you can build an interpretable model which is also adequately

accurate for your setting, DO IT!

Otherwise, post hoc explanations come to the rescue!

Let’s get into some details!
Next Up!

 Define and evaluate interpretability

 somewhat! 

 Taxonomy of interpretability evaluation

 Taxonomy of interpretability based on

applications/tasks

 Taxonomy of interpretability based on methods

[Doshi-Velez & Kim, 2017]
17
Motivation for Interpretability

 ML systems are being deployed in complex high-

stakes settings

 Accuracy alone is no longer enough

 Auxiliary criteria are important:

 Safety
 Nondiscrimination
 Right to explanation

18
Motivation for Interpretability

 Auxiliary criteria are often hard to quantify

(completely)
 E.g.: Impossible to enumerate all scenarios violating safety
of an autonomous car

 Fallback option: interpretability

 If the system can explain its reasoning, we can verify if
that reasoning is sound w.r.t. auxiliary criteria

19
Prior Work: Defining and Measuring
Interpretability
 Little consensus on what interpretability is and how
to evaluate it

 Interpretability evaluation typically falls into:

 Evaluate in the context of an application

 Evaluate via a quantifiable proxy

20
Prior Work: Defining and Measuring
Interpretability
 Evaluate in the context of an application
 If a system is useful in a practical application or a
simplified version, it must be interpretable

 Evaluate via a quantifiable proxy

 Claim some model class is interpretable and present
algorithms to optimize within that class
 E.g. rule lists

You will know it when you see it!

21
Lack of Rigor?

 Yes and No
 Previous notions are reasonable
Important to formalize these notions!!!

 However,

 Are all models in all “interpretable” model classes equally

interpretable?
 Model sparsity allows for comparison

 How to compare a linear model with a decision tree?

 Do all applications have same interpretability needs?

22
What is Interpretability?

 Defn: Ability to explain or to present in

understandable terms to a human

 No clear answers in psychology to:

 What constitutes an explanation?
 What makes some explanations better than the others?
 When are explanations sought?

23
When and Why Interpretability?

 Not all ML systems require interpretability

 E.g., ad servers, postal code sorting
 No human intervention

 No explanation needed because:

 No consequences for unacceptable results
 Problem is well studied and validated well in real-world
applications  trust system’s decision

When do we need explanation then?

24
When and Why Interpretability?

 Incompleteness in problem formalization

 Hinders optimization and evaluation

 Incompleteness ≠ Uncertainty
 Uncertainty can be quantified
 E.g., trying to learn from a small dataset (uncertainty)

25
Incompleteness: Illustrative Examples

 Scientific Knowledge
 E.g., understanding the characteristics of a large dataset
 Goal is abstract

 Safety
 End to end system is never completely testable
 Not possible to check all possible inputs

 Ethics
 Guard against certain kinds of discrimination which are too
abstract to be encoded
 No idea about the nature of discrimination beforehand

26
Taxonomy of Interpretability Evaluation

Claim of the research should match the type of the evaluation!

27
Application-grounded evaluation

 Real humans (domain experts), real tasks

 Domain experts experiment with exact application

task

 Domain experts experiment with a simpler or partial

task
 Shorten experiment time
 Increases number of potential subjects

 Typical in HCI and visualization communities

28
Human-grounded evaluation

 Real humans, simplified tasks

 Can be completed with lay humans
 Larger pool, less expensive

 Potential experiments
 Pairwise comparisons
 Simulate the model output
 What changes should be made to input to change the
output?

29
Functionally-grounded evaluation

 No humans, just proxies

 Appropriate for a class of models already validated
 E.g., decision trees
 A method is not yet mature
 Human subject experiments are unethical
 What proxies to use?

 Potential experiments
 Complexity (of a decision tree) compared to other other
models of the same (similar) class
 How many levels? How many rules?

30
Open Problems: Design Issues

 What proxies are best for what real world

applications?

 What factors to consider when designing simpler

tasks in place of real world tasks?

31
Taxonomy based on applications/tasks

 Global vs. Local

 High level patterns vs. specific decisions

 Degree of Incompleteness
 What part of the problem is incomplete? How incomplete
is it?
 Incomplete inputs or constraints or costs?

 Time Constraints
 How much time can the user spend to understand
explanation?

32
Taxonomy based on applications/tasks

 Nature of User Expertise

 How experienced is end user?
 Experience affects how users process information
 E.g., domain experts can handle detailed, complex
explanations compared to opaque, smaller ones

 Note: These taxonomies are constructed based on intuition

and are not data or evidence driven. They must be treated as
hypotheses.

33
Taxonomy based on methods

 Basic units of explanation:

 Raw features? E.g., pixel values
 Semantically meaningful? E.g., objects in an image
 Prototypes?

 Number of basic units of explanation:

 How many does the explanation contain?
 How do various types of basic units interact?
 E.g., prototype vs. feature

34
Taxonomy based on methods

 Level of compositionality:
 Are the basic units organized in a structured way?
 How do the basic units compose to form higher order
units?

 Interactions between basic units:

 Combined in linear or non-linear ways?
 Are some combinations easier to understand?

 Uncertainty:
 What kind of uncertainty is captured by the methods?
 How easy is it for humans to process uncertainty?
35
Questions??
Relevant Conferences to Explore

 ICML
 NeurIPS
 ICLR
 UAI
 AISTATS
 KDD
 AAAI
 FAccT
 AIES
 CHI
 CSCW
 HCOMP 37
Breakout Groups

 Say hi to your neighbors! Introduce yourselves!

 What topics are you most excited about learning as part of

this course?

 Are you convinced that model interpretability/explainability

is important?

 Do you think we can really interpret/explain models

(correctly)?

 What is your take on inherently interpretable models vs. post

hoc explanations? Would you favor one over the other? Why? 38

The Operator Breaching and SWAT Edition 2022 PDF
100% (4)
The Operator Breaching and SWAT Edition 2022 PDF
84 pages
Explainable AI
No ratings yet
Explainable AI
18 pages
Hima Lakkaraju XAI ShortCourse
No ratings yet
Hima Lakkaraju XAI ShortCourse
271 pages
An Introduction To Machine Learning Interpretability 2e
100% (1)
An Introduction To Machine Learning Interpretability 2e
62 pages
Explainable AI Question Answers
No ratings yet
Explainable AI Question Answers
36 pages
Business Analysis Book Arvind Mehta
100% (2)
Business Analysis Book Arvind Mehta
162 pages
An Introduction To Machine Learning Interpretability
No ratings yet
An Introduction To Machine Learning Interpretability
39 pages
A Comprehensive Guide To Explainable Ai: From Classical Models To Llms
No ratings yet
A Comprehensive Guide To Explainable Ai: From Classical Models To Llms
255 pages
Machine Learning Interpretability
No ratings yet
Machine Learning Interpretability
10 pages
Christoph Molnar - Interpretable Machine Learning-Lulu - Com (2020)
No ratings yet
Christoph Molnar - Interpretable Machine Learning-Lulu - Com (2020)
255 pages
T Respaiit I 4 l2 en File 14.en
No ratings yet
T Respaiit I 4 l2 en File 14.en
181 pages
Counterfactual Explanations For Machine Learning A Review
No ratings yet
Counterfactual Explanations For Machine Learning A Review
13 pages
Evaluating XAI Models
No ratings yet
Evaluating XAI Models
122 pages
Open Problems in Mechanistic Interpretability
No ratings yet
Open Problems in Mechanistic Interpretability
82 pages
Library Management System For Stanford University
No ratings yet
Library Management System For Stanford University
16 pages
Interpretable Machine Learning
No ratings yet
Interpretable Machine Learning
252 pages
21 SS133
No ratings yet
21 SS133
85 pages
Interpretable Machine Learning
No ratings yet
Interpretable Machine Learning
185 pages
Algorithms For Interpretable Machine Learning
No ratings yet
Algorithms For Interpretable Machine Learning
125 pages
EXI Pratheepan
No ratings yet
EXI Pratheepan
48 pages
Explainable AI Introduction
No ratings yet
Explainable AI Introduction
51 pages
Nist Ir 8367
No ratings yet
Nist Ir 8367
56 pages
MLIBooklet
No ratings yet
MLIBooklet
40 pages
Interpretable Machine Learning
No ratings yet
Interpretable Machine Learning
80 pages
2018 Miccai PDF
No ratings yet
2018 Miccai PDF
239 pages
Interpretability and Explainability A Ma
No ratings yet
Interpretability and Explainability A Ma
24 pages
Interpretable Machine Learning - Fundamental Principles and 10 Grand Challenges
No ratings yet
Interpretable Machine Learning - Fundamental Principles and 10 Grand Challenges
74 pages
The Mythos of Model Interpretability
No ratings yet
The Mythos of Model Interpretability
28 pages
Chapter 5
No ratings yet
Chapter 5
29 pages
XAI Basics
No ratings yet
XAI Basics
34 pages
Machine Learnig
No ratings yet
Machine Learnig
93 pages
2024 - How Do ML Practitioners Perceive Explainability An Interview Study of Practices and Challenges
No ratings yet
2024 - How Do ML Practitioners Perceive Explainability An Interview Study of Practices and Challenges
25 pages
Human Factors in Explainability 1 of 2
No ratings yet
Human Factors in Explainability 1 of 2
41 pages
Interpret Ability
No ratings yet
Interpret Ability
65 pages
Module1 Lecture 2
No ratings yet
Module1 Lecture 2
19 pages
An Assessment Framework For Explainable AI With Applications To Cybersecurity
No ratings yet
An Assessment Framework For Explainable AI With Applications To Cybersecurity
19 pages
Counterfactual Explanations and Algorithmic Recourses For Machine Learning: A Review
No ratings yet
Counterfactual Explanations and Algorithmic Recourses For Machine Learning: A Review
23 pages
Docemnent of Presentaion
No ratings yet
Docemnent of Presentaion
20 pages
Facct22 3533090
No ratings yet
Facct22 3533090
19 pages
Disruptive Research-Larry Marine
No ratings yet
Disruptive Research-Larry Marine
238 pages
Explainable AI Introduction 2 of 2
No ratings yet
Explainable AI Introduction 2 of 2
39 pages
Engineering Applications of Artificial Intelligence: Hajar Hakkoum, Ali Idri, Ibtissam Abnane
No ratings yet
Engineering Applications of Artificial Intelligence: Hajar Hakkoum, Ali Idri, Ibtissam Abnane
18 pages
High Level Functional Test Plan
100% (1)
High Level Functional Test Plan
29 pages
Sentry Study Guide - 2022
No ratings yet
Sentry Study Guide - 2022
4 pages
CS282BR: Topics in Machine Learning Interpretability and Explainability
No ratings yet
CS282BR: Topics in Machine Learning Interpretability and Explainability
84 pages
Causal Interpretability For Machine Learning
No ratings yet
Causal Interpretability For Machine Learning
16 pages
Interpretable Machine Learning
No ratings yet
Interpretable Machine Learning
10 pages
An Introduction To Machine Learning Interpretability Second Edition PDF
No ratings yet
An Introduction To Machine Learning Interpretability Second Edition PDF
62 pages
EthicalAI TT2
No ratings yet
EthicalAI TT2
14 pages
Rudin - 2019 - Stop Explaining Black Box Machine Learning Models For High Stakes Decisions and
No ratings yet
Rudin - 2019 - Stop Explaining Black Box Machine Learning Models For High Stakes Decisions and
10 pages
Review IML 2020
No ratings yet
Review IML 2020
17 pages
Interpretability Needs A New Paradigm
No ratings yet
Interpretability Needs A New Paradigm
16 pages
Mythos Model Interpretability21
No ratings yet
Mythos Model Interpretability21
6 pages
The Future of Human-Centric Explainable Artificial Intelligence (Xai) Is Not Post-Hoc Explanations
No ratings yet
The Future of Human-Centric Explainable Artificial Intelligence (Xai) Is Not Post-Hoc Explanations
9 pages
Explaining Explanations - An Overview of Interpretability of Machine Learning
No ratings yet
Explaining Explanations - An Overview of Interpretability of Machine Learning
10 pages
TDSC Choo 221
No ratings yet
TDSC Choo 221
12 pages
Applying Genetic Programming To Improve Interpretability in Machine Learning Models
No ratings yet
Applying Genetic Programming To Improve Interpretability in Machine Learning Models
8 pages
Shap Lime
No ratings yet
Shap Lime
6 pages
Overview ML Interpretability
No ratings yet
Overview ML Interpretability
10 pages
Entropy 23 00018 v2 3
No ratings yet
Entropy 23 00018 v2 3
1 page
Entropy 23 00018 v2 2
No ratings yet
Entropy 23 00018 v2 2
1 page
Entropy 23 00018 v2 36
No ratings yet
Entropy 23 00018 v2 36
1 page
Gmd1 Introduction
100% (2)
Gmd1 Introduction
1 page
ERP Implementation Checklist
No ratings yet
ERP Implementation Checklist
2 pages
Learning and Development Component
No ratings yet
Learning and Development Component
5 pages
Mil-Std-3022 (2012)
No ratings yet
Mil-Std-3022 (2012)
55 pages
TECHOP (P-01 - Rev1 - Jan21) PART 2 - COMPETENCY FOR DP PROFESSIONALS - DP SMEs
No ratings yet
TECHOP (P-01 - Rev1 - Jan21) PART 2 - COMPETENCY FOR DP PROFESSIONALS - DP SMEs
44 pages
Financial Services Professional Resume
100% (1)
Financial Services Professional Resume
4 pages
HRD Interventions
No ratings yet
HRD Interventions
30 pages
16 P6 Lipa
No ratings yet
16 P6 Lipa
22 pages
Corhq 24 R 0422
No ratings yet
Corhq 24 R 0422
158 pages
Powercenter 8.6 New Features: Vincent Loosli Senior Sales Consultant Informatica Software (Schweiz) GMBH
No ratings yet
Powercenter 8.6 New Features: Vincent Loosli Senior Sales Consultant Informatica Software (Schweiz) GMBH
55 pages
Engineering Managers Handbook
No ratings yet
Engineering Managers Handbook
21 pages
Safety Management
No ratings yet
Safety Management
127 pages
Design Thinking - 04 Module
No ratings yet
Design Thinking - 04 Module
31 pages
Hse Competency Management Program
No ratings yet
Hse Competency Management Program
10 pages
Prom 313
No ratings yet
Prom 313
9 pages
10 Steps To An Effective Baldrige Assessment
No ratings yet
10 Steps To An Effective Baldrige Assessment
5 pages
ISC2 Volunteer Opportunities Summary
No ratings yet
ISC2 Volunteer Opportunities Summary
23 pages
Marist Avenue, General Santos City: Nddu-Col-Lp-Foo1
No ratings yet
Marist Avenue, General Santos City: Nddu-Col-Lp-Foo1
6 pages
Accredited Assessors Complaints Feedback Management Policy 200421
No ratings yet
Accredited Assessors Complaints Feedback Management Policy 200421
15 pages
Module2 Lecture 6 Cat1 UptoSmoothGrad
No ratings yet
Module2 Lecture 6 Cat1 UptoSmoothGrad
59 pages
Ai-Enabled Damage Assessment Whitepaper
No ratings yet
Ai-Enabled Damage Assessment Whitepaper
16 pages
WINSEM2023-24 BSTS302P SS CH2023240500139 Reference Material I 17-01-2024 L6 - The Celebrity Problem
No ratings yet
WINSEM2023-24 BSTS302P SS CH2023240500139 Reference Material I 17-01-2024 L6 - The Celebrity Problem
8 pages
03HRD Designing Effective HRD Programs
No ratings yet
03HRD Designing Effective HRD Programs
8 pages
Nist - Ai.100 1 4 - 41 48
No ratings yet
Nist - Ai.100 1 4 - 41 48
8 pages
Sort Without Extra Space
No ratings yet
Sort Without Extra Space
11 pages
Priority Queue
No ratings yet
Priority Queue
10 pages
Iterative Tower of Hanoi
No ratings yet
Iterative Tower of Hanoi
9 pages
Minimum Stack
No ratings yet
Minimum Stack
9 pages
Celebrity Problem
No ratings yet
Celebrity Problem
8 pages
Stock Span
No ratings yet
Stock Span
7 pages
Stack Permutation
No ratings yet
Stack Permutation
7 pages
Subject Matter Expert
No ratings yet
Subject Matter Expert
3 pages
Managing Single Point of Failure Risks
No ratings yet
Managing Single Point of Failure Risks
3 pages
D7i I7 PDF
No ratings yet
D7i I7 PDF
14 pages
Generative Ai: A Comprehensive Guide to Innovative Ai Models (A Step-by-step Understanding of Fundamental Concepts With Practical Applications)
From Everand
Generative Ai: A Comprehensive Guide to Innovative Ai Models (A Step-by-step Understanding of Fundamental Concepts With Practical Applications)
Anthony Phillips
No ratings yet
Pre-calculus Demystified, Second Edition
From Everand
Pre-calculus Demystified, Second Edition
Rhonda Huettenmueller
3/5 (5)
The Millionaire Codes
From Everand
The Millionaire Codes
Suzanne Longstreet
No ratings yet
How Machines Learn (Simplified AI Concepts) A Simple Guide to Big Ideas.pdf
From Everand
How Machines Learn (Simplified AI Concepts) A Simple Guide to Big Ideas.pdf
NOVA MARTIAN
No ratings yet

Module1 Lecture 1

Uploaded by

Module1 Lecture 1

Uploaded by

CS282BR: Topics in Machine Learning

Interpretability and Explainability

 Strong understanding: Linear algebra, probability,

 Familiarity with statistics, optimization

Machine Learning is EVERYWHERE!!

Input This model is

Predictive Prediction = Siberian Husky

[ Larson et. al. 2016 ]

Patient Data Model Understanding

If gender = female, This model is using

Debugging End users (e.g., loan applicants)

Bias Detection Decision makers (e.g., doctors, judges)

Recourse Regulatory agencies (e.g., FDA, European

Take 1: Build inherently interpretable predictive models

[ Letham and Rudin 2015; Lakkaraju et. al. 2016 ]

[ Ribeiro et. al. 2016, 2018; Lakkaraju et. al. 2019]

In certain settings, accuracy-interpretability trade offs may exist.

can build interpretable + complex models might

And, all you have is a (proprietary) black box!

[ Ribeiro et. al. 2016 ]

If you can build an interpretable model which is also adequately

Otherwise, post hoc explanations come to the rescue!

 Define and evaluate interpretability

 Taxonomy of interpretability evaluation

 Taxonomy of interpretability based on

 Taxonomy of interpretability based on methods

 ML systems are being deployed in complex high-

 Accuracy alone is no longer enough

 Auxiliary criteria are important:

 Auxiliary criteria are often hard to quantify

 Fallback option: interpretability

 Interpretability evaluation typically falls into:

 Evaluate in the context of an application

 Evaluate via a quantifiable proxy

 Evaluate via a quantifiable proxy

You will know it when you see it!

 Are all models in all “interpretable” model classes equally

 How to compare a linear model with a decision tree?

 Do all applications have same interpretability needs?

 Defn: Ability to explain or to present in

 No clear answers in psychology to:

 Not all ML systems require interpretability

 No explanation needed because:

When do we need explanation then?

 Incompleteness in problem formalization

Claim of the research should match the type of the evaluation!

 Real humans (domain experts), real tasks

 Domain experts experiment with exact application

 Domain experts experiment with a simpler or partial

 Typical in HCI and visualization communities

 Real humans, simplified tasks

 No humans, just proxies

 What proxies are best for what real world

 What factors to consider when designing simpler

 Global vs. Local

 Nature of User Expertise

 Note: These taxonomies are constructed based on intuition

 Basic units of explanation:

 Number of basic units of explanation:

 Interactions between basic units:

 Say hi to your neighbors! Introduce yourselves!

 What topics are you most excited about learning as part of

 Are you convinced that model interpretability/explainability

 Do you think we can really interpret/explain models

 What is your take on inherently interpretable models vs. post

You might also like