0% found this document useful (0 votes)

16 views35 pages

LRP PPT

The document discusses Layer-Wise Relevance Propagation (LRP) as an explainable machine learning technique that effectively propagates predictions backward through models, providing insights into feature relevance. It outlines various LRP rules, their applications, and the properties of good explanation techniques, emphasizing LRP's ability to produce understandable and faithful explanations. While LRP shows promise and can be efficiently implemented, the document also notes limitations, including a lack of empirical evidence and the need for human assessment of explanation quality.

Uploaded by

shanti.swamy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views35 pages

LRP PPT

Uploaded by

shanti.swamy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

● Introduction
● Layer-Wise Relevance Propogation (LRP)
● Which LRP Rule for Which Layer?
● Conclusion
● Discussion
Introduction
Background & Motivation

● Rise of large datasets is a main driver for the success of machine learning techniques in both
industrial and scientific applications
● Large datasets can be plagued by spurious correlations; leads to “Clever Hans” predictors

All classify
correctly, but only
(i) generalizes
Explainable Machine Learning

● Feature selection is one solution: only present the model with “good” input features
○ This can be difficult to apply in practice
○ Consider image recognition, where individual pixels do not have fixed roles
● Explainable machine learning takes opposite approach: train the model, then examine which
features the model actually learned
○ We do not care about feature selection during training
○ “Bad” features can be removed later, and the model can be retrained on cleaned data
● Taylor Decomposition is a foundational explainable ML technique related to LRP
Taylor Decomposition

● Produce explanations by performing a Taylor expansion of the prediction 𝑓(𝑥) at some nearby
reference point
● First-order terms (the elements of the sum) quantify the relevance of each input feature, forming
the explanation

Reference
point
Problems with Taylor Decomposition

● Unstable when applied to DNN

○ Shattered gradients: while 𝑓(𝑥) is generally accurate, the gradient is often very noisy, containing little
meaningful information
○ Adversarial examples: small perturbations of the input can cause dramatic changes to the function value
● It can be difficult to choose a meaningful reference point
○ 0 → may be far away from real input
Alternative Explanation Techniques

● Integrate a large number of local gradient estimates

● Replace the gradient with a coarser estimate of effect, such as model response to patch-like
perturbations
● Optimization techniques involving a local surrogate model or the explanation itself
● Common Problem: these techniques are all computationally expensive, involving multiple
network evaluation
Four Properties of Good Explanation Techniques

● Conservation: if we find explainable evidence in the output, it must show up somewhere in the
input features (no loss of evidence)
● Positivity: either a feature is relevant (positive) or irrelevant (zero)
● Continuity: if two inputs are almost the same, and the prediction is almost the same, then the
explanation should be almost the same
● Selectivity: models must agree with explanation; removing evidence from input should reduce
confidence in the output

LRP satisfies all of these properties; the previous techniques do not

Layer-Wise Relevance
Propogation (LRP)
LRP Explained

● LRP is an explanation technique which propagates the prediction backwards using purposely
designed local propagation rules
● LRP is subject to the conservation property
○ What has been received by a neuron must be redistributed to the lower layer in equal amount
○ (It’s also subject to the other properties, but this one is explicitly mentioned)
LRP Explained (ii)

Referring to the previous equation:

● 𝒛𝒋𝒌: the quantity which models the extent to which neuron j has contributed to make neuron k
relevant
● σ𝒋 𝒛𝒋𝒌 :the denominator which serves to enforce the conservation property
LRP Rules for Deep Rectifier Networks

● Question: How do we determine 𝑧𝑗𝑘 (the contribution of neuron 𝑗 to 𝑅𝑘)?

● Answer: With LRP rules!
● We’ll talk about three in the following slides:
○ Basic (LRP-0)
○ Epsilon (LRP-𝟄)
○ Gamma (LRP-𝞬)
● Note that we’re working in the context of ReLU activations:
LRP Rules: LRP-0

● Redistribute relevance in proportion to the contributions of each input to the neuron activation
● Note 𝑧𝑗𝑘 = 𝑎𝑗𝑤𝑗𝑘
● Properties:
○ If 𝑎𝑗 = 0 or 𝑤𝑗𝑘 = 0, then 𝑅𝑗 = 0, which allows for compatibility with concepts such as zero
weight, deactivation, or absent connections
○ Uniform application produces an explanation equivalent to (𝐺𝑟𝑎𝑑𝑖𝑒𝑛𝑡 ✕ 𝐼𝑛𝑝𝑢𝑡), which is
undesirable since gradients are noisy
LRP Rules: LRP-𝟄

● This rule aims to solve the problem of gradient noise by introducing a small positive term, 𝟄, to the
denominator
● 𝟄 diminishes relevance scores, aiming to absorb some relevance when contributions to neuron 𝑘
are weak, contradictory, etc.
● As 𝟄 becomes larger, only the most salient explanation factors are preserved
● Result: sparser explanations in terms of input features, and less noise
LRP Rules: LRP-𝞬

● This rule aims to reduce noise and improve stability by favoring the effect of positive contributions
over negative ones with the introduction of a 𝞬 parameter applied to 𝑤𝑗𝑘.
● As 𝞬 increases, negative contributions disappear
● Limits how large positive and negative contributions can grow during propagation, improving
stability
Bonus LRP Rule: LRP-𝞪𝞫

● Like LRP-𝞬, this rule aims to treat positive and negative contributions asymmetrically
● Applies two parameters, 𝞪 and 𝞫, to positive and negative contributions, respectively
● Subject to conservation constraint 𝞪 = 𝞫 + 1
● Using LRP-𝞬 where 𝞬 = ∞ causes LRP-𝞬 to become equivalent to LRP-𝞪𝞫 where 𝞪 = 1 and 𝞫
= 0 (among other rules not covered in this paper)
Apply LRP rule to
weights

Implementing LRP Efficiently

● Consider the generic LRP rule (pictured right)

● For any layer 𝑗, 𝑅𝑗 can be computed in four steps:

● Note the third step is equivalent to a gradient computation, where 𝒂 is the vector of lower-layer
activations:
Implementing LRP Efficiently (in code)
LRP as a Deep Taylor Decomposition

● Deep Taylor Decomposition views LRP as a succession of Taylor expansions performed at each
neuron
● Treat the relevance score 𝑅𝑘 as a function of lower-level activations (𝑎𝑗 )𝑗 denoted by the vector 𝒂,
and then perform a first-order Taylor expansion of 𝑅𝑘(𝒂) at some reference point in the space of
activations:
LRP as a Deep Taylor Decomposition (ii)

● DTD requires a closed-form expression for the terms of the previous equation
● Substitute the true relevance function with a model that is easier to analyze:

● Modulation term 𝑐𝑘 is a constant set in such a way that 𝑅෠𝑘 (𝒂) = 𝑅𝑘(𝒂) at the current data
point
● Then the Taylor expansion becomes:
LRP as a Deep Taylor Decomposition (iii)

● Relation to LRP-0/𝟄/𝞬: LRP rules can be recovered within the DTD framework by changing the
reference point:
○ LRP-0: 0
○ LRP-𝟄: 𝟄 · (𝑎𝑘 + 𝟄) − 1 𝒂
○ LRP-𝞬:
LRP as a Deep Taylor Decomposition (iv)
LRP as Deep Taylor Decomposition (v)

● LRP-0: ෥
𝒂=0

● LRP-𝟄: ෥
𝒂=𝟄 · (𝑎𝑘 + 𝟄) − 1 𝒂

● LRP- 𝞬: ෥
𝒂=
Which LRP Rule for Which Layer?
Properties of Explanations

● LRP is a general framework for propagation, leaving flexibility for different rules at each layer, and
for the parameters ε and γ
○ Optimal selection for parameters requires a measure of explanation quality, which is still being researched

● Focus on 2 main explanation properties: fidelity and understandability

○ Fidelity is the accuracy of the explanation’s representation of the output neuron
■ To visually assess fidelity, we must assume the network properly solved the task (is using correct
visual features and avoiding distracting elements)
○ Understandability is the interpretability of the explanation to a human
Properties of Explanations (ii)

● Explanation is complex
○ Lacks understandability

● Fails to focus on castle

○ Lacks fidelity

red = positively relevance

Model: VGG-16
blue = negatively relevance
Properties of Explanations (ii)

● Explanation is very sparse

○ Lacks understandability

● Much of the noise is removed

○ Has fidelity

red = positive relevance

Model: VGG-16
blue = negative relevance
Properties of Explanations (ii)

● Explanation is very clearly outlined

○ Has understandability

● Too much is highlighted (Ex. lamp post)

○ Lacks fidelity

red = positive relevance

Model: VGG-16
blue = negative relevance
Properties of Explanations (ii)
● Well outlined explanation
○ Has understandability
● Castle is correctly identified
○ Has fidelity

red = positive relevance

Model: VGG-16
blue = negative relevance
Rule Choices with VGG-16

LAYER RULE EXPLANATION

Upper LRP-0 ● Upper layers have about 4000 neurons (4 per class)
● Relatively low neuron count entangles concepts that form classes
● LRP-0 is close to function and gradient, and can ignore entanglements

Middle LRP-ε ● Middle layers are less entangled, but layer stacking and convolution
weight sharing adds spurious variations
● LRP-ε can filter out spurious variations

Lower LRP-γ ● Although similar to middle layers, LRP-γ at these layers uniformly spreads
relevance to whole features instead of individually calculating each pixel
● This helps make the explanation more understandable
Handling the Top Layer

red = positive relevance

Classification Task: Passenger Car blue = negative relevance
Conclusion
Conclusion

● Layer-wise Relevance Propagation (LRP) can explain SOTA predictions in terms of their input
features by propagating the prediction backwards through the model with various rules
● These can be implemented efficiently and modularly (in most modern neural net softwares)
● Through parameter tuning even complex models can have high quality explanations
● With Neuralization-Propagation (NEON), LRP can be applied beyond DNNs to other model types,
increasing its scope to help many other scenarios that require explainable machine learning
solutions
For

● LRP satisfies several properties of good explanatory ML techniques, and produces faithful and
understandable explanations
● LRP can be extended to a broad range of ML models beyond just DNN
● LRP can be implemented efficiently compared to other explanation techniques
● LRP can be easily modified to fit a variety of use cases via different rules
Against

● Little empirical evidence presented, with no comparison to other SOTA methods

● Many types of LRP rules were left out and it is unclear why this is the case
● LRP itself only applies to ReLU activations
● Explanation quality still requires human assessment, which can be time-consuming and error-
prone
● No formal evaluation criteria, only fidelity and understandability, which are subject to bias
○ Fidelity requires the assumption that the model is functioning exactly as intended as well
● Authors offer heuristics for applying LRP rules (i.e., “use epsilon for middle layers”), but only
support these with intuition rather than with more rigorous forms of evidence
● The relationship to the DTD framework is clear, but it is not clear why this relationship is valuable

Daniel A. Roberts, Sho Yaida - The Principles of Deep Learning Theory - An Effective Theory Approach To Understanding Neural Networks
No ratings yet
Daniel A. Roberts, Sho Yaida - The Principles of Deep Learning Theory - An Effective Theory Approach To Understanding Neural Networks
473 pages
Explainability in Deep Reinforcement Learning
No ratings yet
Explainability in Deep Reinforcement Learning
25 pages
Explainable AI
No ratings yet
Explainable AI
18 pages
Deep Learning in Neural Networks An Overview
No ratings yet
Deep Learning in Neural Networks An Overview
89 pages
CV 3
No ratings yet
CV 3
159 pages
A Comprehensive Guide To Explainable Ai: From Classical Models To Llms
No ratings yet
A Comprehensive Guide To Explainable Ai: From Classical Models To Llms
255 pages
Understanding Neural Networks From Theoretical and Biological Perspectives
No ratings yet
Understanding Neural Networks From Theoretical and Biological Perspectives
170 pages
Deep Learning - Part-1
No ratings yet
Deep Learning - Part-1
143 pages
DNN Merged Sugata
No ratings yet
DNN Merged Sugata
243 pages
Iccv19 Binder Slide
No ratings yet
Iccv19 Binder Slide
66 pages
Deep Learning 15 May 2014
No ratings yet
Deep Learning 15 May 2014
70 pages
Reinforcement Learning For Reasoning in Small LLMS: What Works and What Doesn'T
No ratings yet
Reinforcement Learning For Reasoning in Small LLMS: What Works and What Doesn'T
17 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
54 pages
Neural Networks1
No ratings yet
Neural Networks1
164 pages
Full Text 01
No ratings yet
Full Text 01
66 pages
Deep Learning Has Evolved Significantly Since Its Inception in The 1940s
No ratings yet
Deep Learning Has Evolved Significantly Since Its Inception in The 1940s
50 pages
2024 VL TMLS 08 Qi2 LRP
No ratings yet
2024 VL TMLS 08 Qi2 LRP
49 pages
Mueller2017 Presentation - Explaining and Interpreting Deep Neural Networks LRP
No ratings yet
Mueller2017 Presentation - Explaining and Interpreting Deep Neural Networks LRP
81 pages
Ai Important Questions For Semester Exams
100% (1)
Ai Important Questions For Semester Exams
197 pages
Explainability For Large Language Models-A Survey
No ratings yet
Explainability For Large Language Models-A Survey
38 pages
Learning Local Discrete Features in Explainable-By
No ratings yet
Learning Local Discrete Features in Explainable-By
37 pages
Explainability For Large Language Models: A Survey
No ratings yet
Explainability For Large Language Models: A Survey
38 pages
01 Intro
No ratings yet
01 Intro
45 pages
DL Module 1 - CS-1 Fundamentals of Neural Network
No ratings yet
DL Module 1 - CS-1 Fundamentals of Neural Network
81 pages
Explainability in Deep Reinforcement Learning: Version of Record
No ratings yet
Explainability in Deep Reinforcement Learning: Version of Record
24 pages
Dokumen - Pub Handbook of Evolutionary Machine Learning 9789819938148 9789819938131
No ratings yet
Dokumen - Pub Handbook of Evolutionary Machine Learning 9789819938148 9789819938131
1,052 pages
Machine Learnig
No ratings yet
Machine Learnig
93 pages
Understanding Reasoning LLMS: Methods and Strategies For Building and Refining Reasoning Models
No ratings yet
Understanding Reasoning LLMS: Methods and Strategies For Building and Refining Reasoning Models
27 pages
I Have Covered All The Bases Here: Interpreting Reasoning Features in Large Language Models Via Sparse Autoencoders
No ratings yet
I Have Covered All The Bases Here: Interpreting Reasoning Features in Large Language Models Via Sparse Autoencoders
17 pages
ML 22
No ratings yet
ML 22
29 pages
Explainability For Large Language Models: A Survey
No ratings yet
Explainability For Large Language Models: A Survey
31 pages
AIDL03 EvolutionOfAI
No ratings yet
AIDL03 EvolutionOfAI
22 pages
Neural Network Theory22
No ratings yet
Neural Network Theory22
60 pages
A Robot Is A Virtual or Mechanical Artificial Agent
100% (3)
A Robot Is A Virtual or Mechanical Artificial Agent
13 pages
CGXPLAIN
No ratings yet
CGXPLAIN
11 pages
Towards Analogy-Based Explanations in Machine Learning: Abstract
No ratings yet
Towards Analogy-Based Explanations in Machine Learning: Abstract
11 pages
Layer-Wise Relevance Propagation For Neural Bach Et Al 2015
No ratings yet
Layer-Wise Relevance Propagation For Neural Bach Et Al 2015
8 pages
Pattern Recognition: Grégoire Montavon, Sebastian Lapuschkin, Alexander Binder, Wojciech Samek, Klaus-Robert Müller
No ratings yet
Pattern Recognition: Grégoire Montavon, Sebastian Lapuschkin, Alexander Binder, Wojciech Samek, Klaus-Robert Müller
12 pages
CP4252 ML Unit - V
No ratings yet
CP4252 ML Unit - V
17 pages
Explainingdeeplearningmodelsforstructuredatausing Layer Wise Relevance Propogationpreprint
No ratings yet
Explainingdeeplearningmodelsforstructuredatausing Layer Wise Relevance Propogationpreprint
14 pages
WWW - Explainable Neural Rule Learning
No ratings yet
WWW - Explainable Neural Rule Learning
11 pages
Research Report
No ratings yet
Research Report
12 pages
Explainable Reinforcement Learning A Survey
No ratings yet
Explainable Reinforcement Learning A Survey
21 pages
LRP Documentation
No ratings yet
LRP Documentation
7 pages
A Comparison Between Tsetlin Machines and Deep Neu
No ratings yet
A Comparison Between Tsetlin Machines and Deep Neu
8 pages
Stop Overkilling Simple Tasks With Black-Box
No ratings yet
Stop Overkilling Simple Tasks With Black-Box
7 pages
WIREs Data Min Knowl - 2021 - Angelov - Explainable Artificial Intelligence An Analytical Review
No ratings yet
WIREs Data Min Knowl - 2021 - Angelov - Explainable Artificial Intelligence An Analytical Review
13 pages
Shapley Value: From Cooperative Game To Explainable Artificial Intelligence
No ratings yet
Shapley Value: From Cooperative Game To Explainable Artificial Intelligence
12 pages
Aaai22 Snimsv
No ratings yet
Aaai22 Snimsv
11 pages
Neural Networks
No ratings yet
Neural Networks
29 pages
Deep Learning (DL) - Comprehensive Summary
No ratings yet
Deep Learning (DL) - Comprehensive Summary
9 pages
WIREs Data Min Knowl - 2021 - Angelov - Explainable Artificial Intelligence An Analytical Review
No ratings yet
WIREs Data Min Knowl - 2021 - Angelov - Explainable Artificial Intelligence An Analytical Review
13 pages
Deep Learning in Neural Networks: An Overview
No ratings yet
Deep Learning in Neural Networks: An Overview
31 pages
EEE - 6609 - 2022 - Deep Learning - Lecture - 1
No ratings yet
EEE - 6609 - 2022 - Deep Learning - Lecture - 1
16 pages
The Deep Learning Revolution: Introductory Overview Lecture
No ratings yet
The Deep Learning Revolution: Introductory Overview Lecture
35 pages
Usability Aspects On Industrial ABB Robot Calibration With A Focus On TCP and Work Object Calibration
No ratings yet
Usability Aspects On Industrial ABB Robot Calibration With A Focus On TCP and Work Object Calibration
61 pages
Model Explainablity
No ratings yet
Model Explainablity
7 pages
A Selective Overview of Deep Learning: Jianqing Fan Cong Ma Yiqiao Zhong April 16, 2019
No ratings yet
A Selective Overview of Deep Learning: Jianqing Fan Cong Ma Yiqiao Zhong April 16, 2019
37 pages
TR2024 Artificial-Intelligence FINAL LINKED
No ratings yet
TR2024 Artificial-Intelligence FINAL LINKED
122 pages
Interpretable Explanations of Black Boxes by Meaningful Perturbation
No ratings yet
Interpretable Explanations of Black Boxes by Meaningful Perturbation
9 pages
XAI Final
No ratings yet
XAI Final
18 pages
Deep Learning Concise Notes
No ratings yet
Deep Learning Concise Notes
4 pages
Applying Genetic Programming To Improve Interpretability in Machine Learning Models
No ratings yet
Applying Genetic Programming To Improve Interpretability in Machine Learning Models
8 pages
Entropy 23 00018 v2 40
No ratings yet
Entropy 23 00018 v2 40
1 page
Diabetic Retinopathy Presentation
No ratings yet
Diabetic Retinopathy Presentation
17 pages
AI Shopping System
No ratings yet
AI Shopping System
12 pages
TOP 90 AI Tools in 2024-2025
No ratings yet
TOP 90 AI Tools in 2024-2025
14 pages
Aviation Term Paper Topics
100% (1)
Aviation Term Paper Topics
9 pages
Introduction To Artificial Learning Lecture One
No ratings yet
Introduction To Artificial Learning Lecture One
16 pages
Feed LinkedIn
No ratings yet
Feed LinkedIn
1 page
Mom - 070923
No ratings yet
Mom - 070923
8 pages
Educonnect (Student Web Interface)
No ratings yet
Educonnect (Student Web Interface)
25 pages
Brochure PDF
No ratings yet
Brochure PDF
2 pages
All ML
No ratings yet
All ML
9 pages
2023년 - 중3 - 2학기 기말 - 신천중학교 - 서울시 송파구 - NE능률 (김성곤)
No ratings yet
2023년 - 중3 - 2학기 기말 - 신천중학교 - 서울시 송파구 - NE능률 (김성곤)
8 pages
Generalizing From A Few Examples: A Survey On Few-Shot Learning
No ratings yet
Generalizing From A Few Examples: A Survey On Few-Shot Learning
34 pages
Intelligent Clinical Documentation: Harnessing Generative AI For Patient-Centric Clinical Note Generation
No ratings yet
Intelligent Clinical Documentation: Harnessing Generative AI For Patient-Centric Clinical Note Generation
15 pages
The Future of Sports Cars - Insights From The Ferrari Roma
No ratings yet
The Future of Sports Cars - Insights From The Ferrari Roma
4 pages
Introduction to Logarithms and Exponentials
From Everand
Introduction to Logarithms and Exponentials
Simone Malacrida
No ratings yet
Versal Ai Edge Gen2 Product Brief
No ratings yet
Versal Ai Edge Gen2 Product Brief
3 pages
NSE3 FortiAI Complete Downloadable
No ratings yet
NSE3 FortiAI Complete Downloadable
37 pages
The Canary in The Coal Mine A Workforce Transition Path For Coal Miners
No ratings yet
The Canary in The Coal Mine A Workforce Transition Path For Coal Miners
6 pages
Final Year Project Report
No ratings yet
Final Year Project Report
11 pages
Ouyang Image Restoration Refinement With Uformer GAN CVPRW 2024 Paper-2
No ratings yet
Ouyang Image Restoration Refinement With Uformer GAN CVPRW 2024 Paper-2
10 pages
CSE Captcha
No ratings yet
CSE Captcha
17 pages
AAM CT-1 Que Bank
No ratings yet
AAM CT-1 Que Bank
2 pages
Application of Artificial Intelligence in The Busi
No ratings yet
Application of Artificial Intelligence in The Busi
15 pages
Artificial Intelligence Class 10 Syllabus
No ratings yet
Artificial Intelligence Class 10 Syllabus
5 pages
AN Internship Report: Department of Computer Engineering Zeal College of Engineering & Research Narhe, Pune-411041
No ratings yet
AN Internship Report: Department of Computer Engineering Zeal College of Engineering & Research Narhe, Pune-411041
13 pages
Hedlin Novian Napitupulu Tugas3
No ratings yet
Hedlin Novian Napitupulu Tugas3
7 pages