0% found this document useful (0 votes)

42 views48 pages

Backpropogation Algorithm

The document explains how neural networks operate, focusing on the backpropagation algorithm used to train them by adjusting weights to minimize output error. It details the process of calculating net values, activations, and the application of gradient descent for weight updates. Additionally, it covers concepts like local and global minima in optimization, and the chain rule for calculating gradients in more complex networks.

Uploaded by

Cát Lăng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

42 views48 pages

Backpropogation Algorithm

Uploaded by

Cát Lăng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

How Neural Networks and the

Backpropagation Works
We have this input data…….
Feature 1 Feature 2
0.5 -0.5
0.3 0.4
0.7 0.9

We wish to map it to…..

Feature 1 Feature 2
0.9 0.1
0.9 0.9
0.1 0.1
Let’s take our first sample
Feature 1 Feature 2
0.5 -0.5
0.3 0.4
0.7 0.9

Feature 1 Feature 2
0.9 0.1
0.9 0.9
0.1 0.1
Consider this Neural network….
Example taken from: Neural Networks, A classroom approach by Satish Kumar
Bias Bias
0.01 0.31
Input Layer Hidden Layer Output Layer
d1
X1 0.1 0.37
0.9
0.5
0.3 -0.22

0.9
-0.2
d2
X2 0.1
-0.5 0.55 -0.12

-0.02 0.27

Bias
Bias
Let’s Start by moving forward
Bias

0.01
Input Layer Hidden Layer
Net value is the total input
0.1
X1 coming to the neuron.
0.5

Net value of the first neuron in the hidden layer:

-0.2
𝑧1 = 𝑥1 (0.1) + 𝑥2 −0.2 + 𝑏𝑖𝑎𝑠
X2
-0.5 𝑧1 = 0.5 0.1 + −0.5 −0.2 + 0.01

𝑧1 = 0.16
Input Layer
Net value is the total input
X1
coming to the neuron.
0.5
0.3
Net value of the second neuron in the hidden layer:
𝑧2 = 𝑥1 (0.3) + 𝑥2 0.55 + 𝑏𝑖𝑎𝑠

X2
-0.5 0.55

-0.02

Bias

𝑧2 = 0.5 0.3 + −0.5 0.55 + (−0.02)

𝑧2 = −0.145
The activation of the neuron

Net (z) Activation

(z)

Activation is scaling the input value (net value) to

a value from 0-1
For Example, The Sigmoidal Function:
Activating the two neurons at the hidden layer:

Net (z) Activation

(z) The input(net) value

For simplicity, we will consider =1

1
𝛿 𝑧1 = −0.16 = 0.5399
1+𝑒
1
𝛿 𝑧2 = 0.145 = 0.4638
1+𝑒
Let’s continue with the output neurons
Now, the hidden neuron’s output becomes the input to the next neuron

Bias
0.31
Hidden Layer Output Layer

0.37

0.9
𝑦1 = 0.5399(0.37) + 0.4638 0.9 + 0.31
𝑦1 = 0.9271
-0.22
Similarly……

-0.12

0.27

Bias

𝑦2 = 0.5399(−0.22) + 0.4638 −0.12 + 0.27

𝑦2 = 0.0955
Now, activating the output neurons
1
𝛿 𝑦1 = = 0.7164
1+𝑒 −0.9271
1
𝛿 𝑦2 = = 0.5238
1+𝑒 −0.0955
Output Layer

0.7164

0.5238

Complete Guide to Neural Networks with Python: Theory and Applications

Definition of Backpropagation
• A method to train the neural network, by
adjusting the weights of the neurons, for the
purpose of reducing the output error.
Gradient Descent
Gradient Descent

• The base algorithm that is used to minimize the error with respect to the
weights of the neural network. The learning rate determines the step size of
the update used to reach the minimum.
• An Epoch is one complete pass through all the samples.

https://www.learnopencv.com/understanding-activation-
functions-in-deep-learning/ https://sebastianraschka.com/faq/docs/closed-form-vs-gd.html
The Backpropagation
Remember our objective is to:
Minimize the error By Changing the Weight

Negative Slope: Gradient Descent

We move in the
When we Increase w,
direction opposite to
the loss is decreasing 
the derivative
-(-) = +  Weight
(opposite to the slope)
Increases (Moving
Right)

Positive Slope:
When we increase w,
the loss is increasing 
-(+) = -  Weight
Decreases (Moving Left)
Weight Update Rule:

ɳ = Learning Rate – How

fast we update the
weights. In other words,
𝑑𝐸 the step size of the update

𝑤 𝑤 − ղ
𝑑𝑤
Old Weight Negative Learnin Gradient
https://towardsdatascience.com/gradien
Slop g Rate t-descent-in-a-nutshell-eaf8c18212f0
Local Minimum and Global Minimum
f(x)
Convex and Non-Convex
Optimization

One global/local minima One or more local

minima and a global
minima

Image Credits: https://www.oreilly.com/radar/the-hard-thing-about-deep-learning/

Non-Convex Optimization
Multiple Local Minima
y=z+2 No w term 𝑑𝐸
z=w+4
???
𝑑𝑤

𝑤 Net (z) Activation

𝑎
𝐸𝑟𝑟𝑜𝑟 (𝐸)

This cannot be
done directly
𝑦 =𝑧+2 𝑦 = 𝑓(𝑧)

𝑧 =𝑤+4 𝑧 = 𝑔(𝑤)

𝑑𝑦 𝑑𝑦 𝑑𝑧
= .
𝑑𝑤 𝑑𝑧 𝑑𝑤
What Should be done is…….
𝑑𝑎
𝑑𝑧 𝑑𝐸
𝑑𝑧
𝑑𝑤 𝑑𝑎

𝑤 Net (z) Activation

𝑎
𝐸𝑟𝑟𝑜𝑟 (𝐸)
The Chain Rule
𝑑𝑎
𝑑𝑧 𝑑𝐸
𝑑𝑧
𝑑𝑤 𝑑𝑎

𝑤 Net (z) Activation

𝑎
𝐸𝑟𝑟𝑜𝑟 (

𝑑𝐸 𝑑𝐸 𝑑𝑎 𝑑𝑧
=
𝑑𝑤 𝑑𝑎 𝑑𝑧 𝑑𝑤
More Complex

𝑤1 𝑤2
𝑥 𝑧1 𝑎1 𝑧 2 𝑎2 𝐸
𝑑𝐸
𝑑𝑧 1
𝑑𝑎1 𝑑𝑧 2 𝑑𝑎2 𝑑𝑎2
𝑑𝑤1
𝑑𝑧 1 𝑑𝑎1 𝑑𝑧 2

𝑑𝐸 𝑑𝐸 𝑑𝑎2 𝑑𝑧 2 𝑑𝑎1 𝑑𝑧 1
=
𝑑𝑤1 𝑑𝑎2 𝑑𝑧 2 𝑑𝑎1 𝑑𝑧 1 𝑑𝑤1
Consider these neurons to work with…….
Bias Bias
0.01 0.31
Input Layer Hidden Layer Output Layer
d1
X1 0.1 0.37
0.9
0.5
0.3 -0.22

0.9
-0.2
d2
X2 0.1
-0.5 0.55 -0.12

-0.02 0.27

Bias
Bias
Adjusting the weight of the output neuron
Bias Bias
0.01 0.31
Input Layer Hidden Layer Output Layer
d1
X1 0.1 0.37
0.9
0.5
0.3 -0.22

0.9
-0.2
d2
X2 0.1
-0.5 0.55 -0.12

-0.02 0.27

Bias
Bias
How much is the error changing
with respect to the output
Expected Actual

Assuming one training sample

per iteration (batch size of 1)

1
{𝑑1 −𝛿 𝑦1 } 2 + {𝑑2 −𝛿 𝑦2 } 2
2

= -(0.9 – 0.7164) = -0.1836

How much is the output changing
with respect to the input

= 0.7164(1-0.7164) = 0.2031
How much is the input changing
with respect to the weight

= 0.5399
All Together

-0.1836 0.2031 0.5399

= -0.0201
Weight Update for the neuron

Found from the chain rule

-0.0201

Old Weight Learning Rate, how fast are you moving

Assume it to be 1.2

= 0.37 + 1.2(0.0201) = 0.3941

The new weight
Adjusting the weight for the Hidden
Layer

Input Layer Hidden Layer Output Layer

X1
0.5

X2
-0.5
In our case, p=2 𝜕𝑧1
𝜕𝑤1

𝑧1 = 𝑥1 (0.1) + 𝑥2 −0.2 + 𝑏𝑖𝑎𝑠

0.5
𝜕(𝛿 𝑍1 )

In our case, p=2 𝑍1

𝛿 𝑍1 [1-𝛿 𝑍1 ]
1
𝛿 𝑧1 =
1 + 𝑒 −𝑍1
0.5399(1-0.5399)

0.2484
In our case, p=2

𝜕𝐸 𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕𝐸 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2
+
𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕(𝛿 𝑍1 ) 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2 𝜕(𝛿 𝑍1 )
-0.1836 0.2031

𝑦1 = 𝛿 𝑍1 (𝑤1) + 𝛿 𝑍2 𝑤2 + 0.31
𝑦1 = 0.5399(0.37) + 0.4638 0.9 + 0.31
In our case, p=2

𝜕𝐸 𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕𝐸 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2
+
𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕(𝛿 𝑍1 ) 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2 𝜕(𝛿 𝑍1 )
-0.1836 0.2031 0.37

1
{𝑑1 −𝛿 𝑦1 } 2 + {𝑑2 −𝛿 𝑦2 } 2
2
In our case, p=2

𝜕𝐸 𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕𝐸 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2
+
𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕(𝛿 𝑍1 ) 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2 𝜕(𝛿 𝑍1 )
-0.1836 0.2031 0.37
−[𝑑2 − 𝛿 𝑦2 ]
-(0.1-0.5238)
0.4238
In our case, p=2

𝜕𝐸 𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕𝐸 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2
+
𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕(𝛿 𝑍1 ) 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2 𝜕(𝛿 𝑍1 )
-0.1836 0.2031 0.37 0.4238

𝛿 𝑦2 [1 − 𝛿 𝑦2 ]
0.5238(1-0.5238)
0.2494
In our case, p=2

𝜕𝐸 𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕𝐸 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2
+
𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕(𝛿 𝑍1 ) 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2 𝜕(𝛿 𝑍1 )
-0.1836 0.2031 0.37 0.4238 0.2494

𝛿 𝑍1
𝑦2 = 0.5399(−0.22) + 0.4638 −0.12 + 0.27
In our case, p=2

𝜕𝐸 𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕𝐸 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2
+
𝜕(𝛿 𝑦1 ) 𝜕 𝑦1 𝜕(𝛿 𝑍1 ) 𝜕(𝛿 𝑦2 ) 𝜕 𝑦2 𝜕(𝛿 𝑍1 )
-0.1836 0.2031 0.37 0.4238 0.2494 -0.22

-0.0370
0.2484 0.5
-0.0370

-0.0045954
Weight Update for the hidden neuron

Found from the chain rule

-0.0045954

Old Weight Learning Rate, how fast are you moving

Assume it to be 1.2

= 0.1 + 1.2(0.0045954) =
0.1055
The new weight
A Final Diagram to Wrap it up…….

https://www.jeremyjordan.me/neural-networks-training/
Weights Update for the network

https://www.jeremyjordan.me/neural-networks-training/

Blue Path: Orange Path:

Combine
Continue…..
• Similar Procedure for all the other neurons

Complete Guide to Neural Networks with Python: Theory and Applications

Take the second sample (iteration 2)
Feature 1 Feature 2
0.5 -0.5
0.3 0.4
0.7 0.9

Feature 1 Feature 2
0.9 0.1
0.9 0.9
0.1 0.1
Take the third sample (iteration 3)
Feature 1 Feature 2
0.5 -0.5
0.3 0.4
0.7 0.9

Feature 1 Feature 2
0.9 0.1
0.9 0.9
0.1 0.1
• That was ONE EPOCH. An Epoch is one
complete pass through all the samples. After
repeating that for many epochs (ex. 25) our
neural network is expected to reach the
minimum error, and be considered as trained.
We’ll learn about optimization later!

CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
Backpropagation in Neural Networks
No ratings yet
Backpropagation in Neural Networks
29 pages
Backpropagation Algorithm Explained
No ratings yet
Backpropagation Algorithm Explained
11 pages
NN Lecture Notes
No ratings yet
NN Lecture Notes
45 pages
Understanding Backpropagation in Neural Networks
No ratings yet
Understanding Backpropagation in Neural Networks
12 pages
Presentation 1
No ratings yet
Presentation 1
14 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
Back-Propagation in Neural Networks
No ratings yet
Back-Propagation in Neural Networks
42 pages
A Step by Step Backpropagation
No ratings yet
A Step by Step Backpropagation
8 pages
Backpropagation in Neural Nets
No ratings yet
Backpropagation in Neural Nets
13 pages
Back Propogation
No ratings yet
Back Propogation
9 pages
Back in NN
No ratings yet
Back in NN
12 pages
Backpropagation Example
No ratings yet
Backpropagation Example
9 pages
Unit 4
No ratings yet
Unit 4
16 pages
A Step by Step Backpropagation Example
No ratings yet
A Step by Step Backpropagation Example
9 pages
ANN Example
No ratings yet
ANN Example
10 pages
Step by Step Back Propagation
No ratings yet
Step by Step Back Propagation
8 pages
Chap 2 Training Feed Forward Neural Networks
No ratings yet
Chap 2 Training Feed Forward Neural Networks
22 pages
Lecture 13.3 Classification ANN
No ratings yet
Lecture 13.3 Classification ANN
64 pages
Curs3site PDF
No ratings yet
Curs3site PDF
38 pages
Backpropagation in MLP: A Detailed Guide
No ratings yet
Backpropagation in MLP: A Detailed Guide
34 pages
ANN-Implemetation of Back-Prop
No ratings yet
ANN-Implemetation of Back-Prop
89 pages
26 NeuralNetworks
No ratings yet
26 NeuralNetworks
23 pages
DL U-I Introduction Part-2
No ratings yet
DL U-I Introduction Part-2
48 pages
Unit 2
No ratings yet
Unit 2
38 pages
NN 2
No ratings yet
NN 2
12 pages
Sparse Autoencoder Overview
No ratings yet
Sparse Autoencoder Overview
15 pages
A Step by Step Backpropagation Example - Matt Mazur
No ratings yet
A Step by Step Backpropagation Example - Matt Mazur
9 pages
A Step by Step Backpropagation Example - Matt Mazur
No ratings yet
A Step by Step Backpropagation Example - Matt Mazur
17 pages
cst414 - Deep Learning
No ratings yet
cst414 - Deep Learning
34 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
35 pages
Neural Networks
No ratings yet
Neural Networks
29 pages
Back-Propagation Algorithm Explained
No ratings yet
Back-Propagation Algorithm Explained
13 pages
7-Working Example-01-08-2024
No ratings yet
7-Working Example-01-08-2024
29 pages
An Introduction To Mathematics Behind Neural Networks
No ratings yet
An Introduction To Mathematics Behind Neural Networks
5 pages
MLP (Backward Propagation)
No ratings yet
MLP (Backward Propagation)
16 pages
Neural Network Backpropagation Guide
No ratings yet
Neural Network Backpropagation Guide
9 pages
ECE/CS 559 - Neural Networks Lecture Notes #7: The Backpropagation Algorithm
No ratings yet
ECE/CS 559 - Neural Networks Lecture Notes #7: The Backpropagation Algorithm
9 pages
Module 1 DL
No ratings yet
Module 1 DL
84 pages
(IJCST-V6I4P17) :P T V Lakshmi
No ratings yet
(IJCST-V6I4P17) :P T V Lakshmi
4 pages
Module 3.docxaiml
No ratings yet
Module 3.docxaiml
20 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
Slides 11
No ratings yet
Slides 11
48 pages
Sparseautoencoder 2011new
No ratings yet
Sparseautoencoder 2011new
19 pages
5 - From Linear Models To Multi-Layer Perceptrons
No ratings yet
5 - From Linear Models To Multi-Layer Perceptrons
45 pages
MLP Numerical
No ratings yet
MLP Numerical
19 pages
0111CS191028
No ratings yet
0111CS191028
4 pages
Neural Network Backpropagation Guide
No ratings yet
Neural Network Backpropagation Guide
38 pages
NN Intro
No ratings yet
NN Intro
34 pages
Exp 3
No ratings yet
Exp 3
9 pages
International Baccalaureate (IB) : Artificial Neural Networks - #3
No ratings yet
International Baccalaureate (IB) : Artificial Neural Networks - #3
13 pages
Convolutional Neural Network Basics
100% (1)
Convolutional Neural Network Basics
59 pages
Classification BP Regression KNN Other Classifiers - Final
No ratings yet
Classification BP Regression KNN Other Classifiers - Final
116 pages
Lecture 40,41 BP Algorithm
No ratings yet
Lecture 40,41 BP Algorithm
11 pages
Backpropagation
No ratings yet
Backpropagation
12 pages
DNN Tip
No ratings yet
DNN Tip
49 pages
Neural Networks for Beginners
No ratings yet
Neural Networks for Beginners
79 pages
Ece18898g Neural Networks
No ratings yet
Ece18898g Neural Networks
47 pages
RNNs and LSTMs
No ratings yet
RNNs and LSTMs
41 pages
Regularization and Normalization
No ratings yet
Regularization and Normalization
29 pages
Softmax
No ratings yet
Softmax
5 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
98 pages
KL Divergence
No ratings yet
KL Divergence
8 pages
Very Highspeed BJT Buffer For Trackandhold Amplifiers With Enhan
No ratings yet
Very Highspeed BJT Buffer For Trackandhold Amplifiers With Enhan
4 pages
CMOS Delay Optimization Guide
No ratings yet
CMOS Delay Optimization Guide
12 pages
Comparison of Fuzzy Logic and Artificial Neural Networks Approaches in Vehicle Delay Modeling
No ratings yet
Comparison of Fuzzy Logic and Artificial Neural Networks Approaches in Vehicle Delay Modeling
19 pages
Machine Learning: Gradient Descent & Confusion Matrix
No ratings yet
Machine Learning: Gradient Descent & Confusion Matrix
5 pages
3rd Lecture
No ratings yet
3rd Lecture
21 pages
Python Neural Network
No ratings yet
Python Neural Network
5 pages
Unit - 5 (Video Analytics)
No ratings yet
Unit - 5 (Video Analytics)
24 pages
Deep Feedforward Neural Networks Guide
No ratings yet
Deep Feedforward Neural Networks Guide
97 pages
Bee4333 Intelligent Control: Artificial Neural Network (ANN)
No ratings yet
Bee4333 Intelligent Control: Artificial Neural Network (ANN)
120 pages
SC - Lab
No ratings yet
SC - Lab
22 pages
Face Recognition via Back Propagation
No ratings yet
Face Recognition via Back Propagation
1 page
Springer Ai
No ratings yet
Springer Ai
14 pages
Gradient of Matrix Multiplication Explained
No ratings yet
Gradient of Matrix Multiplication Explained
1 page
Fast Voltage Contingency Screening and Ranking Using Cascade Neural Network
No ratings yet
Fast Voltage Contingency Screening and Ranking Using Cascade Neural Network
9 pages
Deep Learning For Computer Vision
No ratings yet
Deep Learning For Computer Vision
125 pages
(Ebook) Neural Network Programming With Java - Second Edition by Fabio M. Soares Alan M. F. Souza ISBN 9781787126053, 1787126056
No ratings yet
(Ebook) Neural Network Programming With Java - Second Edition by Fabio M. Soares Alan M. F. Souza ISBN 9781787126053, 1787126056
81 pages
Section 1 - Mathematical Foundations & Core Theory For Dog Behavior Detection From Video
No ratings yet
Section 1 - Mathematical Foundations & Core Theory For Dog Behavior Detection From Video
33 pages
C++ Neural Networks and Fuzzy Logic - Valluru B. Rao
No ratings yet
C++ Neural Networks and Fuzzy Logic - Valluru B. Rao
595 pages
Deep Learning For Financial Applications - A Survey
No ratings yet
Deep Learning For Financial Applications - A Survey
52 pages
AI Practical
No ratings yet
AI Practical
28 pages
DataDriven ReservoirModeling NAGAO THESIS 2021
No ratings yet
DataDriven ReservoirModeling NAGAO THESIS 2021
119 pages
Backpropagation in Neural Networks
No ratings yet
Backpropagation in Neural Networks
27 pages
Matlab Neural Network Toolbox Guide
No ratings yet
Matlab Neural Network Toolbox Guide
18 pages
Deep Learning Lung Cancer Detection
No ratings yet
Deep Learning Lung Cancer Detection
27 pages
Lecture 10 Neural Network
No ratings yet
Lecture 10 Neural Network
34 pages
Framework For Artificial Intelligence
100% (7)
Framework For Artificial Intelligence
44 pages
Machine Learning Exam Guide
No ratings yet
Machine Learning Exam Guide
11 pages
ANN Backpropagation Algorithm
No ratings yet
ANN Backpropagation Algorithm
4 pages
NEURAL NETWORKS and Deep Learning: Going Deep About Neural Network
No ratings yet
NEURAL NETWORKS and Deep Learning: Going Deep About Neural Network
4 pages
DL MCQ Unit 1,2
No ratings yet
DL MCQ Unit 1,2
24 pages
Training Full Spike Neural Networks Via Auxiliary Accumulation Pathway
No ratings yet
Training Full Spike Neural Networks Via Auxiliary Accumulation Pathway
16 pages
DL Lab Manual
No ratings yet
DL Lab Manual
65 pages

Backpropogation Algorithm

Uploaded by

Backpropogation Algorithm

Uploaded by

How Neural Networks and the

We wish to map it to…..

Net value of the first neuron in the hidden layer:

𝑧2 = 0.5 0.3 + −0.5 0.55 + (−0.02)

Net (z) Activation

Activation is scaling the input value (net value) to

Net (z) Activation

For simplicity, we will consider =1

𝑦2 = 0.5399(−0.22) + 0.4638 −0.12 + 0.27

Complete Guide to Neural Networks with Python: Theory and Applications

Negative Slope: Gradient Descent

ɳ = Learning Rate – How

One global/local minima One or more local

Image Credits: https://www.oreilly.com/radar/the-hard-thing-about-deep-learning/

𝑤 Net (z) Activation

𝑤 Net (z) Activation

𝑤 Net (z) Activation

Assuming one training sample

= -(0.9 – 0.7164) = -0.1836

-0.1836 0.2031 0.5399

Found from the chain rule

Old Weight Learning Rate, how fast are you moving

= 0.37 + 1.2(0.0201) = 0.3941

Input Layer Hidden Layer Output Layer

𝑧1 = 𝑥1 (0.1) + 𝑥2 −0.2 + 𝑏𝑖𝑎𝑠

In our case, p=2 𝑍1

Found from the chain rule

Old Weight Learning Rate, how fast are you moving

Blue Path: Orange Path:

Complete Guide to Neural Networks with Python: Theory and Applications

You might also like