0% found this document useful (0 votes)

24 views21 pages

ANN5

The document describes the backpropagation algorithm for training a neural network. It defines the terms used, such as weights, activations, targets, and derivations of the error function. The backpropagation algorithm computes the error terms for the output layer first, then propagates these errors back through the hidden layers to update the weights between layers in order to minimize error. This process of forward propagation of inputs and backward propagation of errors is repeated iteratively to gradually adjust the weights.

Uploaded by

ARPIT SANJAY AVASARMOL R2566003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views21 pages

ANN5

Uploaded by

ARPIT SANJAY AVASARMOL R2566003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 21

Example 2

x1 2

0.982 0
x  d  1   0.1
 0.5  4 -3.93
x2
1
d = t=y

The transfer function is unipolar continuous (logsig) 1

o
f ' (net )  (1  o)o 1  e net
w   (d  o)(1  o)o x net=2*0.982+4*0.5-3.93*1=0.034

w    x o=1/(1+exp(-0.04)) = 0.51
Error=1-0.51=0.49
  (1  .51)(1  .51)(.51)  .1225
w   *  * 0.982  0.1* .1225 * .982  0.012 4+0.1*0.1225*.5=4.0061
wnew  wold  .012  2  .012  2.012
-3.93+.1*.1225*1=-3.9178
net = 2.012*0.982+4.0061*0.5-3.9178*1=0.061
Error=1-0.5152=0.4848
o=1/(1+exp(-0.061)=0.5152
1 
x  d  t  y 1   0.1
0 
3
1 2

0
4 -3.93
0 5
1
1

1 -6

x1 2

4 -3.93
x2
1
By chain rule:
∂E ∂E ∂o ∂net
---- = ---- ----i ---- i
∂wij ∂oi ∂neti ∂wij

∂E
---- = (1/2) 2 (di - oi) (-1) = (oi - ti) E = 1/2 ∑ (di - oi)2
∂oi i

∂oi ∂
---- = ---- [1 / (1 + e-xi)] = - [1 / (1 + e-xi)2] (- e-xi ) = e-xi / (1 + e-xi)2
∂xi ∂xi
(1 + e-xi) - 1 1
= ------------- • ----------- = [1 - 1 / (1 + e -x
i)] • [1 / (1 + e i)]
-x
(1 + e-xi) (1 + e-xi)

= (1 - oi) oi
∂xi
---- = aj xi = ∑ wijaj
∂wij j
∂E ∂E ∂o ∂xi
---- = ---- ----i ----
∂wij ∂oi ∂xi ∂wij

= (oi - ti) (1 - oi)oi aj

}
}
}
raw error term due to incoming
(pre-synaptic) activation
due to sigmoid

∂E
Δwij = - η ----- (where η is an arbitrary learning rate)
∂wij

wijt+1 = wijt + η (ti - oi) (1 - oi) oi aj

A two layer network
1 
x  d 1   0.1
0 

Transfer function is unipolar continuous

1
o
1  e net
net3=u3= 3*1+4*0+1*1=4 o3=1/(1+exp(-4))=0.982
net4=u4= 6*1+5*0+-6*1=0 o4=1/(1+exp(0))=0.5
net5=u5=2*0.982+4*0.5-3.93*1=0.034 o5=1/(1+exp(-0.04))
=0.51
f ' ( net )  ( 1  o )o
w   ( d  o )( 1  o )o x  w    x  5  (1  .51)(1  .51)(.51)  .1225
δ
w53   *  5 * 0.982  0.1* .1225 * .982  0.012
w53  w53  .012  2  .012  2.012
Derivation of Backprop
Output layer Define:
ai = activation of neuron i
Hidden layer
wij = synaptic weight from neuron j to neuron i
Input layer
ni = excitation of neuron i (sum of weighted
activations coming into neuron i, before
squashing)=net
di = target vector=ti
oi = output of neuron i

By definition:
ni = ∑ wijaj
j
oi = 1 / (1 + e-ni)
Summed, squared error at output layer: E = 1/2 ∑ (di - oi)2
i
Derivation of Backprop
By chain rule:
∂E ∂E ∂o ∂ni
---- = ---- ----i ----
∂wij ∂oi ∂ni ∂wij

∂E
---- = (1/2) 2 (di - oi) (-1) = (oi - ti) E = 1/2 ∑ (di - oi)2
∂oi i

∂oi ∂
---- = ---- [1 / (1 + e-ni)] = - [1 / (1 + e-ni)2] (- e-ni ) = e-ni / (1 + e-ni)2
∂ni ∂ni
(1 + e-ni) - 1 1
= ------------- • ---------- = [1 - 1 / (1 + e -n
i)] • [1 / (1 + e i)]
-n
(1 + e-ni) (1 + e-nxi)

= (1 - oi) oi
∂ni
---- = aj ni = ∑ wijaj
∂wij j
Derivation of Backprop
∂E ∂E ∂o ∂ni
---- = ---- ----i ----
∂wij ∂oi ∂ni ∂wij

= (oi - ti) (1 - oi)oi aj

}
}
}
raw error term due to incoming
(pre-synaptic) activation
due to sigmoid

∂E
Δwij = - η ----- (where η is an arbitrary learning rate)
∂wij

wijt+1 = wijt + η (ti - oi) (1 - oi) oi aj

Derivation of Backprop
Now need to compute weight changes in the hidden layer, so, as before,
we write out the equation for the error function slope w.r.t. a
particular weight leading into the hidden layer:

∂E ∂E ∂ai ∂ni
---- = ---- ---- ----
∂wij ∂ai ∂ni ∂wij
(where i now corresponds to a unit in the hidden layer and j now
corresponds to a unit in the input or earlier hidden layer)
From previous derivation, last two terms can simply be written down:
∂ai
---- = (1 - ai) ai
∂ni
∂ni
---- = aj
∂wij
Derivation of Backprop
However, the first term is more difficult to understand for this hidden
layer. It is what Minsky called the credit assignment problem, and is
what stumped connectionists for two decades. The trick is to realize
that the hidden nodes do not themselves make errors, rather they
contribute to the errors of the output nodes. So, the derivative of the
total error w.r.t. a hidden neuron’s activation is the sum of that hidden
neuron’s contributions to the errors in all of the output neurons:

∂E ∂E ∂ok ∂nk
---- = ∑ ---- ---- ---- (where k indexes over all output units)
∂ai k ∂ok ∂nk ∂ai

contribution of contribution of contribution of the

each output all inputs to the particular neuron
neuron output neuron in the hidden layer
(from the hidden
layer)
Derivation of Backprop
From our previous derivations, the first two terms are easy:
∂E
---- = (ok - dk)
∂ok

∂ok
---- = (1 - ok) ok
∂nk

For the third term, remember: nk = ∑ wkiai

i
And since only one member of the sum involves ai:

∂nk
---- = wki
∂ai
Derivation of Backprop
Combining these terms then yields:

∂E
---- = - ∑ (dk - ok) (1 - ok) ok wki
∂ai k

δk Weight between hidden and output

layers

And combining with previous results yields:

∂E
---- = - (∑ δk wki) (1 - ai) ai aj
∂wij k

wijt+1 = wijt + η (∑ δk wki) (1 - ai) ai aj

δi
Derivation of Backprop
Forward Propagation of Activity
• Forward Direction layer by layer:
– Inputs applied
– Multiplied by weights
– Summed
– ‘Squashed’ by sigmoid activation function
– Output passed to each neuron in next layer
• Repeat above until network output produced

Back-propagation of error

• Compute error (delta or local gradient) for each output unit

• Layer-by-layer, compute error (delta or local gradient) for each
hidden unit by backpropagating errors (as shown previously)

Can then update the weights using the Generalised Delta Rule (GDR), also
known as the Back Propagation (BP) algorithm
For output neuron

wijt+1 = wijt + η (di - oi) (1 - oi) oi aj

i
For hidden neuron

wijt+1 = wijt + η (∑ δk wki) (1 - ai) ai aj

k
δ k=(dk - ok) (1 - ok) ok

i
The chain rule does the following: distribute the error of an output unit o to all
the hidden units that is it connected to, weighted by this connection. Differently
put, a hidden unit h receives a delta from each output unit o equal to the delta
of that output unit weighted with (= multiplied by) the weight of the connection
between those units.
Algorithm (Backpropagation)
Start with random weights
while error is unsatisfactory
do for each input pattern
compute hidden node input (net)
compute hidden node output (o)
compute input to output node (net)
compute network output (o)
Modify outer layer weights
wijt+1 = wijt + η (di - oi) (1 - oi) oi aj

Modify outer layer weights

wijt+1 = wijt + η (∑ δk wki) (1 - ai) ai aj
k

δ k=(dk - ok) (1 - ok) ok

end
end
w50   *  5 *1  0.1* .1225  0.01225
w50  w50  .01225  3.92  .01225  3.9078

w53   *  5 * 0.982  0.1* .1225 * .982  0.012

w53  w53  .012  2  .012  2.012

w41   *  4 1  0.1 .1225 *1  0.01225

w41  w413  .01225  6  .0125  6.01225
w31   *  3 *1  0.1* .0043 *1  0.0043
w53  w31  .0043  3  .0043  3.0043
a new w
w δ
Verification that it works

Thus the new error (1 - 0.5239)=0.476

has been reduced by 0.014
(from 0.490 to 0.476)
Update the weights of the multi-layer network using backpropagation
algorithm. The transfer function of the neurons are tansig functions. Target
outputs are y2*=1 and y3*=0.5. Learning rate is 0.5.
Show that with the updated weights there is a reduction in the total error.

Homework- Bring the solution tomorrow

A two layer network
1 
x  d 1   0.1
0 

Transfer function is unipolar continuous

1
o
1  e net
net3=u3= 3*1+4*0+1*1=4 o3=1/(1+exp(-4))=0.982
net4=u4= 6*1+5*0+-6*1=0 o4=1/(1+exp(0))=0.5
net5=u5=2*0.982+…… o5=

f ' ( net )  ( 1  o )o
w   ( d  o )( 1  o )o x  w    x
δ

Multi Layer Perceptron 1
No ratings yet
Multi Layer Perceptron 1
54 pages
Backprogation With Example
No ratings yet
Backprogation With Example
17 pages
L4-5 Ann
No ratings yet
L4-5 Ann
30 pages
Clase 4 Backpropagation
No ratings yet
Clase 4 Backpropagation
63 pages
MLP
No ratings yet
MLP
19 pages
14 Backprop
No ratings yet
14 Backprop
34 pages
38 Backpropagation
No ratings yet
38 Backpropagation
19 pages
NCERT Reference
No ratings yet
NCERT Reference
295 pages
AyushChokhani AI Asiignment 2
No ratings yet
AyushChokhani AI Asiignment 2
12 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
68 pages
Modue 2 - Back Propagation Algorithm-Updated
No ratings yet
Modue 2 - Back Propagation Algorithm-Updated
51 pages
Assignment 4
No ratings yet
Assignment 4
2 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
Msep2013 L7
No ratings yet
Msep2013 L7
16 pages
Clase 3 - Redes Neuronales - Entrenamiento y Aplicaciones
No ratings yet
Clase 3 - Redes Neuronales - Entrenamiento y Aplicaciones
9 pages
Eio Supplementary
No ratings yet
Eio Supplementary
6 pages
ECE/CS 559 - Neural Networks Lecture Notes #7: The Backpropagation Algorithm
No ratings yet
ECE/CS 559 - Neural Networks Lecture Notes #7: The Backpropagation Algorithm
9 pages
Back in NN
No ratings yet
Back in NN
12 pages
36-Multi-Layer Perceptron and Its Properties-30-10-2024
No ratings yet
36-Multi-Layer Perceptron and Its Properties-30-10-2024
39 pages
Chap5 3-BackProp
No ratings yet
Chap5 3-BackProp
41 pages
Msep2013 L5
No ratings yet
Msep2013 L5
14 pages
MLP (Backward Propagation)
No ratings yet
MLP (Backward Propagation)
16 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Ann R16 Unit 4 PDF
No ratings yet
Ann R16 Unit 4 PDF
16 pages
Lecture 13.3 Classification ANN
No ratings yet
Lecture 13.3 Classification ANN
64 pages
Lect 15 MLP Introduction Backprop
No ratings yet
Lect 15 MLP Introduction Backprop
24 pages
Back Propagation in NN
No ratings yet
Back Propagation in NN
30 pages
Lect 15 MLP Introduction Backprop
No ratings yet
Lect 15 MLP Introduction Backprop
24 pages
Derivations For Back Propagation of Multilayer Neural Network
No ratings yet
Derivations For Back Propagation of Multilayer Neural Network
14 pages
Ai Assignment 2 Answer
No ratings yet
Ai Assignment 2 Answer
12 pages
Backpropagation A Peek Into The Mathematics of Optimization
No ratings yet
Backpropagation A Peek Into The Mathematics of Optimization
4 pages
ANN Example
No ratings yet
ANN Example
10 pages
Unit 3
No ratings yet
Unit 3
17 pages
Backpropagation Example
No ratings yet
Backpropagation Example
9 pages
Back Propogation
No ratings yet
Back Propogation
9 pages
Back Propagation
No ratings yet
Back Propagation
9 pages
BackPropagation PDF
No ratings yet
BackPropagation PDF
48 pages
l7 - Learning in Multi-Layer Perceptrons, Back-Propagation
No ratings yet
l7 - Learning in Multi-Layer Perceptrons, Back-Propagation
16 pages
Backprop Unit 2
No ratings yet
Backprop Unit 2
5 pages
Artificial Neural Networks - Lect - 3
No ratings yet
Artificial Neural Networks - Lect - 3
16 pages
Ann2018 L6
No ratings yet
Ann2018 L6
18 pages
Behat
100% (1)
Behat
87 pages
Backward Forward Propogation
No ratings yet
Backward Forward Propogation
19 pages
Supervised Learning: Csm10: BACKPROPAGATION: An Example of
No ratings yet
Supervised Learning: Csm10: BACKPROPAGATION: An Example of
6 pages
Exp 4
No ratings yet
Exp 4
9 pages
8 Math Ieb Nov 2024 Qp1
No ratings yet
8 Math Ieb Nov 2024 Qp1
17 pages
Chapter 4
0% (1)
Chapter 4
65 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
14 pages
Bucketwheel Stacker Reclaimers - Part1
No ratings yet
Bucketwheel Stacker Reclaimers - Part1
10 pages
Principles of Training Multi-Layer Neural Network Using Backpropagation
100% (1)
Principles of Training Multi-Layer Neural Network Using Backpropagation
15 pages
Quadratic Equation - Arjuna Jee 2.0 2025
No ratings yet
Quadratic Equation - Arjuna Jee 2.0 2025
15 pages
Learning Rules For Multilayer Feedforward Neural Networks
No ratings yet
Learning Rules For Multilayer Feedforward Neural Networks
19 pages
Unit 3. Introduction To Programming in C
No ratings yet
Unit 3. Introduction To Programming in C
76 pages
Multilayer Perceptrons Neural Networks
No ratings yet
Multilayer Perceptrons Neural Networks
19 pages
GATE EE 2007 With Solutions
No ratings yet
GATE EE 2007 With Solutions
62 pages
Exp 3
No ratings yet
Exp 3
9 pages
Back-Propagation Algorithm
No ratings yet
Back-Propagation Algorithm
26 pages
Optimization of Line Losses Using Series Compensation: Vaibhav V. Gholase Sudhir A. Gadekar
No ratings yet
Optimization of Line Losses Using Series Compensation: Vaibhav V. Gholase Sudhir A. Gadekar
29 pages
Neural Networks: Derivation: 1 Model
No ratings yet
Neural Networks: Derivation: 1 Model
9 pages
Manual Moisture
No ratings yet
Manual Moisture
38 pages
OULD - bammOUNE Preparation Tp1
No ratings yet
OULD - bammOUNE Preparation Tp1
13 pages
Back-Propagation Is Very Simple. Who Made It Complicated
No ratings yet
Back-Propagation Is Very Simple. Who Made It Complicated
26 pages
Element Thickness 3
No ratings yet
Element Thickness 3
24 pages
Recurrent Neural Network: Unit - 3
No ratings yet
Recurrent Neural Network: Unit - 3
12 pages
Lec 15 MLP Cont
No ratings yet
Lec 15 MLP Cont
34 pages
Excel Tests For Interview
No ratings yet
Excel Tests For Interview
13 pages
Computer Programming Laboratory 2018-2019
No ratings yet
Computer Programming Laboratory 2018-2019
37 pages
3.7 Calculating Mitotic Index Worksheet
No ratings yet
3.7 Calculating Mitotic Index Worksheet
2 pages
Sheet #6 Ensemble + Neural Nets + Linear Regression + Backpropagation + CNN
No ratings yet
Sheet #6 Ensemble + Neural Nets + Linear Regression + Backpropagation + CNN
4 pages
Scan 9 Apr 2019 PDF
No ratings yet
Scan 9 Apr 2019 PDF
26 pages
17 Perceived Organization Support and Work Engagement Toward Employee Performance With Motivation As Mediating Variable
No ratings yet
17 Perceived Organization Support and Work Engagement Toward Employee Performance With Motivation As Mediating Variable
10 pages
6.034f Neural Net Notes October 28, 2010
No ratings yet
6.034f Neural Net Notes October 28, 2010
7 pages
Triangles Report
No ratings yet
Triangles Report
10 pages
10 11648 J Ijass 20231102 11
No ratings yet
10 11648 J Ijass 20231102 11
8 pages
Chapter 1 - Introduction To Finite Element Analysis
No ratings yet
Chapter 1 - Introduction To Finite Element Analysis
16 pages
Presentation Regression
No ratings yet
Presentation Regression
12 pages
Co5124.Sp52.Assignment1 Ngo Chi Nguyen 12528511 in
No ratings yet
Co5124.Sp52.Assignment1 Ngo Chi Nguyen 12528511 in
15 pages
Neural Networks: Single Neurons (Continued) : G. Extension of The Delta Rule: Smooth F (Z)
No ratings yet
Neural Networks: Single Neurons (Continued) : G. Extension of The Delta Rule: Smooth F (Z)
5 pages
Experiment 2.4 DL
No ratings yet
Experiment 2.4 DL
4 pages
Board Diversity and Its Effects On Bank Performance - An International Analysis PDF
No ratings yet
Board Diversity and Its Effects On Bank Performance - An International Analysis PDF
13 pages
(IJCST-V6I4P17) :P T V Lakshmi
No ratings yet
(IJCST-V6I4P17) :P T V Lakshmi
4 pages
RelaySimTest Brochure ENU
No ratings yet
RelaySimTest Brochure ENU
8 pages
ST 16 2-5 (-4)
No ratings yet
ST 16 2-5 (-4)
9 pages
Second Moment of Area
No ratings yet
Second Moment of Area
4 pages
Illustration of BackP Learn
No ratings yet
Illustration of BackP Learn
2 pages
BJMC - 04 (Answer)
No ratings yet
BJMC - 04 (Answer)
2 pages
Shaft Misalignment and Vibration - A Model
No ratings yet
Shaft Misalignment and Vibration - A Model
13 pages
Solving Math Problems
From Everand
Solving Math Problems
George N. Frempong
No ratings yet
Analytic Geometry: Graphic Solutions Using Matlab Language
From Everand
Analytic Geometry: Graphic Solutions Using Matlab Language
Ing. Mario Castillo
No ratings yet
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
From Everand
Inverse Trigonometric Functions (Trigonometry) Mathematics Question Bank
Mohmmad Khaja Shareef
No ratings yet