0% found this document useful (0 votes)

49 views39 pages

Unit 3

Uploaded by

Bhavani G

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views39 pages

Unit 3

Uploaded by

Bhavani G

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

Neural Networks and Deep Learning

UNIT – 3
Feedforward Neural Networks

Dr. D. SUDHEER
Assistant Professor
Computer Science and Engineering
VNRVJIET
© Dr. Devulapalli Sudheer 1
Introduction
• By a suitable choice of architecture for a feedforward network, it
is possible to perform several pattern recognition tasks.
•The linear association network shows that the network is limited
in its capabilities.
•The constraint on the number of input patterns is overcome by
using a two layer feedforward network with nonlinear processing
units in the output layer.
•This modification automatically leads to the consideration of
pattern classification problems.
• Classification problems which are not linearly separable are
called hard problems. © Dr. Devulapalli Sudheer 2
• In order to overcome the constraint of linear separability for
pattern classification problems, a multilayer feedforward network
with nonlinear processing units in all the intermediate hidden
layers and in the output layer is proposed.
• A multilayer feedforward architecture could solve representation
of the hard problems in a network.
• It introduces the problem of hard learning, i.e., the difficulty in
adjusting the weights of the network to capture the implied
functional relationship between the given input-output pattern
pairs.
• The hard learning problem is solved by using thc
backpropagation learning algorithm. © Dr. Devulapalli Sudheer 3
© Dr. Devulapalli Sudheer 4
Analysis of Pattern Association Networks

• The objective in pattern association is to design a network that

can represent the association in the pairs of vectors (al, bl), l = 1,
2, ..., L, through a set of weights to be determined by a learning
law.
• The given set of input-output pattern pairs is called training data.

© Dr. Devulapalli Sudheer 5

a. Linear Associative Network

© Dr. Devulapalli Sudheer 6

© Dr. Devulapalli Sudheer 7
• Each output unit receives inputs from the M input units corresponding
to the M-dimensional input vectors.
• Due to linearity of the output function, the activation values (xi) and
the signal values of the units in the input layer are the same as the
input data values ali.
The activation value of jth output unit is:

• The weights are determined by using the criterion that the total
mean squared error between the desired output and the actual
output is to be minimized.
© Dr. Devulapalli Sudheer 8
b. Determination of weights by computation
• For linear associative network:

Error in the output is given by the distance between the desired

output vector and the actual output vector.

© Dr. Devulapalli Sudheer 9

• The following singular value decomposition (SVD) of an M x L
matrix A is used to compute the pseudo inverse and to evaluate the
minimum error.

The expression for minimum error is given as

© Dr. Devulapalli Sudheer 10

c. Determination of weights by learning
• It is desirable to determine the weights of a network in an
incremental manner.
•Each update of the weights with a new input data can be
interpreted as network learning.
•Computationally also learning is preferable because it does not
require information of all the training set data at the same time.
•It is also preferable to have learning confined to a local operation.
•Two learning laws and their variations, as applicable to a linear
associative network are discussed: 1. Hebb’s Law
2. Widrows Law
© Dr. Devulapalli Sudheer 11
• Let the input pattern vector al and the corresponding desired
output pattern vector bl be applied to the linear associative
network.
• According to the Hebb's law, the updated weight value of a
connection depends only on the activations of the processing units.

© Dr. Devulapalli Sudheer 12

• Note that the computation of the increment xi yj = aliblj is purely
local for the processor unit and the input-output pattern pair.
• The updated weight matrix for the application of the lth pair (al, bl)
is given by

• where W(1- 1) refers to the weight matrix after presentation of

the first (1 - 1) pattern pairs, and W(1) refers to the weight matrix
after presentation of the first 1 pattern pairs with NxM dimension.

© Dr. Devulapalli Sudheer 13

• To verify whether the network has learnt the association of the
given set of input-output pattern vector pairs, apply the input
pattern ak and determine the actual output vector b’k .

© Dr. Devulapalli Sudheer 14

Widrow’s law:

• A form of Widrow learning can be used to obtain W = BA+

recursively.
• Let W(l - 1) be the weight matrix after presentation of (1 - 1)
samples.
• Then W(1- 1) = B(1- 1)A+(l - 1), where the matrices B(l - 1) and
A(1 - 1) are composed of the first (1 - 1) vectors of bk and the first
(1 - 1) vectors of ak
• The updated matrix is given as

© Dr. Devulapalli Sudheer 15

© Dr. Devulapalli Sudheer 16
• By starting with zero initial values for all the weights, and
successively adding the pairs (a1, b1), (a2, b2), ..., (aL, bL), we can
obtain the final pseudoinverse-based weight matrix W = BA+.
• problem with this approach is that the recursive procedure cannot
be implemented locally because of the need to calculate pl
• The same eventual effect can be approximately realized using the
following variation of the above learning law

© Dr. Devulapalli Sudheer 17

• where is a small positive constant called the learning rate
parameter.
• The learning law implemented locally as:

• where wj(l - 1) is the weight vector associated with the jth

processing unit in the output layer of the linear associative network
at the (1 - 1)th iteration.
• The low learning rate will achieve highest convergence of
learning law.
© Dr. Devulapalli Sudheer 18
Summary of pattern association network

• rank r is maximum number of linearly independent rows of

matrix © Dr. Devulapalli Sudheer 19
Analysis of Pattern Classification Networks

• Each input pattern is associated with any distinct class label.

• The number of output patterns can be viewed as distinct classes.
• There is no restriction of number of input patterns associated
with output patterns.
• The output points are points in a discrete N-dimensional space.
• Some times the input patterns may corrupted by external noise.
• The input will be mapped wit any one of distinct pattern even
not, It is called accretive behaviour.

© Dr. Devulapalli Sudheer 20

Pattern classification with Perceptron
• The number of units in the input layer corresponds to the
dimensionality of the input pattern vectors.
• The number of units in the input layer corresponds to the
dimensionality of the input pattern vectors.
• Typically, if the weighted sum of the input values to the output
unit exceeds the threshold, the output signal is labelled as 1,
otherwise as 0.
• If a subset of the input patterns belong to one
class (say class A,) and the remaining subset of the
input patterns to another class.
© Dr. Devulapalli Sudheer 21
• Note that the dividing surface between the two classes is given
by

• This equation represents a linear hyperplane in the M-

dimensional space. The hyperplane becomes a point if M = 1, a
straight line if M = 2, and a plane if M = 3.

© Dr. Devulapalli Sudheer 22

• Suppose the subsets A1, and A2 of points in the M-dimensional
space contain the sample patterns belonging to the classes A1, and
A2.
• We need to classify the patterns belongs to A1, or A2.

© Dr. Devulapalli Sudheer 23

• The perceptron learning law can be written as follows:

© Dr. Devulapalli Sudheer 25

Is the error signal

• The product between output error e(m) and activation value x(m)
can measure performance as below:

Then the weight update will be:

© Dr. Devulapalli Sudheer 26

Forward pass Y=[0,1,2]
Input
Layer 1.
5.1 0.1 0.3 1.
03 80
0.2
0.2 x 0.7
x0 2 x4 1.
15

0.1 0.6 0.7 x6

2. 1.
3.5 17 22
0.5 0.3
x1 x3 x5 • 0.19 0.69
•0.59 0.29

27
Forward pass Y=[0,1,2]
Input
Layer
5.1 0.1 0.3
0.2
0.2 x 0.7
x0 2 x4

0.1 0.6 0.7 x6

3.5
0.5 0.3
x1 x3 x5

28
Pattern representation problem

Multi class classification

Geometrical representation of Hard problems

• A pattern classification problem can be viewed as determining the

Hypersurfaces separating the multidimensional patterns belonging to
different classes.
• A two-layer network consisting of two input units and N output
units can produce N distinct lines in the pattern space.

• The multilayer perceptron with nonlinear units like sigmoid will
produce smooth surfaces instead of hyperplanes.

Analysis of pattern mapping networks

• A function transforming a point in the M-dimensional input

pattern space to N-dimensional output pattern space, then the
problem of capturing implied functional relationship is called a
mapping problem.
• The network accomplishing this task called mapping network.
• The pattern mapping problem is more general case of
classification problem.
• The objective of pattern mapping is to capture the generalization
implied in the input – output pattern.
© Dr. Devulapalli Sudheer 33
• It is also can be viewed as approximation function from given
data.
• in terms of function approximation the output should be close to
the values for current input used in training.

Pattern Mapping Network

• The multi layer feed forward neural network with at least two
hidden layers along with input and out layers can perform the
pattern classification problem.
• Same models can also perform the pattern mapping task.
• The number of hidden layers depends on the nature of mapping
problem.
• Except the input layer the units in the other layers must be mon
linear to produce the generalization.

Ref: Artificial neural networks, yegna narayana, Table 4.6 for
backpropagation algorithm © Dr. Devulapalli Sudheer 36
• The hard learning problem is solved by using a differentiable
nonlinear output function for each unit in the hidden and output
•layers.
• The corresponding learning law is based on propagating the
error from the output layer to the hidden layers for updating the
weights.
• This is an error correcting learning law, also called the
generalized delta rule.
• It is based on the principle of gradient descent along the error
surface.

• In this so called back propagation network the objective is to capture (in the
weights) the complex nonlinear hypersurfaces separating the classes.
• The complexity of the surface is determined by the number of hidden units
in the network.
• In a classification problem the input patterns belonging to a class are
expected to have some common features which are different for patterns
belonging to another class.
• For a classification problem, the trained neural network is expected to
perform some kind of generalization, which is possible only if there are some
features common among the input patterns belonging to each class.
•features are captured by the network during training.

Simple Neural Nets For Pattern Classification
No ratings yet
Simple Neural Nets For Pattern Classification
68 pages
Module For Stem 12 Gen Physics
No ratings yet
Module For Stem 12 Gen Physics
23 pages
VMD0007 BL UP v3.1
No ratings yet
VMD0007 BL UP v3.1
47 pages
Classification BP Regression KNN Other Classifiers - Final
No ratings yet
Classification BP Regression KNN Other Classifiers - Final
116 pages
ML Unit - 2
No ratings yet
ML Unit - 2
70 pages
Linear Regression and Correlation Analysis PPT at BEC DOMS
50% (2)
Linear Regression and Correlation Analysis PPT at BEC DOMS
67 pages
Chapter 7
No ratings yet
Chapter 7
68 pages
Unit 1
No ratings yet
Unit 1
72 pages
7 - Worksheet 7 - Trigonometry & Right-Angled Triangles
No ratings yet
7 - Worksheet 7 - Trigonometry & Right-Angled Triangles
8 pages
Lecture 10
No ratings yet
Lecture 10
155 pages
Multilayer Feed Forward Neural Network
No ratings yet
Multilayer Feed Forward Neural Network
8 pages
Lecture 2
No ratings yet
Lecture 2
52 pages
ANN Unit 3
No ratings yet
ANN Unit 3
100 pages
Soft Computing Module 2
No ratings yet
Soft Computing Module 2
78 pages
36-Multi-Layer Perceptron and Its Properties-30-10-2024
No ratings yet
36-Multi-Layer Perceptron and Its Properties-30-10-2024
39 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
35 pages
NN Ch2
No ratings yet
NN Ch2
36 pages
Associative - Memory - Networks 1
No ratings yet
Associative - Memory - Networks 1
53 pages
3 Non Linear Classifiers
No ratings yet
3 Non Linear Classifiers
74 pages
AI17-Neural Networks
No ratings yet
AI17-Neural Networks
34 pages
Ppt-Ii NNFL
No ratings yet
Ppt-Ii NNFL
43 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Unit 3
No ratings yet
Unit 3
110 pages
Neural
No ratings yet
Neural
53 pages
Machine Learning Unit 5 Notes
No ratings yet
Machine Learning Unit 5 Notes
19 pages
ANN-Implemetation of Back-Prop
No ratings yet
ANN-Implemetation of Back-Prop
89 pages
Ann 2 A
No ratings yet
Ann 2 A
20 pages
Chapter 3
No ratings yet
Chapter 3
30 pages
Machine Learning
No ratings yet
Machine Learning
83 pages
Lesson 4
No ratings yet
Lesson 4
83 pages
Neural Network
100% (1)
Neural Network
54 pages
Unit 3
100% (1)
Unit 3
11 pages
DHSCH 6
No ratings yet
DHSCH 6
35 pages
Dire Dawa University Institute of Technology School of Computing Department of Computer Science
No ratings yet
Dire Dawa University Institute of Technology School of Computing Department of Computer Science
13 pages
Main
No ratings yet
Main
25 pages
Artificial Neural Networks: Biological Motivation
No ratings yet
Artificial Neural Networks: Biological Motivation
22 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
Definiteness in A Language Without Articles A Study On Polish Adrian Czardybon PDF Download
No ratings yet
Definiteness in A Language Without Articles A Study On Polish Adrian Czardybon PDF Download
74 pages
Chap 2
No ratings yet
Chap 2
105 pages
Lec03 NeuralNetwork
No ratings yet
Lec03 NeuralNetwork
39 pages
Common AMS - Assignment - 1
No ratings yet
Common AMS - Assignment - 1
3 pages
DSD Unit 1 Analysis of Algorithm
No ratings yet
DSD Unit 1 Analysis of Algorithm
38 pages
Pattern Classification 10. Linear Perceptron, Least Squares & Multi-Layer Nns
No ratings yet
Pattern Classification 10. Linear Perceptron, Least Squares & Multi-Layer Nns
38 pages
Class 10 Science Chapter 8 Presentation
No ratings yet
Class 10 Science Chapter 8 Presentation
68 pages
L04 Slides - mlp1
No ratings yet
L04 Slides - mlp1
22 pages
Module 5.3 Lateral Loads On Building Frames (Portal and Cantilever Method)
No ratings yet
Module 5.3 Lateral Loads On Building Frames (Portal and Cantilever Method)
11 pages
1 Complex Numbers
No ratings yet
1 Complex Numbers
32 pages
Mid 1 DL Notes
No ratings yet
Mid 1 DL Notes
15 pages
Chapter 7
No ratings yet
Chapter 7
7 pages
DHSCH 6
No ratings yet
DHSCH 6
30 pages
Simple Neural Nets For Pattern Classification
No ratings yet
Simple Neural Nets For Pattern Classification
68 pages
4 Lab Manual 18CSL76
No ratings yet
4 Lab Manual 18CSL76
29 pages
PERCEPTRONS
No ratings yet
PERCEPTRONS
13 pages
Mod 2.4,2.5,2.6 Architecture Design
No ratings yet
Mod 2.4,2.5,2.6 Architecture Design
20 pages
100 Data Science in R Interview Questions and Answers For 2016
100% (2)
100 Data Science in R Interview Questions and Answers For 2016
56 pages
Neural Networks and Principal Component Analysis: Learning From Examples Without Local Minima
No ratings yet
Neural Networks and Principal Component Analysis: Learning From Examples Without Local Minima
6 pages
NN-Ch2 New V1
No ratings yet
NN-Ch2 New V1
99 pages
MAT1100 Integral Calculus I - 2020
No ratings yet
MAT1100 Integral Calculus I - 2020
6 pages
Module 9 - Motions of Physics - Study Guide
No ratings yet
Module 9 - Motions of Physics - Study Guide
4 pages
Neural Network BSC
No ratings yet
Neural Network BSC
32 pages
HPC Calculation SHEET 1 ROW
No ratings yet
HPC Calculation SHEET 1 ROW
7 pages
1 s2.0 S1877050922015058 Main
No ratings yet
1 s2.0 S1877050922015058 Main
11 pages
Paraview Tutorial
No ratings yet
Paraview Tutorial
28 pages
Grade 7 9.7 Communicationg About Estimation Strategies
No ratings yet
Grade 7 9.7 Communicationg About Estimation Strategies
4 pages
Artificial Neural Networks: HCMC University of Technology Sep. 2008
No ratings yet
Artificial Neural Networks: HCMC University of Technology Sep. 2008
71 pages
PHY10 Lesson 2 Kinematics (Full)
No ratings yet
PHY10 Lesson 2 Kinematics (Full)
35 pages
3251 Real Analysis IIComplex Analysis Feb Mar 2024
No ratings yet
3251 Real Analysis IIComplex Analysis Feb Mar 2024
2 pages
Pattern Classification Slide
No ratings yet
Pattern Classification Slide
45 pages
Learning Rules For Multilayer Feedforward Neural Networks
No ratings yet
Learning Rules For Multilayer Feedforward Neural Networks
19 pages
3 Perceptron: Nnets - L. 3 February 10, 2002
No ratings yet
3 Perceptron: Nnets - L. 3 February 10, 2002
31 pages
Single Layer Feedforward Networks
No ratings yet
Single Layer Feedforward Networks
21 pages
Fundamentals of Artificial Neural Networks
No ratings yet
Fundamentals of Artificial Neural Networks
27 pages
Robotics 15
No ratings yet
Robotics 15
35 pages
Data Mining Techniques: Presentation On Neural Network
No ratings yet
Data Mining Techniques: Presentation On Neural Network
55 pages
Section 5 Quiz
100% (1)
Section 5 Quiz
7 pages
Kevin Swingler - Lecture 4: Multi-Layer Perceptrons
No ratings yet
Kevin Swingler - Lecture 4: Multi-Layer Perceptrons
20 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
71 pages
Least Mean Square (LMS) Algorithm: 3.1 Spatial Filtering
No ratings yet
Least Mean Square (LMS) Algorithm: 3.1 Spatial Filtering
16 pages
Artificial Neural Networks: HCMC University of Technology Sep. 2008
No ratings yet
Artificial Neural Networks: HCMC University of Technology Sep. 2008
71 pages
Maple Labs
No ratings yet
Maple Labs
19 pages
Simulink
No ratings yet
Simulink
6 pages
Pythagorean Triples: Determine Whether Each Set of Numbers Form A Pythagorean Triple. 12, 20, 16 8, 15, 17 1, 7, 5
No ratings yet
Pythagorean Triples: Determine Whether Each Set of Numbers Form A Pythagorean Triple. 12, 20, 16 8, 15, 17 1, 7, 5
2 pages
Lab Reports Requirements Gr. 9
100% (1)
Lab Reports Requirements Gr. 9
2 pages
Latest DLL Math 4 WK 8
No ratings yet
Latest DLL Math 4 WK 8
2 pages
Ex: Luggage / Baggage / Breakage / Advice / Furniture / Information / Scenery / Poetry / Work / Soap / Food / Bread / Fish / Paper / Machinery Etc
No ratings yet
Ex: Luggage / Baggage / Breakage / Advice / Furniture / Information / Scenery / Poetry / Work / Soap / Food / Bread / Fish / Paper / Machinery Etc
3 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
Gauss Nodes Revolution: Numerical Integration Theory Radically Simplified And Generalised
From Everand
Gauss Nodes Revolution: Numerical Integration Theory Radically Simplified And Generalised
Rob Porter
No ratings yet
Level Set Method: Advancing Computer Vision, Exploring the Level Set Method
From Everand
Level Set Method: Advancing Computer Vision, Exploring the Level Set Method
Fouad Sabry
No ratings yet
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
From Everand
Ordered Weighted Averaging Aggregation Operator: Fundamentals and Applications
Fouad Sabry
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet