0% found this document useful (0 votes)

16 views16 pages

Multilayer Perceptron

Uploaded by

Asha Murugan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views16 pages

Multilayer Perceptron

Uploaded by

Asha Murugan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

UNIT 6 MULTI-LAYER PERCEPTRON

Structure Page No

6.1 Introduction 49
Objectives
6.2 Multi Layer Perceptron (MLP) 49
6.3 Backpropagation Learning 56
6.4 Summary 61
6.5 Solutions/Answers 61
6.6 Practical Assignment 64
6.7 References 64

6.1 INTRODUCTION
In this unit, we extend the Single Layer Neural Network architecture to the Multilayer
Feedforward (MLFF) network with backpropagation (BP) learning. This network is
also called multilayer perceptron. As its name suggests that this perceptron network
has more than one layer. First, we briefly review the perceptron model discussed in
unit 4 to show how this is altered to form MLFF networks in Sec. 6.2. In Section 6.3,
we derive the generalized delta (backpropagation) learning rule and see how it is
implemented in practice. Also, we shall examine the variations in the learning process
to improve the efficiency, and ways to avoid some potential problems that can arise
during training are described.

Objectives
After studying the unit you should be able to:
• define the multi-layer perceptron;
• formulate the multi-layer model for the given activation functions of input,
hidden and output layers;
• implement the backpropagation algorithm.

6.2 MULTI LAYER PERCEPTRON

In Unit 5, Rosenblatts’ perceptron i.e. single layer perceptron was introduced and the
limitation of this with regard to the solution of linearly inseparable (or nonlinearly
separable) problems was discussed.

The first approach to solve such linearly inseparable problems was to have more than
one perceptron, each set up identifying small linearly separable sections of the inputs.
Then, combining their outputs into another perceptron would produce a final
indication of the class to which the input belongs.

Let us start with an example of XOR problem. The combination of perceptrons to

solve the XOR problem is represented in Fig. 1. It looks that the arrangement shown in
Fig. 1 can solve the problem. On investigation, it is formed that this arrangement of
perceptrons in layers will be unable to learn. It is known that each neuron in the
architecture takes the weighted sum of inputs, thresholds it and output 1/0. For any
perceptron, in the first layer, the inputs come from the actual inputs of the problem,
while for the perceptron in the second layer the inputs are nothing but the outputs of
the first layer. The perceptrons of the second layer do not know which of the real
inputs from the first layer were on or off.
Neural Networks

**
**
P1
*
*
P3
* *

* *
* *
Fig. 1: Combination of perceptrons to solve XOR problem

It is impossible to strengthen the connections between active inputs and strengthen the
correct parts of the network. The actual inputs are effectively masked off from the
output units by the intermediate layer. The two states of neuron being on or off do not
give us any indication of the scale by which we have to adjust the weights. These are
shown in Fig. 2, where the threshold is adjusted at θ and at 0. The hard-hitting
threshold functions remove the information that is needed if the network is to
successfully learn. Hence, the network is not able to find which of the input weights
are increased and which one are not and so. Therefore, the network is unable to work
to produce a better solution next time. The way to go around the difficulty using the
step function as the thresholding process is to adjust it slightly and to use a slightly
different nonlinearity.
O
O

I I
0 0
θ
Fig. 2: “Step” or “Heaviside” functions

By smoothing the threshold function, we reach so that it turns on or off as before. It

should have a sloping region in the middle that provides some information on the
inputs. By this, strengthen or weaken the relevant weights can be determined. Hence,
the network can learn as required. Here, 0 = 1 if I > θ = 0, I < θ where 0 is the output
n
and I is the input. Also 1 < 0 < 1 if I = θ and u ( t ) = ∑w I
i =1
i i notations of input and

output the model of a neuron is shown in Fig. 3.

50
Multi-Layer
Perceptrons
I0 = 0

w0 = θ

I1
w1
Activation
w2 Σ O
Function
I2
wn Summation

Fig. 3: An artificial neuron

Now let us define few nonlinear activation operators in addition to the earlier defined
activation operators.

i) Linear Function: The output for linear function is o = gI where

g = tan φ and φ is the angle from the x-axis. The graph is shown in Fig. 4.

φ
I

Fig. 4

ii) Piecewise Linear Function: The output for piecewise linear functions is
 1 if mI > 1 
 
defined as O =  gI if mI < 1  . The corresponding graph is shown in Fig. 5.
−1 if mI > −1
 

0
1

–1

Fig. 5
51
Neural Networks iii) Hard Limiter Function: The output is given as O = sgn [ I ] , and the
corresponding graph is shown in Fig. 6.

0
1

–1

Fig. 6

iv) Unipolar Sigmoidal and Bipolar Sigmoidal Function: The output of unipolar
1
sigmoidal function is given as O = whereas the output of the
(1 + exp ( −λI ) )
bipolar function is given as O = tanh [ λI ] .

v) Unipolar Multimodal Function: The output is given as

1 1 M 
O = 1 +
2  M m =1
∑ ( (
tanh g m I − WOm  .

))
The graph is depicted in Fig. 7.

Fig. 7

vi) Radial Basis Function (RBF): The output is O = exp ( I ) where

 N 2
∑
 − Wi ( t ) − X i ( t ) 
I =  i =1  . This curve is same as the normal curve.
 2σ 2 
 
 

Considering threshold θ , the relative input to the neuron is given by

u ( t ) = W1I1 + W2 I 2 + ⋯ + Wn In − θ
n
= ∑W I
i =0
i i where W0 = −θ; I0 = 1

And the output O = f ( u ) where f is the nonlinear transfer function.

52
So far, we have discussed the mathematical details of a neuron at a single level. Multi-Layer
Although a single neuron can perform certain simple pattern detection problems, we Perceptrons
need larger networks to offer greater computational capabilities. In order to mimic the
layered structure of certain portions of the brain, let us explain the single feedforward
neural network as shown in Fig. 8. Consider a single layer feedforward neural network
shown in Fig. 8 consisting of an input layer to receive the inputs and an output layer to
output the vectors respectively. The input layer consists of ‘n’ neurons and the output
layer consists of ‘m’ neurons. Indicate the weight of the synapse connecting ith input
neuron to the jth output neuron as Wij . Now let us recall the single layer neural
network with these notations, which is shown in Fig. 8. The inputs of the input layer
and the corresponding outputs of the output layer are given as

 II1   OO1 
I   
  O 
Consider → I I =  I2  ; and OO =  O2 
 ⋮   ⋮ 
I Im  OOn 

Here input layer consists of m neurons and the output layer consists of n neurons that
the input layer use linear transfer function and the output layer use unipolar sigmoidal
n
function. Accordingly, {O1} = {I1} or I O j =
m×1 n×1
∑w
i =1
ij II i . The input to the output layer

can be given as
t t
[I O ] = [ W ] [OI ] = [ W ] [I I ]
n ×1 n × m m ×1 n × m m ×1
1
OOk =
(
1 + e−λIOk )
or [OO ] = f [ WI]
where, λ is sigmoidal gain and [ W ] is weight matrix and is also known as connection
matrix.

w11 I O1
II 1 OI1 Y01
1 1
w 21
w12
II2 OI 2 Y02
2 2
w1n

II m O Im Y0 n
m n

Fig. 8: Single layer feedforward neural network

The nonlinear activation function f [ WI ] operates component wise on the activation

values ‘I’ of each neuron, where each activation value is in turn a scalar product of the
input with respect to weight.

The sigmoidal function is given as

1
f ( I) =
(
1 + e −λI )
and f ′ ( I ) = λf ( I ) ( I − f ( I ))
53
Neural Networks
Now let us extend the single layer model to multi layer model.

In multi layer perceptron the adapted perceptrons are arranged in layer. This model
has three layers; an input layer, and output layer, and a layer in between the input and
the output called the hidden layer. We use linear transfer function for the perceptrons
in the input layer and sigmoidal or squashed-S functions for hidden layer and the
output layer. The input layer does not perform a weighted sum or threshold. In this, we
have modified the single layer perceptron by changing the nonlinearity from a step
function to a sigmoidal function and added a hidden layer. This type of network
recognizes more complex things. The input-output mapping of multilayer perceptron
is given by
O = N3  N 2  N1 [ I] 
where N1, is the nonlinear mapping provided by input, N2 is the nonlinear mapping
provide by hidden layer and N3 is the nonlinerar mapping provided by output layer.
Multilayer perceptron provides many applications of neural networks, such as
functional approximation, learning, generalization, etc. The multilayer network with
one hidden layer is shown in Fig. 9. The activity of the output units are described by
the activity of neurons in the hidden layer and the weight between the hidden and
output layers and the neurons in the hidden layer is determined by the activities of the
neurons in the input layer and the connecting weights between input and hidden units.
In such networks neurons in the hidden layers are free to construct their own
representations of the input.

O O1
1 1 1

OO 2
2 2 2

OO n
m p n

Fig. 9: Multilayer perceptron model

Now, we shall discuss MADALINE network, which is an example of multilayer

perceptron. MADALINE is abbreviated for many ADALINE i.e. many ADALINE are
connected to create such network. The MADALINE network helps countering the
problem of non-linear separability. For example, the MADALINE network with two
units can be applied to find a solution of the XOR problem. In its functioning the input
bits x1 , x2 are received by each unit of ADALINE and the bias input is assumed as 1 as
its input. The calculated weighted sum of the inputs is passed on to the bipolar
threshold units. The final output is obtained by computing the logical ‘and’ ing
(bipolar) of the two threshold outputs. The computation of output is as given in Table
1.

Table 1

Threshold Outputs Output

x1 x2
+1 +1 1
Even parity
–1 –1 1
+1 –1 –1
Odd parity
–1 +1 –1
54
The corresponding network is shown in Fig. 10. Multi-Layer
Perceptrons

X0 =1
W 01

∑
y1
W11
X1
W21
X2
Z1

ADALINE UNIT LOGIC

UNIT
X0 =1 Z2
W02
W 12
∑
X1 y2
W22
X2

Fig. 10: A MADALINE network

Now, try the following exercises.

E1) Consider the weights w11, w12, w21 and w22 on connection from the input neurons
to the hidden layer neurons and v1, v2 be the weights on the connections from
the hidden layer neurons to the output neurons with following values.
w11 = 0.15, w12 = 0.3, w21 = 0.15, w22 = 0.3, v1 = – 0.3 and v2 = 0.3. Also
consider the input (0, 0) generates the output (0, 0), (1,1) generates (1,1) and
(1, 0) or (0, 1) generate (0, 1) as the hidden layer output.
i) Write the output of the neuron set at the hidden layer.
ii) Check whether the vectors in this set are linear separable or not. Give
reason.
iii) Assume that input (0, 0) and (1, 1) gives 0, and (0, 1) generates 1 as output
at outer layer, then draw the diagraph of this network.
iv) Obtain the activation of layers in the network.

E2) Show the decision boundaries of MADALINE to solve XOR problem.

E3) Consider the following table for the connections between the input neurons and
the hidden layer neurons.

Input Hidden layer Connection

neuron neurons weights
1 1 1
1 2 0.1
1 3 –1
2 1 1
2 2 –1
2 3 –1
3 1 0.2
3 2 0.3
3 3 0.6
55
Neural Networks
The connections weights from Hidden layer neurons to the output neurons are
0.6, 0.3 and 0.6 for first, second and third neurons respectively corresponding
threshold value for output layer is 0.5 and for hidden layer 1.8, 0.05 and – 0.2
for first, second and third neuron respectively,
i) Draw the diagraph of the network.
ii) Write the results of activation and interpret.

Now, in the following section we shall discuss the backpropagation learning.

6.3 BACKPROPAGATION LEARNING

Consider the network with input, hidden and output neurons, where the corresponding
activities are denoted by writing subscript I, H, O for input, hidden and output neurons
respectively. Also consider the weighted synapses from ith hidden neurons to jth input
neuron, denoted by Vij. The corresponding diagram is represented in Fig. 11.

II1 OI1 V11 IH1 OH1 IO1 O O1

1 1 1
W11

V21 W21

II2 OI2 IH2 OH2 IO2 OO 2

2 2 2
W 22

VI1 Wp1

OIn IHp OHp IOn OO n

IIn n p m

Input layer Hidden layer Output layer

n-nodes p-nodes m-nodes

Fig. 11: Multilayer feedforward backpropagation network model

Let us assume the activation function for the output of the input layer to the input of
input layer linear for this
{O}I = {I}I
n ×1 n ×1
input to the hidden neuron is the weighted sum of the outputs of the input neurons to
get I Hp (i.e. Input to the pth hidden neuron) as
n
Therefore I Hp = ∑V O
i =1
ip Ii

where n = 1,2,3,⋯ ,p

Denoting weight matrix or connectivity matrix between input neurons and hidden
neurons as
[ V ] , we can get an input to the hidden neuron as
l× m
56
T
In the matrix notation, {I}H = [ V] {O}I Multi-Layer
Perceptrons
p ×1 p × n n ×1

Let us again assume the activation function for the output of the pth hidden neuron as
sigmoidal function or squashed-S function. Then the output O Hp is given by
1
OHp =
(1 + e (
−λ I Hp −θHp )
)
th
θ Hp is the threshold of the p neuron. The computation of hidden layer is shown in
Fig. 12.

II1 OI1
1
V1n
II2 OI2
2
V2n

n
Vln
IIn OIn
n
θHn
IIO = – 1
O
OIO = – 1

Fig. 12: Threshold in hidden layer

Here, the output of each component to the hidden neuron is given by

 
 − 
 − 
 
 1 
[O]H =  
 1+ e

(
−λ ( I Hn −θHn )


)
 − 
 − 
 
The input to the ith output neuron
i
I Oi = ∑W O
j=1
ji Hj i = 1,2,3,⋯ m

The matrix notation is

T
[I]O = [W] [O]H
m ×1 m×p p ×1

Considering the output of the qth output neuron OOi is given by

1
OOi = (considering the sigmoidal function)
(
1+ e
−λ ( IOi −θOi )
)
where θOq is the threshold of the qth neuron. This threshold is shown in Fig. 13.

57
Neural Networks

IH1 OH1
1
W1i
IH2 OH2
2 W2i
OOi
Wpi n
IHp OHp
p

IHO = – 1 ΘOi
O OHO

Fig. 13: Threshold in output layer

Similarly, the outputs of output neurons are given by

 
 − 
 − 
 
 1 
[O]H =  ′ ) 
 1+ e

(
−λ ( I Oi −θOi


)
 − 
 − 
 

Now let us calculate the error.

The Euclidean norm of error E10 for the first training pattern is given by
n
1 n
∑E
i =1 2 i =1
( OT − OC )2
1i = ∑
where E1i is the error in the ith neuron for the first training patterns, and OT is the
target output and OC is the calculated output.

Using the error, let us now write the modified values of weight vectors using steepes
descent method.

[ V ]t +1 = [ V ]t + [ ∆ V ]t +1
[ W ]t +1 = [ W ]t + [ ∆ W ]t +1
where
t +1 t
[ ∆w ] = α [ ∆w ] + η[ y ] ,
p× m p×m p×m
t +1 t
[ ∆v] = α [ ∆v ] + η[ x ] , here η is learning rate.
n×p n×p n×p

[ y] = [ o]H d y ′

p × m p × 1 1 × m′
 − 
 − 
 d y  =  
m×1 ( OT − O Ok ) OOk (1 − OOk ) 
 
 − 
58
[X] = [ o ]I [dx ] = [ I ]I [d x ]′ Multi-Layer
Perceptrons
1× p 1× 1 1× p 1× 1 1× p
 − 
 − 
 
[d x ]p×1 = ei [ OHi ] (1 − OHi ) 
 − 
 − 
 
and
[e]p×1 = [ w ]p×m d y 
m×1

The weights and threshold may be updated as

[ W ]t +1 = [ W ]t + [ ∆W ]t +1
[ V ]t +1 = [ V ]t + [ ∆V ]t +1
[θ]Ot +1 = [ θ]Ot + [ ∆θ]Ot +1
[θ]Ht +1 = [ θ]Ht + [ ∆θ]Ht +1
Now, after defining the backpropagation network let us take the backpropagation
algorithm for this. Let us consider the three-layer network with input layer having ‘n’
nodes, hidden layer having ‘p’ nodes, and an output layer with ‘m’ nodes. We consider
sigmoidal functions for activation functions for the hidden and output layers and linear
activation function for input layer. Now let us write the backpropagation algorithm
stepwise.

Step 1: Initialize the weights V and W usually from –1 to 1.

[I]I
Step 2: For each training pair, assume there are ‘n’ inputs given by and ‘m’
n ×1
[O]O
output in a normalized form.
m ×1

Step 3: Set the number of neurons in the hidden layer to lie between 1 < p < 21.

Step 4: Set λ = 1 and threshold value 0.

Step 5: Compute the output of the input layer.

Step 6: Compute the inputs and outputs to the hidden layer.

Step 7: Compute the inputs and outputs to the output layer.

Step 8: Calculate the error.

Step 9: Find
[ V ]t +1 = [ V ]t + [ ∆V ]t +1
[ W ]t +1 = [ W ]t + [ ∆W ]t +1
Step 10: Repeat steps 5 to 9 until the convergence in the error rate is less than the
tolerance value.

59
Neural Networks Let us cite the following example to understand the algorithm.

Example 1: Consider the three training sets given in the following Table 2,

Table 2
Inputs Output
I1 I1 O
0.3 – 0.2 0.2
0.4 0.6 0.3
0.6 – 0.2 0.1

0  0.2  0  0.1 0.4

with the initial vectors [ W ] =   and [ V ] =  .
 −0.5  −0.2 0.2 

Solution: Let us find the improved weights. The architecture of this model is given in
Fig. 14.

0.3 0.2
II1 HI1
– 0.5
0.4
0.2
OI1
0.1 0.2
– 0.2 HI2
II2
– 0.2

Fig. 14: Multilayer architecture

T
[I]H = [ V ] [O]I
 0.2 0.1   0.3  0.04
=  = 
 0.4 −0.2  −0.2 0.16
 1 
 1 + e−0.04   0.51
[OC ]H =  = 
 1   0.54 
 1 + 1e−0.16 
 0.51
[ I]O = [ w ]t [ O]H = [ −0.5 0.2] 0.52
 
= −0.15
1
[OC ]O = 0.151 = 0.462
1+ e
2
error = [ O TO − O CO ]
2
= [ 0.2 − 0.462] = 0.069
d = ( OTO − OCO1 )( OCO1 )
= ( 0.2 − 0.462 )( 0.462 )(1 − 0.462 ) = −0.065
 −0.033
[OC ]H [ d ] =  −0.035
 
Here assume that α = 1 and η = 0.5
[ ∆w ]1 = α [ ∆w ]0 + η[ y ]
 −0.517 
= 
60  0.182 
 0.032  Multi-Layer
e = [ w ][ d ] =   Perceptrons
 −0.013
 −0.001
d* =  
 0.0004 
1  0.0003 0.0001 
X = [ O]i  d*  =  
0.0002 −0.00009 
0.1998 0.40007 
[ V ]1 =  0.1001 −0.20004
 
1  − 1.01 
and [ W ] =  
 0.38 
1 1
Using these modified [ V ] and [ w ] , error is calculated and then can be processed for
next training set.

Now, try the following exercises.

E4) Find the modified weights for the training set having input I1 = 0.3, I2 = – 0.5
0  0.1 0.4  0  0.2 
and output = 0.1 with [ V ] =   and [ W ] =  .
 −0.2 0.2   −0.5

E5) How many Layers are there in Multi layer Neural Network?
E6) What is cycle to modify the weight values?

Now, let us summarize the unit.

6.4 SUMMARY
In this unit, we have covered the following points.
i) The neural network architectures are broadly classified as single layer feed
forward networks and multilayer feed forward networks. If only input and
output layers are present, then the network is single layer network. In turn, if in
addition to the input and output layer one or more intermediate layer exists, then
the network is multilayer network.
ii) Backpropagation is a systematic method of training multilayer neural networks.
It is built on high mathematical foundation and has very good application
potential.

6.5 SOLUTIONS/ANSWERS
E1) i) Output set = {(0, 0), (1, 1), (0, 1)}

ii) These three vectors given in (i) are separable with (0, 0) and (1, 1) on one
side of the separating line, while (0, 1) is on the other side.

61
Neural Networks
iii) 0.15
0.2
– 0.3
0.15
0.3
0.2
0.3
0.2
0.3

iv)
Input Activation Output Output Output of
(Hidden (Hidden neuron the network
layer) layer) activation
(0, 0) (0, 0) (0, 0) 0 0
(1, 1) (0.3, 0.6) (1, 1) 0 0
(0, 1) (0.15, 0.3) (0, 1) 0.3 1
(1, 0) (0.15, 0.3) (0, 1) 0.3 1

E2)
O

A(1, 1 )
B(–1, 1 )

C(–1, –1 ) D(1, –1 )

E3) i) 1 1.8
0.6
–1 0.1
1
–1 0.3
–1 0.05 0.5
–1
0.6
0.2 0.3

0.2
0.6

ii) Let us consider the vertexes of a cube be O, A, B, C, D, E, F and G be

(0, 0, 0,), (0, 0, 1), (0, 1, 0), (0, 1, 1,), (1, 0, 0), (1, 0, 1), (1, 1, 0) and
(1, 1, 1).

62
Vertex/ Hidden Weighted Comment Activation Contribution Sum Multi-Layer
Coordinates Layer Sum to output Perceptrons
Neuron
O: 0, 0, 0 1 0+0+0 = 0 < 1.8 0 0
2 0+0+0 = 0 < 0.05 0 0
3 0+0+0 = 0 > –0.2 1 0.6 0.6*
A: 0, 0, 1 1 0+0+0.2 = < 1.8 0 0
0.2
2 0+0+0.3 = > 0.05 1 0.3
0.3
3 0+0+0.6 = > –0.2 1 0.6 0.9*
0.6
B: 0, 1, 0 1 0+1+0 = 1 < 1.8 0 0
2 0–1+0 = –1 < 0.05 0 0
3 0–1+0 = –1 < –0.2 0 0 0
C: 0, 1, 1 1 0+1+0.2 = < 1.8 0 0
1.2
2 0+0.1+0.2 = > 0.05 1 0.3
0.2
3 0–1+0.6 = < – 0.2 0 0 0.3
– 0.4
D: 1, 0, 0 1 1+0+0 = 1 < 1.8 0 0
2 0.1+0+0 = > 0.05 1 0.3
0.1
3 –1+0+0=–1 < –0.2 0 0 0.3
E: 1, 0, 1 1 1–0+0.2= 1.2 < 1.8 0 0
2 0.1+0+0.2 = > 0.05 1 0.3
0.4
3 –1+0+0.6 = < –0.2 0 0 0.3
–0.4
F: 1, 1, 0 1 1+1+0 = 2 > 1.8 1 0.6
2 0.1–1+0 = – < 0.05 0 0
0.9
3 –1–1+0 = –2 < – 0.2 0 0 0.6*
G: 1, 1, 1 1 1+1+0.2 = > 1.8 1 0.6
2.2
2 0.1–1+0.3 = – < 0.05 0 0
0.8
3 –1–1+0.6 = – < – 0.2 0 0 0.6*
1.4
* The output neuron fires, as this value is greater than 0.5 (the threshold value); the function value is + 1.

E4) The following values are obtained:

 0.1 0.4 
[ V ]0 = −0.2 0.2 
 
 0.2 
[ W ]0 = −0.5
 
 0.3 
[O ]I =  −0.5
 
 0.13
[ I]H = 0.02
 
 0.53
[OC ]H = 0.50
 
[ I ] = [ −0.146]
63
Neural Networks [OC ]O = 0.464
error = 0.132
assuming [α = 0.5, η = 0.5]
d = – 0.090
 −0.048 
[ Y ] =  −0.0456
 
 0.076 
[ ∆W ]1 =  −0.273
 
 −0.018 
e= 
 0.045 
 −0.001
d* =  
 −0.002 
 −0.0003 −0.0006 
X= 
 0.0005 0.0012 
 0.0498 0.1997 
[ V ]1 = −0.0997 0.1005
 
 0.276 
[ W ]1 = −0.773
 

6.6 PRACTICAL ASSIGNMENT

Session 7
Write a program in ‘C’ language to implement the backpropagation algorithm. Show
step by step output to input, hidden and output neurons as well as errors. How the
weights W and V modified? Use data given in Example 1.

6.7 REFERENCES

1. Simon S Haykin and Simon Haykin, (1998), Neural Networks: A

Comprehensive Foundation, Pearson Education.
2. Raul Rojas, (1996), Neural Networks: A Systematic Introduction.
3. S. Rajasekaran and G.A. Vijayalakshmi Pai, (2010), Neural Netwoks: Fuzzy
Logic, and Genetic Algorithms.
4. D.K. Pratihar, (2008), Soft Computing.
5. Valluru B. Rao and Hayagriva V. Rao, C + + Neural Networks & Fuzzy Logic.

Unit V
No ratings yet
Unit V
26 pages
Neural Network and Fuzzy Logic
50% (2)
Neural Network and Fuzzy Logic
54 pages
Unit 2 - Machine Learning
No ratings yet
Unit 2 - Machine Learning
19 pages
Neural Networks
No ratings yet
Neural Networks
54 pages
Artificial Neural Networks and Deep Learning
No ratings yet
Artificial Neural Networks and Deep Learning
22 pages
Feed-Forward Multi-Layer Neural Networks: Definitions
No ratings yet
Feed-Forward Multi-Layer Neural Networks: Definitions
33 pages
Perceptron and Multi Layer Perceptron
No ratings yet
Perceptron and Multi Layer Perceptron
5 pages
5 - From Linear Models To Multi-Layer Perceptrons
No ratings yet
5 - From Linear Models To Multi-Layer Perceptrons
45 pages
Backpropagation
No ratings yet
Backpropagation
7 pages
Unit 2 (Q&A)
No ratings yet
Unit 2 (Q&A)
23 pages
Project Report On Diabetes Prediction
No ratings yet
Project Report On Diabetes Prediction
29 pages
ML Unit 2
No ratings yet
ML Unit 2
24 pages
Unit V - Aiml PDF
No ratings yet
Unit V - Aiml PDF
29 pages
9.deep Feedforward Networks
100% (1)
9.deep Feedforward Networks
13 pages
FML Unit5
No ratings yet
FML Unit5
21 pages
Neural Networks Notes
No ratings yet
Neural Networks Notes
22 pages
Unit V
No ratings yet
Unit V
33 pages
2K21 - Ee - 192 MLP
No ratings yet
2K21 - Ee - 192 MLP
59 pages
AI17-Neural Networks
No ratings yet
AI17-Neural Networks
34 pages
Neural Network Notes
No ratings yet
Neural Network Notes
8 pages
Digital Library
No ratings yet
Digital Library
24 pages
Neural Networks
No ratings yet
Neural Networks
37 pages
Lesson 7.0 Supervised Learning With Neural Networks
No ratings yet
Lesson 7.0 Supervised Learning With Neural Networks
22 pages
Unit V
No ratings yet
Unit V
25 pages
CH 12 - Artificial Neural Networks
No ratings yet
CH 12 - Artificial Neural Networks
39 pages
AI & ML Unit 5 Notes
No ratings yet
AI & ML Unit 5 Notes
23 pages
Lecture 2
No ratings yet
Lecture 2
52 pages
Unit 5
No ratings yet
Unit 5
28 pages
Machine Learning Unit 5 Notes
No ratings yet
Machine Learning Unit 5 Notes
19 pages
Aiml Unit 5
No ratings yet
Aiml Unit 5
34 pages
Unit 5
No ratings yet
Unit 5
102 pages
Elliptic Curve Digital Signature Algorithms (ECDSA)
No ratings yet
Elliptic Curve Digital Signature Algorithms (ECDSA)
22 pages
MLP Lecture 4
No ratings yet
MLP Lecture 4
35 pages
Unit V
No ratings yet
Unit V
49 pages
Chapter Neural Networks
No ratings yet
Chapter Neural Networks
14 pages
Soft Computing Unit 2
No ratings yet
Soft Computing Unit 2
22 pages
ML Lec-22
No ratings yet
ML Lec-22
25 pages
DL Mod 1 Final
No ratings yet
DL Mod 1 Final
4 pages
Unit 2
No ratings yet
Unit 2
18 pages
Neural Network Basics 2.1 Neurons or Nodes and Layers
No ratings yet
Neural Network Basics 2.1 Neurons or Nodes and Layers
9 pages
Ai Unit 4 Part 2
No ratings yet
Ai Unit 4 Part 2
45 pages
Week 03-04 - Deep Feedforward Networks - Intro
No ratings yet
Week 03-04 - Deep Feedforward Networks - Intro
141 pages
Function of Single Biological Neuron and Modelling of Artificial Neuron From It
No ratings yet
Function of Single Biological Neuron and Modelling of Artificial Neuron From It
33 pages
Neural Networks
No ratings yet
Neural Networks
28 pages
Advanced Supervised Learning
No ratings yet
Advanced Supervised Learning
17 pages
Unit - II ML
No ratings yet
Unit - II ML
9 pages
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
No ratings yet
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
40 pages
Rotating Machinery Vibration - From Analysis To Troublesh Ooting
100% (16)
Rotating Machinery Vibration - From Analysis To Troublesh Ooting
371 pages
@vtucode - in Module 5 AI 2021 Scheme 5th Sem
No ratings yet
@vtucode - in Module 5 AI 2021 Scheme 5th Sem
66 pages
Week-3 Module-2 Neural Network
No ratings yet
Week-3 Module-2 Neural Network
58 pages
NNDL
No ratings yet
NNDL
96 pages
Notes Chapter Neural Networks
No ratings yet
Notes Chapter Neural Networks
18 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Module 2
No ratings yet
Module 2
44 pages
Elliptic Curve Cryptography
No ratings yet
Elliptic Curve Cryptography
30 pages
Supervised Learning Neural Networks
No ratings yet
Supervised Learning Neural Networks
34 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
Artificial Neural Network: Lecture Module 22
No ratings yet
Artificial Neural Network: Lecture Module 22
54 pages
A Presentation On: By: Edutechlearners
No ratings yet
A Presentation On: By: Edutechlearners
33 pages
4 Multilayer Perceptrons and Radial Basis Functions
No ratings yet
4 Multilayer Perceptrons and Radial Basis Functions
6 pages
Applicable From The Academic Year 2023-24 and Onwards
No ratings yet
Applicable From The Academic Year 2023-24 and Onwards
7 pages
Artificial Neural Network Application in Logic System: Siddharth Saxena TCET, Mumbai
No ratings yet
Artificial Neural Network Application in Logic System: Siddharth Saxena TCET, Mumbai
5 pages
Cim Pyq Aktu
No ratings yet
Cim Pyq Aktu
3 pages
Basic Crypto Primitives
No ratings yet
Basic Crypto Primitives
47 pages
Simple Perceptrons: 1 Nonlinearity
No ratings yet
Simple Perceptrons: 1 Nonlinearity
5 pages
Mid Term Question Paper AI Soln v2
No ratings yet
Mid Term Question Paper AI Soln v2
6 pages
Clocks-And Event Ordering-I
No ratings yet
Clocks-And Event Ordering-I
40 pages
The Ease of Making Money in The Age of AI
No ratings yet
The Ease of Making Money in The Age of AI
3 pages
Maxima and Minima
0% (1)
Maxima and Minima
25 pages
Data Science Task List Pfsinterns
No ratings yet
Data Science Task List Pfsinterns
14 pages
Divide and Conquer (Mergesort) : CSE373: Design and Analysis of Algorithms
No ratings yet
Divide and Conquer (Mergesort) : CSE373: Design and Analysis of Algorithms
61 pages
s3950476 TimeSeriesAnalysis Assignment 3
No ratings yet
s3950476 TimeSeriesAnalysis Assignment 3
13 pages
The Rabin-Karp Algorithm: String Matching
No ratings yet
The Rabin-Karp Algorithm: String Matching
18 pages
Design of A Linear State Feedback Controller
No ratings yet
Design of A Linear State Feedback Controller
27 pages
Printable Module Test Form A1
No ratings yet
Printable Module Test Form A1
4 pages
Code - Aster: SDLS100 - Study of Grids On A Plate Square Thin
No ratings yet
Code - Aster: SDLS100 - Study of Grids On A Plate Square Thin
10 pages
Digital Signature
No ratings yet
Digital Signature
23 pages
Stat PPT 4
No ratings yet
Stat PPT 4
17 pages
L - 2 - High-Dimensional Space
No ratings yet
L - 2 - High-Dimensional Space
20 pages
Strassen Algorithm
No ratings yet
Strassen Algorithm
4 pages
Assignment On Normal Distribution and Poision Distribution
No ratings yet
Assignment On Normal Distribution and Poision Distribution
6 pages
Iva Syb With Lab
No ratings yet
Iva Syb With Lab
3 pages
Autonomous Driving With Deep Reinforcement Learning in CARLA Simulation
No ratings yet
Autonomous Driving With Deep Reinforcement Learning in CARLA Simulation
7 pages
Extension of Playfair Cipher Using 16X16
No ratings yet
Extension of Playfair Cipher Using 16X16
5 pages
PKI Sunny Classes
No ratings yet
PKI Sunny Classes
22 pages
Ee3512 Ci Lab 2021R
No ratings yet
Ee3512 Ci Lab 2021R
3 pages
Password Cracking Survey PDF
No ratings yet
Password Cracking Survey PDF
8 pages
Soft Computing Assignment: Q5) Explain Supervised and Unsupervised Learning. Ans 5) Supervised Learning
No ratings yet
Soft Computing Assignment: Q5) Explain Supervised and Unsupervised Learning. Ans 5) Supervised Learning
3 pages
Euler-Heun Method PDF
No ratings yet
Euler-Heun Method PDF
3 pages
On The Abc Spectral Radius of Cactus Graphs: Z D B Z
No ratings yet
On The Abc Spectral Radius of Cactus Graphs: Z D B Z
1 page
Round and Round Maze: Input
No ratings yet
Round and Round Maze: Input
2 pages
VHDL Implementation of Non Restoring DivisionAlgorithm Using High SpeedAdderSubtractor.
No ratings yet
VHDL Implementation of Non Restoring DivisionAlgorithm Using High SpeedAdderSubtractor.
2 pages
Digital Circuit Simulation Using Excel
From Everand
Digital Circuit Simulation Using Excel
Anthony Mazzurco
No ratings yet
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
From Everand
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
Fouad Sabry
No ratings yet

Multilayer Perceptron

Uploaded by

Multilayer Perceptron

Uploaded by

UNIT 6 MULTI-LAYER PERCEPTRON

6.2 MULTI LAYER PERCEPTRON

Let us start with an example of XOR problem. The combination of perceptrons to

By smoothing the threshold function, we reach so that it turns on or off as before. It

output the model of a neuron is shown in Fig. 3.

Fig. 3: An artificial neuron

i) Linear Function: The output for linear function is o = gI where

v) Unipolar Multimodal Function: The output is given as

vi) Radial Basis Function (RBF): The output is O = exp ( I ) where

Considering threshold θ , the relative input to the neuron is given by

And the output O = f ( u ) where f is the nonlinear transfer function.

Fig. 8: Single layer feedforward neural network

The nonlinear activation function f [ WI ] operates component wise on the activation

The sigmoidal function is given as

Fig. 9: Multilayer perceptron model

Now, we shall discuss MADALINE network, which is an example of multilayer

Threshold Outputs Output

ADALINE UNIT LOGIC

Fig. 10: A MADALINE network

Now, try the following exercises.

E2) Show the decision boundaries of MADALINE to solve XOR problem.

Input Hidden layer Connection

Now, in the following section we shall discuss the backpropagation learning.

6.3 BACKPROPAGATION LEARNING

II1 OI1 V11 IH1 OH1 IO1 O O1

II2 OI2 IH2 OH2 IO2 OO 2

OIn IHp OHp IOn OO n

Input layer Hidden layer Output layer

Fig. 11: Multilayer feedforward backpropagation network model

Fig. 12: Threshold in hidden layer

Here, the output of each component to the hidden neuron is given by

The matrix notation is

Considering the output of the qth output neuron OOi is given by

Fig. 13: Threshold in output layer

Similarly, the outputs of output neurons are given by

Now let us calculate the error.

[ y] = [ o]H d y ′

The weights and threshold may be updated as

Step 1: Initialize the weights V and W usually from –1 to 1.

Step 4: Set λ = 1 and threshold value 0.

Step 5: Compute the output of the input layer.

Step 6: Compute the inputs and outputs to the hidden layer.

Step 7: Compute the inputs and outputs to the output layer.

Step 8: Calculate the error.

0  0.2  0  0.1 0.4

Fig. 14: Multilayer architecture

Now, try the following exercises.

Now, let us summarize the unit.

ii) Let us consider the vertexes of a cube be O, A, B, C, D, E, F and G be

E4) The following values are obtained:

6.6 PRACTICAL ASSIGNMENT

1. Simon S Haykin and Simon Haykin, (1998), Neural Networks: A

You might also like