0% found this document useful (0 votes)

20 views5 pages

Taud 2017

This document provides an overview of multilayer perceptrons (MLPs), which are artificial neural networks used for modeling nonlinear relationships in data. MLPs consist of multiple layers of nodes - an input layer, one or more hidden layers, and an output layer. Information flows from the input to the output layers through the hidden layers. Each connection between nodes has a weight that is optimized during training using an algorithm like backpropagation. MLPs can model complex nonlinear functions by learning from examples of input-output pairs. They have been successfully applied to problems like land change modeling that involve many interacting factors.

Uploaded by

bob

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views5 pages

Taud 2017

Uploaded by

bob

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Chapter 27

Multilayer Perceptron (MLP)

H. Taud and J.F. Mas

Abstract Artiﬁcial Neural networks have been found to be outstanding tools able
to generate generalizable models in many disciplines. In this technical note, we
present the multi-layer perceptron (MLP) which is the most common neural
network.

Keywords Calibration Neural networks Non-linear relationships Back

propagation

1 Short Description of Interest

Artiﬁcial Neural Networks (ANNs) are structures inspired by the function of the
brain. These Networks can perform model function estimation and handle
linear/nonlinear functions by learning from data relationships and generalizing to
unseen situations. One of the popular Artiﬁcial Neural Networks (ANNs) is
Multi-Layer Perceptron (MLP). This is a powerful modeling tool, which applies a
supervised training procedure using examples of data with known outputs (Bishop
1995). This procedure generates a nonlinear function model that enables the pre-
diction of output data from given input data.

See Chap. 2 about calibration.

H. Taud (&)
Centro de Innovación y Desarrollo Tecnológico en Cómputo,
Instituto Politécnico Nacional, Mexico City, Mexico
e-mail: [email protected]
J.F. Mas
Centro de Investigaciones en Geografía Ambiental, Universidad Nacional
Autónoma de México (UNAM), Morelia, Michoacán, Mexico
e-mail: [email protected]

© Springer International Publishing AG 2018 451

M.T. Camacho Olmedo et al. (eds.), Geomatic Approaches for Modeling
Land Change Scenarios, Lecture Notes in Geoinformation and Cartography,
https://doi.org/10.1007/978-3-319-60801-3_27
452 H. Taud and J.F. Mas

2 Technical Details

In order to understand the MLP, a brief introduction to the one neuron perceptron
and single layer perceptron is provided. The former represents the simplest neural
network and has only one output to which all inputs are connected. Given i = 0,1,
…,n where n is the number of inputs, the quantities {wi} are the weights of the
neuron. The inputs {xi} correspond to features or variables and the output y to their
predictive binary class. Figure 1 describes the three steps forming the perceptron
model. Figure 2 shows its simpliﬁed representation. The weighting step involves
the multiplication of each input feature value by its weight {xiwi} and in the second
step they are added together (x0w0 + x1w1 + + xnwn). The third is the transfer
step where an activation function f (also called a transfer function) is applied to the
sum producing an output y presented as:

X
n
y ¼ f ðzÞ and z ¼ wi xi ð1Þ
i¼0

x0 ¼ 1; w0 the threshold or bias, and y the output.

The activation function takes various forms. Their common functions are listed
in Table 1.
A perceptron can only learn linearly separable functions from Eq. (1). Figure 3a
shows an example of linear function w1 x1 þ w2 x2 þ w0 ¼ 0 that separates the data
into two classes. In two dimensions with two features, the function is a line. In three
dimensions with three features, it is a plane. In n dimensions, it is a hyperplane with
equation:

Fig. 1 Perceptron steps: from left to right, weighting, sum and transfer steps

Fig. 2 Perceptron model, from left to right: a steps model. b Simpliﬁed model
27 Multilayer Perceptron (MLP) 453

Table 1 Some activation functions

Activation function Equation 2D graph

Unit step (Heaviside) 1z 0
f ðzÞ ¼
0z\0

Linear f ðzÞ ¼ z

Logistic (sigmoid) f ð xÞ ¼ 1 þ1ex

Fig. 3 Input patterns, from

left to right: a linearly
separable, b nonlinearly
separable

X
n
wi xi ¼ 0 ð2Þ
i¼0

The Equation (2) can be presented by the dot product between the weight vector
W and the input vector X:

W X ¼0 ð3Þ

With known responses of the input training data, the learning step (also known
as the training step) is completed. The purpose of learning is to optimize the
weights by minimizing a cost function, which is usually a square error between the
known response and the estimated one. Analytical techniques such as gradient
descent determine the optimum weight vector. The algorithm converges to a
solution reaching an operational configuration network. The validation of the model
is achieved using new data in order to show how the configuration can be gener-
alized to new situations.
The parallel connection of many perceptrons generates a single layer perceptron
(SLP) architecture, which is used in the case of various outputs. Figure 4a shows an
example with an input and output layer serving in a linearly separable multiclass case.
The perceptron and the single layer perceptron do not resolve the nonlinearly
separable problem (Fig. 3b). In this case, a solution can be found by adding any
number of layers in successive arrangement and creating a MLP architecture
(Fig. 4b). The output of one layer becomes the input of the next and so on. The first
454 H. Taud and J.F. Mas

Fig. 4 Layer structure:

a SLP with three inputs and
four outputs. b MLP with
three inputs, two hidden
layers, and two outputs

and the last layers are called input and output layers respectively, while the others
are the hidden layers of the neural network.
The MLP is a layered feedforward neural network in which the information
flows unidirectionally from the input layer to the output layer, passing through the
hidden layers (Bishop 1995). Each connection between neurons has its own weight.
Perceptrons for the same layer have the same activation function. In general, it is a
sigmoid for the hidden layers. Depending on the application, the output layer can
also be a sigmoid or a linear function.
Among many other algorithms, the widely known MLP learning algorithm is a
backpropagation, which is a generalization of the Least Mean Squared rule (Du and
Swamy 2014). Weights can be corrected by propagating the errors from layer to
layer starting with the output layer and working backwards, hence the name
backpropagation.
The MLP model performance depends not only on the choice of the variables,
the numbers of hidden layers, nodes, and training data but also on the training
parameters such as learning rate, momentum controlling the weight change, and
number of iterations. A MLP with one hidden layer identifies the nonlinear function
with lower accuracies. Networks with more hidden layers are likely to overfit the
training data. The learning rate and the momentum control the speed and effec-
tiveness of the learning process.
In land change modeling, the analysis of the complex relationships between land
transition and the large number of variables acting as drivers, needs advanced
empirical techniques to find a nonlinear function that describes such a complex
relationship (Mas et al. 2014). Variables such as distance, slope, type of soil, land
tenure, etc. are presented at the input node of the network. Each output node
represents a different land transition (e.g. forest to pasture, forest to cropland, and
forest to urban, etc…) for which explanatory variable values are known, as well as
the land transition observed in the past. After the training step, the MLP is able to
predict the potential change of each transition when new input data is presented to
the network (Pijanowski et al. 2002; Mas et al. 2004).
27 Multilayer Perceptron (MLP) 455

References

Bishop CM (1995) Neural networks for pattern recognition. Oxford University Press, Oxford 482
Du K-L, Swamy MNS (2014) Neuronal networks and statistical learning. Springer, Berlin
Mas JF, Puig H, Palacio JL, Sosa AA (2004) Modelling deforestation using GIS and artiﬁcial
neural networks. Environ Model Softw 19(5):461–471
Mas JF, Kolb M, Paegelow M, Camacho Olmedo MT, Houet T (2014) Inductive pattern-based
land use/cover change models: a comparison of four software packages. Environ Model Softw
51:94–111
Pijanowski BC, Brown DG, Shellito BA, Manik GA (2002) Using neural nets and gis to forecast
land use changes: a land transformation model. Comput Environ Urban Syst 26(6):553–575

Unit I
0% (1)
Unit I
21 pages
Unit2ml 230101150634 5590aaef
No ratings yet
Unit2ml 230101150634 5590aaef
202 pages
List in C++ Standard Template Library (STL) - GeeksforGeeks
No ratings yet
List in C++ Standard Template Library (STL) - GeeksforGeeks
13 pages
Unit 2
No ratings yet
Unit 2
48 pages
DL Notes ALL
No ratings yet
DL Notes ALL
63 pages
Unit 4 Neural Networks
No ratings yet
Unit 4 Neural Networks
76 pages
Multi Layer Perceptron
No ratings yet
Multi Layer Perceptron
82 pages
1,2& 5 Mod SEM
No ratings yet
1,2& 5 Mod SEM
53 pages
ML Module 2
No ratings yet
ML Module 2
59 pages
Multilayer Perceptron (MLP) & Linear Separabaility
No ratings yet
Multilayer Perceptron (MLP) & Linear Separabaility
7 pages
Dsap l02 PDF
No ratings yet
Dsap l02 PDF
119 pages
Unit 1
No ratings yet
Unit 1
72 pages
ML Unit-Ii
No ratings yet
ML Unit-Ii
34 pages
Chapter 2 - Artificial Neural Networks
No ratings yet
Chapter 2 - Artificial Neural Networks
19 pages
Unraveling String View
No ratings yet
Unraveling String View
61 pages
Lecture 2
No ratings yet
Lecture 2
52 pages
IV Ai & Ds Al3451 ML Unit4
No ratings yet
IV Ai & Ds Al3451 ML Unit4
36 pages
UNIT-2 Machine Learning
No ratings yet
UNIT-2 Machine Learning
35 pages
4 2-IntroCPlex Advanced
No ratings yet
4 2-IntroCPlex Advanced
19 pages
JAVA INTERNSHIP Report
No ratings yet
JAVA INTERNSHIP Report
34 pages
ML Unit 2
No ratings yet
ML Unit 2
24 pages
Neural Networks
No ratings yet
Neural Networks
19 pages
Unit 3
100% (1)
Unit 3
11 pages
Multilayer Neural Network
No ratings yet
Multilayer Neural Network
27 pages
Supervised Learning Network Introduction: Unit 2
No ratings yet
Supervised Learning Network Introduction: Unit 2
52 pages
Soft Computing Unit 2
No ratings yet
Soft Computing Unit 2
23 pages
Lesson 3 Basics of Neural Networks - Perceptron
No ratings yet
Lesson 3 Basics of Neural Networks - Perceptron
26 pages
Aiml Unit 5
No ratings yet
Aiml Unit 5
34 pages
Machine Learning Unit 4
No ratings yet
Machine Learning Unit 4
21 pages
Exkmc: Expanding Explainable K-Means Clustering
No ratings yet
Exkmc: Expanding Explainable K-Means Clustering
27 pages
Percept Ron
No ratings yet
Percept Ron
13 pages
DL Question Bank Answers
No ratings yet
DL Question Bank Answers
55 pages
Unit-3 ML
No ratings yet
Unit-3 ML
21 pages
l3 Perceptron
No ratings yet
l3 Perceptron
5 pages
l3 Perceptron
No ratings yet
l3 Perceptron
5 pages
CSC 204 Session 1
No ratings yet
CSC 204 Session 1
16 pages
Ai Lect2 Search
No ratings yet
Ai Lect2 Search
81 pages
ML Module 2 New
No ratings yet
ML Module 2 New
36 pages
ML Unit 3-2-18
No ratings yet
ML Unit 3-2-18
17 pages
Computer Project 2025-26
No ratings yet
Computer Project 2025-26
8 pages
Exp6 - Artificial Neural Networks
No ratings yet
Exp6 - Artificial Neural Networks
16 pages
Deep Learning
No ratings yet
Deep Learning
11 pages
Neural Networks
No ratings yet
Neural Networks
10 pages
UNit 6 Machine Learning
No ratings yet
UNit 6 Machine Learning
23 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
19 pages
DL Question Bank Answers
No ratings yet
DL Question Bank Answers
55 pages
Fundamental Concepts: Programming Language
No ratings yet
Fundamental Concepts: Programming Language
25 pages
A Cluster-Based Optimization Framework For Vehicle Routing Problem With Workload Balance
No ratings yet
A Cluster-Based Optimization Framework For Vehicle Routing Problem With Workload Balance
14 pages
1991 Multilayer Perceptrons
No ratings yet
1991 Multilayer Perceptrons
15 pages
Basics of Deep Learning
No ratings yet
Basics of Deep Learning
20 pages
Reinforcement Learning With Decision Trees
No ratings yet
Reinforcement Learning With Decision Trees
6 pages
Chapter - 3
No ratings yet
Chapter - 3
31 pages
Lab11 Manual
No ratings yet
Lab11 Manual
13 pages
FALLSEM2024-25 BCSE209L TH VL2024250101737 2024-08-08 Reference-Material-I
No ratings yet
FALLSEM2024-25 BCSE209L TH VL2024250101737 2024-08-08 Reference-Material-I
11 pages
Unit-4 AML (1. Basics and K-NN)
No ratings yet
Unit-4 AML (1. Basics and K-NN)
25 pages
Exp 5
No ratings yet
Exp 5
3 pages
CVDL Cae1
No ratings yet
CVDL Cae1
28 pages
1 Neural Networks
No ratings yet
1 Neural Networks
16 pages
CTE 115-DATA STRUCTURE Latest
No ratings yet
CTE 115-DATA STRUCTURE Latest
13 pages
El Assignment
No ratings yet
El Assignment
10 pages
Multilayer Perceptron
No ratings yet
Multilayer Perceptron
9 pages
Math Class 4 Version 1
No ratings yet
Math Class 4 Version 1
6 pages
Graph Theory Report
No ratings yet
Graph Theory Report
9 pages
DAA Record Niladri Ghoshal RA2011003040003
No ratings yet
DAA Record Niladri Ghoshal RA2011003040003
82 pages
CodeVisionAVR User Manual (13tr)
No ratings yet
CodeVisionAVR User Manual (13tr)
13 pages
Multilayer Perceptron Algorithm
No ratings yet
Multilayer Perceptron Algorithm
3 pages
Bengal Institute of Technology
No ratings yet
Bengal Institute of Technology
5 pages
CPP MidTerm
No ratings yet
CPP MidTerm
8 pages
AI Week 12
No ratings yet
AI Week 12
2 pages
Unit 2
No ratings yet
Unit 2
15 pages
Map Reduce Algorithm
No ratings yet
Map Reduce Algorithm
4 pages
Chandana
No ratings yet
Chandana
6 pages
Co-So-Tri-Tue-Nhan-Tao - 2021-Reviewexercise05-Pl-Sol - (Cuuduongthancong - Com)
No ratings yet
Co-So-Tri-Tue-Nhan-Tao - 2021-Reviewexercise05-Pl-Sol - (Cuuduongthancong - Com)
3 pages
Structure: Input Layer Hidden Layers
No ratings yet
Structure: Input Layer Hidden Layers
2 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
Analysis of Multi Layer Perceptron Network
No ratings yet
Analysis of Multi Layer Perceptron Network
7 pages
High Performance Computing System (CSE 5154) RCS
No ratings yet
High Performance Computing System (CSE 5154) RCS
1 page
Procfs in Linux (Virtual File System) EmbeTronicX
No ratings yet
Procfs in Linux (Virtual File System) EmbeTronicX
1 page
Neural Networks: Aroob Amjad Farrukh
No ratings yet
Neural Networks: Aroob Amjad Farrukh
6 pages
OS Sheet (3) Solution
No ratings yet
OS Sheet (3) Solution
11 pages
Algorithms and Data Structures-Searching Algorithms
No ratings yet
Algorithms and Data Structures-Searching Algorithms
15 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
29 pages
EECE 320 Digital Systems Design: Combinational Logic Design Principles
No ratings yet
EECE 320 Digital Systems Design: Combinational Logic Design Principles
11 pages
A Presentation On: By: Edutechlearners
No ratings yet
A Presentation On: By: Edutechlearners
33 pages
Pattern Recognition & Analysis Assignment - Ii
No ratings yet
Pattern Recognition & Analysis Assignment - Ii
19 pages
Cie QP 2 - 21ai71
No ratings yet
Cie QP 2 - 21ai71
2 pages
Multilayer Perceptron
No ratings yet
Multilayer Perceptron
24 pages
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
From Everand
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
Fouad Sabry
No ratings yet
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
From Everand
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
Fouad Sabry
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet

Taud 2017

Uploaded by

Taud 2017

Uploaded by

Chapter 27

Multilayer Perceptron (MLP)

H. Taud and J.F. Mas

Keywords Calibration Neural networks Non-linear relationships Back

1 Short Description of Interest

See Chap. 2 about calibration.

© Springer International Publishing AG 2018 451

x0 ¼ 1; w0 the threshold or bias, and y the output.

Table 1 Some activation functions

Logistic (sigmoid) f ð xÞ ¼ 1 þ1ex

Fig. 3 Input patterns, from

Fig. 4 Layer structure:

You might also like