0% found this document useful (0 votes)

147 views8 pages

Classification by Backpropagation - A Multilayer Feed-Forward Neural Network - Defining A Network Topology - Backpropagation

The document summarizes the backpropagation neural network algorithm. It is a learning algorithm for neural networks that works by iteratively processing training data, comparing the network's predictions to actual target values, and adjusting the weights to minimize error. This is done through backpropagation from the output layer through each hidden layer down to the first hidden layer. The algorithm aims to eventually converge the weights so learning can stop. Backpropagation is commonly used for classification and numeric prediction problems in data mining.

Uploaded by

Kingzlyn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

147 views8 pages

Classification by Backpropagation - A Multilayer Feed-Forward Neural Network - Defining A Network Topology - Backpropagation

Uploaded by

Kingzlyn

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

398 Chapter 9 Classification: Advanced Methods

9.2 Classification by Backpropagation

“What is backpropagation?” Backpropagation is a neural network learning algorithm.

The neural networks field was originally kindled by psychologists and neurobiologists
who sought to develop and test computational analogs of neurons. Roughly speaking, a
neural network is a set of connected input/output units in which each connection has
a weight associated with it. During the learning phase, the network learns by adjusting
the weights so as to be able to predict the correct class label of the input tuples. Neural
network learning is also referred to as connectionist learning due to the connections
between units.
Neural networks involve long training times and are therefore more suitable for appli-
cations where this is feasible. They require a number of parameters that are typically
best determined empirically such as the network topology or “structure.” Neural net-
works have been criticized for their poor interpretability. For example, it is difficult for
humans to interpret the symbolic meaning behind the learned weights and of “hidden
units” in the network. These features initially made neural networks less desirable for
data mining.
Advantages of neural networks, however, include their high tolerance of noisy data
as well as their ability to classify patterns on which they have not been trained. They
can be used when you may have little knowledge of the relationships between attributes
and classes. They are well suited for continuous-valued inputs and outputs, unlike most
decision tree algorithms. They have been successful on a wide array of real-world data,
including handwritten character recognition, pathology and laboratory medicine, and
training a computer to pronounce English text. Neural network algorithms are inher-
ently parallel; parallelization techniques can be used to speed up the computation
process. In addition, several techniques have been recently developed for rule extrac-
tion from trained neural networks. These factors contribute to the usefulness of neural
networks for classification and numeric prediction in data mining.
There are many different kinds of neural networks and neural network algorithms.
The most popular neural network algorithm is backpropagation, which gained repute
in the 1980s. In Section 9.2.1 you will learn about multilayer feed-forward net-
works, the type of neural network on which the backpropagation algorithm performs.
Section 9.2.2 discusses defining a network topology. The backpropagation algorithm is
described in Section 9.2.3. Rule extraction from trained neural networks is discussed in
Section 9.2.4.

9.2.1 A Multilayer Feed-Forward Neural Network

The backpropagation algorithm performs learning on a multilayer feed-forward neural
network. It iteratively learns a set of weights for prediction of the class label of tuples.
A multilayer feed-forward neural network consists of an input layer, one or more hidden
layers, and an output layer. An example of a multilayer feed-forward network is shown
in Figure 9.2.
9.2 Classification by Backpropagation 399

2
2j

Figure 9.2 Multilayer feed-forward neural network.

Each layer is made up of units. The inputs to the network correspond to the attributes
measured for each training tuple. The inputs are fed simultaneously into the units
making up the input layer. These inputs pass through the input layer and are then
weighted and fed simultaneously to a second layer of “neuronlike” units, known as a
hidden layer. The outputs of the hidden layer units can be input to another hidden
layer, and so on. The number of hidden layers is arbitrary, although in practice, usually
only one is used. The weighted outputs of the last hidden layer are input to units making
up the output layer, which emits the network’s prediction for given tuples.
The units in the input layer are called input units. The units in the hidden layers and
output layer are sometimes referred to as neurodes, due to their symbolic biological
basis, or as output units. The multilayer neural network shown in Figure 9.2 has two
layers of output units. Therefore, we say that it is a two-layer neural network. (The
input layer is not counted because it serves only to pass the input values to the next
layer.) Similarly, a network containing two hidden layers is called a three-layer neural
network, and so on. It is a feed-forward network since none of the weights cycles back
to an input unit or to a previous layer’s output unit. It is fully connected in that each
unit provides input to each unit in the next forward layer.
Each output unit takes, as input, a weighted sum of the outputs from units in the
previous layer (see Figure 9.4 later). It applies a nonlinear (activation) function to the
weighted input. Multilayer feed-forward neural networks are able to model the class pre-
diction as a nonlinear combination of the inputs. From a statistical point of view, they
perform nonlinear regression. Multilayer feed-forward networks, given enough hidden
units and enough training samples, can closely approximate any function.
400 Chapter 9 Classification: Advanced Methods

9.2.2 Defining a Network Topology

“How can I design the neural network’s topology?” Before training can begin, the user
must decide on the network topology by specifying the number of units in the input
layer, the number of hidden layers (if more than one), the number of units in each
hidden layer, and the number of units in the output layer.
Normalizing the input values for each attribute measured in the training tuples will
help speed up the learning phase. Typically, input values are normalized so as to fall
between 0.0 and 1.0. Discrete-valued attributes may be encoded such that there is one
input unit per domain value. For example, if an attribute A has three possible or known
values, namely {a0 , a1 , a2 }, then we may assign three input units to represent A. That
is, we may have, say, I0 , I1 , I2 as input units. Each unit is initialized to 0. If A = a0 , then
I0 is set to 1 and the rest are 0. If A = a1 , then I1 is set to 1 and the rest are 0, and
so on.
Neural networks can be used for both classification (to predict the class label of a
given tuple) and numeric prediction (to predict a continuous-valued output). For clas-
sification, one output unit may be used to represent two classes (where the value 1
represents one class, and the value 0 represents the other). If there are more than two
classes, then one output unit per class is used. (See Section 9.7.1 for more strategies on
multiclass classification.)
There are no clear rules as to the “best” number of hidden layer units. Network design
is a trial-and-error process and may affect the accuracy of the resulting trained net-
work. The initial values of the weights may also affect the resulting accuracy. Once a
network has been trained and its accuracy is not considered acceptable, it is common to
repeat the training process with a different network topology or a different set of initial
weights. Cross-validation techniques for accuracy estimation (described in Chapter 8)
can be used to help decide when an acceptable network has been found. A number of
automated techniques have been proposed that search for a “good” network structure.
These typically use a hill-climbing approach that starts with an initial structure that is
selectively modified.

9.2.3 Backpropagation
“How does backpropagation work?” Backpropagation learns by iteratively processing a
data set of training tuples, comparing the network’s prediction for each tuple with the
actual known target value. The target value may be the known class label of the training
tuple (for classification problems) or a continuous value (for numeric prediction). For
each training tuple, the weights are modified so as to minimize the mean-squared error
between the network’s prediction and the actual target value. These modifications are
made in the “backwards” direction (i.e., from the output layer) through each hidden
layer down to the first hidden layer (hence the name backpropagation). Although it is
not guaranteed, in general the weights will eventually converge, and the learning process
stops. The algorithm is summarized in Figure 9.3. The steps involved are expressed in
terms of inputs, outputs, and errors, and may seem awkward if this is your first look at
9.2 Classification by Backpropagation 401

Algorithm: Backpropagation. Neural network learning for classification or numeric

prediction, using the backpropagation algorithm.
Input:
D, a data set consisting of the training tuples and their associated target values;
l, the learning rate;
network, a multilayer feed-forward network.
Output: A trained neural network.
Method:
(1) Initialize all weights and biases in network;
(2) while terminating condition is not satisfied {
(3) for each training tuple X in D {
(4) // Propagate the inputs forward:
(5) for each input layer unit j {
(6) Oj = Ij ; // output of an input unit is its actual input value
(7) for each hidden
P or output layer unit j {
(8) Ij = i wij Oi + θj ; //compute the net input of unit j with respect to
the previous layer, i
(9) Oj = 1−I j ; } // compute the output of each unit j
1+e
(10) // Backpropagate the errors:
(11) for each unit j in the output layer
(12) Errj = Oj (1 − Oj )(Tj − Oj ); // compute the error
(13) for each unit j in the hidden
P layers, from the last to the first hidden layer
(14) Errj = Oj (1 − Oj ) k Errk wjk ; // compute the error with respect to
the next higher layer, k
(15) for each weight wij in network {
(16) 1wij = (l)Errj Oi ; // weight increment
(17) wij = wij + 1wij ; } // weight update
(18) for each bias θj in network {
(19) 1θj = (l)Errj ; // bias increment
(20) θj = θj + 1θj ; } // bias update
(21) }}

Figure 9.3 Backpropagation algorithm.

neural network learning. However, once you become familiar with the process, you will
see that each step is inherently simple. The steps are described next.

Initialize the weights: The weights in the network are initialized to small random num-
bers (e.g., ranging from −1.0 to 1.0, or −0.5 to 0.5). Each unit has a bias associated with
it, as explained later. The biases are similarly initialized to small random numbers.
Each training tuple, X, is processed by the following steps.

Propagate the inputs forward: First, the training tuple is fed to the network’s input
layer. The inputs pass through the input units, unchanged. That is, for an input unit, j,
402 Chapter 9 Classification: Advanced Methods

Weights
w1j
y1
Bias

j
w2j
y2
...
Σ f Output

wnj
yn

Inputs Weighted sum Activation

(outputs from function
previous layer)

Figure 9.4 Hidden or output layer unit j: The inputs to unit j are outputs from the previous layer. These
are multiplied by their corresponding weights to form a weighted sum, which is added to the
bias associated with unit j. A nonlinear activation function is applied to the net input. (For
ease of explanation, the inputs to unit j are labeled y1 , y2 , . . . , yn . If unit j were in the first
hidden layer, then these inputs would correspond to the input tuple (x1 , x2 , . . . , xn ).)

its output, Oj , is equal to its input value, Ij . Next, the net input and output of each unit
in the hidden and output layers are computed. The net input to a unit in the hidden or
output layers is computed as a linear combination of its inputs. To help illustrate this
point, a hidden layer or output layer unit is shown in Figure 9.4. Each such unit has
a number of inputs to it that are, in fact, the outputs of the units connected to it in
the previous layer. Each connection has a weight. To compute the net input to the unit,
each input connected to the unit is multiplied by its corresponding weight, and this is
summed. Given a unit, j in a hidden or output layer, the net input, Ij , to unit j is
X
Ij = wij Oi + θj , (9.4)
i

where wij is the weight of the connection from unit i in the previous layer to unit j; Oi is
the output of unit i from the previous layer; and θj is the bias of the unit. The bias acts
as a threshold in that it serves to vary the activity of the unit.
Each unit in the hidden and output layers takes its net input and then applies an acti-
vation function to it, as illustrated in Figure 9.4. The function symbolizes the activation
of the neuron represented by the unit. The logistic, or sigmoid, function is used. Given
the net input Ij to unit j, then Oj , the output of unit j, is computed as

1
Oj = . (9.5)
1 + e −Ij
9.2 Classification by Backpropagation 403

This function is also referred to as a squashing function, because it maps a large input
domain onto the smaller range of 0 to 1. The logistic function is nonlinear and
differentiable, allowing the backpropagation algorithm to model classification problems
that are linearly inseparable.
We compute the output values, Oj , for each hidden layer, up to and including the
output layer, which gives the network’s prediction. In practice, it is a good idea to
cache (i.e., save) the intermediate output values at each unit as they are required again
later when backpropagating the error. This trick can substantially reduce the amount of
computation required.

Backpropagate the error: The error is propagated backward by updating the weights
and biases to reflect the error of the network’s prediction. For a unit j in the output
layer, the error Errj is computed by

Errj = Oj (1 − Oj )(Tj − Oj ), (9.6)

where Oj is the actual output of unit j, and Tj is the known target value of the given
training tuple. Note that Oj (1 − Oj ) is the derivative of the logistic function.
To compute the error of a hidden layer unit j, the weighted sum of the errors of the
units connected to unit j in the next layer are considered. The error of a hidden layer
unit j is
X
Errj = Oj (1 − Oj ) Errk wjk , (9.7)
k

where wjk is the weight of the connection from unit j to a unit k in the next higher layer,
and Errk is the error of unit k.
The weights and biases are updated to reflect the propagated errors. Weights are
updated by the following equations, where 1wij is the change in weight wij :

1wij = (l )Errj Oi . (9.8)

wij = wij + 1wij . (9.9)

“What is l in Eq. (9.8)?” The variable l is the learning rate, a constant typically having
a value between 0.0 and 1.0. Backpropagation learns using a gradient descent method
to search for a set of weights that fits the training data so as to minimize the mean-
squared distance between the network’s class prediction and the known target value of
the tuples.1 The learning rate helps avoid getting stuck at a local minimum in decision
space (i.e., where the weights appear to converge, but are not the optimum solution) and
encourages finding the global minimum. If the learning rate is too small, then learning
will occur at a very slow pace. If the learning rate is too large, then oscillation between

1A method of gradient descent was also used for training Bayesian belief networks, as described in
Section 9.1.2.
404 Chapter 9 Classification: Advanced Methods

inadequate solutions may occur. A rule of thumb is to set the learning rate to 1/t, where
t is the number of iterations through the training set so far.
Biases are updated by the following equations, where 1θj is the change in bias θj :

1θj = (l)Errj . (9.10)

θj = θj + 1θj . (9.11)

Note that here we are updating the weights and biases after the presentation of each
tuple. This is referred to as case updating. Alternatively, the weight and bias incre-
ments could be accumulated in variables, so that the weights and biases are updated
after all the tuples in the training set have been presented. This latter strategy is called
epoch updating, where one iteration through the training set is an epoch. In the-
ory, the mathematical derivation of backpropagation employs epoch updating, yet
in practice, case updating is more common because it tends to yield more accurate
results.

Terminating condition: Training stops when

All 1wij in the previous epoch are so small as to be below some specified
threshold, or
The percentage of tuples misclassified in the previous epoch is below some thresh-
old, or
A prespecified number of epochs has expired.

In practice, several hundreds of thousands of epochs may be required before the weights
will converge.
“How efficient is backpropagation?” The computational efficiency depends on the
time spent training the network. Given |D| tuples and w weights, each epoch requires
O(|D| × w) time. However, in the worst-case scenario, the number of epochs can be
exponential in n, the number of inputs. In practice, the time required for the networks
to converge is highly variable. A number of techniques exist that help speed up the train-
ing time. For example, a technique known as simulated annealing can be used, which
also ensures convergence to a global optimum.

Example 9.1 Sample calculations for learning by the backpropagation algorithm. Figure 9.5 shows
a multilayer feed-forward neural network. Let the learning rate be 0.9. The initial weight
and bias values of the network are given in Table 9.1, along with the first training tuple,
X = (1, 0, 1), with a class label of 1.
This example shows the calculations for backpropagation, given the first training
tuple, X. The tuple is fed into the network, and the net input and output of each unit
9.2 Classification by Backpropagation 405

are computed. These values are shown in Table 9.2. The error of each unit is computed
and propagated backward. The error values are shown in Table 9.3. The weight and bias
updates are shown in Table 9.4.

x1 1 w14

w15 4
w46
w24
x2 2 6

w25
w56
w34 5

x3 3 w35

Figure 9.5 Example of a multilayer feed-forward neural network.

Table 9.1 Initial Input, Weight, and Bias Values

x1 x2 x3 w14 w15 w24 w25 w34 w35 w46 w56 θ4 θ5 θ6

1 0 1 0.2 −0.3 0.4 0.1 −0.5 0.2 −0.3 −0.2 −0.4 0.2 0.1

Table 9.2 Net Input and Output Calculations

Unit, j Net Input, Ij Output, Oj
4 0.2 + 0 − 0.5 − 0.4 = −0.7 1/(1 + e 0.7 ) = 0.332
5 −0.3 + 0 + 0.2 + 0.2 = 0.1 1/(1 + e −0.1 ) = 0.525
6 (−0.3)(0.332) − (0.2)(0.525) + 0.1 = −0.105 1/(1 + e 0.105 ) = 0.474

Table 9.3 Calculation of the Error at Each Node

Unit, j Errj
6 (0.474)(1 − 0.474)(1 − 0.474) = 0.1311
5 (0.525)(1 − 0.525)(0.1311)(−0.2) = −0.0065
4 (0.332)(1 − 0.332)(0.1311)(−0.3) = −0.0087

COMP5310 Notes
No ratings yet
COMP5310 Notes
10 pages
Aiml Manual 6th Sem
No ratings yet
Aiml Manual 6th Sem
15 pages
Agra University Journal Scie
No ratings yet
Agra University Journal Scie
69 pages
Exercises 695 Clas
No ratings yet
Exercises 695 Clas
3 pages
PLC Overview PID Control and Tuning
100% (1)
PLC Overview PID Control and Tuning
64 pages
Concepts and Techniques: Data Mining
No ratings yet
Concepts and Techniques: Data Mining
78 pages
Tutorial Windev 21
No ratings yet
Tutorial Windev 21
449 pages
Hacking With React
100% (1)
Hacking With React
123 pages
453 Deep CNN Based Blind Image Quality Predictor
No ratings yet
453 Deep CNN Based Blind Image Quality Predictor
75 pages
Face Detection Using Haar Cascades
No ratings yet
Face Detection Using Haar Cascades
4 pages
Crime Prediction in Nigeria's Higer Institutions
No ratings yet
Crime Prediction in Nigeria's Higer Institutions
13 pages
Model Building Through
No ratings yet
Model Building Through
21 pages
Software Testing Strategies
No ratings yet
Software Testing Strategies
27 pages
KNN Algorithm - PPT (Autosaved)
0% (1)
KNN Algorithm - PPT (Autosaved)
8 pages
Exam2005 2
0% (1)
Exam2005 2
19 pages
Detection of Stroke Disease Using Machine Learning Algorithams Full
No ratings yet
Detection of Stroke Disease Using Machine Learning Algorithams Full
57 pages
Types of Data (Qualitative and Quantitative)
No ratings yet
Types of Data (Qualitative and Quantitative)
89 pages
ECE 5th Sem Syllabus
0% (1)
ECE 5th Sem Syllabus
84 pages
Analysis of Crop Yield Using Machine Learning: A Minor Project Report
No ratings yet
Analysis of Crop Yield Using Machine Learning: A Minor Project Report
51 pages
COMP3711: Design and Analysis of Algorithms: Tutorial 5 Hkust
100% (1)
COMP3711: Design and Analysis of Algorithms: Tutorial 5 Hkust
31 pages
2511 Practice Final Exam Answers
No ratings yet
2511 Practice Final Exam Answers
7 pages
Classification Advanced
No ratings yet
Classification Advanced
51 pages
OOP Lab LabModule
No ratings yet
OOP Lab LabModule
78 pages
Structorizer User Guide
33% (3)
Structorizer User Guide
177 pages
Heart Disease Prediction Using Machine Learning-1
No ratings yet
Heart Disease Prediction Using Machine Learning-1
6 pages
Decision Tree Learning: - A Learned Decision Tree Can Also Be Re-Represented As A Set of If-Then Rules
No ratings yet
Decision Tree Learning: - A Learned Decision Tree Can Also Be Re-Represented As A Set of If-Then Rules
49 pages
ML Lab Manual
100% (1)
ML Lab Manual
37 pages
Predicting Cardiovascular Disease Using Logistic Regression Research Paper
No ratings yet
Predicting Cardiovascular Disease Using Logistic Regression Research Paper
4 pages
Binary Decision Diagrams: Theory, Implementation, Usage
No ratings yet
Binary Decision Diagrams: Theory, Implementation, Usage
39 pages
CB19241-Data Structures and Algorithms-Syllabus
No ratings yet
CB19241-Data Structures and Algorithms-Syllabus
1 page
Duda Solutions PDF
No ratings yet
Duda Solutions PDF
77 pages
Classification 1
No ratings yet
Classification 1
78 pages
Obstructive Sleep Apnea
No ratings yet
Obstructive Sleep Apnea
19 pages
Econometrics - MCQ Flashcards - Quizlet
No ratings yet
Econometrics - MCQ Flashcards - Quizlet
19 pages
Graphics Processing Unit (GPU) Architecture and Programming: TU/e 5kk73 Zhenyu Ye Henk Corporaal 2011-11-15
No ratings yet
Graphics Processing Unit (GPU) Architecture and Programming: TU/e 5kk73 Zhenyu Ye Henk Corporaal 2011-11-15
53 pages
Chandigarh Group of Colleges College of Engineering Landran, Mohali
No ratings yet
Chandigarh Group of Colleges College of Engineering Landran, Mohali
47 pages
Quiz Week 7 - Support Vector Machines
100% (1)
Quiz Week 7 - Support Vector Machines
3 pages
Unit - IV PDF
No ratings yet
Unit - IV PDF
16 pages
Lecture 9 PDF
100% (1)
Lecture 9 PDF
28 pages
Iaetsd-Jaras-Diabetic Retinopathy Detection Using Transfer
No ratings yet
Iaetsd-Jaras-Diabetic Retinopathy Detection Using Transfer
9 pages
DWDM Unit4-2
No ratings yet
DWDM Unit4-2
4 pages
DTB (ch5)
No ratings yet
DTB (ch5)
14 pages
First Review PDF
No ratings yet
First Review PDF
36 pages
Sas Semma
100% (1)
Sas Semma
39 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
Class Notes PDF
No ratings yet
Class Notes PDF
24 pages
Discriminant Analysis Chapter-Seven
No ratings yet
Discriminant Analysis Chapter-Seven
7 pages
7 - Classification
No ratings yet
7 - Classification
71 pages
Module 1 Quiz
No ratings yet
Module 1 Quiz
7 pages
Backpropagation
No ratings yet
Backpropagation
7 pages
Basic Concepts of One Way Analysis of Variance (ANOVA)
No ratings yet
Basic Concepts of One Way Analysis of Variance (ANOVA)
30 pages
Strategic Approach To Software Testing
No ratings yet
Strategic Approach To Software Testing
6 pages
Decision Trees For Predictive Modeling (Neville)
100% (1)
Decision Trees For Predictive Modeling (Neville)
24 pages
Git With Eclipse (EGit) - Tutorial
No ratings yet
Git With Eclipse (EGit) - Tutorial
27 pages
Coverity Security Report Pttep Bankguarantee - Snid 39617
100% (1)
Coverity Security Report Pttep Bankguarantee - Snid 39617
17 pages
Designing A G+2 Structure Using Python With Graphical User Interface
No ratings yet
Designing A G+2 Structure Using Python With Graphical User Interface
9 pages
CS 601 Machine Learning Unit 5
No ratings yet
CS 601 Machine Learning Unit 5
18 pages
Efficient Nearest Neighbor Search in High Dimensional Hamming Space
No ratings yet
Efficient Nearest Neighbor Search in High Dimensional Hamming Space
11 pages
Evaluation Mcqs
No ratings yet
Evaluation Mcqs
2 pages
Data Mining Comprehensive Exam - Regular PDF
No ratings yet
Data Mining Comprehensive Exam - Regular PDF
3 pages
PDFlib in PHP HowTo
No ratings yet
PDFlib in PHP HowTo
11 pages
Classification Metrics in Machine Learning
No ratings yet
Classification Metrics in Machine Learning
6 pages
Disease Prediction Synopsis
No ratings yet
Disease Prediction Synopsis
3 pages
Introduction To R Brochure
No ratings yet
Introduction To R Brochure
3 pages
SAS Part001
No ratings yet
SAS Part001
15 pages
Ch3 Worksheet
No ratings yet
Ch3 Worksheet
8 pages
Data Mining, Advance Methods
No ratings yet
Data Mining, Advance Methods
83 pages
IAT-I Question Paper With Solution of 18CS71 Artificial Intelligence and Machine Learning Oct-2022-Dr. Paras Nath Singh
No ratings yet
IAT-I Question Paper With Solution of 18CS71 Artificial Intelligence and Machine Learning Oct-2022-Dr. Paras Nath Singh
7 pages
Performance Evaluation of Machine Learning Algorithms in Post-Operative Life Expectancy in The Lung Cancer Patients
No ratings yet
Performance Evaluation of Machine Learning Algorithms in Post-Operative Life Expectancy in The Lung Cancer Patients
11 pages
Nueral Network Mcqs
No ratings yet
Nueral Network Mcqs
6 pages
POP Notes
No ratings yet
POP Notes
6 pages
ML Assignment 3 Nptel 2019
No ratings yet
ML Assignment 3 Nptel 2019
26 pages
Must Know Questions Deep Learning
No ratings yet
Must Know Questions Deep Learning
22 pages
Answer 1722791857 NLP and Classification Practical MCQ 4991
No ratings yet
Answer 1722791857 NLP and Classification Practical MCQ 4991
26 pages
Soft Computing Unit 2 Notes..
No ratings yet
Soft Computing Unit 2 Notes..
24 pages
Database Design Application Development and Administration 3rd Edition Mannino Test Bank
100% (45)
Database Design Application Development and Administration 3rd Edition Mannino Test Bank
31 pages
Ge4 Linear Programming
No ratings yet
Ge4 Linear Programming
5 pages
Lecture 17-Classification by Backpropagation-M
No ratings yet
Lecture 17-Classification by Backpropagation-M
25 pages
Lecture Notes 1.0 Introduction To Databases
No ratings yet
Lecture Notes 1.0 Introduction To Databases
23 pages
Lec5-Adts 6up
No ratings yet
Lec5-Adts 6up
4 pages
CAIN2022ID18WhatIsanAIEngineer submittedMarchCorrected
No ratings yet
CAIN2022ID18WhatIsanAIEngineer submittedMarchCorrected
10 pages
SOFT COMPUTING - NOTES - UNIT 4 and UNIT 5
No ratings yet
SOFT COMPUTING - NOTES - UNIT 4 and UNIT 5
32 pages
CL Back Propogation
No ratings yet
CL Back Propogation
11 pages
AI&ML BM4251 Unit 1-5 Notes
No ratings yet
AI&ML BM4251 Unit 1-5 Notes
116 pages
Computer Studies Syllabus Grades 10 To 12
No ratings yet
Computer Studies Syllabus Grades 10 To 12
41 pages
Shogbanmu Adefela Gabriel - March Resume
No ratings yet
Shogbanmu Adefela Gabriel - March Resume
4 pages
Operating System
No ratings yet
Operating System
6 pages
Basta Copiar e Colar No Seu Market - PHP
No ratings yet
Basta Copiar e Colar No Seu Market - PHP
11 pages
Bca 3 Sem Data Structures Winter 2017
No ratings yet
Bca 3 Sem Data Structures Winter 2017
1 page
MySQL & Oracle
No ratings yet
MySQL & Oracle
5 pages

Classification by Backpropagation - A Multilayer Feed-Forward Neural Network - Defining A Network Topology - Backpropagation

Uploaded by

Classification by Backpropagation - A Multilayer Feed-Forward Neural Network - Defining A Network Topology - Backpropagation

Uploaded by

398 Chapter 9 Classification: Advanced Methods

9.2 Classification by Backpropagation

“What is backpropagation?” Backpropagation is a neural network learning algorithm.

9.2.1 A Multilayer Feed-Forward Neural Network

Figure 9.2 Multilayer feed-forward neural network.

9.2.2 Defining a Network Topology

Algorithm: Backpropagation. Neural network learning for classification or numeric

Figure 9.3 Backpropagation algorithm.

Inputs Weighted sum Activation

Errj = Oj (1 − Oj )(Tj − Oj ), (9.6)

1wij = (l )Errj Oi . (9.8)

1θj = (l)Errj . (9.10)

Terminating condition: Training stops when

Figure 9.5 Example of a multilayer feed-forward neural network.

Table 9.1 Initial Input, Weight, and Bias Values

Table 9.2 Net Input and Output Calculations

Table 9.3 Calculation of the Error at Each Node

You might also like