0% found this document useful (0 votes)

18 views29 pages

Backpropagation Algorithm

The backpropagation algorithm consists of two phases: the forward pass where inputs are passed through the network to obtain outputs, and the backward pass where the loss gradient is calculated and used to update network weights through backpropagation of error.

Uploaded by

Nhật Khoa Dương

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views29 pages

Backpropagation Algorithm

Uploaded by

Nhật Khoa Dương

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 29

10/12/2023

Backpropagation Algorithm
1
The backpropagation algorithm consists of two phases:

1. The forward pass where our inputs are passed through the network and output
predictions obtained (also known as the propagation phase).

2. The backward pass where we compute the gradient of the loss function at the
final layer (i.e., predictions layer) of the network and use this gradient to
recursively apply the chain rule to update the weights in our network (also known
as the weight update phase).

Backpropagation Algorithm
2
Search for the related information of the following concepts:

1. Forward pass

2. Backward pass

3. Gradient

4. Loss function

5. Chain rule

1
10/12/2023

3
The Forward Pass

The purpose of the

forward pass is to propagate
our inputs through the
network by applying a
series of dot products and
activations until we reach
the output layer of the
network (i.e., our
predictions).

4
Gradient

The gradient descent method is an iterative optimization

algorithm that operates over a loss landscape
As we can see, our loss landscape has many peaks and valleys
based on which values our parameters take on. Each peak is a
local maximum that represents very high regions of loss –the
local maximum with the largest loss across the entire loss
landscape is the global maximum.
Similarly, we also have local minimum which represents many
small regions of loss

2
10/12/2023

5
Gradient

The surface of our bowl is the loss landscape, which is a plot of the loss
function. The difference between our loss landscape and your cereal bowl is
that your cereal bowl only exists in three dimensions, while your loss
landscape exists in many dimensions, perhaps tens, hundreds, or thousands of
dimensions.
Each position along the surface of the bowl corresponds to a particular loss
value given a set of parameters W (weight matrix) and b (bias vector). Our
goal is to try different values of W and b, evaluate their loss, and then take a
step towards more optimal values that (ideally) have lower loss.

6
Loss function

The loss function quantifies how “good” or “bad” of a job a given

model is doing classifying data points from the dataset. Model #1
achieves considerably lower loss than Model #2.
The smaller the loss, the better a job the classifier is at modeling the
relationship between the input data and output class labels.
To improve our classification accuracy, we need to tune the
parameters of our weight matrix W or bias vector b. Exactly how
we go about updating these parameters is an optimization problem.

3
10/12/2023

7
Loss function

Backpropagation Algorithm
8

4
10/12/2023

Backpropagation Algorithm
9
we present the feature vector (0,1,1) (and
target output value 1 to the network).
Here we can see that 0, 1, and 1 have
been assigned to the three input nodes in
the network.

To propagate the values through the

network and obtain the final
classification, we need to take the dot
product between the inputs and the
weight values, followed by applying an
activation function (in this case, the
sigmoid function, σ)

The Forward Pass 10

5
10/12/2023

The output of the network is thus

0.506. We can apply a step function
to determine if this output is the
correct classification or not:

6
10/12/2023

Applying the step function with net = 0.506 we see that our network
predicts 1 which is, in fact, the correct class label. However, our
network is not very confident in this class label – the predicted value
0.506 is very close to the threshold of the step. Ideally, this prediction
should be closer to 0.98−0.99, implying that our network has truly
learned the underlying pattern in the dataset. In order for our
network to actually “learn”, we need to apply the backward pass.

The Backward Pass 14

7
10/12/2023

Implementing Backpropagation with Python

Open up a new file, name it neuralnetwork.py, store it in the nn 15
submodule of pyimagesearch, and let’s get to work:

Implementing Backpropagation with Python

Line 5 then defines the constructor to our NeuralNetwork class. The constructor requires
a single argument, followed by a second optional one:
• layers: A list of integers which represents the actual architecture of the feedforward
network. For example, a value of [2,2,1] would imply that our first input layer has two
nodes, our hidden layer has two nodes, and our final output layer has one node.
• alpha: Here we can specify the learning rate of our neural network. This value is
applied during the weight update phase.

8
10/12/2023

Implementing Backpropagation with Python

Line 8 initializes our list of weights for each layer, W.

We then store layers and alpha on Lines 9 and 10.
Our weights list W is empty, so let’s go ahead and initialize it

Implementing Backpropagation with Python

On Line 14 we start looping over the number of layers in the network (i.e.,
len(layers)), but we stop before the final two layer.
Each layer in the network is randomly initialized by constructing an MxN
weight matrix by sampling values from a standard, normal distribution (Line
18). The matrix is MxN since we wish to connect every node in current layer
to every node in the next layer.

9
10/12/2023

Implementing Backpropagation with Python

We scale w by dividing by the square root of the number of nodes in the

current layer, thereby normalizing the variance of each neuron’s output
(Line 19).

Implementing Backpropagation with Python

The final code block of the constructor handles the special case where the
input connections need a bias term, but the output does not:

Again, these weight values are randomly sampled and then normalized.

10
10/12/2023

Implementing Backpropagation with Python

The next function we define is a Python “magic method” named repr –

this function is useful for debugging:

In our case, we’ll format a string for our NeuralNetwork object by

concatenating the integer value of the number of nodes in each layer.

Implementing Backpropagation with Python

Given a layers value of (2, 2, 1), the output of calling this function will be:

11
10/12/2023

Implementing Backpropagation with Python

Next, we can define our sigmoid activation function:

Implementing Backpropagation with Python

As well as the derivative of the sigmoid which we’ll use during the
backward pass:

Again, note that whenever you perform backpropagation, you’ll always

want to choose an activation function that is differentiable

12
10/12/2023

Implementing Backpropagation with Python

We’ll draw inspiration from the scikit-learn library and define a function
named fit which will be responsible for actually training our NeuralNetwork

Implementing Backpropagation with Python

13
10/12/2023

Implementing Backpropagation with Python

The actual heart of the backpropagation algorithm is found inside 28
our fit_partial method below:

14
10/12/2023

Implementing Backpropagation with Python

From here, we can start the forward propagation phase: 29

Implementing Backpropagation with Python

The final entry in A is thus the output of the last layer in our network

15
10/12/2023

Implementing Backpropagation with Python

Now that the forward pass is done, we can move on to the slightly 31
more complicated backward pass:

Implementing Backpropagation with Python

The first phase of the backward pass is to compute our error, or 32
simply the difference between our predicted label and the
ground-truth label (Line 91).
Since the final entry in the activations list A contains the output of
the network, we can access the output prediction via A[-1]. The
value y is the target output for the input data point x.

16
10/12/2023

Implementing Backpropagation with Python

Next, we need to start applying the chain rule to build our list of 33
deltas, D. The deltas will be used to update our weight matrices,
scaled by the learning rate alpha.
The first entry in the deltas list is the error of our output layer
multiplied by the derivative of the sigmoid for the output value
(Line 97)

Implementing Backpropagation with Python

Given the delta for the final layer in the network, we can now work 34
backward using a for loop:

17
10/12/2023

Implementing Backpropagation with Python

Given our deltas list D, we can move on to the weight update phase:36

18
10/12/2023

Implementing Backpropagation with Python

19
10/12/2023

Implementing Backpropagation with Python

39on
Once our network is trained on a given dataset, we’ll want to make predictions
the testing set, which can be accomplished via the predict method below:

Implementing Backpropagation with Python

20
10/12/2023

Implementing Backpropagation with Python

21
10/12/2023

Implementing Backpropagation with Python

The final function we’ll define inside the NeuralNetwork class will be used to 43
calculate the loss across our entire training set:

Backpropagation with Python Example #1: Bitwise XOR

44
Go ahead and open up a new file, name it nn_xor.py, and insert the following
code:

22
10/12/2023

Backpropagation with Python Example #1: Bitwise XOR

We can now define our network architecture and train it: 45

Backpropagation with Python Example #1: Bitwise XOR

46 to
Once our network is trained, we’ll loop over our XOR datasets, allow the network
predict the output for each one, and display the prediction to our screen:

23
10/12/2023

Backpropagation with Python Example #1: Bitwise XOR

47the
To train our neural network using backpropagation with Python, simply execute
following command:

Backpropagation with Python Example #1: Bitwise XOR

48
A plot of the squared loss is displayed below (Figure 10.11). As we can see, loss
slowly decreases to approximately zero over the course of training.

24
10/12/2023

Backpropagation with Python Example #1: Bitwise XOR

Furthermore, looking at the final four lines of the output we can see our 49
predictions:

Backpropagation with Python Example: MNIST Sample

50
Let’s examine a subset of the MNIST dataset for handwritten digit recognition. This
subset of the MNIST dataset is built-into the scikit-learn library and includes 1,797
example digits, each of which are 8×8 grayscale images (the original images are 28×28).
When flattened, these images are represented by an 8×8 = 64-dim vector.

25
10/12/2023

We also perform min/max normalizing by scaling each digit into the range [0,1] (Line 14).

52
Next, let’s construct a training and testing split, using 75% of the data for testing
and 25% for evaluation

We’ll also encode our class label integers as vectors, a process called one-hot encoding that
we will discuss in detail later in this chapter

26
10/12/2023

Here we can see that we are training a NeuralNetwork with a 64−32−16−10 architecture.
The output layer has ten nodes due to the fact that there are ten possible output classes
for the digits 0-9. We then allow our network to train for 1,000 epochs.

Once our network has been trained, we can evaluate it on the testing set: 54

27
10/12/2023

28
10/12/2023

Notice how our loss starts off very

high, but quickly drops during the
training process. Our classification
report demonstrates that we are
obtaining ≈ 98% classification
accuracy on our testing set;
however, we are having some
trouble classifying digits 4 and 5 (95%
and 94% accuracy, respectively).

BV Raman 300 Important Yogas
67% (3)
BV Raman 300 Important Yogas
17 pages
How To Build Your Own Neural Network From Scratch in
No ratings yet
How To Build Your Own Neural Network From Scratch in
6 pages
Backpropagation
No ratings yet
Backpropagation
12 pages
Artificial Neural Networks - Lect - 3
No ratings yet
Artificial Neural Networks - Lect - 3
16 pages
Types of MAC Protocols
No ratings yet
Types of MAC Protocols
32 pages
Back Propagation
No ratings yet
Back Propagation
29 pages
Working of Multi-Layer Perceptron
No ratings yet
Working of Multi-Layer Perceptron
16 pages
Lecture 02-2
No ratings yet
Lecture 02-2
37 pages
Types of MAC Protocols
No ratings yet
Types of MAC Protocols
16 pages
Exp - 4 - 5 (Prakash)
No ratings yet
Exp - 4 - 5 (Prakash)
10 pages
Question Example
No ratings yet
Question Example
10 pages
ML Unit 5
No ratings yet
ML Unit 5
34 pages
ANN Notes Updated
0% (1)
ANN Notes Updated
46 pages
Exp 3
No ratings yet
Exp 3
9 pages
ANN Research
No ratings yet
ANN Research
18 pages
Back Propagation
No ratings yet
Back Propagation
8 pages
Classification Advanced
No ratings yet
Classification Advanced
51 pages
Chapter 1 Annexe
No ratings yet
Chapter 1 Annexe
17 pages
L4deep Learning
No ratings yet
L4deep Learning
14 pages
Back Propagation
No ratings yet
Back Propagation
5 pages
AI Expt 07
No ratings yet
AI Expt 07
4 pages
7 Neural Networks
No ratings yet
7 Neural Networks
70 pages
ML807 Distributed and Federated Learning Slides 2
No ratings yet
ML807 Distributed and Federated Learning Slides 2
211 pages
3rd Ass
No ratings yet
3rd Ass
6 pages
Backpropagation Steps
No ratings yet
Backpropagation Steps
2 pages
How To Build Your Own Neural Network From Scratch in Python
No ratings yet
How To Build Your Own Neural Network From Scratch in Python
11 pages
Redes Neuronales Desde 0
No ratings yet
Redes Neuronales Desde 0
21 pages
Pr3 ANN WriteUp
No ratings yet
Pr3 ANN WriteUp
8 pages
A Step by Step Backpropagation Example
No ratings yet
A Step by Step Backpropagation Example
9 pages
21 CA1 Mahak
No ratings yet
21 CA1 Mahak
10 pages
ML Exp 8
No ratings yet
ML Exp 8
2 pages
Annette Paper
No ratings yet
Annette Paper
7 pages
Neural Networks
No ratings yet
Neural Networks
10 pages
0111CS191028
No ratings yet
0111CS191028
4 pages
A Step by Step Backpropagation Example - Matt Mazur
No ratings yet
A Step by Step Backpropagation Example - Matt Mazur
17 pages
A Step by Step Backpropagation Example - Matt Mazur
No ratings yet
A Step by Step Backpropagation Example - Matt Mazur
9 pages
Module 2
No ratings yet
Module 2
14 pages
Lesson 3 Artificial Neural Network
No ratings yet
Lesson 3 Artificial Neural Network
77 pages
EPS-DL-Handout3-Build ANN From Scratch Basics
No ratings yet
EPS-DL-Handout3-Build ANN From Scratch Basics
25 pages
Unit 2 - ML
No ratings yet
Unit 2 - ML
18 pages
Step by Step Back Propagation
No ratings yet
Step by Step Back Propagation
8 pages
CL Back Propogation
No ratings yet
CL Back Propogation
11 pages
FFNN, GD, Backpropagation
No ratings yet
FFNN, GD, Backpropagation
18 pages
ML Expt 9
No ratings yet
ML Expt 9
9 pages
Pr2 ANN WriteUp
No ratings yet
Pr2 ANN WriteUp
11 pages
Lecture 13.3 Classification ANN
No ratings yet
Lecture 13.3 Classification ANN
64 pages
ML.8-Neural Networks - Deep Learning (Week 12,13)
No ratings yet
ML.8-Neural Networks - Deep Learning (Week 12,13)
80 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
35 pages
ML Unit-2
No ratings yet
ML Unit-2
141 pages
3EBX0 Lecture Notes Addendum
No ratings yet
3EBX0 Lecture Notes Addendum
10 pages
What Is Backpropagation
No ratings yet
What Is Backpropagation
8 pages
Neural Networks
No ratings yet
Neural Networks
52 pages
EELU ANN ITF309 Lecture 08 Spring 2023-2024-Sensitivity-Back-Propagation
No ratings yet
EELU ANN ITF309 Lecture 08 Spring 2023-2024-Sensitivity-Back-Propagation
39 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
Module 3 Final
No ratings yet
Module 3 Final
88 pages
Unit 2
No ratings yet
Unit 2
38 pages
Artificial Intelligence - Chapter 7
No ratings yet
Artificial Intelligence - Chapter 7
18 pages
Lecture 09 Slides - After
No ratings yet
Lecture 09 Slides - After
57 pages
Unit 4
No ratings yet
Unit 4
16 pages
Notes Chapter8
No ratings yet
Notes Chapter8
4 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Laboratory Quality Control
50% (2)
Laboratory Quality Control
19 pages
5 Versionfinal
No ratings yet
5 Versionfinal
8 pages
Sumana Bandyopadhyay - Kolkata The Colonial City in Transition - Reflections in Geographies of Urban India-Routledge (2022)
100% (1)
Sumana Bandyopadhyay - Kolkata The Colonial City in Transition - Reflections in Geographies of Urban India-Routledge (2022)
395 pages
Evolution of Stars
No ratings yet
Evolution of Stars
3 pages
AutoCAD Civil 3D 2012 Essentials p2
No ratings yet
AutoCAD Civil 3D 2012 Essentials p2
55 pages
Cement Mill Certificate
100% (2)
Cement Mill Certificate
1 page
WhatsApp Chat With Nazia Lahor-1
No ratings yet
WhatsApp Chat With Nazia Lahor-1
13 pages
Yamaha Fzr400swc 89 Parts Catalogue
100% (42)
Yamaha Fzr400swc 89 Parts Catalogue
6 pages
Samuel Mercer - The Ideology of Work - Theoretical Humanism, Work and Labour (Historical Materialism Book Series, 311) - Brill Academic Pub (2024)
No ratings yet
Samuel Mercer - The Ideology of Work - Theoretical Humanism, Work and Labour (Historical Materialism Book Series, 311) - Brill Academic Pub (2024)
219 pages
BUCHI Destilador B-324 LIGAL 489 Operationmanual - SP
No ratings yet
BUCHI Destilador B-324 LIGAL 489 Operationmanual - SP
30 pages
Epic Minigeddon2
No ratings yet
Epic Minigeddon2
1 page
Extra Grammar Exercises Lesson 1 UNIT 6
No ratings yet
Extra Grammar Exercises Lesson 1 UNIT 6
2 pages
Evidence and Practice
No ratings yet
Evidence and Practice
18 pages
98 - Improving Rutting Resistance Using Geosynthetics
No ratings yet
98 - Improving Rutting Resistance Using Geosynthetics
5 pages
Dijkstra's Algorithm: 1 N Ij I J 1
No ratings yet
Dijkstra's Algorithm: 1 N Ij I J 1
5 pages
Quidos Technical Bulletin - 15th September 2019
100% (1)
Quidos Technical Bulletin - 15th September 2019
7 pages
Unit One: Lesson 10 "I'll Always Be Proud of Him"
No ratings yet
Unit One: Lesson 10 "I'll Always Be Proud of Him"
11 pages
Kleinman 2011
No ratings yet
Kleinman 2011
9 pages
vb8 Datasheet
No ratings yet
vb8 Datasheet
9 pages
Anderson ButcherConroy
No ratings yet
Anderson ButcherConroy
21 pages
Graph
No ratings yet
Graph
9 pages
Lecture 3: Role of Academic Librarian: Prof. Dana P. Tugade
100% (1)
Lecture 3: Role of Academic Librarian: Prof. Dana P. Tugade
34 pages
MP - English (R - 23)
No ratings yet
MP - English (R - 23)
192 pages
Date Reference Description Valuedate Deposit Withdrawal Balance
No ratings yet
Date Reference Description Valuedate Deposit Withdrawal Balance
26 pages
Solar-Powered Lawnmower Design and Development
No ratings yet
Solar-Powered Lawnmower Design and Development
8 pages
5 BA Q1250 Komax HMI EN
No ratings yet
5 BA Q1250 Komax HMI EN
31 pages
Dbms Lab 1,2,3,4
No ratings yet
Dbms Lab 1,2,3,4
40 pages
Class IX E-Content Links (Final)
No ratings yet
Class IX E-Content Links (Final)
1 page
Calculators List Allowed
No ratings yet
Calculators List Allowed
1 page

Backpropagation Algorithm

Uploaded by

Backpropagation Algorithm

Uploaded by

10/12/2023

The purpose of the

The gradient descent method is an iterative optimization

The loss function quantifies how “good” or “bad” of a job a given

To propagate the values through the

The Forward Pass 10

The output of the network is thus

The Backward Pass 14

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Line 8 initializes our list of weights for each layer, W.

Implementing Backpropagation with Python

Implementing Backpropagation with Python

We scale w by dividing by the square root of the number of nodes in the

Implementing Backpropagation with Python

Implementing Backpropagation with Python

The next function we define is a Python “magic method” named __repr__ –

In our case, we’ll format a string for our NeuralNetwork object by

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Next, we can define our sigmoid activation function:

Implementing Backpropagation with Python

Again, note that whenever you perform backpropagation, you’ll always

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Implementing Backpropagation with Python

Backpropagation with Python Example #1: Bitwise XOR

Backpropagation with Python Example #1: Bitwise XOR

Backpropagation with Python Example #1: Bitwise XOR

Backpropagation with Python Example #1: Bitwise XOR

Backpropagation with Python Example #1: Bitwise XOR

Backpropagation with Python Example #1: Bitwise XOR

Backpropagation with Python Example: MNIST Sample

Notice how our loss starts off very

You might also like

The next function we define is a Python “magic method” named repr –