0% found this document useful (0 votes)

10 views17 pages

DL3 Backpropagation

Uploaded by

Siji Satheesan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views17 pages

DL3 Backpropagation

Uploaded by

Siji Satheesan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Backpropagation

Entri
elevate
ANN

Entri
elevate
Entri
elevate
Backpropagation

● Backpropagation is an algorithm that backpropagates the errors from the output nodes to the
input nodes

● It calculate the errors in predictions and then adjusts the weights and biases of the function by
moving backwards through the layers in an eﬀort to train the model

● Together, forward propagation and backpropagation allow a neural network to make

predictions and correct for any errors accordingly. Over time, the algorithm becomes gradually
more accurate
Entri
elevate
Entri
elevate
Backpropagation

● The algorithm is used to eﬀectively train a neural network through a method called chain rule. In
simple terms, after each forward pass through a network, backpropagation performs a backward
pass while adjusting the model’s parameters (weights and biases)

● Neural Networks learn through iterative tuning of parameters (weights and biases) during the
training stage
● At the start, parameters are initialized by randomly generated weights, and the biases are set to
zero. This is followed by a forward pass of the data through the network to get model output
● Entri
Lastly, back-propagation is conducted. The model training process typically entails several
elevate
iterations of a forward pass, back-propagation, and parameters update Entri
elevate
Perceptron Learning Rule- Single layer perceptron

● The Perceptron learning rule is a supervised learning algorithm used in single-layer perceptrons. It is primarily used for binary classiﬁcation tasks,
where the goal is to separate data points into two classes based on their input features

Basic Equation of Perceptron Learning Rule

● For a single-layer perceptron with 'n' input features and a bias term (threshold) 'b', the weight update for the j-th weight (w_j) is given by:

w_j(new) = w_j(old) + learning_rate * (target_output - predicted_output) * input_j

● The perceptron learning rule iteratively updates the weights for each training sample until the model converges to a set of weights that can separate
the data points into the desired classes
Entri
elevate
Entri
elevate
SLP

● Assume (x1,x2,x3……………………….xn) –>set of input vectors

● and (w1,w2,w3…………………..wn) –>set of weights

● y=actual output

● wo=initial weight, wnew=new weight, δw=change in weight, α=learning rate

● actual output(y)=wixi

● learning signal(ej)=ti-y (diﬀerence between desired and actual output)

● δw=αxiej

● wnew=wo+δw

● Now, the output can be calculated on the basis of the input and the activation function applied over the net input and can be expressed as:

● y=1, if net input>=θ

● Entri
y=0, if net input<θ
elevate
Entri
elevate
Backpropagation – Multi-layer perceptron

● Backpropagation is a learning algorithm used in multi-layer perceptrons (MLPs) or deep neural networks (DNNs) with one or more hidden layers. It is a more general
and powerful learning algorithm compared to the perceptron learning rule

● Basic Equation of Backpropagation: In backpropagation, the weight update for the j-th weight (w_j) connecting two neurons in the neural network is given by:

w_j(new) = w_j(old) - learning_rate * gradient

Where:

w_j(new) is the new value of the weight after the update

w_j(old) is the current value of the weight

learning_rate is a hyperparameter controlling the step size of weight updates

Entri
elevate
Entri
elevate
MLP

● Learning Rate: The learning rate is a hyperparameter in machine learning algorithms, including
neural networks, that determines the step size at which the model's weights are updated during
training. It controls how much the model adapts its weights based on the gradients computed
during the optimization process

● When training a machine learning model, the goal is to minimize a loss function that quantiﬁes
the diﬀerence between the predicted outputs and the true targets (ground truth) in the training
data

Entri
elevate
Entri
elevate
MLP

● A higher learning rate means larger weight updates, which can help the model converge faster, but it also increases the risk of
overshooting the optimal weights and diverging from the optimal solution
● A lower learning rate results in smaller weight updates, leading to slower convergence but potentially better stability

● The learning rate is a critical hyperparameter that requires careful tuning

● Commonly used learning rates are in the range of 0.001 to 0.1, but the optimal value can vary depending on the problem,
architecture, and dataset
● Learning rate schedules, where the learning rate is adjusted during training, can also be employed to improve the convergence
behavior

Entri
elevate
Entri
elevate
MLP

Gradient

● In the context of optimization algorithms like gradient descent and backpropagation, the gradient is a vector that contains the partial
derivatives of the loss function with respect to each model parameter (weight)

● It indicates the direction and magnitude of the steepest increase in the loss function

● The gradient is computed during the backward pass (backpropagation) using the chain rule of calculus

● The backward pass through the network calculates the gradients starting from the output layer and moving backward to the input layer.
Once the gradients are known, they are used in weight update rules, such as gradient descent, to iteratively update the model's weights to
minimize the loss function

● These gradients tell the model how much each weight should be adjusted to reduce the error and improve the model's performance
Entri
elevate
Entri
elevate
● In summary, the learning rate controls the step size of weight updates during training, while the
gradients indicate the direction and magnitude of adjustments required for each weight to
minimize the loss function and improve the model's performance

● The combination of learning rate and gradients plays a crucial role in the success of the
optimization process and the eﬀectiveness of the learning algorithm

Entri
elevate
Entri
elevate
Entri
elevate
Entri
elevate
Entri
elevate
Entri
elevate
●

Entri
elevate
Entri
elevate
Entri
elevate
Entri
elevate
Local minima

● The point in a curve which is minimum when compared to its preceding and succeeding points is
called local minima

● It is the lowest point in a particular region, but not necessarily the absolute lowest point in the
entire function

Global minima
Entri
●elevate
The point in a curve which is minimum when compared to all points in the curve is called Global
Entri
Minima elevate

● For a curve there can be more than one local minima, but it does have only one global minima
Reaching the global minimum in backpropagation would be ideal, it is not a strict requirement for a
good model. A model that performs well on validation data and generalizes eﬀectively is considered
good, even if it doesn't reach the global minimum of the loss function

Entri
elevate
Entri
elevate

AI & MLB (MBA-III Sem.) 2022-24
0% (1)
AI & MLB (MBA-III Sem.) 2022-24
6 pages
NNFL 3unit
No ratings yet
NNFL 3unit
10 pages
We Used Neural Networks To Detect Clickbaits: You Won't Believe What Happened Next!
No ratings yet
We Used Neural Networks To Detect Clickbaits: You Won't Believe What Happened Next!
7 pages
Continual Learning and Catastrophic Forgetting
No ratings yet
Continual Learning and Catastrophic Forgetting
21 pages
BT4395 RR Final
No ratings yet
BT4395 RR Final
32 pages
ACT6100 A2020 Sup 12
No ratings yet
ACT6100 A2020 Sup 12
37 pages
Pattern Recognition and Machine Learning: Fuzzy Sets in Pattern Recognition Debrup Chakraborty Cinvestav
No ratings yet
Pattern Recognition and Machine Learning: Fuzzy Sets in Pattern Recognition Debrup Chakraborty Cinvestav
15 pages
DL Unit 3 Notes PPT
No ratings yet
DL Unit 3 Notes PPT
37 pages
Is Zc415 (Data Mining BITS-WILP)
No ratings yet
Is Zc415 (Data Mining BITS-WILP)
4 pages
L08 Clustering
No ratings yet
L08 Clustering
31 pages
Artificial Neural Network: Training: Debasis Samanta
No ratings yet
Artificial Neural Network: Training: Debasis Samanta
13 pages
1 s2.0 S2589721722000046 Main
No ratings yet
1 s2.0 S2589721722000046 Main
13 pages
CNN RNN LSTM Attention
No ratings yet
CNN RNN LSTM Attention
86 pages
Role of Machine Learning in MIS
No ratings yet
Role of Machine Learning in MIS
4 pages
Machine Learning Unit-2 Backpropagation Algorithm
No ratings yet
Machine Learning Unit-2 Backpropagation Algorithm
23 pages
Machine Learning
No ratings yet
Machine Learning
73 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
1 page
Chapter II Build A Neural Network Step by Step
No ratings yet
Chapter II Build A Neural Network Step by Step
31 pages
Multi Layer Perceptron Haykin
No ratings yet
Multi Layer Perceptron Haykin
50 pages
Note On Backpropagation John Hull: Ith Observation, and y
No ratings yet
Note On Backpropagation John Hull: Ith Observation, and y
2 pages
An Efficient Incremental Clustering Algorithm
No ratings yet
An Efficient Incremental Clustering Algorithm
3 pages
Clustering High-Dimensional Data
No ratings yet
Clustering High-Dimensional Data
5 pages
Artificial Neural Network - Back-Propagation Learning
No ratings yet
Artificial Neural Network - Back-Propagation Learning
21 pages
Unit 2
No ratings yet
Unit 2
38 pages
Assignment-8 Task 1
No ratings yet
Assignment-8 Task 1
2 pages
DWDM Externallab2022for Student
No ratings yet
DWDM Externallab2022for Student
3 pages
Lecture 9 - Supervised Learning in ANN - (Part 2) New
No ratings yet
Lecture 9 - Supervised Learning in ANN - (Part 2) New
7 pages
Mod 2.4,2.5,2.6 Architecture Design
No ratings yet
Mod 2.4,2.5,2.6 Architecture Design
20 pages
What Is Backpropagation
No ratings yet
What Is Backpropagation
8 pages
Training by Error Backpropagation
No ratings yet
Training by Error Backpropagation
6 pages
ML Session 15 Backpropagation
No ratings yet
ML Session 15 Backpropagation
30 pages
SJNanda Neural Network
No ratings yet
SJNanda Neural Network
47 pages
Back Propagation Technique
No ratings yet
Back Propagation Technique
24 pages
Ann-Back Propagation
No ratings yet
Ann-Back Propagation
21 pages
Lecture 13.3 Classification ANN
No ratings yet
Lecture 13.3 Classification ANN
64 pages
Cross-Industry Standard Process For Data Mining
No ratings yet
Cross-Industry Standard Process For Data Mining
3 pages
ANN Unit 3
No ratings yet
ANN Unit 3
100 pages
Chapter 1 Annexe
No ratings yet
Chapter 1 Annexe
17 pages
Back Propagation
No ratings yet
Back Propagation
20 pages
08 Neural Networks
No ratings yet
08 Neural Networks
47 pages
R&D Showcase Template
No ratings yet
R&D Showcase Template
1 page
Lect 15 MLP Introduction Backprop
No ratings yet
Lect 15 MLP Introduction Backprop
24 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
An Introduction To Mathematics Behind Neural Networks
No ratings yet
An Introduction To Mathematics Behind Neural Networks
5 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
68 pages
Lect 15 MLP Introduction Backprop
No ratings yet
Lect 15 MLP Introduction Backprop
24 pages
ANN-5 Backpropagation
No ratings yet
ANN-5 Backpropagation
17 pages
Project Ideas
No ratings yet
Project Ideas
10 pages
Classification
No ratings yet
Classification
14 pages
ML Prep For Samsung
No ratings yet
ML Prep For Samsung
73 pages
EE769 Assignment 3
No ratings yet
EE769 Assignment 3
1 page
Final Unit 2 Questions.
No ratings yet
Final Unit 2 Questions.
5 pages
Multilayer Perceptrons and Backpropagation Learning: 1 Some History
No ratings yet
Multilayer Perceptrons and Backpropagation Learning: 1 Some History
6 pages
Types of MAC Protocols
No ratings yet
Types of MAC Protocols
32 pages
Backpropagation
No ratings yet
Backpropagation
12 pages
PNAL6 MLPTraining
No ratings yet
PNAL6 MLPTraining
40 pages
21CS743
No ratings yet
21CS743
1 page
Lec5-MLP (Abdelrahman Mohamed)
No ratings yet
Lec5-MLP (Abdelrahman Mohamed)
15 pages
QB3RDIA
No ratings yet
QB3RDIA
2 pages
Backpropagation Algorithm - 6th Semester
No ratings yet
Backpropagation Algorithm - 6th Semester
11 pages
CI-6-8 Backpropagation (COMPLETE) Updated
No ratings yet
CI-6-8 Backpropagation (COMPLETE) Updated
76 pages
L4deep Learning
No ratings yet
L4deep Learning
14 pages
SJNanda Neural Network
No ratings yet
SJNanda Neural Network
43 pages
Clase 4 Backpropagation
No ratings yet
Clase 4 Backpropagation
63 pages
Lecture 40,41 BP Algorithm
No ratings yet
Lecture 40,41 BP Algorithm
11 pages
Ch2 ANN BB
No ratings yet
Ch2 ANN BB
16 pages
Unit 1
No ratings yet
Unit 1
72 pages
Backpropagation Networks Presentation Updated
No ratings yet
Backpropagation Networks Presentation Updated
10 pages
SJNanda - Neural Network
No ratings yet
SJNanda - Neural Network
43 pages
Understanding Backpropagation and Its Role in Deep LearningPARTH LAMBAT AND - 20250415 - 122012 - 0000
No ratings yet
Understanding Backpropagation and Its Role in Deep LearningPARTH LAMBAT AND - 20250415 - 122012 - 0000
18 pages
FFNN, GD, Backpropagation
No ratings yet
FFNN, GD, Backpropagation
18 pages
36-Multi-Layer Perceptron and Its Properties-30-10-2024
No ratings yet
36-Multi-Layer Perceptron and Its Properties-30-10-2024
39 pages
Domnic Object Detecion Basics
No ratings yet
Domnic Object Detecion Basics
62 pages
ML Module 2 New
No ratings yet
ML Module 2 New
36 pages
Back Propagation
No ratings yet
Back Propagation
5 pages
Ann 2 A
No ratings yet
Ann 2 A
20 pages
Back Propagation
No ratings yet
Back Propagation
29 pages
ML Lec 09 ANN Quadratic Training
No ratings yet
ML Lec 09 ANN Quadratic Training
44 pages
Https WWW Chegg Com Homework Help Questions and Answers B Assume
No ratings yet
Https WWW Chegg Com Homework Help Questions and Answers B Assume
3 pages
L04 Slides - mlp1
No ratings yet
L04 Slides - mlp1
22 pages
Deep Learning
No ratings yet
Deep Learning
24 pages
CS601 - Machine Learning - Unit 2 New
No ratings yet
CS601 - Machine Learning - Unit 2 New
56 pages
ANN Notes Updated
0% (1)
ANN Notes Updated
46 pages
Backpropagation Algorithm
No ratings yet
Backpropagation Algorithm
6 pages
Module 2
No ratings yet
Module 2
14 pages
ML Unit 2 Lecture Notes
No ratings yet
ML Unit 2 Lecture Notes
20 pages
Unit 2 - ML
No ratings yet
Unit 2 - ML
18 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
A Conversation About Calculus
From Everand
A Conversation About Calculus
Ginachukwu Amah
No ratings yet
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet

DL3 Backpropagation

Uploaded by

DL3 Backpropagation

Uploaded by

Backpropagation

● Together, forward propagation and backpropagation allow a neural network to make

**Basic Equation of Perceptron Learning Rule**

w_j(new) = w_j(old) + learning_rate * (target_output - predicted_output) * input_j

● Assume (x1,x2,x3……………………….xn) –>set of input vectors

● and (w1,w2,w3…………………..wn) –>set of weights

● wo=initial weight, wnew=new weight, δw=change in weight, α=learning rate

● learning signal(ej)=ti-y (diﬀerence between desired and actual output)

● y=1, if net input>=θ

w_j(new) = w_j(old) - learning_rate * gradient

w_j(new) is the new value of the weight after the update

w_j(old) is the current value of the weight

learning_rate is a hyperparameter controlling the step size of weight updates

● The learning rate is a critical hyperparameter that requires careful tuning

You might also like

Basic Equation of Perceptron Learning Rule