0% found this document useful (0 votes)

10 views24 pages

Lecture 1 Part II

The document covers the backpropagation algorithm in deep learning, detailing both the forward and backward passes, along with examples. It discusses issues of underfitting and overfitting, and presents solutions such as data augmentation, regularization techniques (L1, L2, dropout), bagging, and early stopping. The content is aimed at electrical and computer engineering students, providing foundational knowledge for deep learning applications.

Uploaded by

roycetheebanedu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views24 pages

Lecture 1 Part II

Uploaded by

roycetheebanedu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

EC 9170

Deep Learning for Electrical &

Computer Engineers

Lecture 01:
Deep feedforward networks
Part II

11th March 2024

Faculty of Engineering, University of Jaffna
Backpropagation Algorithm

Chain Rule of Calculus

Backpropagation Algorithm Cont…

y depends in 2 variables

y1,y2 depend on x
Backpropagation Algorithm Cont…

1. Backpropagation - Forward Pass

Backpropagation Algorithm Cont…
2. Backpropagation - Backward Pass
Ø Backpropagation Example
0.15 w1 0.4 w5 o1 0.01
X1 h1
0.2 0.45
0.05 w2 w6

w3 w7
0.25 0.5
o2 0.99
X2 h2
0.3 w4 0.55 w8
0.1

1 0.35 1 0.6
b1 b2
Ø Backpropagation Example Cont…

1. Compute the output of the hidden layer

2. Compute the output of the output layer

Ø Backpropagation Example Cont…

3. Calculate the error for each output

Ø Backpropagation Example Cont…
4.1 Backpropagation - Backward Pass- Output layer
Ø Backpropagation Example Cont…
Ø Backpropagation Example Cont…
Putting it all together:
Ø Backpropagation Example Cont…
4.2 Backpropagation - Backward Pass- Hidden layer
Step 01:

Step 02: Plugging them together:

Step 03: Step 04:

Step 05: Step 06:

Underfitting and Overfitting
Underfitting Overfitting

Training dataset Training dataset

Low accuracy High accuracy

Training dataset
High Training accuracy
High Testing accuracy
Optimal
Test dataset Test dataset
Low accuracy Low accuracy
Solutions for Overfitting
• Increase the size of the dataset – e.g Data augmentation
• Regularization
• L1
• L2
• Dropout
• Bagging/Ensemble models
• Early stopping
Ø Data Augmentation Data augmentation techniques in computer
vision
• Cropping.
• Flipping.
• Rotation.
• Translation.
• Brightness.
• Contrast.
• Color Augmentation.
• Saturation.
Ø Regularization for deep learning
• Regularization is any modification made to the learning algorithm with the intention of lowering
the generalization error but not the training error.
• In the context of deep learning, most regularization strategies involve regularizing estimators.
This is done by reducing variance at the expense of increasing the estimator's bias.
• An effective regularizer is one that decreases the variance significantly while not overly
increasing the bias.
• controlling the complexity of the model is not a simple matter of finding the right model size
and the right number of parameters.
• Instead, deep learning relies on finding the best-fitting model, a large model that has been
properly regularized.
Ø L1/L2 Regularization
Ø Dropout
• Dropout provides a computationally inexpensive but powerful method of
regularizing a broad family of models.
• Dropout provides an inexpensive approximation to training and evaluating a
bagged ensemble of exponentially many neural networks.
• Specifically, dropout trains the ensemble consisting of all sub-networks that can
be formed by removing non-output units from an underlying base network.
Ø Training with Dropout
• To train with dropout, we use a minibatch-based learning algorithm that makes
small steps, such as stochastic gradient descent.
• Each time we load an example into a minibatch, we randomly sample a different
binary mask to apply to all of the input and hidden units in the network.
• The mask for each unit is sampled independently from all of the others.
• Typically, the probability of including a hidden unit is 0.5, while the probability of
including an input unit is 0.8.
Ø Bagging/ Ensemble models
• Bagging (short for bootstrap aggregating) is a technique for reducing
generalization error by combining several models.
• Bagging is defined as follows:
• Train k different models on k different subsets of training data, constructed to
have the same number of examples as the original dataset through random
sampling from that dataset with replacement.
• Have all of the models vote on the output for test examples.
• Techniques employing bagging are called ensemble models.
Ø Bagging/ Ensemble models
• Bagging works because different models will usually not all make the same errors
on the test set.
• This is a direct result of training on k different subsets of the training data, each
subset missing some of the examples from the original dataset.
• Other factors, such as differences in random initialization, random selection of
mini-batches,differences in hyperparameters, or different outcomes of non-
deterministic neural network implementations, are often enough to cause
different members of the ensemble to make partially independent errors.
Ø Early stopping
• When training models with sufficient representational capacity to overfit the
task, we often observe that training error decreases steadily over time while the
error on the validation set begins to rise again.
• The occurrence of this behaviour in the scope of our applications is almost
certain.
• This means we can obtain a model with better validation set error (and thus,
hopefully, better test set error) by returning to the parameter setting at the
point in time with the lowest validation set error.
• This is termed Early Stopping.
Thank you!

Power Electronics Notes by Arunkumar
No ratings yet
Power Electronics Notes by Arunkumar
58 pages
Introduction To Extendible: Hashing
No ratings yet
Introduction To Extendible: Hashing
7 pages
Neighborhood and Connectivity of Pixels
No ratings yet
Neighborhood and Connectivity of Pixels
25 pages
ANN Notes Updated
0% (1)
ANN Notes Updated
46 pages
Training Neural Netwok: Data Set
No ratings yet
Training Neural Netwok: Data Set
35 pages
Brief Introduction of Mobilenetv1 V2 V3 Lightweight Network
No ratings yet
Brief Introduction of Mobilenetv1 V2 V3 Lightweight Network
29 pages
ML Interview
No ratings yet
ML Interview
65 pages
Digital Signal Processing
No ratings yet
Digital Signal Processing
14 pages
DL Unit 3 Notes PPT
No ratings yet
DL Unit 3 Notes PPT
37 pages
Chapter 5 Curve Fitting
No ratings yet
Chapter 5 Curve Fitting
12 pages
Digital Transmission
100% (1)
Digital Transmission
5 pages
Early Stopping, Dropout, Augmentation, Optimizers New
No ratings yet
Early Stopping, Dropout, Augmentation, Optimizers New
91 pages
2D FFT Without Using 1D FFT A PREPRINT
No ratings yet
2D FFT Without Using 1D FFT A PREPRINT
9 pages
Ensemble Methods
No ratings yet
Ensemble Methods
21 pages
Lab 4 Discrete Convolution
No ratings yet
Lab 4 Discrete Convolution
12 pages
Classification Algorithm
No ratings yet
Classification Algorithm
78 pages
AN2DL 03 2324 NeuralNetwroksTraining
No ratings yet
AN2DL 03 2324 NeuralNetwroksTraining
40 pages
Big O
No ratings yet
Big O
2 pages
22 D1 January 2003 Answer Booklet
No ratings yet
22 D1 January 2003 Answer Booklet
10 pages
6 - Tips For Training Deep Neural Networks
No ratings yet
6 - Tips For Training Deep Neural Networks
59 pages
Interpolated Filters Small
No ratings yet
Interpolated Filters Small
6 pages
UNIT 2 Notes
No ratings yet
UNIT 2 Notes
19 pages
Unit - IV
No ratings yet
Unit - IV
24 pages
Regularization
No ratings yet
Regularization
9 pages
12 - Sampled Data Control
No ratings yet
12 - Sampled Data Control
16 pages
Week 10
No ratings yet
Week 10
69 pages
Past Paper Quests. For C++
No ratings yet
Past Paper Quests. For C++
4 pages
Structure of The Convolutional Codes
No ratings yet
Structure of The Convolutional Codes
19 pages
DL Regularization
No ratings yet
DL Regularization
28 pages
Module-4 4
No ratings yet
Module-4 4
19 pages
Assignment Jaiprakash
No ratings yet
Assignment Jaiprakash
5 pages
Ee247 hw4 2010
No ratings yet
Ee247 hw4 2010
4 pages
05 Introduction To Digital Control
No ratings yet
05 Introduction To Digital Control
6 pages
Course Inication
No ratings yet
Course Inication
3 pages
DL Lect 7
No ratings yet
DL Lect 7
15 pages
Deep Learning Module 2 Important Topics PYQs
No ratings yet
Deep Learning Module 2 Important Topics PYQs
30 pages
Supervised Deep Learning
No ratings yet
Supervised Deep Learning
28 pages
03 Reg Slides
No ratings yet
03 Reg Slides
64 pages
Analysis of Algorithms: Recurrences
No ratings yet
Analysis of Algorithms: Recurrences
32 pages
ANN Presentation Exam Hafsa
No ratings yet
ANN Presentation Exam Hafsa
29 pages
Deep Neural Network
No ratings yet
Deep Neural Network
60 pages
Anaconda With Python 2 On 32-Bit Windows - Anaconda 2.0 Documentation
No ratings yet
Anaconda With Python 2 On 32-Bit Windows - Anaconda 2.0 Documentation
2 pages
DL IT324a 3
No ratings yet
DL IT324a 3
13 pages
Underfitting Overfitting
No ratings yet
Underfitting Overfitting
38 pages
7 Deep Learning
No ratings yet
7 Deep Learning
75 pages
4 NN Regularization
No ratings yet
4 NN Regularization
13 pages
Convolutional Neural Networks (Image Recognition) Part - II: Dr. Syed M. Usman
No ratings yet
Convolutional Neural Networks (Image Recognition) Part - II: Dr. Syed M. Usman
75 pages
CSE508: Information Retrieval Assignment 2: Question 1 - (40 Points) Scoring and Term-Weighting
No ratings yet
CSE508: Information Retrieval Assignment 2: Question 1 - (40 Points) Scoring and Term-Weighting
3 pages
Pdsa Ga5
No ratings yet
Pdsa Ga5
3 pages
Regularization Slides
No ratings yet
Regularization Slides
50 pages
2 Deep Neural Network - 241120 - 095158
No ratings yet
2 Deep Neural Network - 241120 - 095158
47 pages
Regularization
No ratings yet
Regularization
19 pages
Ai - W7L14
No ratings yet
Ai - W7L14
22 pages
Unit-2 L4
No ratings yet
Unit-2 L4
23 pages
Unit Online 1.4
No ratings yet
Unit Online 1.4
132 pages
6 Working Example 01-08-2024
No ratings yet
6 Working Example 01-08-2024
21 pages
Unit Ii
No ratings yet
Unit Ii
8 pages
Image Transforms
No ratings yet
Image Transforms
22 pages
cst414 - Deep Learning
No ratings yet
cst414 - Deep Learning
34 pages
Pa 4 Unit
No ratings yet
Pa 4 Unit
33 pages
DGM Mid Sem
No ratings yet
DGM Mid Sem
39 pages
BACK PROPAGATION and REGULATION, BATCH NORMALIZATION
No ratings yet
BACK PROPAGATION and REGULATION, BATCH NORMALIZATION
20 pages
Kamal Yadav
No ratings yet
Kamal Yadav
1 page
Deep Neural Networks
No ratings yet
Deep Neural Networks
26 pages
Cours 4
No ratings yet
Cours 4
30 pages
Deep Feedforward Networks and Regularization: Licheng Zhang
No ratings yet
Deep Feedforward Networks and Regularization: Licheng Zhang
56 pages
Rr210504 Design and Analysis of Algorithms
No ratings yet
Rr210504 Design and Analysis of Algorithms
4 pages
Deep Neural Network Module 4 Regularization
No ratings yet
Deep Neural Network Module 4 Regularization
53 pages
Deep Learning Unit2
No ratings yet
Deep Learning Unit2
16 pages
Different Activation Functions With The Equations
No ratings yet
Different Activation Functions With The Equations
6 pages
UNIT-II Regularization in Deep Learning
No ratings yet
UNIT-II Regularization in Deep Learning
24 pages
Activity 1
No ratings yet
Activity 1
1 page
Secrets of Deep Learning 1716536527
No ratings yet
Secrets of Deep Learning 1716536527
12 pages
Chap 2 Training Feed Forward Neural Networks
No ratings yet
Chap 2 Training Feed Forward Neural Networks
22 pages
Practical Aspects of Deep Learning PI
No ratings yet
Practical Aspects of Deep Learning PI
46 pages
Lecture 6
No ratings yet
Lecture 6
41 pages
DL Unit 3
No ratings yet
DL Unit 3
14 pages
LECTURE#9 EE258 F22 Part2 Draft v1
No ratings yet
LECTURE#9 EE258 F22 Part2 Draft v1
14 pages
DL Class3
No ratings yet
DL Class3
28 pages
Deep Learning Basics Lecture 4 Regularization II
No ratings yet
Deep Learning Basics Lecture 4 Regularization II
27 pages
Unit-2 Improving-Deep-Neural-Networks
No ratings yet
Unit-2 Improving-Deep-Neural-Networks
18 pages
Artificial Neural Networks - Lect - 4
No ratings yet
Artificial Neural Networks - Lect - 4
17 pages
Neural Networks For Machine Learning: Lecture 9a Overview of Ways To Improve Generalization
No ratings yet
Neural Networks For Machine Learning: Lecture 9a Overview of Ways To Improve Generalization
39 pages
Tutorial 4
No ratings yet
Tutorial 4
6 pages
Regularization For Neural Networks 1718966083
No ratings yet
Regularization For Neural Networks 1718966083
9 pages
Sy Signal System
No ratings yet
Sy Signal System
4 pages
A Probabilistic Theory of Deep Learning: Unit 2
100% (1)
A Probabilistic Theory of Deep Learning: Unit 2
17 pages
Hyperparameters
No ratings yet
Hyperparameters
15 pages
Class 10 Holiday Homework
100% (1)
Class 10 Holiday Homework
3 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet

Lecture 1 Part II

Uploaded by

Lecture 1 Part II

Uploaded by

EC 9170

Deep Learning for Electrical &

11th March 2024

Chain Rule of Calculus

1. Backpropagation - Forward Pass

1. Compute the output of the hidden layer

2. Compute the output of the output layer

3. Calculate the error for each output

Step 02: Plugging them together:

Step 05: Step 06:

Training dataset Training dataset

You might also like