0% found this document useful (0 votes)

23 views

Lecture 2 - CNN and Overfitting

Uploaded by

Joel Lim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views

Lecture 2 - CNN and Overfitting

Uploaded by

Joel Lim

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 42

Official (Closed) - Non Sensitive

Deep Learning

Lecture 2:
Convolutional Neural
Networks & Overfitting

• Specialist Diploma in Applied

Generative AI

• Academic Year 2024/25

Official (Closed) - Non Sensitive

Topics
1. Introduction to Convolutional Neural
Networks (CNN)

2. The CNN operations

3. Training a CNN from scratch on a small

dataset

4. Handling Overfitting
Official (Closed) - Non Sensitive

1. Introduction to CNN
Official (Closed) - Non Sensitive

1. Introduction to CNN

https://www.youtube.com/watch?v=Gu0MkmynWkw
Official (Closed) - Non Sensitive

1. Introduction to CNN
 The MNIST classification problem using CNN
Convolution
operation

Max-pooling
operation
Official (Closed) - Non Sensitive

2. The CNN operations

Official (Closed) - Non Sensitive

2.1 The convolution operation

 Dense Layers vs.
Convolution Layers
1. Dense Layers learn
global patterns
2. Convolution Layers learn
local patterns

Images can be broken

into local patterns e.g.
edges, textures, etc.
Official (Closed) - Non Sensitive

2.1 The convolution operation

 Patterns learnt by convnets are
1. Transformable
2. Hierarchical
2nd Layer

1st Layer
Official (Closed) - Non Sensitive

2.1 The convolution operation

Refer to excel
 A Simple Example of 2D convolution spreadsheet
for details
Input Image: (5, 5, 1)  Output (3,
3, 1)
One Filter: (3, 3)
Pixel value

Filter
weight
0 1 2
2 2 0
0 1 2
Source:
https://towardsdatascience.com/intuitively-understanding-convolutions-for-deep-learning-1f6f42faee1
Official (Closed) - Non Sensitive

2.1 The convolution operation

 A Simple Example of 2D convolution
Input Image: (5, 5, 1)  Output (3, 3,
1)
One Filter: (3, 3)

0 1 2
2 2 0
0 1 2

Source:
https://towardsdatascience.com/intuitively-understanding-convolutions-for-deep-learning-1f6f42faee1
Official (Closed) - Non Sensitive

2.1 The convolution operation

TO CALCULATE WIDTH OF OUTPUT

Output Depth= # of Filters = 2

Source: Output Height or Width = Input width -

https://towardsdatascience.com/demystifying-convolutional-neural-networks-384785791596
Filter width + 1
=6–4+1=3
Official (Closed) - Non Sensitive

2.1 The convolution operation

 Padding
Input Image: (5, 5, 1)
One Filter: (3, 3)
Padding = 1


Output (5, 5, 1)

Source:
https://towardsdatascience.com/intuitively-understanding-convolutions-for-deep-learning-1f6f42faee1
Official (Closed) - Non Sensitive

2.1 The convolution operation

 Strides
Input Image: (5, 5, 1)
One Filter: (3, 3)
Strides = 2


Output (2, 2, 1)

Source:
https://towardsdatascience.com/intuitively-understanding-convolutions-for-deep-learning-1f6f42faee1
Official (Closed) - Non Sensitive

2.1 The convolution operation

Dot product of image with filter and add
across different filter with a bias

(-2)+(-1)+(0)+0=-3

Input width =5
Padding = 1
Strides = 2

TO CALCULATE WIDTH OF OUTPUT

Output Depth= # of Filters = 2

Output Height or Width =

(Input width + (2*Padding) - Filter width) / Strides + 1
= (5 + 2*1 - 3) / 2 + 1 = 3
Official (Closed) - Non Sensitive

2. The convolution operation

Input Image
This layer uses
is 28 x 28 pixels
a total of 32 filters
black-and-white

Filter Size
is 3 x 3

The output shape is (26, 26, 32),

Where:
26 = 28 – 3 + 1
32 = # of Filters
Official (Closed) - Non Sensitive

2.2 The max-pooling operation

 Down sample the tensors by taking the max value
in a window (e.g. 2x2)

Source:
https://computersciencewiki.org/index.php/Max-pooling_/_Pooling
Official (Closed) - Non Sensitive

2.2 The max-pooling operation

 MNIST Model with max-pooling layers
Official (Closed) - Non Sensitive

2.2 The max-pooling operation

 MNIST Model without max-pooling layers
4.1.5 Demo on convnet
Official (Closed) - Non Sensitive

Demo 2 – MNIST using CNN

Official (Closed) - Non Sensitive

3. Training a CNN from scratch on a small

dataset
Official (Closed) - Non Sensitive

 Datasets
1. Kaggle 2013 competition
 Original: 25,000 images (12,500 dogs and 12,500
cats)
 A Small Set:
• Training: 2,000 images (1,000 dogs and 1,000 cats)
• Validation: 1,000 images (500 dogs and 500 cats)
• Testing: 1,000 images (500 dogs and 500 cats)
2. Data Preprocessing  ImageDataGenerator
in Keras
 Read the picture files
 Decode the JPEG content to RGB grids of pixels
 Convert these into floating-point tensors
 Rescale the pixels values (0-255) to [0,1] interval
Official (Closed) - Non Sensitive

Let’s try your hands on your first CNN!

(Practical 2 – Part 1)
We will naively train a small CNN on our training
samples to classify images of "dogs" and "cats".
Official (Closed) - Non Sensitive

3 (RGB) 150
3x3x32
150

148
32
3x3x64

148

74 32

64
72

72 64
36
3x3x128
36

128

34
34
Official (Closed) - Non Sensitive

128

128
34
34

17
128
17

3x3x128
15
128
15

7
7
Official (Closed) - Non Sensitive

128

7
7
7*7*128=6272

[0.1,0.234,0.521,0.2,…,0.454,0.442,0.984]

512 nodes ……

Output
(Sigmoid)
Official (Closed) - Non Sensitive

3 (RGB) 150
3x3x32
150

148
32

148

128

7
7
7*7*128=6272
[0.1,0.234,0.521,0.2,…,0.454,0.442,0.984]

512 nodes ……

Output
(Sigmoid)
Official (Closed) - Non Sensitive

4. Handling Overfitting
Official (Closed) - Non Sensitive

4.1 Fundamental Issues

 Optimization vs. Generalization
• Adjust model to get best possible performance on
train data
• How well the trained model performs on data it has
never seen before (test data)

 Underfitting
• The lower the error on training data, the lower the
error on testing data  still progress to be made

 Overfitting
• Generalization stops improving:
 Training error keeps decreasing
 Validation / Testing error starts to increase
• Model beginning to learn patterns overly-specific to
training data only
Official (Closed) - Non Sensitive

4.1 Fundamental Issues

Balancing
Optimization and
Generalization
Tradeoff of Model
Complexity against
Training and Testing
accuracy

(Testing)

Optimizing Regularizing
Official (Closed) - Non Sensitive

4.2 To prevent overfitting

1.Reducing network size (tweak
hyperparameters)

2.Adding weight regularization

3.Adding dropout

4.Get more training data

Official (Closed) - Non Sensitive

4.2 To prevent overfitting

1. Reducing network size
• Capacity: the number of learnable parameters
 The number of layers
 The number of units per layer

• High Capacity
 Good at fitting to the training data

• Limited Capacity
 Good at generalizing to unseen data (prediction)

• Balance: too much capacity vs. not enough

capacity
 Start with small network size
 Increase the size and monitor error with validation
dataset
Official (Closed) - Non Sensitive

4.2 To prevent overfitting

2. Adding weight regularization
• To avoid overfitting  simpler model
 Forcing the weights to take only small values
 Adding a cost (associated with weights) to loss function

• Two types of cost function (weights

Lasso
regularization)
 L1 regularization
• Cost is proportional to the absolute value of the weights
 L2 regularization (Weight Decay)
• Cost is proportional to the square of the value of the
weights

• Ridge
Implemented in Keras
 In layers function, add in an argument to configure
weight regularizer
Official (Closed) - Non Sensitive

4.2 To prevent overfitting

3. Adding dropout
• Applied to a layer during training time
 randomly dropping out (setting to zero) a few outputs
 dropout rate normally between 0.2 - 0.5

• At test time, no dropout but the outputs are

scaled down by a factor (= dropout rate)

• The technique helps to reduce overfitting

 Randomly removing a different subset of neurons
 Introducing noises and break up non-significant
patterns

• Implemented in Keras by adding in a Dropout

Layer
Official (Closed) - Non Sensitive

4.2 To prevent overfitting

4. Get more training data

 The best solution!
 A model trained on more data will
naturally generalize better
 Why?
Official (Closed) - Non Sensitive

4. Get more training data - Data

Augmentation

• Data augmentation is a strategy to

significantly increase the diversity of
data available for training models,
without actually collecting new data.

• Data augmentation techniques such as

cropping, padding, and horizontal
flipping are commonly used to train
large neural networks.
Official (Closed) - Non Sensitive

 Example of Data Augmentation

After Data Augmentation

Original Image
(150 x 150 pixels)
Official (Closed) - Non Sensitive

Let’s try your hands on your first CNN!

(Practical 2 – Part 2)
We will now add measures to train the CNN to
classify images of "dogs" and "cats“ to prevent
overfitting.
Official (Closed) - Non Sensitive

For every epoch, the

Example
150
of Data Augmentation images presented to
3 (RGB) the model for
learning will be a
2000 images
3x3x32 simulated new
Epoch 1 150 image.

148
32
These new images 2000 images
are not saved 148
anywhere. Epoch 2
128

Epoch 3 2000 images

7
7
7*7*128=6272
[0.1,0.234,0.521,0.2,…,0.454,0.442,0.984]
Epoch 4
2000 images
512 nodes ……

Output Validates
Epoch 5 (Sigmoid)
2000 images
Official (Closed) - Non Sensitive

Wrapping Up
 CNN are the best type of machine-
learning models for computer-
vision tasks

 It’s possible to train a model from

scratch on a very small dataset 
overfitting

 Data augmentation is a powerful

way to fight overfitting when
working with image data
Official (Closed) - Non Sensitive

Further Reading
 Demystifying Convolutional Neural Networks
• https://towardsdatascience.com/demystifying-conv
olutional-neural-networks-384785791596

 Intuitively Understanding Convolutions for

Deep Learning
• https://towardsdatascience.com/intuitively-unders
tanding-convolutions-for-deep-learning-1f6f42faee
1

 Data Augmentation Increases Accuracy of

your Model – But how?
• https://medium.com/secure-and-private-ai-writing-
challenge/data-augmentation-increases-accuracy-
of-your-model-but-how-aa1913468722
Official (Closed) - Non Sensitive

Q&A
Official (Closed) - Non Sensitive

References
Books:

François Chollet, Deep Learning with Python (2018)

Online Resources:

Asset-V1 RISE+MASTER-BCG RF+Wave11-DSM09-P0-3+Type@[email protected] Build Your Team Everest Onepager Pre Read
No ratings yet
Asset-V1 RISE+MASTER-BCG RF+Wave11-DSM09-P0-3+Type@[email protected] Build Your Team Everest Onepager Pre Read
1 page
Lecture 6 - Use Cases of CNN and Implementation
No ratings yet
Lecture 6 - Use Cases of CNN and Implementation
33 pages
Lecture 3 - Pretrained CNN - CET
No ratings yet
Lecture 3 - Pretrained CNN - CET
33 pages
Lab 5 - Intro To Convolutional Neural Networks
No ratings yet
Lab 5 - Intro To Convolutional Neural Networks
52 pages
CS601 Machine Learning Unit 3
No ratings yet
CS601 Machine Learning Unit 3
47 pages
Building A Convolutional Neural Network Using Tensorflow Keras
No ratings yet
Building A Convolutional Neural Network Using Tensorflow Keras
10 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
68 pages
Computer Vision: MR Hew Ka Kian Hew - Ka - Kian@rp - Edu.sg
No ratings yet
Computer Vision: MR Hew Ka Kian Hew - Ka - Kian@rp - Edu.sg
22 pages
Cnn
No ratings yet
Cnn
98 pages
Iii Unit - Deeplearning
No ratings yet
Iii Unit - Deeplearning
93 pages
Practical 08 Solutions
No ratings yet
Practical 08 Solutions
6 pages
Lecture 1 - Introduction To NN - CET
No ratings yet
Lecture 1 - Introduction To NN - CET
53 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
37 pages
AE556_2024_Topic4_CNN
No ratings yet
AE556_2024_Topic4_CNN
26 pages
CNN For Computer Vision Problem (Session 1)
No ratings yet
CNN For Computer Vision Problem (Session 1)
43 pages
Convolution Model Step by Step v1
No ratings yet
Convolution Model Step by Step v1
31 pages
Convolutional Neural Networks - Part 1
No ratings yet
Convolutional Neural Networks - Part 1
44 pages
Sarma Cnn Vce Oct 2022
No ratings yet
Sarma Cnn Vce Oct 2022
63 pages
3 - DeepLearning - and - CNN v3
No ratings yet
3 - DeepLearning - and - CNN v3
50 pages
3 ANN
No ratings yet
3 ANN
61 pages
03 PL, Activation, BackProp, CNN
No ratings yet
03 PL, Activation, BackProp, CNN
95 pages
Unec 1700728516
No ratings yet
Unec 1700728516
105 pages
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
No ratings yet
Convolutional Neural Networks: CS 535 Deep Learning, Winter 2020 Fuxin Li
44 pages
Introduction To Convolutional Neural Networks
No ratings yet
Introduction To Convolutional Neural Networks
41 pages
CNN
No ratings yet
CNN
62 pages
Week2_lecture1_2
No ratings yet
Week2_lecture1_2
113 pages
CNN Course-Notes 365
No ratings yet
CNN Course-Notes 365
29 pages
Convolutional Neural Networks: Objectives
No ratings yet
Convolutional Neural Networks: Objectives
10 pages
DS Tricks
No ratings yet
DS Tricks
2 pages
3 Salazar Francisco Improving - Accuracy - Using - Convolutions
No ratings yet
3 Salazar Francisco Improving - Accuracy - Using - Convolutions
14 pages
6 CNN
No ratings yet
6 CNN
50 pages
Convolutional Neural Networks in Python _ DataCamp
No ratings yet
Convolutional Neural Networks in Python _ DataCamp
22 pages
Lec5 CNN RNN Attention
No ratings yet
Lec5 CNN RNN Attention
71 pages
Lec14-15_CNN
No ratings yet
Lec14-15_CNN
40 pages
UNIT-III DLL full unit
No ratings yet
UNIT-III DLL full unit
63 pages
Why Convolutions?: Till Now in MLP
No ratings yet
Why Convolutions?: Till Now in MLP
38 pages
Ml@ok Questions
No ratings yet
Ml@ok Questions
16 pages
Presentation Notes
No ratings yet
Presentation Notes
21 pages
Unit 4a - Convolutional Neural Networks
No ratings yet
Unit 4a - Convolutional Neural Networks
107 pages
Cnn
No ratings yet
Cnn
123 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
AIML_ECE_UNIT-5
No ratings yet
AIML_ECE_UNIT-5
48 pages
03 Convolution Neural Networks and Computer Vision With Tensorflow
No ratings yet
03 Convolution Neural Networks and Computer Vision With Tensorflow
21 pages
UNIT2-CNN
No ratings yet
UNIT2-CNN
34 pages
Binary Classification Using Convolution Neural Network (CNN) Model by Mayank Verma Medium 2
No ratings yet
Binary Classification Using Convolution Neural Network (CNN) Model by Mayank Verma Medium 2
1 page
Lecture_3
No ratings yet
Lecture_3
48 pages
Convolutinal Neural Networks
No ratings yet
Convolutinal Neural Networks
43 pages
HODL Lec 3 DNNs For Vision 1
No ratings yet
HODL Lec 3 DNNs For Vision 1
36 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
35 pages
DeepLearning Unit-II
No ratings yet
DeepLearning Unit-II
70 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
53 pages
Deep Learning LectureCNN
No ratings yet
Deep Learning LectureCNN
28 pages
Deep Learning Notes For Easy Access
No ratings yet
Deep Learning Notes For Easy Access
14 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
77 pages
Unit 2 (1)
No ratings yet
Unit 2 (1)
45 pages
Convolutional Model
No ratings yet
Convolutional Model
21 pages
VGG Convolutional Neural Networks Practi
No ratings yet
VGG Convolutional Neural Networks Practi
27 pages
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
No ratings yet
CII4Q3 VISI KOMPUTER - Deep Learning - CNN
106 pages
Lecture 3 Updated
No ratings yet
Lecture 3 Updated
56 pages
Hidden Surface Determination: Unveiling the Secrets of Computer Vision
From Everand
Hidden Surface Determination: Unveiling the Secrets of Computer Vision
Fouad Sabry
No ratings yet
S3-K-Nearest-Neighbor-LKW-15Jan2025
No ratings yet
S3-K-Nearest-Neighbor-LKW-15Jan2025
16 pages
S4-LogisticRegression-15Jan2025
No ratings yet
S4-LogisticRegression-15Jan2025
25 pages
AST Day 4 Slides (New)
No ratings yet
AST Day 4 Slides (New)
37 pages
Ans in Day 4 Slides
No ratings yet
Ans in Day 4 Slides
5 pages
Lecture 2 - Conv - Operation
No ratings yet
Lecture 2 - Conv - Operation
31 pages
Exercise 1.2 Data Exploration
No ratings yet
Exercise 1.2 Data Exploration
1 page
Week 1 - Introduction To SDGAI
No ratings yet
Week 1 - Introduction To SDGAI
36 pages
Week 0 - Introduction To SDGAI
No ratings yet
Week 0 - Introduction To SDGAI
8 pages
Lecture 1 - NN - Computation
No ratings yet
Lecture 1 - NN - Computation
5 pages
Lecture 0 - DLIR Module Intro
No ratings yet
Lecture 0 - DLIR Module Intro
8 pages
Week 3 - Post - GAN
No ratings yet
Week 3 - Post - GAN
38 pages
07 Resource Monitoring
No ratings yet
07 Resource Monitoring
37 pages
M3 - T-GCPFCI-B - Core Infrastructure 5.0 - ILT
No ratings yet
M3 - T-GCPFCI-B - Core Infrastructure 5.0 - ILT
45 pages
06 Sample Exam Questions
No ratings yet
06 Sample Exam Questions
79 pages
M2 - T-GCPFCI-B - Core Infrastructure 5.0 - ILT
No ratings yet
M2 - T-GCPFCI-B - Core Infrastructure 5.0 - ILT
47 pages
11 Managed Services
No ratings yet
11 Managed Services
25 pages
08 Interconnecting Networks
No ratings yet
08 Interconnecting Networks
45 pages
Leadership Team
No ratings yet
Leadership Team
10 pages
Mod6 2.1 RACE Framework Your Practical Tool For Effective Digital Marketing
No ratings yet
Mod6 2.1 RACE Framework Your Practical Tool For Effective Digital Marketing
5 pages
01 Interacting With Google Cloud
No ratings yet
01 Interacting With Google Cloud
20 pages
Comptia Linux Xk0 005 Exam Objectives (2 0)
No ratings yet
Comptia Linux Xk0 005 Exam Objectives (2 0)
16 pages
Mod6 4.1 Blogger vs. WordPress
No ratings yet
Mod6 4.1 Blogger vs. WordPress
5 pages
ML Question Bank CA-II
No ratings yet
ML Question Bank CA-II
10 pages
ML Course Slides
No ratings yet
ML Course Slides
345 pages
35 Improvement Question - DLP (3174201) - Sem-7
No ratings yet
35 Improvement Question - DLP (3174201) - Sem-7
55 pages
Road Damage Detection Algorithm For Improved YOLOv5
No ratings yet
Road Damage Detection Algorithm For Improved YOLOv5
12 pages
PSY417 Week12
No ratings yet
PSY417 Week12
34 pages
BTP Report Final 1
No ratings yet
BTP Report Final 1
28 pages
CS8091 BIGDATA ANALYTICS QUESTION BANK - Watermark
No ratings yet
CS8091 BIGDATA ANALYTICS QUESTION BANK - Watermark
95 pages
Da Unit-4
No ratings yet
Da Unit-4
43 pages
Sentiment Classification With Deep Neural Networks: Yi Zhou
No ratings yet
Sentiment Classification With Deep Neural Networks: Yi Zhou
58 pages
Image Classification of An American Sign Language Dataset: Objectives
No ratings yet
Image Classification of An American Sign Language Dataset: Objectives
11 pages
Deep Learning (MODULE-2) (2)
No ratings yet
Deep Learning (MODULE-2) (2)
86 pages
Copy of Model Answer paper- UT1-QP-ML-SEM7-COMPUTER-2023-3024 version2
No ratings yet
Copy of Model Answer paper- UT1-QP-ML-SEM7-COMPUTER-2023-3024 version2
18 pages
AI_THEORY_KAI-501[1]
No ratings yet
AI_THEORY_KAI-501[1]
65 pages
NNs PDF
No ratings yet
NNs PDF
16 pages
Unit-I (Ensemble Learning)
No ratings yet
Unit-I (Ensemble Learning)
67 pages
7.+18750+61-76
No ratings yet
7.+18750+61-76
16 pages
34 Machine Learning Interview Questions & Answers For 2020
No ratings yet
34 Machine Learning Interview Questions & Answers For 2020
27 pages
s00431-024-05925-5
No ratings yet
s00431-024-05925-5
20 pages
ML New
No ratings yet
ML New
20 pages
Report On Loan-Prediction Using Machine Learning
No ratings yet
Report On Loan-Prediction Using Machine Learning
25 pages
Distributed Machine Learning with PySpark Migrating Effortlessly from Pandas and Scikit-Learn (Abdelaziz Testas) (Z-Library)
No ratings yet
Distributed Machine Learning with PySpark Migrating Effortlessly from Pandas and Scikit-Learn (Abdelaziz Testas) (Z-Library)
381 pages
MACHINE LEARNING NOTES ANNA UNIVERSITY
No ratings yet
MACHINE LEARNING NOTES ANNA UNIVERSITY
9 pages
Feature Selection Techniques in ML With Python-1
No ratings yet
Feature Selection Techniques in ML With Python-1
7 pages
Phase - 2 - PPT (1) Final
No ratings yet
Phase - 2 - PPT (1) Final
26 pages
Data Science Vijay1
No ratings yet
Data Science Vijay1
88 pages
(Ebook) Machine Learning with Spark and Python by Michael Bowles ISBN 9781119561958, 1119561957 instant download
100% (2)
(Ebook) Machine Learning with Spark and Python by Michael Bowles ISBN 9781119561958, 1119561957 instant download
54 pages
Ch2
No ratings yet
Ch2
29 pages
Book Machine Learning Finance Python
100% (1)
Book Machine Learning Finance Python
75 pages
Quantum Learning With Noise and Decoherence: A Robust Quantum Neural Network
No ratings yet
Quantum Learning With Noise and Decoherence: A Robust Quantum Neural Network
15 pages
Freshwater Fish Image Classifier
No ratings yet
Freshwater Fish Image Classifier
54 pages

Lecture 2 - CNN and Overfitting

Uploaded by

Lecture 2 - CNN and Overfitting

Uploaded by

Official (Closed) - Non Sensitive

• Specialist Diploma in Applied

• Academic Year 2024/25

2. The CNN operations

3. Training a CNN from scratch on a small

2. The CNN operations

2.1 The convolution operation

Images can be broken

2.1 The convolution operation

2.1 The convolution operation

2.1 The convolution operation

2.1 The convolution operation

TO CALCULATE WIDTH OF OUTPUT

Output Depth= # of Filters = 2

Source: Output Height or Width = Input width -

2.1 The convolution operation

2.1 The convolution operation

2.1 The convolution operation

TO CALCULATE WIDTH OF OUTPUT

Output Depth= # of Filters = 2

Output Height or Width =

2. The convolution operation

The output shape is (26, 26, 32),

2.2 The max-pooling operation

2.2 The max-pooling operation

2.2 The max-pooling operation

Demo 2 – MNIST using CNN

3. Training a CNN from scratch on a small

Let’s try your hands on your first CNN!

4.1 Fundamental Issues

4.1 Fundamental Issues

4.2 To prevent overfitting

2.Adding weight regularization

4.Get more training data

4.2 To prevent overfitting

• Balance: too much capacity vs. not enough

4.2 To prevent overfitting

• Two types of cost function (weights

4.2 To prevent overfitting

• At test time, no dropout but the outputs are

• The technique helps to reduce overfitting

• Implemented in Keras by adding in a Dropout

4.2 To prevent overfitting

4. Get more training data

4. Get more training data - Data

• Data augmentation is a strategy to

• Data augmentation techniques such as

 Example of Data Augmentation

Let’s try your hands on your first CNN!

For every epoch, the

Epoch 3 2000 images

 It’s possible to train a model from

 Data augmentation is a powerful

 Intuitively Understanding Convolutions for

 Data Augmentation Increases Accuracy of

François Chollet, Deep Learning with Python (2018)

You might also like