CSE4261 Lecture-11

Cse

Uploaded by

asad chowdhury

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views35 pages

CSE4261 Lecture-11

Cse

Uploaded by

asad chowdhury

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

Fundamentals of Convolutional

Neural Networks(CNN)

Prof. Dr. Shamim Akhter

Professor, Dept. of CSE
Ahsanullah University of Science and Technology
Kernels and Filters
• A main component of CNN is filter
– which is a square matrix that has nK × nK dimension,
where nK is an integer and is usually a small number,
like 3 or 5.
– Sometimes filters are also called kernels.
– Kernels are used to do sharpening, blurring, embossing,
and so on during image processing.
Example: Four different filters
Convolution
In the previous process, we moved our 3 × 3 region always
one column to the right and one row down. The number of
rows and columns, in this example 1, is called the stride and
is often indicated with s. Stride s = 2 means simply that we
shift our 3 × 3 region two columns to the right and two rows
down at each step.
Pooling: max pooling
• Pooling is the second operation in CNNs.

5
Padding
• Sometimes, when dealing with images, getting a result from a
convolution operation with dimensions different from the original
image is not optimal. This is when padding is necessary.
• Add rows of pixels on the top and bottom and columns of pixels on
the right and left of the final images so the resulting matrices are the
same size as the original.
Building Blocks of a CNN
• Convolution and pooling operations are used to
build the layers in CNNs.

• In CNNs typically you can find the following layers:

– Convolutional layers
– Pooling layers
– Fully connected layers: a layer where neurons are
connected to all neurons of previous and subsequent
layers.
Convolutional Layers

However, what are the weights in this layer?

– The weights, or the parameters that the network learns during the
training phase, are the elements of the kernel themselves.
– We have nc kernels, each of nK × nK dimensions. That means that we have
nK2 nc parameters in a convolutional layer.
– since for each filter there is also a bias term that you will need to add.
Convolution- Pooling layer

LeNet-5 with Softmax activation function

Dense/FC Layer

• The weights are the ones you know from traditional

feed-forward networks.
• So the number depends on the number of neurons
and the number of neurons in the preceding and
subsequent layers.
CNN Implementation

Why 1 is here?

What is Dropout?
What is Flatten?
Dropout Regularization Techniques
• Dropout is a technique where randomly selected
neurons are ignored during training. They are
“dropped out” randomly. This means that their
contribution to the activation of downstream neurons
is temporally removed on the forward pass, and
weight updates are not applied to the neuron on the
backward pass.
• Generally, use a small dropout value of 20%-50% of
neurons with 20% providing a good starting point. A
low probability has minimal effect and a high value
results in under-learning by the network.
Flatten Layer
• Intuition behind flattening layer is to converts data
into 1-dimentional array for feeding next layer. we
flatted output of convolutional layer into single long
feature vector.
FC and Dense Layer
• A fully-connected layer is a layer that has a
connection/edge across every pair of nodes from two
node sets. For example, if you want to build a layer
with N1 input neurons and N2 output neurons, the
number of connections/edges will be N1 X N2, which is
also the shape of the weight matrix.
• As the number of connections can be very large (think
of connecting thousands of neurons to one another),
the layer is going to be highly dense which is why these
layers are also called Dense Layer.
4608

32x5x5
S=1 2x2
32x12x12

4608x128+128

128x10+10
Effect of
Convolution
Effect of Max
Pooling
Going Deeper with Convolutions

• Problems of “classical” CNNs

– It isn’t easy to get the right kernel size. Each image
is different. Typically, larger kernels are good for
more globally distributed information, and smaller
ones for locally distributed information.
– Deep CNNs are prone to overfitting.
– Training and inference of networks with many
parameters is computationally intensive.
Inception Module: Naïve Version
• Overcome the difficulties of CNN
– networks are wider instead of deeper
• perform convolution with multiple-size kernels in parallel,
to detect features at different sizes simultaneously, instead
of adding convolutional layer after layer sequentially.
• convolution with 1 × 1, 3 × 3, and 5 × 5 kernels, and even
max pooling at the same time in parallel

1 × 1 kernel looks at very localized features,

while the 5 × 5 spots more global features.
Number of Parameters @ Naïve Inception
• Let’s use 32 kernels for all layers.
– 1 × 1 convolutions: 64 parameters [32+32]
– 3 × 3 convolutions: 320 parameters [9x32+32]
– 5 × 5 convolutions: 832 parameters [25x32+32]
– max-pooling does not have learnable parameters
Models # of Parameters
Sequential Processing 64+9248+25632=34,944 Classical
Parallel Processing 64+320+832=1,216 Naïve Inception 30 times faster

1x1
3x3 5x5
32+32 32x9x32+32 Reduce channel to kernel
32x25x32+32
dimension
Inception Module: Dimension Reduction
• In the naïve inception module, we get a smaller number of learnable parameters
concerning classical CNNs, but we can do even better
• We can use 1 × 1
convolutions at the right
places (mainly before the 8 kernels
higher dimension 8 kernels
convolutions) to reduce
dimensions. 8 kernels 8 kernels
• Suppose that the previous
layer is the output of a
previous operation and that
its output has the
dimensions of 256, 28, 28.
Models # of Parameters
Naïve Inception 256x1x8+8=2056, 256x9x8+8=18,440, 256x25x8+8=51,208
Total=71,704
Parallel Processing 2056, (2056+8x9x8+8=2056+584=2640),
(2056+8x25x8+8=2056+1608=3664), 2056
Total=10,416 [An inception network is simply built by stacking lots of those
modules one after the other]
GoogLeNet: Multiple Cost Functions
• Stacks several inception models one after the other
– the middle layers tend to “die”.

Introduced two intermediate loss functions and then

computed the total loss function as a weighted sum of the
auxiliary losses, effectively using a total loss
Of course, the auxiliary losses are used only in training and not during inference.
Pre-Trained Networks
• Pre-trained deep learning models available to use.

• If weights is None the weights are randomly initialized.

That means that you get the VGG16 architecture and you
can train it yourself. But be aware, that it has roughly 138
million parameters, so you will need a really big training
dataset.

• If you use the value imagenet, the weights are the ones
obtained by training the network with the imagenet
dataset
Transfer Learning
• Transfer learning is a technique where a model trained to
solve a specific problem is re-purposed for a new
challenge related to the first problem.
• The imagenet dataset can be used to classify dogs’
images but should not be used for speech recognition.

• In image recognition with CNN typically,

– the first layers will learn to detect generic features, and
– the last layers will be able to detect more specific ones.
– In a classification problem, the last layer will have N softmax
neurons (for classifying N classes), and therefore must learn to
be very specific to your problem.
How does Transfer Learning work?
• A network with nL layers
– train a base network (or get a pre-trained model) on a big
dataset (called a base dataset). The dataset should be problem-
specified.
– the new or target dataset will be much smaller than the
previous dataset.
– train a new network, called a target network, on the target
dataset.
• The target network will typically have the same first nk (with nk < nL)
layers of our base network.
• The learnable parameters of the first layers (let’s say 1 to nk, with nk <
nL) are inherited from the base pre-trained network and are not
changed during the training of the target network.
• Only the last and new layers (from layer nK to nL) are trained.
Schematic representation of the transfer learning
A Dog and Cat Problem
Dogs vs. Cats | Kaggle, 800MB
Training Image (3000, 150, 150, 3), Testing Image (1000, 150, 150, 3)

Naïve CNN Approach

With two epochs, 69% validation accuracy and 70% training accuracy.
A Dog and Cat Problem
Transfer Learning Approach 1

The result will be an astounding 88% in two epochs.

An incredibly better result than before!
A Dog and Cat Problem
Transfer Learning Approach 2

90% accuracy in a few seconds.

One epoch takes only six

seconds, in comparison to the
4.5 minutes in Approach1

100 epochs??

UNIT 2 Study Materials 1
No ratings yet
UNIT 2 Study Materials 1
42 pages
Mind Map
100% (1)
Mind Map
13 pages
Gill
No ratings yet
Gill
474 pages
Convolutional Neural Network: by Gagandeep Kaur
100% (1)
Convolutional Neural Network: by Gagandeep Kaur
107 pages
CNN Short
No ratings yet
CNN Short
61 pages
Module5 ML
No ratings yet
Module5 ML
112 pages
What Is Convolutional Neural Network
No ratings yet
What Is Convolutional Neural Network
16 pages
Cinematography: Lighting
88% (24)
Cinematography: Lighting
77 pages
Unit 5 CNN
No ratings yet
Unit 5 CNN
151 pages
CNN and Autoencoder
No ratings yet
CNN and Autoencoder
56 pages
ACS800 Multidrive Modules & Cabinets
No ratings yet
ACS800 Multidrive Modules & Cabinets
3 pages
MCP-161 - Micro III Direct Communications
100% (2)
MCP-161 - Micro III Direct Communications
1 page
DL Mod3
No ratings yet
DL Mod3
102 pages
CNNs
No ratings yet
CNNs
88 pages
Intro To CNN
No ratings yet
Intro To CNN
93 pages
(Fold/Cover If You Don'T Wanna See The Answers Yet) B
100% (2)
(Fold/Cover If You Don'T Wanna See The Answers Yet) B
43 pages
Lec14-15 CNN
No ratings yet
Lec14-15 CNN
40 pages
Unec 1700728516
No ratings yet
Unec 1700728516
105 pages
Synapse RIS Version 4-1
No ratings yet
Synapse RIS Version 4-1
46 pages
DL Unit-Ii
No ratings yet
DL Unit-Ii
34 pages
CPCS432 Lecture 5 Deep Learning and Artificial Neural Networks Techniques in Computer Vision
No ratings yet
CPCS432 Lecture 5 Deep Learning and Artificial Neural Networks Techniques in Computer Vision
57 pages
DL CNN
No ratings yet
DL CNN
129 pages
Convolutional Neural Networks - Part 2
No ratings yet
Convolutional Neural Networks - Part 2
49 pages
Lecture 3 Updated
No ratings yet
Lecture 3 Updated
56 pages
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet
Unit 4a - Convolutional Neural Networks
No ratings yet
Unit 4a - Convolutional Neural Networks
107 pages
Wa0002.
No ratings yet
Wa0002.
28 pages
CS601 Machine Learning Unit 3
No ratings yet
CS601 Machine Learning Unit 3
47 pages
Deep Learning UNIT-4
No ratings yet
Deep Learning UNIT-4
34 pages
Unit III
No ratings yet
Unit III
89 pages
Lecture 3
No ratings yet
Lecture 3
48 pages
Lec5 CNN RNN Attention
No ratings yet
Lec5 CNN RNN Attention
71 pages
Advanced Data Visualization and Interpretation 1
No ratings yet
Advanced Data Visualization and Interpretation 1
37 pages
Experiment 3
No ratings yet
Experiment 3
48 pages
Introduction To Deep Learning
No ratings yet
Introduction To Deep Learning
47 pages
CSE4261 Lecture-10
No ratings yet
CSE4261 Lecture-10
50 pages
CSE4261 Lecture-8
No ratings yet
CSE4261 Lecture-8
49 pages
Unit Iii Deep Learning
No ratings yet
Unit Iii Deep Learning
31 pages
CNN
No ratings yet
CNN
37 pages
CSE4261 Lecture-9
No ratings yet
CSE4261 Lecture-9
45 pages
Understanding of A Convolutional Neural Network
No ratings yet
Understanding of A Convolutional Neural Network
6 pages
Lecture 6 Deep Learning Training and Testing 2025
No ratings yet
Lecture 6 Deep Learning Training and Testing 2025
36 pages
DEEP LEARNING Unit-2 NOTES For Post Graduation
No ratings yet
DEEP LEARNING Unit-2 NOTES For Post Graduation
11 pages
Unit - 2
No ratings yet
Unit - 2
51 pages
Construction of Three Phase Induction Motor
No ratings yet
Construction of Three Phase Induction Motor
6 pages
Deep Learning LectureCNN
No ratings yet
Deep Learning LectureCNN
28 pages
Unit3 2023 NNDL
No ratings yet
Unit3 2023 NNDL
69 pages
CNN Notes Unit 3 Notes
No ratings yet
CNN Notes Unit 3 Notes
17 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
68 pages
Unit Iii Convolutional Networks and Sequence Modelling
No ratings yet
Unit Iii Convolutional Networks and Sequence Modelling
38 pages
DLT Unit - 4
No ratings yet
DLT Unit - 4
36 pages
UNIT-III DeepLearning Notes
No ratings yet
UNIT-III DeepLearning Notes
30 pages
CSE4261 Lecture-12
No ratings yet
CSE4261 Lecture-12
24 pages
International GCSE Biology (4BI1) - Grade Characteristics: Holistic Approach To Grades
No ratings yet
International GCSE Biology (4BI1) - Grade Characteristics: Holistic Approach To Grades
7 pages
CNN 2
No ratings yet
CNN 2
47 pages
Nria20-Dl - Unit-3 Notes-Final
No ratings yet
Nria20-Dl - Unit-3 Notes-Final
23 pages
Project Exhibition 2
No ratings yet
Project Exhibition 2
42 pages
Unit III
No ratings yet
Unit III
60 pages
Chemistry
No ratings yet
Chemistry
11 pages
Convolutional Neural Networks (CNN) : Convolutions
No ratings yet
Convolutional Neural Networks (CNN) : Convolutions
17 pages
Deep Learning 2017 Lecture5CNN
No ratings yet
Deep Learning 2017 Lecture5CNN
30 pages
DL Unit4
No ratings yet
DL Unit4
31 pages
Comba Report
No ratings yet
Comba Report
51 pages
Convolutional Neural Networks: ZV0GDF798E
No ratings yet
Convolutional Neural Networks: ZV0GDF798E
9 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
9 pages
CNN Architecture
No ratings yet
CNN Architecture
24 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
11 pages
A Convolutional Neural Network
No ratings yet
A Convolutional Neural Network
6 pages
Machine Learning Unit 3
No ratings yet
Machine Learning Unit 3
40 pages
Deep Learning
No ratings yet
Deep Learning
17 pages
Building A Convolutional Neural Network Using Tensorflow Keras
No ratings yet
Building A Convolutional Neural Network Using Tensorflow Keras
10 pages
A Review On Cellular Manufacturing Syste
No ratings yet
A Review On Cellular Manufacturing Syste
5 pages
Introduction To Convolution Neural Network
No ratings yet
Introduction To Convolution Neural Network
6 pages
MMW Midterms Notes
No ratings yet
MMW Midterms Notes
6 pages
Fundamentals of Meter Provers and Proving Methods
100% (1)
Fundamentals of Meter Provers and Proving Methods
9 pages
Deep LearningUNIT-IV
No ratings yet
Deep LearningUNIT-IV
16 pages
Generalised Angular Momentum
No ratings yet
Generalised Angular Momentum
10 pages
Abyss MiniRPG
No ratings yet
Abyss MiniRPG
4 pages
MTH302-lec-02 Worksheet
No ratings yet
MTH302-lec-02 Worksheet
6 pages
ATATool
No ratings yet
ATATool
6 pages
Convolu Onal Neural Network (CNN)
No ratings yet
Convolu Onal Neural Network (CNN)
3 pages
04 02 Permutation and Combinations2 PDF
No ratings yet
04 02 Permutation and Combinations2 PDF
28 pages
GLB Earn Proration Anytime
No ratings yet
GLB Earn Proration Anytime
11 pages
2017 H2 Prelim (Maclaurin and Binomial Series)
No ratings yet
2017 H2 Prelim (Maclaurin and Binomial Series)
12 pages
Basic Introduction To Convolutional Neural Network in Deep Learning
No ratings yet
Basic Introduction To Convolutional Neural Network in Deep Learning
9 pages
Convolutional Neural Networks (CNN)
No ratings yet
Convolutional Neural Networks (CNN)
7 pages
Schmersal Safety Sensor
No ratings yet
Schmersal Safety Sensor
6 pages
UNIT TEST CHAPTER 11 IMMUNITY - Jamal XI69069
No ratings yet
UNIT TEST CHAPTER 11 IMMUNITY - Jamal XI69069
8 pages
2010 01 12 3DBeam CDT6
No ratings yet
2010 01 12 3DBeam CDT6
65 pages
Data Security System Using Crytography Algorithm (RC6
No ratings yet
Data Security System Using Crytography Algorithm (RC6
8 pages
Chemical Engineering - Why in A Normal Distillation Column Does Temperature and Pressure Gradient Exist From Bottom To Top - Quora PDF
No ratings yet
Chemical Engineering - Why in A Normal Distillation Column Does Temperature and Pressure Gradient Exist From Bottom To Top - Quora PDF
6 pages
Basic Principles and Practices in CC1 1
No ratings yet
Basic Principles and Practices in CC1 1
2 pages
CGL Tier-1 Mock - p12
No ratings yet
CGL Tier-1 Mock - p12
1 page
Compressor Data Sheet: Chicago Pneumatic
No ratings yet
Compressor Data Sheet: Chicago Pneumatic
1 page