0% found this document useful (0 votes)

5 views

Week 7

The document provides an overview of Convolutional Neural Networks (CNNs), covering key concepts such as padding (valid and same), strides, and the introduction of bias in kernels. It discusses pooling techniques like max and average pooling, typical CNN architecture, and popular CNN models like VGG, ResNet, Inception, and MobileNet. Additionally, it highlights the importance of data augmentation in enhancing training datasets for better model generalization.

Uploaded by

sidharth

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Week 7

Uploaded by

sidharth

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Week 7

Convolution Neural Network

Padding
• Padding preserves the spatial dimensions of the input image after
convolution operations on a feature map by adding the pixels around
the boundary
• The most popularly used padding is Zero Padding
• Zero Padding – Simply add value 0 to the borders of the input feature
map
• Two types of padding
▪ Valid Padding
▪ Same Padding
Valid Padding
• No padding is added to the input feature map
• The convolution operation is performed only on the valid pixels of the
input feature map and results in a smaller dimension output feature
map

*
filter 3 X 3
Input 5 X 5 Output (5-3+1) X (5-3+1)
-> 3 X 3
Same Padding
• Padding is added to the input feature map.
• How is the padding calculated?
• Output shape with padding is calculated by n+2p − f+1
• The number of pixels to be added for padding can be calculated based on the
size of the kernel and the desired output feature map size.
n+2p − f+1 = n
(𝒇 − 𝟏)
𝒑=
𝟐
• The convolution operation is performed on the padded input
feature map and to maintain same-dimension output feature map
as the input feature map (Note: default stride =1)
Same Padding
Padding -> (3-1)/2 = 1

filter 3 X 3
Output (7-3 +1)X (7-3 +1)
Actual Input 5 X 5 -> 5 X 5
After Padding 7 X 7
Strides
• Stride is a parameter of the neural network's filter that modifies the
amount of movement/slide over input feature map

Stride 2

*
Output 2 X 2
filter 3 X 3
Input 5 X 5

Consider padding = ‘valid’, Stride = 2

Understanding output shape with padding
and stride
Introducing bias in kernel & parameter
Considering: Padding – ”Valid”, Stride = 1

wn[0:0] wn[0:1] wn[0:2]

wn[1:0] wn[1:1] wn[1:2] b[n] Filter n

wn[2:0] wn[2:1] wn[2:2]

w2[0:0] w2[0:1] w2[0:2]

b2
w2[1:0]
w1[0:0]
w2[1:1]
w1[0:1]
w2[1:2]
w1[0:2] Filter 2
w2[2:0] w2[2:1] w2[2:2]
w1[1:0] w1[1:1] w1[1:2]

b1 Filter 1
Input – 5 x 5 w1[2:0] w1[2:1] w1[2:2]

Filter – 3 X 3 X nf
Output – 3 X 3 X nf
Here, filter size = 3 x 3,
Number of total parameters for each filter = 9 weights + 1 bias
Total parameter of the layer = (9+1) * n, where n- number of channels
Convolution on colored images (RGB)
• Number of channels in input layer n c will be 3 (Red , Blue and Green )
• Filter will be applied to each channels in the input

Filter 2

Filter 1
Convolution on colored images (RGB)

Output – 3*3

Filter – 3 X 3

Considering: Padding – ”Valid”, Stride = 1, Number of filter =1

Input – 5 x 5 x 3
Introducing bias in kernel & parameter
w1[0:0:2] w1 [0:1:2] w1 [0:2:2] Considering: Padding – ”Valid”,
w1 [1:0:2] w1 [1:1:2] w1 [1:2:2] Stride = 1, Number of filter =1
w1 [2:0:2] w1 [2:1:2] w1 [2:2:2]

w1[0:0:1] w1 [0:1:1] w1 [0:2:1]

w1 [1:0:1] w1 [1:1:1] w1 [1:2:1]

w1 [2:0:1] w1 [2:1:1] w1 [2:2:1]

+ b1

w1[0:0:0] w1 [0:1:0] w1 [0:2:0] Output – 3 X 3

w1 [1:0:0] w1 [1:1:0] w1 [1:2:0]
W1 – 3 x 3 x 3
w1 [2:0:0] w1 [2:1:0] w1 [2:2:0]

5x
5(0) Filter – 3 X 3

Here, we have 1 filter with filter size = 3 x 3 (height ,width)

Input – 5 x 5 x 3 Number of total parameters for this filter = 9 * 3(no of input channel weights +1 bias
Parameter Calculation
• Input: h - height of the input, w - weight of input, nc -Number of channels in the input
• Filter: f height and weight of filter, nf - number of filter

Filter n

Filter 1

weight - (Filter height, Filter width,

Number of input channels, Number of filters)
Bias – Number of filters

Number of Parameter of layer - 𝑓 ∗ 𝑓 ∗ 𝑛𝑐 + 1 ∗ 𝑛𝑓

RGB image – Convolution Calculation

Convolution with
filter 1:

Input Size = 5X5

Padding=1
Stride=2
Filter = 3x3
(0*0+0*-1+0*1)+(0*-1+2*1+0*-1)+(0*0+1*0+1*0)
+
Slide 1 (0*1+0*0+0*-1)+(0*0+1*0+1*1)+(0*-1+1*-1+2*1)
+
(0*1+0*1+0*-1)+(0*0+0*-1+0*0)+(0*1+0*-1+2*1)
+
1

x[:,:,0] * w0[:,:,0] + x[:,:,1] * w0[:,:,1] + x[:,:,2] * w0[:,:,2] + b0

RGB image – Convolution Calculation
Convolution with
filter 1:
Input Size = 5X5
Padding=1
Stride=2
Filter = 3x3
Slide 2
RGB image – Convolution Calculation

Convolution with
filter 1:

Slide 9
Pooling
• Progressively reduce the spatial size of the representation to
reduce the network complexity and computational cost.
• Handles overfitting during feature extraction process
• Widely used pooling layer:
• Max Pooling
• Average Pooling
Max Pooling
• Max pooling applies MAX filter
• Max pooling selects the brighter pixels from the image.
When interested in only the lighter pixels of the image
this filter can be applied

1 2 5 5 3 10 5
1 6 1 5 0 11 11
10 6 5 3 1
7 5 11 4 4
5−3 5−3
1 -2 9 3 0 Stride =2, pooling Output +1 X +1 -> 2 X 2
2 2

Input 5 X 5 filter 3 X 3
Average Pooling
• This pooling layer works by getting the average of the pool.
• Max Pooling in the sense that it retains much information about the
“less important” elements of a block, or pool.

6 1 5 0 4.5 2.25
6 5 3 1
5 11 4 4 5.75 2.75

-2 9 3 0
Stride =2, pooling
4−2 4−2
Input 4 X 4 filter = 2 X 2 Output +1 X +1 -> 2 X 2
2 2
Typical Architecture of CNN
• Convolutional layers: Performs convolution operations by applying the filters (set of the learnable
parameter). to input
• Pooling layers: Perform pooling operations and help spatial dimension reduction on the output of
the convolutional layers.
• Fully connected layers: Connect every neuron in one layer to every neuron in another layer and
computes the final classification or regression task
Typical Architecture of CNN
Input

((convolution -> relu) x n / pooling) x m) + (1 fully-connected layer -> relu) x k + (output layer)

Block for Feature Classifier

Extraction

• (convolution/pooling) + (1 fully-connected layer) + (output layer)

• (convolution/pooling/convolution/pooling) + (1 fully-connected layer) + (output layer)
• (convolution/convolution/pooling) + (1 fully-connected layer) + (output layer)
• (convolution/convolution/pooling convolution/convolution/pooling) + (1 fully-connected layer) + (output layer)
Typical Architecture of CNN
• Number of Filters/Kernels: Varies depending on the specific architecture but may start with 32
or 64 filters.
• Filter/Kernel Size: Default values often include 3x3 or 5x5.
• Stride: Default stride is usually set to 1.
• Padding: Often set to 'valid' (no padding) or 'same' (zero padding).
• Activation Function: The default activation function is often the rectified linear unit (ReLU).
• Pooling: Max pooling layers with a default pool size of 2x2 are commonly used.
Popular CNN Architectures
• VGG (Visual Geometry Group):
• VGG16 and VGG19 are popular models known for their simplicity and effectiveness. They have 16
and 19 weight layers, respectively.
• Pretrained models are available in various deep learning frameworks.
• ResNet (Residual Network):
• ResNet models, including ResNet-50, ResNet-101, and ResNet-152, introduced residual
connections that allow for training very deep networks.
• Pretrained ResNet models are widely used for various computer vision tasks.
• Inception (GoogLeNet):
• Inception models, including InceptionV3 and InceptionV4, use a network architecture with multiple
branches of different filter sizes to capture a wide range of features.
• Pretrained Inception models are useful for tasks that require multi-scale feature extraction.
• MobileNet:
• MobileNet models are designed for efficient inference on mobile devices. They offer a balance
between model size and accuracy.
• Pretrained MobileNet models are commonly used in mobile and embedded applications.
VGG 16 – Architecture

VGG 16
Data Augmentation
• Techniques for increasing the size of the training dataset by applying random transformations
to the input data, such as rotation, flipping, scaling etc..
• Helps in increasing the amount of data in a machine learning model by adding slightly
modified copies of already existing data or newly created synthetic data from existing data
• Data augmentation enhances the diversity of your training dataset, which can improve the
model's ability to generalize to new data.

Convolutional Neural Networks (CNN) - QA & HandsOn
60% (5)
Convolutional Neural Networks (CNN) - QA & HandsOn
5 pages
Design Guide - OpenShift 4.12 On Dell Intel Infrastructure
No ratings yet
Design Guide - OpenShift 4.12 On Dell Intel Infrastructure
49 pages
Spam Alert PDF
No ratings yet
Spam Alert PDF
8 pages
CNN
No ratings yet
CNN
8 pages
Cnn
No ratings yet
Cnn
26 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
11 pages
CS601 - Machine Learning - Unit 3 - Notes - 1672759761
No ratings yet
CS601 - Machine Learning - Unit 3 - Notes - 1672759761
15 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
35 pages
CNN Architecture
No ratings yet
CNN Architecture
24 pages
Lab 5 - Intro To Convolutional Neural Networks
No ratings yet
Lab 5 - Intro To Convolutional Neural Networks
52 pages
Unit3 2023 NNDL
No ratings yet
Unit3 2023 NNDL
69 pages
Unit 3 CNN
No ratings yet
Unit 3 CNN
47 pages
Unit 2 (1)
No ratings yet
Unit 2 (1)
45 pages
Unit 3 ML
No ratings yet
Unit 3 ML
27 pages
Convolutional Neural Networks - Annotated
No ratings yet
Convolutional Neural Networks - Annotated
83 pages
Unit 3 - Machine Learning
No ratings yet
Unit 3 - Machine Learning
29 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
102 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
6 pages
21-Foundations of Convolutional Neural Networks-04!09!2024
No ratings yet
21-Foundations of Convolutional Neural Networks-04!09!2024
10 pages
5-Convolutional Neural Network
No ratings yet
5-Convolutional Neural Network
43 pages
DL_MOD3
No ratings yet
DL_MOD3
102 pages
CNN Midterm
No ratings yet
CNN Midterm
103 pages
Unit III
No ratings yet
Unit III
38 pages
3.Convolutional Networks and Sequence Modeling
No ratings yet
3.Convolutional Networks and Sequence Modeling
19 pages
Lecture_3
No ratings yet
Lecture_3
48 pages
3.3 - CNNs
No ratings yet
3.3 - CNNs
29 pages
UNIT2-CNN
No ratings yet
UNIT2-CNN
34 pages
Unit 3
No ratings yet
Unit 3
80 pages
Ch-3 Convolutional Neural Networks (CNNs)
No ratings yet
Ch-3 Convolutional Neural Networks (CNNs)
11 pages
02 - Introduction to Convolutional Neural Networks (CNNs)
No ratings yet
02 - Introduction to Convolutional Neural Networks (CNNs)
28 pages
Convolutional Neural Networks: Shusen Wang
No ratings yet
Convolutional Neural Networks: Shusen Wang
75 pages
NN 06
No ratings yet
NN 06
18 pages
A Comprehensive Tutorial To Learn Convolutional Neural Networks From Scratch
No ratings yet
A Comprehensive Tutorial To Learn Convolutional Neural Networks From Scratch
11 pages
Summary Notes of Cnn
No ratings yet
Summary Notes of Cnn
23 pages
Convolutional Neural Networks (CNN) : Convolutions
No ratings yet
Convolutional Neural Networks (CNN) : Convolutions
17 pages
Deep Learning Notes For Easy Access
No ratings yet
Deep Learning Notes For Easy Access
14 pages
Unit 3 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 3 - Machine Learning - WWW - Rgpvnotes.in
29 pages
DeepLearning Unit-II
No ratings yet
DeepLearning Unit-II
70 pages
CNN Building Blocks
No ratings yet
CNN Building Blocks
14 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
unit2
No ratings yet
unit2
22 pages
Cnnbasics 171028092801
No ratings yet
Cnnbasics 171028092801
43 pages
1.5+Convolutional+Neural+Networks (1)
No ratings yet
1.5+Convolutional+Neural+Networks (1)
9 pages
Data Warehouse
No ratings yet
Data Warehouse
3 pages
CNN AI
No ratings yet
CNN AI
17 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
11 pages
Implemented LeNet on PyTorch
100% (1)
Implemented LeNet on PyTorch
17 pages
20 Questions To Test Your Skills On CNN Convolutional Neural Networks
No ratings yet
20 Questions To Test Your Skills On CNN Convolutional Neural Networks
11 pages
CNN Interview Question
No ratings yet
CNN Interview Question
16 pages
Unit-4
No ratings yet
Unit-4
19 pages
26-Deep Convolutional Models - ResNet, AlexNet, InceptionNet and Others-16!09!2024
No ratings yet
26-Deep Convolutional Models - ResNet, AlexNet, InceptionNet and Others-16!09!2024
6 pages
Introduction To Convolutional Neural Networks (CNNS)
No ratings yet
Introduction To Convolutional Neural Networks (CNNS)
28 pages
convolution operation
No ratings yet
convolution operation
23 pages
Chapter14 CNN
No ratings yet
Chapter14 CNN
54 pages
Additional CNN
No ratings yet
Additional CNN
82 pages
Convolutional Neural Network Architecture - CNN Architecture
No ratings yet
Convolutional Neural Network Architecture - CNN Architecture
13 pages
HODL Lec 3 DNNs For Vision 1
No ratings yet
HODL Lec 3 DNNs For Vision 1
36 pages
371810f3-a2d5-467f-aa88-bfa680405b79
No ratings yet
371810f3-a2d5-467f-aa88-bfa680405b79
17 pages
Ch VI _ Convolutional Neural Network_24
No ratings yet
Ch VI _ Convolutional Neural Network_24
33 pages
05introduction To Convolutional Neural Networks
No ratings yet
05introduction To Convolutional Neural Networks
72 pages
Convolutional Neural Network - 5
No ratings yet
Convolutional Neural Network - 5
21 pages
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
project_management_ppt final
No ratings yet
project_management_ppt final
18 pages
Developer - Vapasi_Thoughtworks
No ratings yet
Developer - Vapasi_Thoughtworks
2 pages
Exam_2_details_S1 (2)
No ratings yet
Exam_2_details_S1 (2)
1 page
Lecture week 8 part 2
No ratings yet
Lecture week 8 part 2
9 pages
DLCO previous question paper
No ratings yet
DLCO previous question paper
2 pages
Lets Install Cisco ISE
No ratings yet
Lets Install Cisco ISE
8 pages
Telecom Concepts PDF
No ratings yet
Telecom Concepts PDF
186 pages
Smart Software Manager On-Prem 7 Installation Guide
No ratings yet
Smart Software Manager On-Prem 7 Installation Guide
30 pages
Paper 062
No ratings yet
Paper 062
6 pages
Neural recording and stimulation using wireless networks of microimplants(科研通-ablesci.com)
No ratings yet
Neural recording and stimulation using wireless networks of microimplants(科研通-ablesci.com)
13 pages
Opti Control
100% (2)
Opti Control
4 pages
Bp744 User Manual
No ratings yet
Bp744 User Manual
29 pages
Intergal Calculus
No ratings yet
Intergal Calculus
20 pages
AMA Answers - Computer Programming 1
No ratings yet
AMA Answers - Computer Programming 1
7 pages
PCWorld 01 2024
No ratings yet
PCWorld 01 2024
112 pages
SA2_BUSINESS_AND_SUPPLY_CHAIN_M[1]
No ratings yet
SA2_BUSINESS_AND_SUPPLY_CHAIN_M[1]
3 pages
HIKVISION Password Reset RU
No ratings yet
HIKVISION Password Reset RU
5 pages
Mix_1249
No ratings yet
Mix_1249
22 pages
SinclairUser003 Jun82 PDF
No ratings yet
SinclairUser003 Jun82 PDF
68 pages
Gamasutra - Q&A - Translating The Humor & Tone of Yakuza Games For The West
No ratings yet
Gamasutra - Q&A - Translating The Humor & Tone of Yakuza Games For The West
11 pages
Linux Lab
No ratings yet
Linux Lab
15 pages
VP or Director
No ratings yet
VP or Director
2 pages
Lab 8 Splunk Boss of The SOC (15 Pts + 20 Pts Extra)
No ratings yet
Lab 8 Splunk Boss of The SOC (15 Pts + 20 Pts Extra)
16 pages
COS 312 - Simple GIS Assignment
No ratings yet
COS 312 - Simple GIS Assignment
4 pages
Revit Mep 2013 Ascent
No ratings yet
Revit Mep 2013 Ascent
658 pages
5371 PB AES MC SBC IMX8M G V2 V12a Web
No ratings yet
5371 PB AES MC SBC IMX8M G V2 V12a Web
2 pages
Getting X509 Certificates in and Out of The Key Store
No ratings yet
Getting X509 Certificates in and Out of The Key Store
7 pages
CCNA Study Guide
100% (1)
CCNA Study Guide
23 pages
Lab Session 1 Command Line Basics
No ratings yet
Lab Session 1 Command Line Basics
6 pages
MOCA2311001.txt Statment Proposal
No ratings yet
MOCA2311001.txt Statment Proposal
2 pages
OOSE Domain Analysis
No ratings yet
OOSE Domain Analysis
41 pages
Towards A Capability Maturity Model For Digital Forensic Readiness
No ratings yet
Towards A Capability Maturity Model For Digital Forensic Readiness
13 pages