0% found this document useful (0 votes)

8 views7 pages

What Is Padding in CNN

Uploaded by

Nikhil Suryawanshi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views7 pages

What Is Padding in CNN

Uploaded by

Nikhil Suryawanshi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

What is padding in CNN

Padding in Convolutional Neural Networks (CNNs) refers to the technique of adding extra border
pixels to an input image. This is done to control the spatial dimensions of the output feature maps
after convolutional operations.

There are generally two types of padding:

1. Valid Padding (No Padding): In this case, no padding is added to the input image. As a result,
the output feature map's spatial dimensions are reduced after convolution because the
filter/kernel cannot fully traverse the edges of the input image.

2. Same Padding: Here, padding is added to the input image so that the spatial dimensions of
the output feature map remain the same as the input image. The extra pixels added around
the border are typically zeros (zero-padding), but other padding strategies are also possible.

Padding is essential because it helps in preserving spatial information, prevents information loss at
the edges of the image, and ensures that the convolutional layers can process the entire image,
including its borders.

2.Stride in CNN
Stride in Convolutional Neural Networks (CNNs) is a hyperparameter that determines how much the
filter/kernel moves across the input image or feature map during the convolution operation.

When a filter is applied to an input image or feature map, it slides across the input by a certain
number of pixels defined by the stride. The stride value determines the amount of movement for the
filter in both the horizontal and vertical directions.

Here are the two common types of strides:

1. Stride of 1: In this case, the filter moves one pixel at a time in both the horizontal and
vertical directions. This means the filter shifts by one pixel for each convolution operation,
leading to overlapping receptive fields and higher computational cost but preserving more
spatial information.

2. Stride of N (where N > 1): When the stride is greater than 1, the filter moves N pixels at a
time in both directions. This results in a reduction of the output feature map size because
the filter covers a larger area with fewer overlapping regions. Using a larger stride can reduce
computational complexity and memory usage but may lead to information loss and reduced
spatial resolution in the output feature map.

The choice of stride in a CNN depends on the specific task and architecture design considerations.
Smaller strides preserve more spatial information but increase computational cost, while larger
strides reduce spatial resolution but can improve efficiency in certain scenarios.
3. Relu in CNN
ReLU (Rectified Linear Unit) is an activation function commonly used in Convolutional Neural
Networks (CNNs) and other deep learning architectures. It introduces non-linearity into the network,
enabling it to learn complex patterns and relationships in the data.

Here's how ReLU works in CNNs:

1. Mathematical Formulation: The ReLU activation function is defined as
𝑓(𝑥)=max⁡(0,𝑥)f(x)=max(0,x), which means that for any input 𝑥x, the output of the ReLU
function is the maximum of 00 and 𝑥x.

2. Activation: During the forward pass of a CNN, each neuron's output is calculated as the
result of applying the ReLU function to the weighted sum of its inputs and biases.

3. Benefits:
 Non-linearity: ReLU introduces non-linearity into the network, allowing it to model
and learn complex relationships in the data that linear functions cannot capture.
 Sparsity: ReLU produces sparsity in the network because any negative input is
transformed to zero. This sparsity can help in reducing overfitting by preventing the
network from becoming too reliant on specific features.

 Efficiency: Computationally, ReLU is efficient to compute compared to some other

activation functions like sigmoid or tanh.

4. Training: During training, ReLU helps in mitigating the vanishing gradient problem by
allowing gradients to flow more freely through the network compared to saturating
activation functions like sigmoid or tanh.

However, ReLU is not without its limitations. One major drawback is the "dying ReLU" problem,
where some neurons may become inactive (output zero) for all inputs during training, effectively
"dying" and not contributing to the learning process. This can happen if the learning rate is set too
high or due to unlucky weight initialization, especially in deeper networks. Variants like Leaky ReLU,
Parametric ReLU, and Exponential Linear Units (ELUs) are designed to address some of these issues.

Advantages of Relu layer in CNN

ReLU (Rectified Linear Unit) layers in Convolutional Neural Networks (CNNs) offer several advantages:
1. Efficient Computation: ReLU activation is computationally efficient compared to other
activation functions like sigmoid or tanh. This efficiency is due to its simple mathematical
formulation (𝑓(𝑥)=max⁡(0,𝑥)f(x)=max(0,x)) which involves only a comparison and a
maximum operation.

2. Non-Linearity: ReLU introduces non-linearity into the network, allowing CNNs to learn
complex patterns and relationships in the data. This non-linearity is crucial for the network's
ability to model and approximate complex functions.
3. Sparse Activation: ReLU produces sparsity in the network by setting all negative values to
zero. This sparsity can be beneficial as it helps in reducing the computational load and
memory requirements, especially in deep networks with a large number of parameters.
4. Avoids Vanishing Gradient: ReLU helps in mitigating the vanishing gradient problem, which
can occur in deep networks with saturating activation functions like sigmoid or tanh. By
allowing gradients to flow more freely during backpropagation, ReLU facilitates better and
more stable training of deep CNNs.

5. Faster Convergence: Due to its non-saturating nature and avoidance of the vanishing
gradient problem, ReLU layers can lead to faster convergence during training. This can result
in shorter training times and quicker model development.

6. Better Representation Learning: ReLU activations are known to encourage better

representation learning in CNNs. They help the network in capturing and emphasizing
important features while suppressing irrelevant or noisy information.

Pooling Layer in CNN

A Pooling Layer in Convolutional Neural Networks (CNNs) is a type of layer used to downsample or
reduce the spatial dimensions of the feature maps generated by convolutional layers. It operates on
each feature map independently, typically dividing the map into non-overlapping regions and
performing a function (such as max pooling or average pooling) to obtain a single value for each
region.
Here are the key aspects and advantages of using pooling layers in CNNs:

1. Downsampling: Pooling layers reduce the spatial dimensions of the input feature maps,
effectively downsampling the information. This helps in controlling the model's complexity,
reducing computational requirements, and preventing overfitting by focusing on the most
important features.

2. Translation Invariance: Pooling layers provide some degree of translation invariance,

meaning that the presence of a feature in different locations of the input can still be
captured. This is particularly beneficial in tasks like image recognition, where the exact
location of a feature may vary.

3. Max Pooling: In max pooling, the maximum value within each region of the feature map is
taken as the representative value. Max pooling helps in preserving the most prominent
features and highlighting important spatial information.

4. Average Pooling: In average pooling, the average value within each region of the feature
map is computed. Average pooling can provide a smoother downsampling effect and may be
less sensitive to noise compared to max pooling.

5. Stride: Pooling layers often use a stride parameter, which determines the step size at which
pooling operations are applied across the feature map. A larger stride leads to more
aggressive downsampling and reduces the output size further.
6. Pooling Size: The size of the pooling window (pooling size) defines the region over which the
pooling operation is performed. Commonly used pooling window sizes are 2x2 or 3x3.

7. Spatial Hierarchy: Pooling layers help in creating a spatial hierarchy of features, where lower
layers capture fine-grained details, and higher layers capture more abstract and generalized
features.

8. Reduced Overfitting: By reducing the spatial dimensions and focusing on the most important
features, pooling layers can help in reducing overfitting, especially in deep CNN
architectures.

Fully connected layers in CNN

Fully connected layers (also called dense layers) in Convolutional Neural Networks (CNNs) are
traditional neural network layers where each neuron is connected to every neuron in the previous
layer. Unlike convolutional and pooling layers that operate on spatially organized features, fully
connected layers aggregate information from all features regardless of their spatial location.

Here are some key points about fully connected layers in CNNs:

1. Role in CNNs: Fully connected layers are typically used towards the end of the CNN
architecture, after the convolutional and pooling layers. They help in learning complex
patterns by combining spatially extracted features into higher-level representations suitable
for the final classification or regression tasks.
2. Flattening: Before passing the outputs of the convolutional and pooling layers to the fully
connected layers, the feature maps are often flattened. Flattening means reshaping the 3D
or 4D feature maps into 1D vectors, where each element represents a feature.
3. Parameters: Fully connected layers have a large number of parameters, especially if the
preceding layers have produced a high-dimensional output. Each neuron in a fully connected
layer is connected to every neuron in the previous layer, resulting in a dense connectivity
pattern that contributes to the model's capacity to learn complex relationships.

4. Activation Functions: Each neuron in a fully connected layer typically uses an activation
function such as ReLU, sigmoid, or tanh to introduce non-linearity and enable the model to
learn non-linear mappings between input features and output classes.

5. Dropout: Dropout regularization is often applied to fully connected layers to prevent

overfitting. Dropout randomly deactivates a fraction of neurons during training, forcing the
network to learn more robust and generalizable representations.

6. Output Layer: The last fully connected layer in a CNN is usually the output layer, which has
neurons corresponding to the number of classes in a classification task or the number of
output dimensions in a regression task. The output layer usually employs an appropriate
activation function (e.g., softmax for classification or linear activation for regression).

7. Training and Optimization: Fully connected layers are trained using backpropagation along
with the rest of the CNN layers. Optimization techniques such as stochastic gradient descent
(SGD), Adam, or RMSprop are commonly used to update the weights and biases of fully
connected layers during training.

Fully connected layers are crucial for learning high-level abstractions and making final predictions in
CNNs. However, their large number of parameters can also make them prone to overfitting, so
proper regularization techniques and optimization strategies are essential for effective training and
generalization.

Types of pooling layer

In Convolutional Neural Networks (CNNs), there are primarily two types of pooling layers commonly
used:

1. Max Pooling:

 In max pooling, each region of the input feature map (usually non-overlapping
regions defined by a pooling size and a stride) is reduced to a single value, which is
the maximum value within that region.

 Max pooling helps in retaining the most prominent features within each region,
thereby preserving important spatial information.

 Example: If a 2x2 max pooling layer with a stride of 2 is applied to a feature map,
each 2x2 region in the feature map is reduced to a single maximum value, and the
output feature map size is halved in both dimensions.

2. Average Pooling:

 In average pooling, each region of the input feature map is reduced to a single value,
which is the average (mean) value of all the values within that region.
 Average pooling helps in smoothing out the features and can be less sensitive to
outliers or noisy information compared to max pooling.

 Example: If a 2x2 average pooling layer with a stride of 2 is applied to a feature map,
each 2x2 region in the feature map is reduced to a single average value, and the
output feature map size is halved in both dimensions.
Drop out layer in CNN
A Dropout layer is a regularization technique commonly used in Convolutional Neural Networks
(CNNs) and other deep learning models to prevent overfitting. The main idea behind Dropout is to
randomly deactivate (set to zero) a fraction of neurons in the network during training, which forces
the network to learn more robust and generalized representations.
Here's how Dropout works in a CNN:

1. Random Deactivation: During training, at each iteration or mini-batch, a Dropout layer

randomly selects a fraction of neurons (typically specified by a dropout rate, e.g., 0.2 for 20%
dropout) and sets their outputs to zero. This means that these neurons do not contribute to
the forward pass and backpropagation during that iteration.

2. Variability: Dropout introduces variability and randomness into the training process. By
randomly dropping neurons, the network learns to be less dependent on specific neurons or
features, thus reducing the risk of overfitting to the training data.

3. Ensemble Effect: Dropout can be seen as training multiple subnetworks within the main
network. Each dropout iteration trains a different subset of neurons, effectively creating an
ensemble of models. During inference (testing or prediction), all neurons are active, but their
outputs are scaled by the dropout rate to maintain expected behavior.

4. Regularization: Dropout acts as a regularization technique by implicitly penalizing complex

co-adaptations among neurons. It encourages the network to learn more generalizable and
robust features rather than memorizing noise or outliers in the training data.

5. Usage: Dropout layers are typically inserted between fully connected layers in a CNN
architecture, although they can also be applied after convolutional layers in certain cases.
The dropout rate is a hyperparameter that needs to be tuned based on the specific dataset
and model complexity.
6. Training vs. Inference: During training, Dropout is active, while during inference (when
making predictions), Dropout is usually turned off or applied with a scaling factor to
compensate for the deactivated neurons and maintain the expected output range.
Local Response Normalization
Local Response Normalization (LRN) is a technique used in Convolutional Neural Networks (CNNs) to
normalize the activations of neurons within a local neighborhood across different feature maps. LRN
was popularized by the AlexNet architecture, which won the ImageNet Large Scale Visual Recognition
Challenge in 2012.

Here's how LRN is typically applied in CNNs:

1. Local Neighborhood:

 For each location in a feature map, LRN considers a local neighborhood defined by a
specified window size along the channel dimension. The window size determines
how many neighboring activations are included in the normalization.

2. Normalization Formula:

 The normalized activation of a neuron is computed using the LRN formula, which
divides the activation of the neuron by a term that represents the sum of squared
activations within the local neighborhood. The formula includes parameters such as
the size of the neighborhood, a scaling factor, and a small constant to avoid division
by zero.

 The LRN formula helps in normalizing the responses of neurons based on their
relative contributions within the local context.
3. Normalization Benefits:

 LRN is intended to enhance the contrast between activated neurons and suppress
responses that are not significantly higher than their neighbors. This can lead to
more selective and robust feature representations.

 LRN also helps in reducing the sensitivity of the network to variations in input data
and can improve generalization performance.

4. LRN in AlexNet:

 In AlexNet, LRN was applied after the ReLU activation function in some convolutional
layers. This helped in normalizing the activations before passing them to subsequent
layers, contributing to the overall performance of the network on image
classification tasks.
5. Limitations and Alternatives:

 LRN has some limitations, such as being less stable in deep networks and having
hyperparameters that need careful tuning.

 Modern CNN architectures often use alternative normalization techniques like Batch
Normalization (BatchNorm) or Group Normalization (GroupNorm), which offer more
stable and effective normalization strategies, especially in deeper networks.

Summary Notes of CNN
No ratings yet
Summary Notes of CNN
23 pages
3.convolutional Networks and Sequence Modeling
No ratings yet
3.convolutional Networks and Sequence Modeling
19 pages
Fact Sheet Scietech - English and Filipino Article 1
90% (10)
Fact Sheet Scietech - English and Filipino Article 1
2 pages
Unit III
No ratings yet
Unit III
38 pages
UNIT-III DLL Full Unit
No ratings yet
UNIT-III DLL Full Unit
63 pages
UNIT-4 Foundations of Deep Learning
100% (1)
UNIT-4 Foundations of Deep Learning
43 pages
What Is Convolutional Neural Network
No ratings yet
What Is Convolutional Neural Network
16 pages
Deep Learning Unit-III
No ratings yet
Deep Learning Unit-III
9 pages
20 Questions To Test Your Skills On CNN Convolutional Neural Networks
No ratings yet
20 Questions To Test Your Skills On CNN Convolutional Neural Networks
11 pages
Unit 5th Ig Ann
No ratings yet
Unit 5th Ig Ann
112 pages
DL Unit-Ii
No ratings yet
DL Unit-Ii
36 pages
DL Mod3
No ratings yet
DL Mod3
102 pages
CNNs
No ratings yet
CNNs
88 pages
Some Important Question
No ratings yet
Some Important Question
59 pages
Unit 3
No ratings yet
Unit 3
59 pages
Unit III
No ratings yet
Unit III
89 pages
Anaphora Resolution PDF
No ratings yet
Anaphora Resolution PDF
63 pages
ML Lec 13 CNN
No ratings yet
ML Lec 13 CNN
44 pages
Deep Learning - Lecture 4 - CNNs
No ratings yet
Deep Learning - Lecture 4 - CNNs
53 pages
Lecture 3
No ratings yet
Lecture 3
48 pages
Unit3 2023 NNDL
No ratings yet
Unit3 2023 NNDL
69 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
35 pages
Unit II
No ratings yet
Unit II
38 pages
DL Unit 3 2019PAT
No ratings yet
DL Unit 3 2019PAT
66 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
61 pages
2025 PADI Instructor Manual Errata (Summary of Changes Between 2024 and 2025 Versions)
No ratings yet
2025 PADI Instructor Manual Errata (Summary of Changes Between 2024 and 2025 Versions)
8 pages
Convolutional Neural Network
100% (1)
Convolutional Neural Network
3 pages
MLT UNIT-4 & 5 Imp Sol
No ratings yet
MLT UNIT-4 & 5 Imp Sol
22 pages
DL Endsem 2024 FlyHigh Services
No ratings yet
DL Endsem 2024 FlyHigh Services
18 pages
CNN Interview Question
No ratings yet
CNN Interview Question
16 pages
Deep Learning Series CNN - 2
No ratings yet
Deep Learning Series CNN - 2
15 pages
Combined Paper
No ratings yet
Combined Paper
26 pages
Introduction To Convolution Neural Network
No ratings yet
Introduction To Convolution Neural Network
15 pages
Unit 2
No ratings yet
Unit 2
20 pages
Unit 3 ML
No ratings yet
Unit 3 ML
27 pages
Convolutional Neural Network - 5
No ratings yet
Convolutional Neural Network - 5
21 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
11 pages
You Can't Stop The Clock
No ratings yet
You Can't Stop The Clock
14 pages
CC511 Week 7 - Deep - Learning
No ratings yet
CC511 Week 7 - Deep - Learning
33 pages
Unit 3 Deep Learning
No ratings yet
Unit 3 Deep Learning
10 pages
What Is A Convolutional Neural Network-Unit3
No ratings yet
What Is A Convolutional Neural Network-Unit3
12 pages
CNN Notes Unit-3
No ratings yet
CNN Notes Unit-3
12 pages
5 Layers of A Convolutional Neural Network
No ratings yet
5 Layers of A Convolutional Neural Network
15 pages
Various Neural Network Architect Assignment Questions
No ratings yet
Various Neural Network Architect Assignment Questions
9 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu Raghav - Medium
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu Raghav - Medium
10 pages
Convolutional Neural Networks: 1. Basics of Cnns
No ratings yet
Convolutional Neural Networks: 1. Basics of Cnns
8 pages
Unit 3 Deep Learning
No ratings yet
Unit 3 Deep Learning
11 pages
UNIT-III Convolution Neural Networks
No ratings yet
UNIT-III Convolution Neural Networks
9 pages
DL Unit3
No ratings yet
DL Unit3
8 pages
CV Lab 12 - Implementatin of A Simple CNN
No ratings yet
CV Lab 12 - Implementatin of A Simple CNN
9 pages
Short Qns CNN
No ratings yet
Short Qns CNN
11 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
8 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
9 pages
Unit 4 Deep Learning Model:: Introduction To Cnns
No ratings yet
Unit 4 Deep Learning Model:: Introduction To Cnns
7 pages
Unit III
No ratings yet
Unit III
8 pages
New
No ratings yet
New
8 pages
Module 05 CNN Arctitecture
No ratings yet
Module 05 CNN Arctitecture
7 pages
S9 Q4 Hybrid Module 2 Week 3 Conservation of Momentum
No ratings yet
S9 Q4 Hybrid Module 2 Week 3 Conservation of Momentum
19 pages
Step by Step Procedure That How I Resolve Given Task Pytorh
No ratings yet
Step by Step Procedure That How I Resolve Given Task Pytorh
6 pages
Library - TestDome
0% (1)
Library - TestDome
3 pages
Deep Neural Network DNN
No ratings yet
Deep Neural Network DNN
5 pages
CNN Theory
No ratings yet
CNN Theory
3 pages
What Is CNN
No ratings yet
What Is CNN
2 pages
MAH CET MBA Question Paper 2016
No ratings yet
MAH CET MBA Question Paper 2016
84 pages
Action Research DepEd Format 1
No ratings yet
Action Research DepEd Format 1
29 pages
B.Sc. Eligibility
No ratings yet
B.Sc. Eligibility
10 pages
Ncert Book List - 0
No ratings yet
Ncert Book List - 0
12 pages
Yoruba Three Value Logic (New)
No ratings yet
Yoruba Three Value Logic (New)
18 pages
How Many Miles To Babylon Teacher S Pack PDF
No ratings yet
How Many Miles To Babylon Teacher S Pack PDF
30 pages
MAH MBA CET 2024 Official Sample Paper
No ratings yet
MAH MBA CET 2024 Official Sample Paper
8 pages
اختبار الفتره انجليزي رابع خامس سادس ف1 1446 موقع مادتي
No ratings yet
اختبار الفتره انجليزي رابع خامس سادس ف1 1446 موقع مادتي
10 pages
Gned 02: Ethics: Philia Is The Love That Seeks The Truth, Whether The
No ratings yet
Gned 02: Ethics: Philia Is The Love That Seeks The Truth, Whether The
3 pages
Nielsen Interview Shortlists 24 Batch
No ratings yet
Nielsen Interview Shortlists 24 Batch
6 pages
1 Pengantar Manajemen Mutu - 2019
100% (1)
1 Pengantar Manajemen Mutu - 2019
76 pages
Building Guardrails For Large Language Models
No ratings yet
Building Guardrails For Large Language Models
20 pages
Beury Donald Attorney California 141733 Binder6
No ratings yet
Beury Donald Attorney California 141733 Binder6
47 pages
The Concept of Competitive Advantages. Logic, Sources and Durability
No ratings yet
The Concept of Competitive Advantages. Logic, Sources and Durability
14 pages
SKN Review 5
No ratings yet
SKN Review 5
25 pages
SKN Proj
No ratings yet
SKN Proj
27 pages
Negative Affect and Overconfidence: A Laboratory Investigation
No ratings yet
Negative Affect and Overconfidence: A Laboratory Investigation
41 pages
Neutrosophic TreeSoft Expert Set and ForestSoft Set
No ratings yet
Neutrosophic TreeSoft Expert Set and ForestSoft Set
12 pages
The Principal, H.P. Govt. Dental College, Shimla
No ratings yet
The Principal, H.P. Govt. Dental College, Shimla
15 pages
Luci Essentials
No ratings yet
Luci Essentials
4 pages
Urban Rattan Employment Application Form
No ratings yet
Urban Rattan Employment Application Form
5 pages
MAH CET MBA Question Paper 2017
No ratings yet
MAH CET MBA Question Paper 2017
98 pages
The Beginners Guide To Virtual Training
No ratings yet
The Beginners Guide To Virtual Training
5 pages
Relationship in The Filipino Family
No ratings yet
Relationship in The Filipino Family
2 pages
Graziella Moraes Silva CV
No ratings yet
Graziella Moraes Silva CV
10 pages
CURRICULUM VITAE at 2024
No ratings yet
CURRICULUM VITAE at 2024
3 pages
Top Collages To Pursue Aerospace
No ratings yet
Top Collages To Pursue Aerospace
6 pages
Python Full Course
No ratings yet
Python Full Course
2 pages
Eng028 P3 Summative (Set B - With Keys)
No ratings yet
Eng028 P3 Summative (Set B - With Keys)
2 pages
Abnormal Beh Lara
No ratings yet
Abnormal Beh Lara
5 pages
SC - MSC - Cs - 1st Sem - 2018
No ratings yet
SC - MSC - Cs - 1st Sem - 2018
2 pages
Approaches To The Study of Social Problems
No ratings yet
Approaches To The Study of Social Problems
2 pages
Techniques and Tools for Artificial Intelligence. Neural Networks via R and PYTHON
From Everand
Techniques and Tools for Artificial Intelligence. Neural Networks via R and PYTHON
César Pérez López
No ratings yet
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet