0% found this document useful (0 votes)

187 views

Understanding of Convolutional Neural Network (CNN) - Deep Learning

Convolutional neural networks (CNNs) are a type of neural network used for image recognition and classification. A CNN takes an input image and passes it through multiple convolution and pooling layers to extract features, followed by fully connected layers to classify the image. Key aspects of CNNs include the use of filters in convolution layers to detect features like edges, max pooling to reduce dimensionality, and multiple layers to learn increasingly complex patterns in the data. CNNs are widely used for computer vision tasks like image classification.

Uploaded by

Mark Alwin Caimbre

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

187 views

Understanding of Convolutional Neural Network (CNN) - Deep Learning

Uploaded by

Mark Alwin Caimbre

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

4/11/2019 Understanding of Convolutional Neural Network (CNN) — Deep Learning

Understanding of Convolutional
Neural Network (CNN) —
Deep Learning
Prabhu Follow
Mar 4, 2018 · 5 min read

In neural networks, Convolutional neural network (ConvNets or CNNs)

is one of the main categories to do images recognition, images
classifications. Objects detections, recognition faces etc., are some of
the areas where CNNs are widely used.

CNN image classifications takes an input image, process it and classify

it under certain categories (Eg., Dog, Cat, Tiger, Lion). Computers sees
an input image as array of pixels and it depends on the image
resolution. Based on the image resolution, it will see h x w x d( h =
Height, w = Width, d = Dimension ). Eg., An image of 6 x 6 x 3 array of
matrix of RGB (3 refers to RGB values) and an image of 4 x 4 x 1 array
of matrix of grayscale image.

Figure 1 : Array of RGB Matrix

Technically, deep learning CNN models to train and test, each input
image will pass it through a series of convolution layers with filters

https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-99760835f148 1/9
4/11/2019 Understanding of Convolutional Neural Network (CNN) — Deep Learning

(Kernals), Pooling, fully connected layers (FC) and apply Softmax

function to classify an object with probabilistic values between 0 and 1.
The below figure is a complete flow of CNN to process an input image
and classifies the objects based on values.

Figure 2 : Neural network with many convolutional layers

Convolution Layer

Convolution is the first layer to extract features from an input image.

Convolution preserves the relationship between pixels by learning
image features using small squares of input data. It is a mathematical
operation that takes two inputs such as image matrix and a filter or
kernal

Figure 3: Image matrix multiplies kernel or lter matrix

Consider a 5 x 5 whose image pixel values are 0, 1 and filter matrix 3 x

3 as shown in below

https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-99760835f148 2/9
4/11/2019 Understanding of Convolutional Neural Network (CNN) — Deep Learning

Figure 4: Image matrix multiplies kernel or lter matrix

Then the convolution of 5 x 5 image matrix multiplies with 3 x 3 filter

matrix which is called “Feature Map” as output shown in below

Figure 5: 3 x 3 Output matrix

Convolution of an image with different filters can perform operations

such as edge detection, blur and sharpen by applying filters. The below
example shows various convolution image after applying different
types of filters (Kernels).

https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-99760835f148 3/9
4/11/2019 Understanding of Convolutional Neural Network (CNN) — Deep Learning

Figure 7 : Some common lters

Strides

Stride is the number of pixels shifts over the input matrix. When the
stride is 1 then we move the filters to 1 pixel at a time. When the stride
is 2 then we move the filters to 2 pixels at a time and so on. The below
figure shows convolution would work with a stride of 2.

https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-99760835f148 4/9
4/11/2019 Understanding of Convolutional Neural Network (CNN) — Deep Learning

Figure 6 : Stride of 2 pixels

Padding

Sometimes filter does not fit perfectly fit the input image. We have two
options:

• Pad the picture with zeros (zero-padding) so that it fits

• Drop the part of the image where the filter did not fit. This is
called valid padding which keeps only valid part of the image.

Non Linearity (ReLU)

ReLU stands for Rectified Linear Unit for a non-linear operation. The
output is ƒ(x) = max(0,x).

Why ReLU is important : ReLU’s purpose is to introduce non-linearity in

our ConvNet. Since, the real world data would want our ConvNet to
learn would be non-negative linear values.

Figure 7 : ReLU operation

https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-99760835f148 5/9
4/11/2019 Understanding of Convolutional Neural Network (CNN) — Deep Learning

There are other non linear functions such as tanh or sigmoid can also
be used instead of ReLU. Most of the data scientists uses ReLU since
performance wise ReLU is better than other two.

Pooling Layer

Pooling layers section would reduce the number of parameters when

the images are too large. Spatial pooling also called subsampling or
downsampling which reduces the dimensionality of each map but
retains the important information. Spatial pooling can be of different
types:

• Max Pooling

• Average Pooling

• Sum Pooling

Max pooling take the largest element from the rectified feature map.
Taking the largest element could also take the average pooling. Sum of
all elements in the feature map call as sum pooling.

Figure 8 : Max Pooling

Fully Connected Layer

The layer we call as FC layer, we flattened our matrix into vector and
feed it into a fully connected layer like neural network.

https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-99760835f148 6/9
4/11/2019 Understanding of Convolutional Neural Network (CNN) — Deep Learning

Figure 9 : After pooling layer, attened as FC layer

In the above diagram, feature map matrix will be converted as vector

(x1, x2, x3, …). With the fully connected layers, we combined these
features together to create a model. Finally, we have an activation
function such as softmax or sigmoid to classify the outputs as cat, dog,
car, truck etc.,

Figure 10 : Complete CNN architecture

Summary

• Provide input image into convolution layer

• Choose parameters, apply filters with strides, padding if requires.

Perform convolution on the image and apply ReLU activation to
the matrix.

• Perform pooling to reduce dimensionality size

• Add as many convolutional layers until satisfied

• Flatten the output and feed into a fully connected layer (FC Layer)

• Output the class using an activation function (Logistic Regression

with cost functions) and classifies images.

https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-99760835f148 7/9
4/11/2019 Understanding of Convolutional Neural Network (CNN) — Deep Learning

In the next post, I would like to talk about some popular CNN
architectures such as AlexNet, VGGNet, GoogLeNet and ResNet.

References :

• https://www.mathworks.com/discovery/convolutional-neural-
network.html

• https://adeshpande3.github.io/adeshpande3.github.io/A-
Beginner's-Guide-To-Understanding-Convolutional-Neural-
Networks/

• https://ujjwalkarn.me/2016/08/11/intuitive-explanation-
convnets/

• https://blog.datawow.io/interns-explain-cnn-8a669d053f8b.

https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-99760835f148 8/9
4/11/2019 Understanding of Convolutional Neural Network (CNN) — Deep Learning

https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-99760835f148 9/9

Analytics at Work Smarter Decisions Better Results by Thomas t Morison Thomas H. Davenport;Jeanne G. Harris;Robert Morison all chapter instant download
100% (1)
Analytics at Work Smarter Decisions Better Results by Thomas t Morison Thomas H. Davenport;Jeanne G. Harris;Robert Morison all chapter instant download
24 pages
Understanding The Interplay of Artifcial Intelligence
No ratings yet
Understanding The Interplay of Artifcial Intelligence
44 pages
What Can Machine Learning Do - Workforce - Brynjolfsson e Mitchel
No ratings yet
What Can Machine Learning Do - Workforce - Brynjolfsson e Mitchel
6 pages
NNpred
100% (2)
NNpred
74 pages
Jeff Dean's Lecture For YC AI
100% (19)
Jeff Dean's Lecture For YC AI
86 pages
IDC Executive Insights January2011 T 76-4420 PDF
No ratings yet
IDC Executive Insights January2011 T 76-4420 PDF
5 pages
The Role of Scientific Thought
No ratings yet
The Role of Scientific Thought
7 pages
The 9 Deep Learning Papers You Need To Know About 3
No ratings yet
The 9 Deep Learning Papers You Need To Know About 3
19 pages
2001 Principlesforecasting
No ratings yet
2001 Principlesforecasting
862 pages
8-Deep Learning For NLP
No ratings yet
8-Deep Learning For NLP
49 pages
Accenture Edge Analytics POV
No ratings yet
Accenture Edge Analytics POV
17 pages
Legal Leverage in New Technology Industry
No ratings yet
Legal Leverage in New Technology Industry
95 pages
Machine Learning Approaches in Battery Management Systems State of The Art Remaining Useful Life and Fault Detection
No ratings yet
Machine Learning Approaches in Battery Management Systems State of The Art Remaining Useful Life and Fault Detection
6 pages
HR’s New Role
100% (1)
HR’s New Role
8 pages
The AI Economy Free Summary by Roger Bootle
No ratings yet
The AI Economy Free Summary by Roger Bootle
14 pages
Immediate download Hands-on AIOps: Best Practices Guide to Implementing AIOps 1st Edition Navin Sabharwal ebooks 2024
100% (4)
Immediate download Hands-on AIOps: Best Practices Guide to Implementing AIOps 1st Edition Navin Sabharwal ebooks 2024
40 pages
CenturyLink Valuation Project
No ratings yet
CenturyLink Valuation Project
120 pages
Data-Driven Business Model Innovation
No ratings yet
Data-Driven Business Model Innovation
6 pages
E BookPeopleandTalentManagement PDF
No ratings yet
E BookPeopleandTalentManagement PDF
147 pages
Introduction-to-Artificial-Intelligence-AI (1) - Tsukuyomi
No ratings yet
Introduction-to-Artificial-Intelligence-AI (1) - Tsukuyomi
8 pages
Neuro-Symbolic AI in 2024 A Systematic Review
100% (1)
Neuro-Symbolic AI in 2024 A Systematic Review
19 pages
Strategic Grid Useful Guidelines For ERP Implementations
No ratings yet
Strategic Grid Useful Guidelines For ERP Implementations
8 pages
Multilayer Perceptron
No ratings yet
Multilayer Perceptron
24 pages
Gale 1960
No ratings yet
Gale 1960
6 pages
ch03 Powerpoints
No ratings yet
ch03 Powerpoints
35 pages
Scaling Laws For Neural Language Models
No ratings yet
Scaling Laws For Neural Language Models
30 pages
Large Language Models Need Symbolic Ai
No ratings yet
Large Language Models Need Symbolic Ai
6 pages
AliveCor v. Apple
100% (1)
AliveCor v. Apple
31 pages
Download Full Deep Learning 1st Edition Dulani Meedeniya PDF All Chapters
100% (2)
Download Full Deep Learning 1st Edition Dulani Meedeniya PDF All Chapters
50 pages
Introduction To Prescriptive AI: A Primer For Decision Intelligence Solutioning With Python
No ratings yet
Introduction To Prescriptive AI: A Primer For Decision Intelligence Solutioning With Python
205 pages
MartinFowlerAnalysisPatterns PDF
No ratings yet
MartinFowlerAnalysisPatterns PDF
22 pages
A Survey of Graph Neural Networks in Various Learning Paradigms Methods, Applications, and Challenges
No ratings yet
A Survey of Graph Neural Networks in Various Learning Paradigms Methods, Applications, and Challenges
70 pages
SSRN Id3177534 PDF
No ratings yet
SSRN Id3177534 PDF
11 pages
TowerJazz 2010
No ratings yet
TowerJazz 2010
23 pages
Guide To Public Takeovers in Europe 2016
No ratings yet
Guide To Public Takeovers in Europe 2016
459 pages
Paper 8-The Role of Hyperspectral Imaging
100% (1)
Paper 8-The Role of Hyperspectral Imaging
13 pages
Anylogic and Java
No ratings yet
Anylogic and Java
38 pages
Beyond Digital Mini Book
No ratings yet
Beyond Digital Mini Book
35 pages
TechnoLeaders: Steps to Enhance Your Technical Leadership
From Everand
TechnoLeaders: Steps to Enhance Your Technical Leadership
Tom Henricksen
No ratings yet
Service in the AI Era: Science, Logic, and Architecture Perspectives
From Everand
Service in the AI Era: Science, Logic, and Architecture Perspectives
Jim Spohrer
No ratings yet
Agentic AI Brochure
100% (1)
Agentic AI Brochure
12 pages
Designing deep learning systems: Software engineering, #1
From Everand
Designing deep learning systems: Software engineering, #1
rayaan
No ratings yet
Semantic Knowledge Graphing Third Edition
From Everand
Semantic Knowledge Graphing Third Edition
Gerardus Blokdyk
No ratings yet
Living in A World of Low Levels of Predictability International Journal of Forecasting With N. Taleb
No ratings yet
Living in A World of Low Levels of Predictability International Journal of Forecasting With N. Taleb
5 pages
Vedic Inventive Principles Presentation
100% (1)
Vedic Inventive Principles Presentation
27 pages
Andrew NG Main - Notes PDF
No ratings yet
Andrew NG Main - Notes PDF
226 pages
Image Search Engine: Resource Guide
No ratings yet
Image Search Engine: Resource Guide
11 pages
Mapping the Neuro-Symbolic AI Landscape by Architectures: A Handbook on Augmenting Deep Learning Through Symbolic Reasoning
No ratings yet
Mapping the Neuro-Symbolic AI Landscape by Architectures: A Handbook on Augmenting Deep Learning Through Symbolic Reasoning
57 pages
Artificial Intelligence Presentation 2019
No ratings yet
Artificial Intelligence Presentation 2019
28 pages
Pioneering Views: Pushing the Limits of Your C/ETRM - Volume 2
From Everand
Pioneering Views: Pushing the Limits of Your C/ETRM - Volume 2
Pioneer Solutions
No ratings yet
Six Steps To Master Machine Learning With Data Preparation
No ratings yet
Six Steps To Master Machine Learning With Data Preparation
44 pages
New Age Management: Philosophy from Ancient India
From Everand
New Age Management: Philosophy from Ancient India
V. Srinivasan
No ratings yet
TowerJazz TPSCo Press Release
No ratings yet
TowerJazz TPSCo Press Release
1 page
Big Data Analytics
No ratings yet
Big Data Analytics
18 pages
An Analysis of Convolutional Neural Network Architectures
No ratings yet
An Analysis of Convolutional Neural Network Architectures
54 pages
Apache Mahout Essentials
From Everand
Apache Mahout Essentials
Jayani Withanawasam
No ratings yet
AI and Ethics
No ratings yet
AI and Ethics
134 pages
BIN There Done That - Morgan Stanley
No ratings yet
BIN There Done That - Morgan Stanley
19 pages
(Alexander Brem) The Boundaries of Innovation and (BookFi) PDF
No ratings yet
(Alexander Brem) The Boundaries of Innovation and (BookFi) PDF
241 pages
Multi-Agent System in A Collaborative Supply Chain
No ratings yet
Multi-Agent System in A Collaborative Supply Chain
11 pages
Rawlings, 2000 - Tutorial Overview of Model Predictive Control
No ratings yet
Rawlings, 2000 - Tutorial Overview of Model Predictive Control
15 pages
Dartmouth 1956
No ratings yet
Dartmouth 1956
4 pages
Failure Reporting, Analysis, and Corrective Action System
100% (3)
Failure Reporting, Analysis, and Corrective Action System
46 pages
Topik Konten: Software Engineering Basics
No ratings yet
Topik Konten: Software Engineering Basics
1 page
Example 1: DFT of Sine Waveform: Lecture Topic: Understanding DFT and FFT
No ratings yet
Example 1: DFT of Sine Waveform: Lecture Topic: Understanding DFT and FFT
15 pages
Software Testing Release Life Cycle: Draft Draft
No ratings yet
Software Testing Release Life Cycle: Draft Draft
6 pages
Comparative Politics
No ratings yet
Comparative Politics
25 pages
RPLC/CRK Angeles City, Philippines: Cab Vor Dme
No ratings yet
RPLC/CRK Angeles City, Philippines: Cab Vor Dme
30 pages
SDLC Models
100% (7)
SDLC Models
7 pages
Module 2 Lesson 3: Going Nonlinear: The Extended Kalman Filter
No ratings yet
Module 2 Lesson 3: Going Nonlinear: The Extended Kalman Filter
14 pages
Family Psychology The Art of the Science 1st Edition William M. Pinsof 2024 Scribd Download
100% (5)
Family Psychology The Art of the Science 1st Edition William M. Pinsof 2024 Scribd Download
61 pages
Distributed Control System Slide Group 8 Final
No ratings yet
Distributed Control System Slide Group 8 Final
20 pages
Thermodynamics MCQ With Answers
No ratings yet
Thermodynamics MCQ With Answers
29 pages
Trajectory Tracking For The Quadcopter UAV Utilizing Fuzzy PID Control Approach
No ratings yet
Trajectory Tracking For The Quadcopter UAV Utilizing Fuzzy PID Control Approach
6 pages
STQA Lab 3
No ratings yet
STQA Lab 3
5 pages
Control System Lab
No ratings yet
Control System Lab
6 pages
Different Kinds of Technical Drawing
100% (1)
Different Kinds of Technical Drawing
24 pages
TQM Milan Seminarski Rad
No ratings yet
TQM Milan Seminarski Rad
14 pages
SE - Lab-Manual
No ratings yet
SE - Lab-Manual
36 pages
Cass Toes For The Overall Safety Lifecycle Assessment (Iec 61508-1: 2010)
No ratings yet
Cass Toes For The Overall Safety Lifecycle Assessment (Iec 61508-1: 2010)
3 pages
04 Discrete-Time Signal and System
No ratings yet
04 Discrete-Time Signal and System
57 pages
Classical Control Revision
No ratings yet
Classical Control Revision
7 pages
Zimmermann
0% (3)
Zimmermann
3 pages
Exercise - 5G Kavi
No ratings yet
Exercise - 5G Kavi
2 pages
Total Quality Management - Lecture Notes, Study Material and Important Questions, Answers
100% (3)
Total Quality Management - Lecture Notes, Study Material and Important Questions, Answers
8 pages
Assignment - 2023 - Week - 2-With Solution PDF
No ratings yet
Assignment - 2023 - Week - 2-With Solution PDF
5 pages
Road Map: Design For Six Sigma
No ratings yet
Road Map: Design For Six Sigma
2 pages
CS504 Past Paper
No ratings yet
CS504 Past Paper
7 pages
Computer Aided: Bart de Moor Koen Eneman YI Cheng
No ratings yet
Computer Aided: Bart de Moor Koen Eneman YI Cheng
9 pages
CRN-EPR-EnG-012 ED 0012 Design Validation
No ratings yet
CRN-EPR-EnG-012 ED 0012 Design Validation
7 pages

Understanding of Convolutional Neural Network (CNN) - Deep Learning

Uploaded by

Understanding of Convolutional Neural Network (CNN) - Deep Learning

Uploaded by

4/11/2019 Understanding of Convolutional Neural Network (CNN) — Deep Learning

In neural networks, Convolutional neural network (ConvNets or CNNs)

CNN image classifications takes an input image, process it and classify

Figure 1 : Array of RGB Matrix

(Kernals), Pooling, fully connected layers (FC) and apply Softmax

Figure 2 : Neural network with many convolutional layers

Convolution is the first layer to extract features from an input image.

Figure 3: Image matrix multiplies kernel or lter matrix

Consider a 5 x 5 whose image pixel values are 0, 1 and filter matrix 3 x

Figure 4: Image matrix multiplies kernel or lter matrix

Then the convolution of 5 x 5 image matrix multiplies with 3 x 3 filter

Figure 5: 3 x 3 Output matrix

Convolution of an image with different filters can perform operations

Figure 7 : Some common lters

Figure 6 : Stride of 2 pixels

• Pad the picture with zeros (zero-padding) so that it fits

Non Linearity (ReLU)

Why ReLU is important : ReLU’s purpose is to introduce non-linearity in

Figure 7 : ReLU operation

Pooling layers section would reduce the number of parameters when

Figure 8 : Max Pooling

Fully Connected Layer

Figure 9 : After pooling layer, attened as FC layer

In the above diagram, feature map matrix will be converted as vector

Figure 10 : Complete CNN architecture

• Provide input image into convolution layer

• Choose parameters, apply filters with strides, padding if requires.

• Perform pooling to reduce dimensionality size

• Add as many convolutional layers until satisfied

• Output the class using an activation function (Logistic Regression

You might also like