0% found this document useful (0 votes)

277 views8 pages

Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu - Medium

The document discusses Convolutional Neural Networks (CNNs) for image classification. It explains that CNNs take an input image and classify it into categories. The CNN architecture uses successive layers of convolutions, ReLU activations, pooling and fully connected layers to extract features and classify the image. In convolution layers, filters are convolved across the image to extract features. Max pooling reduces the size of feature maps output from convolution. Finally, fully connected layers flatten and combine the features for classification.

Uploaded by

andres alfonso varelo silgado

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

277 views8 pages

Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu - Medium

Uploaded by

andres alfonso varelo silgado

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

30/8/2020 Understanding of Convolutional Neural Network (CNN) — Deep Learning | by Prabhu | Medium

Understanding of Convolutional Neural

Network (CNN) — Deep Learning
Prabhu
Mar 4, 2018 · 5 min read

In neural networks, Convolutional neural network (ConvNets or CNNs) is one of the

main categories to do images recognition, images classifications. Objects detections,
recognition faces etc., are some of the areas where CNNs are widely used.

CNN image classifications takes an input image, process it and classify it under certain
categories (Eg., Dog, Cat, Tiger, Lion). Computers sees an input image as array of pixels
and it depends on the image resolution. Based on the image resolution, it will see h x w
x d( h = Height, w = Width, d = Dimension ). Eg., An image of 6 x 6 x 3 array of
matrix of RGB (3 refers to RGB values) and an image of 4 x 4 x 1 array of matrix of
grayscale image.

https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-99760835f148 1/8
30/8/2020 Understanding of Convolutional Neural Network (CNN) — Deep Learning | by Prabhu | Medium

Figure 1 : Array of RGB Matrix

Technically, deep learning CNN models to train and test, each input image will pass it
through a series of convolution layers with filters (Kernals), Pooling, fully connected
layers (FC) and apply Softmax function to classify an object with probabilistic values
between 0 and 1. The below figure is a complete flow of CNN to process an input image
and classifies the objects based on values.

Figure 2 : Neural network with many convolutional layers

Convolution Layer

Convolution is the first layer to extract features from an input image. Convolution
preserves the relationship between pixels by learning image features using small
squares of input data. It is a mathematical operation that takes two inputs such as
image matrix and a filter or kernel.

https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-99760835f148 2/8
30/8/2020 Understanding of Convolutional Neural Network (CNN) — Deep Learning | by Prabhu | Medium

Figure 3: Image matrix multiplies kernel or lter matrix

Consider a 5 x 5 whose image pixel values are 0, 1 and filter matrix 3 x 3 as shown in
below

Figure 4: Image matrix multiplies kernel or lter matrix

Then the convolution of 5 x 5 image matrix multiplies with 3 x 3 filter matrix which is
called “Feature Map” as output shown in below

Figure 5: 3 x 3 Output matrix

Convolution of an image with different filters can perform operations such as edge
detection, blur and sharpen by applying filters. The below example shows various
convolution image after applying different types of filters (Kernels).

https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-99760835f148 3/8
30/8/2020 Understanding of Convolutional Neural Network (CNN) — Deep Learning | by Prabhu | Medium

Figure 7 : Some common lters

Strides

Stride is the number of pixels shifts over the input matrix. When the stride is 1 then we
move the filters to 1 pixel at a time. When the stride is 2 then we move the filters to 2
pixels at a time and so on. The below figure shows convolution would work with a
stride of 2.

https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-99760835f148 4/8
30/8/2020 Understanding of Convolutional Neural Network (CNN) — Deep Learning | by Prabhu | Medium

Figure 6 : Stride of 2 pixels

Padding

Sometimes filter does not fit perfectly fit the input image. We have two options:

Pad the picture with zeros (zero-padding) so that it fits

Drop the part of the image where the filter did not fit. This is called valid padding
which keeps only valid part of the image.

Non Linearity (ReLU)

ReLU stands for Rectified Linear Unit for a non-linear operation. The output is ƒ(x) =
max(0,x).

Why ReLU is important : ReLU’s purpose is to introduce non-linearity in our ConvNet.

Since, the real world data would want our ConvNet to learn would be non-negative
linear values.

Figure 7 : ReLU operation

There are other non linear functions such as tanh or sigmoid that can also be used
instead of ReLU. Most of the data scientists use ReLU since performance wise ReLU is
better than the other two.

Pooling Layer

Pooling layers section would reduce the number of parameters when the images are
too large. Spatial pooling also called subsampling or downsampling which reduces the

https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-99760835f148 5/8
30/8/2020 Understanding of Convolutional Neural Network (CNN) — Deep Learning | by Prabhu | Medium

dimensionality of each map but retains important information. Spatial pooling can be
of different types:

Max Pooling

Average Pooling

Sum Pooling

Max pooling takes the largest element from the rectified feature map. Taking the
largest element could also take the average pooling. Sum of all elements in the feature
map call as sum pooling.

Figure 8 : Max Pooling

Fully Connected Layer

The layer we call as FC layer, we flattened our matrix into vector and feed it into a fully
connected layer like a neural network.

https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-99760835f148 6/8
30/8/2020 Understanding of Convolutional Neural Network (CNN) — Deep Learning | by Prabhu | Medium

Figure 9 : After pooling layer, attened as FC layer

In the above diagram, the feature map matrix will be converted as vector (x1, x2, x3,
…). With the fully connected layers, we combined these features together to create a
model. Finally, we have an activation function such as softmax or sigmoid to classify
the outputs as cat, dog, car, truck etc.,

Figure 10 : Complete CNN architecture

Summary

Provide input image into convolution layer

Choose parameters, apply filters with strides, padding if requires. Perform

convolution on the image and apply ReLU activation to the matrix.

Perform pooling to reduce dimensionality size

Add as many convolutional layers until satisfied

Flatten the output and feed into a fully connected layer (FC Layer)

Output the class using an activation function (Logistic Regression with cost
functions) and classifies images.

In the next post, I would like to talk about some popular CNN architectures such as
AlexNet, VGGNet, GoogLeNet, and ResNet.

References :

https://www.mathworks.com/discovery/convolutional-neural-network.html

https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-99760835f148 7/8
30/8/2020 Understanding of Convolutional Neural Network (CNN) — Deep Learning | by Prabhu | Medium

https://adeshpande3.github.io/adeshpande3.github.io/A-Beginner's-Guide-To-
Understanding-Convolutional-Neural-Networks/

https://ujjwalkarn.me/2016/08/11/intuitive-explanation-convnets/

https://blog.datawow.io/interns-explain-cnn-8a669d053f8b.

Machine Learning Cnn Convolution Neural Net Image Recognition Neural Networks

About Help Legal

Get the Medium app

https://medium.com/@RaghavPrabhu/understanding-of-convolutional-neural-network-cnn-deep-learning-99760835f148 8/8

CNN Short
No ratings yet
CNN Short
61 pages
LangChain - Chat With Your Data
No ratings yet
LangChain - Chat With Your Data
32 pages
A Survey On Vision Transformer
No ratings yet
A Survey On Vision Transformer
23 pages
CNN RNN Assignment Set 4
0% (1)
CNN RNN Assignment Set 4
2 pages
Deep Learning Methods and Applications For Electrical Power Systems A Comprehensive Review
No ratings yet
Deep Learning Methods and Applications For Electrical Power Systems A Comprehensive Review
22 pages
Digital Control Systems
100% (1)
Digital Control Systems
1 page
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet
11.feature Selection, Extraction
No ratings yet
11.feature Selection, Extraction
38 pages
Deep Learning (MODULE-3)
No ratings yet
Deep Learning (MODULE-3)
85 pages
Slides CNN
No ratings yet
Slides CNN
17 pages
Lec 06 Feature Selection and Extraction
No ratings yet
Lec 06 Feature Selection and Extraction
43 pages
Unit 5
No ratings yet
Unit 5
23 pages
Deep Learning
No ratings yet
Deep Learning
2 pages
Feature Selection in Machine Learning
No ratings yet
Feature Selection in Machine Learning
4 pages
Deep Learning CNN
100% (1)
Deep Learning CNN
28 pages
Artificial Neural Networks: Part 1/3
No ratings yet
Artificial Neural Networks: Part 1/3
25 pages
DLunit 4
No ratings yet
DLunit 4
16 pages
Backpropagation
No ratings yet
Backpropagation
7 pages
3 - ANN Part One PDF
No ratings yet
3 - ANN Part One PDF
30 pages
Convolution Neural Networks U2
No ratings yet
Convolution Neural Networks U2
24 pages
Convolutional Neural Network
100% (1)
Convolutional Neural Network
3 pages
Unit III
No ratings yet
Unit III
58 pages
Unit-V Deep Learning Techniques
100% (1)
Unit-V Deep Learning Techniques
31 pages
DL Question Bank Answers
No ratings yet
DL Question Bank Answers
55 pages
Btech CSE
No ratings yet
Btech CSE
17 pages
50 Most Important CNN Interview Questions
No ratings yet
50 Most Important CNN Interview Questions
18 pages
CS 601 Machine Learning Unit 3
No ratings yet
CS 601 Machine Learning Unit 3
37 pages
Neural Networks
No ratings yet
Neural Networks
29 pages
UNIT-I - Introduction To Computer Vision
No ratings yet
UNIT-I - Introduction To Computer Vision
45 pages
Introduction To Computer Vision
No ratings yet
Introduction To Computer Vision
10 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
24 pages
DL Unit-4
No ratings yet
DL Unit-4
26 pages
2.building Blocks of Neural Networks
100% (1)
2.building Blocks of Neural Networks
2 pages
Image Processing With CUDA
No ratings yet
Image Processing With CUDA
66 pages
A Practical Guide To Graph Neural Networks
No ratings yet
A Practical Guide To Graph Neural Networks
28 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
3 pages
G5Aiai Introduction To AI: Graham Kendall
No ratings yet
G5Aiai Introduction To AI: Graham Kendall
48 pages
CNN Lecture Notes
No ratings yet
CNN Lecture Notes
86 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
13 pages
Notes On Backpropagation
No ratings yet
Notes On Backpropagation
14 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
5 pages
AI&ML BM4251 Unit 1-5 Notes
No ratings yet
AI&ML BM4251 Unit 1-5 Notes
116 pages
ML L8 Decision Tree
No ratings yet
ML L8 Decision Tree
109 pages
Unit - 3
No ratings yet
Unit - 3
42 pages
DL Question Bank
No ratings yet
DL Question Bank
23 pages
Introduction of Machine Learning
No ratings yet
Introduction of Machine Learning
58 pages
Unit 4 Deeplearning
No ratings yet
Unit 4 Deeplearning
41 pages
Dropout Vs Pruning
No ratings yet
Dropout Vs Pruning
2 pages
Data Modelling and Visualization
No ratings yet
Data Modelling and Visualization
31 pages
Linear Regression 18may
No ratings yet
Linear Regression 18may
28 pages
RAG With Math
No ratings yet
RAG With Math
7 pages
Unit-I Introduction and ANN Structure
No ratings yet
Unit-I Introduction and ANN Structure
15 pages
Omkar Sabnis B4-764 Experiment No. 7 Aim: Implementation of MC-Culloch Pitt Model For AND Gate Using Python. Theory
No ratings yet
Omkar Sabnis B4-764 Experiment No. 7 Aim: Implementation of MC-Culloch Pitt Model For AND Gate Using Python. Theory
10 pages
Chap 11 12 - Practical Methodology and Applications - Heechul Lim
100% (1)
Chap 11 12 - Practical Methodology and Applications - Heechul Lim
60 pages
Deep Learning With Tensorflow
No ratings yet
Deep Learning With Tensorflow
15 pages
Clustering & Association Algorithms 4
No ratings yet
Clustering & Association Algorithms 4
17 pages
Mathematics For Machine Learning-I
No ratings yet
Mathematics For Machine Learning-I
10 pages
Unit 3 Full Notes
No ratings yet
Unit 3 Full Notes
30 pages
Chapter 7 - Neural-Networks
100% (1)
Chapter 7 - Neural-Networks
60 pages
Back Propagation
100% (1)
Back Propagation
27 pages
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
From Everand
Connectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control
Sebastian Thelen
5/5 (1)
Student Notes - Convolutional Neural Networks (CNN) Introduction - Belajar Pembelajaran Mesin Indonesia
No ratings yet
Student Notes - Convolutional Neural Networks (CNN) Introduction - Belajar Pembelajaran Mesin Indonesia
14 pages
Compressive Sensing - A 25 Minute Tour: Emmanuel Cand' Es
No ratings yet
Compressive Sensing - A 25 Minute Tour: Emmanuel Cand' Es
49 pages
Rainfall Prediction: A Deep Learning Approach: April 2016
No ratings yet
Rainfall Prediction: A Deep Learning Approach: April 2016
13 pages
MPPT T40 PDF
No ratings yet
MPPT T40 PDF
5 pages
Advances: A Monthly Precipitation Database For Spain (1851-2008) : Reconstruction, Homogeneity and Trends
No ratings yet
Advances: A Monthly Precipitation Database For Spain (1851-2008) : Reconstruction, Homogeneity and Trends
4 pages
1machine Learning Based Intelligent Career Counselling Chatbot ICCC
No ratings yet
1machine Learning Based Intelligent Career Counselling Chatbot ICCC
8 pages
04 Choosing Storage Solutions
No ratings yet
04 Choosing Storage Solutions
29 pages
Final
No ratings yet
Final
145 pages
Data Analysis and Automtion
No ratings yet
Data Analysis and Automtion
3 pages
Lunet: A Deep Neural Network For Network Intrusion Detection
No ratings yet
Lunet: A Deep Neural Network For Network Intrusion Detection
8 pages
LDPC Decoder Help Doc
No ratings yet
LDPC Decoder Help Doc
4 pages
Recollected - Questions para Repasar
No ratings yet
Recollected - Questions para Repasar
8 pages
Paper Pengolahan Data
No ratings yet
Paper Pengolahan Data
9 pages
Data Mining 5 Semester Bca
No ratings yet
Data Mining 5 Semester Bca
44 pages
C1 Table Matrix
No ratings yet
C1 Table Matrix
15 pages
MC71206A Practices of The Culture Industry
No ratings yet
MC71206A Practices of The Culture Industry
24 pages
Unit-1 (Part-1)
No ratings yet
Unit-1 (Part-1)
10 pages
Lecture 3 - BCSE302L - DBMS Architecture
No ratings yet
Lecture 3 - BCSE302L - DBMS Architecture
17 pages
Unsupervised Learning and Clustering: Somayeh Molaei University of Michigan, BDSI 2022
No ratings yet
Unsupervised Learning and Clustering: Somayeh Molaei University of Michigan, BDSI 2022
46 pages
Data Mapping
No ratings yet
Data Mapping
4 pages
AI: Its Nature and Future 1st Edition Margaret A. Boden - Instantly Access The Full Ebook Content in Just A Few Seconds
100% (2)
AI: Its Nature and Future 1st Edition Margaret A. Boden - Instantly Access The Full Ebook Content in Just A Few Seconds
64 pages
Aids Cis Final
No ratings yet
Aids Cis Final
6 pages
Sentiment Analysis Over Social Networks: An
No ratings yet
Sentiment Analysis Over Social Networks: An
6 pages
Machine Learning (CSC052P6G, CSC033U3M, CSL774, EEL012P5E) : Dr. Shaifu Gupta
No ratings yet
Machine Learning (CSC052P6G, CSC033U3M, CSL774, EEL012P5E) : Dr. Shaifu Gupta
18 pages
Swe1011 Soft-Computing Eth 1.0 37 Swe1011
No ratings yet
Swe1011 Soft-Computing Eth 1.0 37 Swe1011
2 pages
Cambridge Writing Rubric PDF
No ratings yet
Cambridge Writing Rubric PDF
1 page
Myp Language Acquisition Rubrics
100% (1)
Myp Language Acquisition Rubrics
15 pages
Noam
No ratings yet
Noam
6 pages
DIP3E Chapter03 Art
No ratings yet
DIP3E Chapter03 Art
63 pages
SCS6105 Assignment 1 Group3
No ratings yet
SCS6105 Assignment 1 Group3
11 pages
Ogata Root Locus
100% (1)
Ogata Root Locus
32 pages
Computer Practical Cbse Guidelines
No ratings yet
Computer Practical Cbse Guidelines
1 page
Vehicle License Plate Identification System Using Aritifical Neural - Ppt1
No ratings yet
Vehicle License Plate Identification System Using Aritifical Neural - Ppt1
15 pages
Slides Basics Whatisml
No ratings yet
Slides Basics Whatisml
10 pages

Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu - Medium

Uploaded by

Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu - Medium

Uploaded by

30/8/2020 Understanding of Convolutional Neural Network (CNN) — Deep Learning | by Prabhu | Medium

Understanding of Convolutional Neural

In neural networks, Convolutional neural network (ConvNets or CNNs) is one of the

Figure 1 : Array of RGB Matrix

Figure 2 : Neural network with many convolutional layers

Figure 3: Image matrix multiplies kernel or lter matrix

Figure 4: Image matrix multiplies kernel or lter matrix

Figure 5: 3 x 3 Output matrix

Figure 7 : Some common lters

Figure 6 : Stride of 2 pixels

Pad the picture with zeros (zero-padding) so that it fits

Non Linearity (ReLU)

Why ReLU is important : ReLU’s purpose is to introduce non-linearity in our ConvNet.

Figure 7 : ReLU operation

Figure 8 : Max Pooling

Fully Connected Layer

Figure 9 : After pooling layer, attened as FC layer

Figure 10 : Complete CNN architecture

Provide input image into convolution layer

Choose parameters, apply filters with strides, padding if requires. Perform

Perform pooling to reduce dimensionality size

Add as many convolutional layers until satisfied

About Help Legal

Get the Medium app

You might also like