0% found this document useful (0 votes)

3 views10 pages

Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu Raghav - Medium

The document provides an overview of Convolutional Neural Networks (CNNs), which are primarily used for image recognition and classification tasks. It explains the structure and function of CNNs, detailing layers such as convolution, pooling, and fully connected layers, along with operations like strides and padding. The summary includes the process of inputting an image, applying filters, and classifying the output using activation functions.

Uploaded by

Gowtham Don

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views10 pages

Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu Raghav - Medium

Uploaded by

Gowtham Don

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Get unlimited access to the best of Medium for less than $1/week.

Become a member

Understanding of Convolutional Neural Network (CNN) — Deep Learning

5 min read · Mar 4, 2018

Prabhu Raghav Follow

Listen Share More

In neural networks, Convolutional neural network (ConvNets or CNNs) is one of the main categories to do images recognition, images classifications.
Objects detections, recognition faces etc., are some of the areas where CNNs are widely used.

CNN image classifications takes an input image, process it and classify it under certain categories (Eg., Dog, Cat, Tiger, Lion). Computers sees an input
image as array of pixels and it depends on the image resolution. Based on the image resolution, it will see h x w x d( h = Height, w = Width, d = Dimension ).
Eg., An image of 6 x 6 x 3 array of matrix of RGB (3 refers to RGB values) and an image of 4 x 4 x 1 array of matrix of grayscale image.
Figure 1 : Array of RGB Matrix

Technically, deep learning CNN models to train and test, each input image will pass it through a series of convolution layers with filters (Kernals), Pooling,
fully connected layers (FC) and apply Softmax function to classify an object with probabilistic values between 0 and 1. The below figure is a complete flow
of CNN to process an input image and classifies the objects based on values.

Figure 2 : Neural network with many convolutional layers

Convolution Layer

Convolution is the first layer to extract features from an input image. Convolution preserves the relationship between pixels by learning image features
using small squares of input data. It is a mathematical operation that takes two inputs such as image matrix and a filter or kernel.

Figure 3: Image matrix multiplies kernel or filter matrix

Consider a 5 x 5 whose image pixel values are 0, 1 and filter matrix 3 x 3 as shown in below

Figure 4: Image matrix multiplies kernel or filter matrix

Then the convolution of 5 x 5 image matrix multiplies with 3 x 3 filter matrix which is called “Feature Map” as output shown in below

Figure 5: 3 x 3 Output matrix

Convolution of an image with different filters can perform operations such as edge detection, blur and sharpen by applying filters. The below example
shows various convolution image after applying different types of filters (Kernels).
Figure 7 : Some common filters

Strides

Stride is the number of pixels shifts over the input matrix. When the stride is 1 then we move the filters to 1 pixel at a time. When the stride is 2 then we
move the filters to 2 pixels at a time and so on. The below figure shows convolution would work with a stride of 2.

Figure 6 : Stride of 2 pixels

Padding

Sometimes filter does not fit perfectly fit the input image. We have two options:

Pad the picture with zeros (zero-padding) so that it fits

Drop the part of the image where the filter did not fit. This is called valid padding which keeps only valid part of the image.

Non Linearity (ReLU)

ReLU stands for Rectified Linear Unit for a non-linear operation. The output is ƒ(x) = max(0,x).

Why ReLU is important : ReLU’s purpose is to introduce non-linearity in our ConvNet. Since, the real world data would want our ConvNet to learn would be
non-negative linear values.

Figure 7 : ReLU operation

There are other non linear functions such as tanh or sigmoid that can also be used instead of ReLU. Most of the data scientists use ReLU since performance
wise ReLU is better than the other two.

Pooling Layer
Pooling layers section would reduce the number of parameters when the images are too large. Spatial pooling also called subsampling or downsampling
which reduces the dimensionality of each map but retains important information. Spatial pooling can be of different types:

Max Pooling

Average Pooling

Sum Pooling

Max pooling takes the largest element from the rectified feature map. Taking the largest element could also take the average pooling. Sum of all elements
in the feature map call as sum pooling.

Figure 8 : Max Pooling

Fully Connected Layer

The layer we call as FC layer, we flattened our matrix into vector and feed it into a fully connected layer like a neural network.

Figure 9 : After pooling layer, flattened as FC layer

In the above diagram, the feature map matrix will be converted as vector (x1, x2, x3, …). With the fully connected layers, we combined these features
together to create a model. Finally, we have an activation function such as softmax or sigmoid to classify the outputs as cat, dog, car, truck etc.,

Figure 10 : Complete CNN architecture

Summary

Provide input image into convolution layer

Choose parameters, apply filters with strides, padding if requires. Perform convolution on the image and apply ReLU activation to the matrix.

Perform pooling to reduce dimensionality size

Add as many convolutional layers until satisfied

Flatten the output and feed into a fully connected layer (FC Layer)

Output the class using an activation function (Logistic Regression with cost functions) and classifies images.

In the next post, I would like to talk about some popular CNN architectures such as AlexNet, VGGNet, GoogLeNet, and ResNet.
References :

https://www.mathworks.com/discovery/convolutional-neural-network.html

https://adeshpande3.github.io/adeshpande3.github.io/A-Beginner's-Guide-To-Understanding-Convolutional-Neural-Networks/

https://ujjwalkarn.me/2016/08/11/intuitive-explanation-convnets/

https://blog.datawow.io/interns-explain-cnn-8a669d053f8b.

Machine Learning Cnn Convolution Neural Net Image Recognition Neural Networks

Written by Prabhu Raghav

1.97K followers · 183 following

SuperAgentX - https://www.superagentx.ai/ OpenSource Agentic AGI Framework — Decisionfacts.ai, Deeplore.io

Responses (45)

Gowthambreeze

What are your thoughts?

Saliya Ekanayake
Jun 23, 2018

Really nice explanation. Thanks!

18 Reply

Hamid Haghdoost
Dec 8, 2018

Thank you for your clear and fluent tutorial, I enjoyed that and I learned the base of CNN in less than 10 minutes :)

14 Reply

veena tapaswi
Feb 21, 2019

neat explanation… thanks

6 Reply

See all responses

More from Prabhu Raghav

Open in app

Search
Prabhu Raghav

Master Agentic AI: A Beginner’s Step-by-Step Guide with SuperAgentX — Tutorial Series (Part 1)
Hello Everyone, Welcome to the Agent AI Tutorial Series — Part 1! 🚀

Nov 20, 2024 247 1

In DecisionFacts by Prabhu Raghav

Understanding Mathematics behind floating-point precisions

Introduction

May 11, 2024 57 1

Prabhu Raghav

CNN Architectures — LeNet, AlexNet, VGG, GoogLeNet and ResNet

In my previous blog post, explained about my understanding of Convolution Neural Network (CNN). In this post, I am going to detailing about…

Mar 15, 2018 348 3

In TDS Archive by Prabhu Raghav

Linear Regression Simplified - Ordinary Least Square vs Gradient Descent

What is Linear Regression? Linear regression is a statistical method of finding the relationship between independent and dependent…

May 15, 2018 2.2K 13

See all from Prabhu Raghav

Recommended from Medium

Hugman Sangkeun Jung

Automatic Differentiation and the Computational Graph

Understanding Automatic Differentiation and Computational Graphs in Modern Machine Learning

Apr 18 4

LM Po

Basics of Convolutional Neural Networks (CNNs)

If you’re not a Medium subscriber, click here to read the full article.

Feb 17 26 2
Mohana Roy Chowdhury

Everything you need to know about CNNs Part 5: Batch Normalization

So far in the series, we’ve covered the Convolution Layer, the Pooling layer, the Dense Layer, and the Activation Function. We’ve discussed…

Jan 23 145

In Self Study Notes by Cevher Dogan

What is CNN (Convolutional Neural Networks)?

A Convolutional Neural Network (CNN) is a type of deep learning algorithm specifically designed for processing structured data like images…

Nov 30, 2024 11

Emir Uzun

Common CNN Architectures: LeNet, AlexNet, VGG, and ResNet

We will summarize the logic and functionality of popular CNN architectures. LeNet,AlexNet,VGG, ResNet.

Jan 12 53

Kaouthar EL BAKOURI

ANN vs DNN
ANN and DNN for (Deep Neural Network) (Artificial Neural Network): it’s a very broad term that encompasses any form of Deep Learning model…

Feb 3

See more recommendations

Operating System Os Notes New Cs 2nd Year
No ratings yet
Operating System Os Notes New Cs 2nd Year
89 pages
Module5 ML
No ratings yet
Module5 ML
112 pages
Unit 5th Ig Ann
No ratings yet
Unit 5th Ig Ann
112 pages
AI Facilitators Handbook X
No ratings yet
AI Facilitators Handbook X
42 pages
Sorting PDF
No ratings yet
Sorting PDF
2,495 pages
Computer Vision: Field of AI That Enables Computers To Derive Meaningful Information From
No ratings yet
Computer Vision: Field of AI That Enables Computers To Derive Meaningful Information From
26 pages
07 Ais302 CNN
No ratings yet
07 Ais302 CNN
56 pages
Intro To CNN
No ratings yet
Intro To CNN
93 pages
Unit 3
No ratings yet
Unit 3
59 pages
Topic 3ii - Convolutional Neural Network
No ratings yet
Topic 3ii - Convolutional Neural Network
43 pages
Lecture 3 Updated
No ratings yet
Lecture 3 Updated
56 pages
Some Important Question
No ratings yet
Some Important Question
59 pages
DL Unit Iii
No ratings yet
DL Unit Iii
13 pages
Convolutional Networks 2024
No ratings yet
Convolutional Networks 2024
44 pages
Deep Learning Series CNN - 2
No ratings yet
Deep Learning Series CNN - 2
15 pages
Unit III
No ratings yet
Unit III
89 pages
Convolution Neural Networks
No ratings yet
Convolution Neural Networks
80 pages
Convolution in CNN and GCN (Related Work)
No ratings yet
Convolution in CNN and GCN (Related Work)
12 pages
Unit 3 CNN 2024
No ratings yet
Unit 3 CNN 2024
58 pages
E-Note 33951 Content Document 20250328020322PM
No ratings yet
E-Note 33951 Content Document 20250328020322PM
29 pages
Lecture 3
No ratings yet
Lecture 3
48 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
35 pages
NN 07
No ratings yet
NN 07
24 pages
CNN 2
No ratings yet
CNN 2
47 pages
Introduction To Convolutional Neural Networks (CNNS)
No ratings yet
Introduction To Convolutional Neural Networks (CNNS)
28 pages
Scan 30 Sep 23 18 20 44
No ratings yet
Scan 30 Sep 23 18 20 44
30 pages
Unit III
No ratings yet
Unit III
8 pages
Unit 3 CNN
No ratings yet
Unit 3 CNN
47 pages
Deep Learning: Seungsang Oh
No ratings yet
Deep Learning: Seungsang Oh
39 pages
Unit Iii Deep Learning
No ratings yet
Unit Iii Deep Learning
31 pages
CVlecture 5
No ratings yet
CVlecture 5
56 pages
DL Unit 3 2019PAT
No ratings yet
DL Unit 3 2019PAT
66 pages
Lecture 6
No ratings yet
Lecture 6
17 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
27 pages
CNN Interview Question
No ratings yet
CNN Interview Question
16 pages
Unit3 2023 NNDL
No ratings yet
Unit3 2023 NNDL
69 pages
CNN
No ratings yet
CNN
10 pages
Unit - 2
No ratings yet
Unit - 2
51 pages
20 Questions To Test Your Skills On CNN Convolutional Neural Networks
No ratings yet
20 Questions To Test Your Skills On CNN Convolutional Neural Networks
11 pages
L09-10 DL and CNN
No ratings yet
L09-10 DL and CNN
56 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
38 pages
UNIT-III DeepLearning Notes
No ratings yet
UNIT-III DeepLearning Notes
30 pages
21CS743 Module4 Notes
No ratings yet
21CS743 Module4 Notes
15 pages
Umerical Ifferentiation AND Ntegration: Chapter Objectives
No ratings yet
Umerical Ifferentiation AND Ntegration: Chapter Objectives
58 pages
What Is A Convolutional Neural Network-Unit3
No ratings yet
What Is A Convolutional Neural Network-Unit3
12 pages
Lab Manual EC DSP 3171003
No ratings yet
Lab Manual EC DSP 3171003
61 pages
CNN Notes Unit-3
No ratings yet
CNN Notes Unit-3
12 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
11 pages
M4 Ia2
No ratings yet
M4 Ia2
6 pages
DL Unit4
No ratings yet
DL Unit4
31 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
8 pages
Ch-3 Convolutional Neural Networks (CNNS)
No ratings yet
Ch-3 Convolutional Neural Networks (CNNS)
11 pages
Understanding of Convolutional Neural Network (CNN)
No ratings yet
Understanding of Convolutional Neural Network (CNN)
9 pages
CNN Architecture
No ratings yet
CNN Architecture
24 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
No ratings yet
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
7 pages
Deep LearningUNIT-IV
No ratings yet
Deep LearningUNIT-IV
16 pages
Chap 2
No ratings yet
Chap 2
48 pages
Back Propagation
No ratings yet
Back Propagation
20 pages
IML Trees
No ratings yet
IML Trees
66 pages
Chapter 15 - Time Series Regression and Forecasting
No ratings yet
Chapter 15 - Time Series Regression and Forecasting
47 pages
Efficient DFT Calculation
100% (1)
Efficient DFT Calculation
9 pages
Theory of CNN (Convolutional Neural Network)
No ratings yet
Theory of CNN (Convolutional Neural Network)
4 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
9 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning
7 pages
22eil384 - Basic Programming With Matlab
No ratings yet
22eil384 - Basic Programming With Matlab
2 pages
Covering and Coloring Mat175
No ratings yet
Covering and Coloring Mat175
9 pages
Course Syllabus and Schedule/Map - Fall 2020 (Session A) : CSE 551: Foundations of Algorithms
No ratings yet
Course Syllabus and Schedule/Map - Fall 2020 (Session A) : CSE 551: Foundations of Algorithms
17 pages
CH 4-Design Optimization-Optimum Design Concepts-B PDF
No ratings yet
CH 4-Design Optimization-Optimum Design Concepts-B PDF
41 pages
Assignment Unit 2 Problem Solving by Search
No ratings yet
Assignment Unit 2 Problem Solving by Search
2 pages
Lecture 12
No ratings yet
Lecture 12
19 pages
Image Recognition Using Neural Networks
No ratings yet
Image Recognition Using Neural Networks
18 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu - Medium
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu - Medium
8 pages
What Is Convolutional Neural Network
No ratings yet
What Is Convolutional Neural Network
16 pages
DSA Codes For CAT 1
No ratings yet
DSA Codes For CAT 1
7 pages
ML Week 1 Slides
No ratings yet
ML Week 1 Slides
14 pages
Loyd Lesson Plan1
No ratings yet
Loyd Lesson Plan1
4 pages
MPCA LAB3 Programs
No ratings yet
MPCA LAB3 Programs
8 pages
Parkinson Disease Prediction Using Feature Selection Technique in Machine Learning
No ratings yet
Parkinson Disease Prediction Using Feature Selection Technique in Machine Learning
5 pages
Elchanan Mossel (UC Berkeley) CS 170:spring 2014: April 3, 2014 1 / 16
No ratings yet
Elchanan Mossel (UC Berkeley) CS 170:spring 2014: April 3, 2014 1 / 16
21 pages
1 s2.0 S0167865510001169 Main
No ratings yet
1 s2.0 S0167865510001169 Main
11 pages
Simulink Implementationof Human Voice Filter 2
No ratings yet
Simulink Implementationof Human Voice Filter 2
7 pages
18BIT0467 18BIT0473 Paper Under HEMALATHA S
No ratings yet
18BIT0467 18BIT0473 Paper Under HEMALATHA S
15 pages
Regression
No ratings yet
Regression
7 pages
Vehicle Segmentation Using K-Means With Fuzzy Logic
No ratings yet
Vehicle Segmentation Using K-Means With Fuzzy Logic
6 pages
Pulse Code Modulator
No ratings yet
Pulse Code Modulator
3 pages
Assignment No 2 Cs 502 Solution
No ratings yet
Assignment No 2 Cs 502 Solution
5 pages
MA19455 Syllabus
No ratings yet
MA19455 Syllabus
1 page
OTE Assignment-1 PDF
No ratings yet
OTE Assignment-1 PDF
2 pages