0% found this document useful (0 votes)

207 views16 pages

What Is Convolutional Neural Network

A Convolutional Neural Network (CNN) is a type of deep learning algorithm used for tasks like image classification, detection, and segmentation. CNNs have a hierarchical structure with convolutional layers that apply filters to extract patterns from data. They also use pooling layers to reduce dimensionality and fully connected layers to perform classification. CNNs are inspired by the visual cortex and use techniques like convolution operations, shared weights, and pooling to achieve translation invariance. Regularization methods like dropout and data augmentation help reduce overfitting in CNNs.

Uploaded by

ahmedliet143

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

207 views16 pages

What Is Convolutional Neural Network

Uploaded by

ahmedliet143

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

WHAT IS ?

Convolutional
Neural Networks

Deep Learning
What is a Convolutional Neural Network (CNN)?

A Convolutional Neural Network (CNN), also known as ConvNet, is a

specialized type of deep learning algorithm mainly designed for tasks
that necessitate object recognition.

It includes image classification, detection, and segmentation.

CNNs are employed in a variety of practical scenarios, such as

autonomous vehicles, security camera systems, and others.

Why CNN is important ?

CNNs are distinguished from classic machine learning algorithms
such as SVMs and decision trees by their ability to autonomously
extract features at a large scale, bypassing the need for manual
feature engineering and thereby enhancing efficiency.

The convolutional layers grant CNNs their translation-invariant

characteristics, empowering them to identify and extract
patterns and features from data irrespective of variations in
position, orientation, scale, or translation.

linkedin.com/in/ileonjose
Beyond image classification tasks, CNNs are versatile and can be
applied to a range of other domains, such as natural language
processing, time series analysis, and speech recognition.

Parallels With The Human Visual System

Convolutional neural networks were inspired by the layered
architecture of the human visual cortex, and below are some key
similarities and differences:
llustration of the correspondence between the areas associated with
the primary visual cortex and the layers in a convolutional neural
network

Hierarchical architecture: Both CNNs and the visual cortex have a

hierarchical structure, with simple features extracted in early
layers and more complex features built up in deeper layers. This
allows increasingly sophisticated representations of visual inputs.

Local connectivity: Neurons in the visual cortex only connect to a

local region of the input, not the entire visual field. Similarly, the
neurons in a CNN layer are only connected to a local region of the
input volume through the convolution operation. This local
connectivity enables efficiency.

Translation invariance: Visual cortex neurons can detect features

regardless of their location in the visual field. Pooling layers in a
CNN provide a degree of translation invariance by summarizing
local features.

Multiple feature maps: At each stage of visual processing, there

are many different feature maps extracted. CNNs mimic this
through multiple filter maps in each convolution layer.

Non-linearity: Neurons in the visual cortex exhibit non-linear

response properties. CNNs achieve non-linearity through
activation functions like ReLU applied after each convolution.
Key Components of a CNN

The convolutional neural network is made of four main parts.

But how do CNNs Learn with those parts?
They help the CNNs mimic how the human brain operates to
recognize patterns and features in images:

Convolutional layers
Rectified Linear Unit (ReLU for short)
Pooling layers
Fully connected layers

This section dives into the definition of each one of these

components through the example of the following example of
classification of a handwritten digit.
Convolution layers
This is the first building block of a CNN. As the name suggests, the
main mathematical task performed is called convolution, which is the
application of a sliding window function to a matrix of pixels
representing an image. The sliding function applied to the matrix is
called kernel or filter, and both can be used interchangeably.

In the convolution layer, several filters of equal size are applied, and
each filter is used to recognize a specific pattern from the image,
such as the curving of the digits, the edges, the whole shape of the
digits, and more.

Put simply, in the convolution layer, we use small grids (called filters
or kernels) that move over the image. Each small grid is like a mini
magnifying glass that looks for specific patterns in the photo, like
lines, curves, or shapes. As it moves across the photo, it creates a
new grid that highlights where it found these patterns.

For example, one filter might be good at finding straight lines,

another might find curves, and so on. By using several different
filters, the CNN can get a good idea of all the different patterns that
make up the image.

Let’s consider this 32x32 grayscale image of a handwritten digit. The

values in the matrix are given for illustration purposes.
Also, let’s consider the kernel used for the convolution. It is a matrix
with a dimension of 3x3. The weights of each element of the kernel is
represented in the grid. Zero weights are represented in the black
grids and ones in the white grid.

Do we have to manually find these weights?

In real life, the weights of the kernels are determined during the
training process of the neural network.

Using these two matrices, we can perform the convolution operation

by applying the dot product, and work as follows:
1. Apply the kernel matrix from the top-left corner to the right.
2. Perform element-wise multiplication.
3. Sum the values of the products.
4. The resulting value corresponds to the first value (top-left
corner) in the convoluted matrix.
5. Move the kernel down with respect to the size of the sliding
window.
6. Repeat steps 1 to 5 until the image matrix is fully covered.

The dimension of the convoluted matrix depends on the size of the

sliding window. The higher the sliding window, the smaller the
dimension.
Another name associated with the kernel in the literature is feature
detector because the weights can be fine-tuned to detect specific
features in the input image.

For instance:

Averaging neighboring pixels kernel can be used to blur the input

image.
Subtracting neighboring kernel is used to perform edge
detection.

The more convolution layers the network has, the better the layer is
at detecting more abstract features.

Activation function
A ReLU activation function is applied after each convolution
operation. This function helps the network learn non-linear
relationships between the features in the image, hence making the
network more robust for identifying different patterns. It also helps
to mitigate the vanishing gradient problems.

linkedin.com/in/ileonjose
Pooling layer
The goal of the pooling layer is to pull the most significant features
from the convoluted matrix. This is done by applying some
aggregation operations, which reduce the dimension of the feature
map (convoluted matrix), hence reducing the memory used while
training the network. Pooling is also relevant for mitigating
overfitting.

The most common aggregation functions that can be applied are:

Max pooling, which is the maximum value of the feature map

Sum pooling corresponds to the sum of all the values of the
feature map
Average pooling is the average of all the values.

Below is an illustration of each of the previous example:

Fully connected layers
These layers are in the last layer of the convolutional neural network,
and their inputs correspond to the flattened one-dimensional matrix
generated by the last pooling layer. ReLU activations functions are
applied to them for non-linearity.

Finally, a softmax prediction layer is used to generate probability

values for each of the possible output labels, and the final label
predicted is the one with the highest probability score.

Overfitting and Regularization in CNNs

Overfitting is a common challenge in machine learning models and
CNN deep learning projects. It happens when the model learns the
training data too well (“learning by heart”), including its noise and
outliers. Such a learning leads to a model that performs well on the
training data but badly on new, unseen data.

This can be observed when the performance on training data is too

low compared to the performance on validation or testing data, and
a graphical illustration is given below:
Deep learning models, especially Convolutional Neural Networks
(CNNs), are particularly susceptible to overfitting due to their
capacity for high complexity and their ability to learn detailed
patterns in large-scale data.

Several regularization techniques can be applied to mitigate

overfitting in CNNs, and some are illustrated below:

linkedin.com/in/ileonjose
Dropout: This consists of randomly dropping some neurons
during the training process, which forces the remaining neurons
to learn new features from the input data.

Batch normalization: The overfitting is reduced at some extent by

normalizing the input layer by adjusting and scaling the
activations. This approach is also used to speed up and stabilize
the training process.
Pooling Layers: This can be used to reduce the spatial dimensions
of the input image to provide the model with an abstracted form
of representation, hence reducing the chance of overfitting.

Early stopping: This consists of consistently monitoring the

model’s performance on validation data during the training
process and stopping the training whenever the validation error
does not improve anymore.

Noise injection: This process consists of adding noise to the

inputs or the outputs of hidden layers during the training to make
the model more robust and prevent it from a weak generalization.

L1 and L2 normalizations: Both L1 and L2 are used to add a

penalty to the loss function based on the size of weights. More
specifically, L1 encourages the weights to be spare, leading to
better feature selection. On the other hand, L2 (also called weight
decay) encourages the weights to be small, preventing them from
having too much influence on the predictions.

Data augmentation: This is the process of artificially increasing

the size and diversity of the training dataset by applying random
transformations like rotation, scaling, flipping, or cropping to the
input images.
Practical Applications of CNNs
Convolutional Neural Networks have revolutionized the field of
computer vision, leading to significant advancements in many real-
world applications. Below are a few examples of how they are
applied.
If you find this helpful, Repost

Follow for Data Science content

linkedin.com/in/ileonjose

Unit 4
No ratings yet
Unit 4
108 pages
CEC453 Machine Learning
No ratings yet
CEC453 Machine Learning
168 pages
Total Listing Machine Learning
100% (1)
Total Listing Machine Learning
114 pages
Maths Roadmap For Machine Learning
No ratings yet
Maths Roadmap For Machine Learning
16 pages
Cluster
100% (1)
Cluster
72 pages
Supervised Learning 1 PDF
100% (1)
Supervised Learning 1 PDF
162 pages
AML 04 Backpropagation
100% (1)
AML 04 Backpropagation
26 pages
Feature Engineering
100% (2)
Feature Engineering
44 pages
U02Lecture07 Classification
100% (1)
U02Lecture07 Classification
56 pages
Deep Learning 2017 Lecture7GAN
No ratings yet
Deep Learning 2017 Lecture7GAN
62 pages
Vocabulary Ladders Understanding Word Nuances Level 6 (Timothy Rasinski, Melissa Cheesman Smith) (Z-Library)
100% (2)
Vocabulary Ladders Understanding Word Nuances Level 6 (Timothy Rasinski, Melissa Cheesman Smith) (Z-Library)
146 pages
Statistics in Details
100% (2)
Statistics in Details
283 pages
Ensemble Methods - Bagging, Boosting and Stacking - Towards Data Science PDF
No ratings yet
Ensemble Methods - Bagging, Boosting and Stacking - Towards Data Science PDF
37 pages
Linear Regression
100% (1)
Linear Regression
51 pages
Linear Regression
No ratings yet
Linear Regression
83 pages
A Gentle Introduction To Neural Networks With Python
100% (1)
A Gentle Introduction To Neural Networks With Python
85 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
52 pages
Data Science Intervieew Questions
100% (1)
Data Science Intervieew Questions
16 pages
Deep Learning CNN
100% (1)
Deep Learning CNN
28 pages
Gradient Descent Algorithms and Variations - PyImageSearch
No ratings yet
Gradient Descent Algorithms and Variations - PyImageSearch
21 pages
R Deep Learning Essentials - Sample Chapter
100% (3)
R Deep Learning Essentials - Sample Chapter
24 pages
Ensemble Learning Methods
100% (1)
Ensemble Learning Methods
24 pages
Deep Learning
No ratings yet
Deep Learning
189 pages
Deep Learning Tensorflow
No ratings yet
Deep Learning Tensorflow
35 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
35 pages
Lectures Machine Learning
No ratings yet
Lectures Machine Learning
205 pages
Generative Adversarial Networks (GANs)
No ratings yet
Generative Adversarial Networks (GANs)
51 pages
Image Processing With CUDA
No ratings yet
Image Processing With CUDA
66 pages
Mastering Machine Learning With Scikit-Learn: Chapter No. 5 "Nonlinear Classification and Regression With Decision Trees"
No ratings yet
Mastering Machine Learning With Scikit-Learn: Chapter No. 5 "Nonlinear Classification and Regression With Decision Trees"
23 pages
Deep Learning Unit 1
No ratings yet
Deep Learning Unit 1
32 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
15 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
5 pages
2nd Exam Question Paper 2
No ratings yet
2nd Exam Question Paper 2
16 pages
Deep Learning (MODULE-3)
No ratings yet
Deep Learning (MODULE-3)
85 pages
High School Mathematics Lesson Plan: Algebra
No ratings yet
High School Mathematics Lesson Plan: Algebra
7 pages
Mehryar Mohri - Foundations of Machine Learning - Book
No ratings yet
Mehryar Mohri - Foundations of Machine Learning - Book
1 page
Intro To IB DP English B
100% (1)
Intro To IB DP English B
9 pages
Mathematics For Machine Learning-I
No ratings yet
Mathematics For Machine Learning-I
10 pages
Online Machine Learning Algorithms For Currency Exchange Prediction
No ratings yet
Online Machine Learning Algorithms For Currency Exchange Prediction
84 pages
Depth Prediction Single Image
No ratings yet
Depth Prediction Single Image
8 pages
Early Stopping in Practice
No ratings yet
Early Stopping in Practice
14 pages
Lesson Participation and Observation 2
No ratings yet
Lesson Participation and Observation 2
7 pages
Language and Media - Course Guide - 2025 - Blended
No ratings yet
Language and Media - Course Guide - 2025 - Blended
15 pages
FS 2 Module 1
75% (8)
FS 2 Module 1
27 pages
Lecture 01 (Introduction To Pattern Recognition)
No ratings yet
Lecture 01 (Introduction To Pattern Recognition)
26 pages
Machine Learning Guide Line
No ratings yet
Machine Learning Guide Line
10 pages
Curse of Dimensionality
No ratings yet
Curse of Dimensionality
9 pages
Database Management Systems by Raghu Ramakrishnan: Special Features of Book
No ratings yet
Database Management Systems by Raghu Ramakrishnan: Special Features of Book
3 pages
Gradient Descent
No ratings yet
Gradient Descent
15 pages
Chapter 17 - Logistic Regression
No ratings yet
Chapter 17 - Logistic Regression
32 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
3 pages
Commerce Scheme Form 4
100% (1)
Commerce Scheme Form 4
15 pages
Classification Algorithms
100% (2)
Classification Algorithms
23 pages
Lesson Plan For Final Demo in English 6: WWW - Grammar.ccc - Co
No ratings yet
Lesson Plan For Final Demo in English 6: WWW - Grammar.ccc - Co
7 pages
RRL For Conducive Learning Environment
100% (3)
RRL For Conducive Learning Environment
4 pages
Preschool Play Seasons of Wonder Curriculum Sample
No ratings yet
Preschool Play Seasons of Wonder Curriculum Sample
62 pages
Dropout Vs Pruning
No ratings yet
Dropout Vs Pruning
2 pages
Katie
No ratings yet
Katie
3 pages
Matplotlib PDF
No ratings yet
Matplotlib PDF
16 pages
Unit 2
No ratings yet
Unit 2
112 pages
Advanced Deep Learning Questions - ChatGPT
No ratings yet
Advanced Deep Learning Questions - ChatGPT
13 pages
An Introduction To Lions Quest
No ratings yet
An Introduction To Lions Quest
18 pages
The Problem of Overfitting: Overfitting With Linear Regression
No ratings yet
The Problem of Overfitting: Overfitting With Linear Regression
32 pages
Effective Supervision, Monitoring and Evaluation in Basic Education Challenges and Solutions in Yobe State
No ratings yet
Effective Supervision, Monitoring and Evaluation in Basic Education Challenges and Solutions in Yobe State
10 pages
CyberSA L3
No ratings yet
CyberSA L3
14 pages
Career Development Plan
No ratings yet
Career Development Plan
2 pages
CrewAI Vs LangChain - The Clash of AI Titans in The LLM Arena - by Cogni Down Under - Nov, 2024 - Medium
No ratings yet
CrewAI Vs LangChain - The Clash of AI Titans in The LLM Arena - by Cogni Down Under - Nov, 2024 - Medium
13 pages
Deep Learning Interview Questions - Deep Learning Questions
No ratings yet
Deep Learning Interview Questions - Deep Learning Questions
21 pages
Effectiveness of Supplementary Materials in Developing Vocabulary Competence Utilizing Video Lessons Among Grade 5 Learners
No ratings yet
Effectiveness of Supplementary Materials in Developing Vocabulary Competence Utilizing Video Lessons Among Grade 5 Learners
8 pages
Psw-Grade 6-Topics-Term 2-2025
No ratings yet
Psw-Grade 6-Topics-Term 2-2025
1 page
DPS Dohlron - Best School in Mahilpur - Top School in Mahilpur
No ratings yet
DPS Dohlron - Best School in Mahilpur - Top School in Mahilpur
8 pages
Machine Learning Handouts
No ratings yet
Machine Learning Handouts
110 pages
Cot MATH 6 Q3 W2
100% (3)
Cot MATH 6 Q3 W2
7 pages
VIP English Teaching Service Agreement - Filipino Teachers - MR - Lego
No ratings yet
VIP English Teaching Service Agreement - Filipino Teachers - MR - Lego
9 pages
LP LJSC
No ratings yet
LP LJSC
4 pages
The Behaviorist Theory
No ratings yet
The Behaviorist Theory
2 pages
1st Co
No ratings yet
1st Co
3 pages
Respond Appropriately in Formal Dan Informal Situations For A Variety of Purposes
No ratings yet
Respond Appropriately in Formal Dan Informal Situations For A Variety of Purposes
3 pages
Weekly Learning Plan Quarter: 4th Quarter Grade Level: Grade 8 Week: Learning Area: Mathematics MELC/s: PS
No ratings yet
Weekly Learning Plan Quarter: 4th Quarter Grade Level: Grade 8 Week: Learning Area: Mathematics MELC/s: PS
3 pages
LAS 9 Preparing For Teaching and Learning
No ratings yet
LAS 9 Preparing For Teaching and Learning
7 pages
2012 Old Eamcet Medical
No ratings yet
2012 Old Eamcet Medical
62 pages
Bias and Variance
No ratings yet
Bias and Variance
6 pages
DLL - RW - LC 1 - Francis EN1112RWS-IIIa-1
No ratings yet
DLL - RW - LC 1 - Francis EN1112RWS-IIIa-1
3 pages
Paper 1 May 2005 Physics
No ratings yet
Paper 1 May 2005 Physics
3 pages
What Is A Support Vector Machine?: Primer
No ratings yet
What Is A Support Vector Machine?: Primer
3 pages
Neural Networks and Fuzzy Logic
From Everand
Neural Networks and Fuzzy Logic
C. Naga Bhaskar
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
From Everand
Hybrid Neural Networks: Fundamentals and Applications for Interacting Biological Neural Networks with Artificial Neuronal Models
Fouad Sabry
No ratings yet

What Is Convolutional Neural Network

Uploaded by

What Is Convolutional Neural Network

Uploaded by

WHAT IS ?

A Convolutional Neural Network (CNN), also known as ConvNet, is a

It includes image classification, detection, and segmentation.

CNNs are employed in a variety of practical scenarios, such as

Why CNN is important ?

The convolutional layers grant CNNs their translation-invariant

Parallels With The Human Visual System

Hierarchical architecture: Both CNNs and the visual cortex have a

Local connectivity: Neurons in the visual cortex only connect to a

Translation invariance: Visual cortex neurons can detect features

Multiple feature maps: At each stage of visual processing, there

Non-linearity: Neurons in the visual cortex exhibit non-linear

The convolutional neural network is made of four main parts.

This section dives into the definition of each one of these

For example, one filter might be good at finding straight lines,

Let’s consider this 32x32 grayscale image of a handwritten digit. The

Do we have to manually find these weights?

Using these two matrices, we can perform the convolution operation

The dimension of the convoluted matrix depends on the size of the

Averaging neighboring pixels kernel can be used to blur the input

The most common aggregation functions that can be applied are:

Max pooling, which is the maximum value of the feature map

Below is an illustration of each of the previous example:

Finally, a softmax prediction layer is used to generate probability

Overfitting and Regularization in CNNs

This can be observed when the performance on training data is too

Several regularization techniques can be applied to mitigate

Batch normalization: The overfitting is reduced at some extent by

Early stopping: This consists of consistently monitoring the

Noise injection: This process consists of adding noise to the

L1 and L2 normalizations: Both L1 and L2 are used to add a

Data augmentation: This is the process of artificially increasing

Follow for Data Science content

You might also like