0% found this document useful (0 votes)

35 views

Basic Introduction To Convolutional Neural Network in Deep Learning

The document provides an overview of convolutional neural networks (CNNs). It discusses that CNNs were initially developed in the 1980s to detect handwritten digits, but lacked sufficient data for broad application. The key components of CNN architecture include convolutional layers that extract image features, pooling layers that reduce spatial size, and fully connected layers for classification. CNNs are trained by adjusting weights through backpropagation to minimize differences between predicted and true labels. While powerful, CNNs require large amounts of data and computing resources for training.

Uploaded by

Narsini AKSHARA

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views

Basic Introduction To Convolutional Neural Network in Deep Learning

Uploaded by

Narsini AKSHARA

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

Basic Introduction to Convolutional Neural Network in Deep

Learning
A D VA NC E D D E E P LE A RNI NG PYT HO N

This article was published as a part of the Data Science Blogathon.

The field of Deep Learning has materialized a lot over the past few decades due to efficiently tackling
massive datasets and making computer systems capable enough to solve computational problems

Hidden layers have ushered in a new era, with the old techniques being non-efficient, particularly when it
comes to problems like Pattern Recognition, Object Detection, Image Segmentation, and other image
processing-based problems. CNN is one of the most deployed deep
learning neural networks.

Source: Medium.com

Table of Contents

1. Background of CNNs
2. What is CNN
3. CNN’s Basic Architecture
4. Training the convolutional Neural Network
5. Limitations
. Code implementation for Implementing CNN for image detection
7. Conclusion

Background of CNNs

Around the 1980s, CNNs were developed and deployed for the first time. A CNN could only detect
handwritten digits at the time. CNN was primarily used in various areas to read zip and pin codes etc.
The most common aspect of any A.I. model is that it requires a massive amount of data to train. This was
one of the biggest problems that CNN faced at the time, and due to this, they were only used in the postal
industry. Yann LeCun was the first to introduce convolutional neural networks.

Kunihiko Fukushima, a renowned Japanese scientist, who even invented recognition, which was a very
simple Neural Network used for image identification, had developed on the work done earlier by LeCun

What is CNN?

In the field of deep learning, convolutional neural network (CNN) is among the class of deep neural
networks, which was being mostly deployed in the field of analyzing/image recognition.

Convolutional Neural uses a very special kind of method which is being known as Convolution.

The mathematical definition of convolution is a mathematical operation being applied on the two functions
that give output in a form of a third function that shows how the shape of one function is being influenced,
modified by the other function.
Source: Towardsdatascience

The Convolutional neural networks(CNN) consists of various layers of ar tificial neurons. Ar tificial
neurons, similar to that neuron cells that are being used by the human brain for passing various sensory
input signals and other responses, are mathematical functions that are being used for calculating the sum
of various inputs and giving output in the form of an activation value.

The behaviour of each CNN neuron is being defined by the value of its weights. When being fed with the
values (of the pixel), the artificial neurons of a CNN recognizes various visual features and specifications.

When we give an input image into a CNN, each of its inner layers generates various activation maps.
Activation maps point out the relevant features of the given input image. Each of the CNN neurons
generally takes input in the form of a group/patch of the pixel, multiplies their values(colours) by the value
of its weights, adds them up, and input them through the respective activation function.

The first (or maybe the bottom) layer of the CNN usually recognizes the various features of the input image
such as edges horizontally, vertically, and diagonally.

The output of the first layer is being fed as an input of the next layer, which in turn will extract other
complex features of the input image like corners and combinations of edges.

The deeper one moves into the convolutional neural network, the more the layers start detecting various
higher-level features such as objects, faces, etc

CNN’s Basic Architecture

A CNN architecture consists of two key components:

• A convolution tool that separates and identifies the distinct features of an image for analysis in a process
known as Feature Extraction

• A fully connected layer that takes the output of the convolution process and predicts the image’s class
based on the features retrieved earlier.

The CNN is made up of three types of layers: convolutional layers, pooling layers, and fully-connected (FC)
layers.

source: Upgrad.com

Convolution Layers

This is the very first layer in the CNN that is responsible for the extraction of the different features from
the input images. The convolution mathematical operation is done between the input image and a filter of
a specific size MxM in this layer.

The Fully Connected

The Fully Connected (FC) layer comprises the weights and biases together with the neurons and is used
to connect the neurons between two separate layers. The last several layers of a CNN Architecture are
usually positioned before the output layer.

Pooling layer
The Pooling layer is responsible for the reduction of the size(spatial) of the Convolved Feature. This
decrease in the computing power is being required to process the data by a significant reduction in the
dimensions.
There are two types of pooling
1 average pooling
2 max pooling.

A Pooling Layer is usually applied after a Convolutional Layer. This layer’s major goal is to lower the size of
the convolved feature map to reduce computational expenses. This is accomplished by reducing the
connections between layers and operating independently on each feature map. There are numerous sorts
of Pooling operations, depending on the mechanism utilised.

Source: Analytics Vidhya.com

The largest element is obtained from the feature map in Max Pooling. The average of the elements in a
predefined sized Image segment is calculated using Average Pooling. Sum Pooling calculates the total sum
of the components in the predefined section. The Pooling Layer is typically used to connect the
Convolutional Layer and the FC Layer.

Dropout

To avoid overfitting (when a model performs well on training data but not on new data), a dropout layer is
utilised, in which a few neurons are removed from the neural network during the training phase, resulting
in a smaller model.

Activation Functions
They’re utilised to learn and approximate any form of network variable-to-variable association that’s both
continuous and complex.

It gives the network non-linearity. The ReLU, Softmax, and tanH are some of the most often utilised
activation functions.

Training the convolutional neural network

The process of adjusting the value of the weights is defined as the “training” of the neural network.

Firstly, the CNN initiates with the random weights. During the training of CNN, the neural network is being
fed with a large dataset of images being labelled with their corresponding class labels (cat, dog, horse,
etc.). The CNN network processes each image with its values being assigned randomly and then make
comparisons with the class label of the input image.

If the output does not match the class label(which mostly happen initially at the beginning of the training
process and therefore makes a respective small adjustment to the weights of its CNN neurons so that
output correctly matches the class label image.

Source: Medium.com

The corrections to the value of weights are being made through a technique which is known as
backpropagation. Backpropagation optimizes the tuning process and makes it easier for adjustments for
better accuracy every run of the training of the image dataset is being called an “epoch.”

The CNN goes through several series of epochs during the process of training, adjusting its weights as per
the required small amounts.

After each epoch step, the neural network becomes a bit more accurate at classifying and correctly
predicting the class of the training images. As the CNN improves, the adjustments being made to the
weights become smaller and smaller accordingly.

After training the CNN, we use a test dataset to verify its accuracy. The test dataset is a set of labelled
images that were not being included in the training process. Each image is being fed to CNN, and the
output is compared to the actual class label of the test image. Essentially, the test dataset evaluates the
prediction performance of the CNN
If a CNN accuracy is good on its training data but is bad on the test data, it is said as “overfitting.” This
happens due to less size of the dataset (training)

Limitations

They (CNN) use massive computing power and resources for the recognition of various visual
patterns/trends that is very much impossible to achieve by the human eye.

One usually needs a very long time to train a convolutional neural network, especially with a large size of
image datasets.

One generally requires very specialized hardware (like a GPU) to perform the training of the dataset

Python Code implementation for Implementing CNN for classification

Importing Relevant Libraries

import NumPy as np %matplotlib inline import matplotlib.image as mpimg import matplotlib.pyplot as plt import

TensorFlow as tf tf.compat.v1.set_random_seed(2019)

Loading MNIST Dataset

(X_train,Y_train),(X_test,Y_test) = keras.datasets.mnist.load_data()

Scaling The Data

X_train = X_train / 255 X_test = X_test / 255

#flatenning

X_train_flattened = X_train.reshape(len(X_train), 28*28) X_test_flattened = X_test.reshape(len(X_test),

28*28)

Designing The Neural Network

model = keras.Sequential([ keras.layers.Dense(10, input_shape=(784,), activation='sigmoid') ])

model.compile(optimizer='adam', loss='sparse_categorical_crossentropy', metrics=['accuracy'])

model.fit(X_train_flattened, Y_train, epochs=5)

Output:

Epoch 1/5 1875/1875 [==============================] - 8s 4ms/step - loss: 0.7187 - accuracy: 0.8141 Epoch

2/5 1875/1875 [==============================] - 6s 3ms/step - loss: 0.3122 - accuracy: 0.9128 Epoch 3/5
1875/1875 [==============================] - 6s 3ms/step - loss: 0.2908 - accuracy: 0.9187 Epoch 4/5
1875/1875 [==============================] - 6s 3ms/step - loss: 0.2783 - accuracy: 0.9229 Epoch 5/5
1875/1875 [==============================] - 6s 3ms/step - loss: 0.2643 - accuracy: 0.9262

Confusion Matrix for visualization of predictions

Y_predict = model.predict(X_test_flattened) Y_predict_labels = [np.argmax(i) for i in Y_predict]

cm = tf.math.confusion_matrix(labels=Y_test,predictions=Y_predict_labels) %matplotlib inline

plt.figure(figsize = (10,7)) sn.heatmap(cm, annot=True, fmt='d') plt.xlabel('Predicted') plt.ylabel('Truth')

Output

Source: Author

Conclusion

So in this article, we covered the basic Introduction about CNN architecture and its basic implementation
in real-time scenarios like classification. We also covered other key terminologies related to CNN like
pooling, Activation Function, Dropoutetc. We also covered about limitations regarding CNN and the
training of CNN

With this, I finish this blog.

Hello Everyone, Namaste
My name is Pranshu Sharma and I am a Data Science Enthusiast

Thank you so much for taking your precious time to read this blog. Feel free to point out any mistake(I’m
a learner after all) and provide respective feedback or leave a comment.
Dhanyvaad!!
Feedback:Email: [email protected]

The media shown in this ar ticle is not owned by Analytics Vidhya and are used at the Author’s discretion

Article Url - https://www.analyticsvidhya.com/blog/2022/03/basic-introduction-to-convolutional-neural-

network-in-deep-learning/

Pranshu Sharma

TensorFlow in 1 Day: Make your own Neural Network
From Everand
TensorFlow in 1 Day: Make your own Neural Network
Krishna Rungta
3.5/5 (10)
Nria20-Dl - Unit-3 Notes-Final
No ratings yet
Nria20-Dl - Unit-3 Notes-Final
23 pages
Deep Learning Unit 5
No ratings yet
Deep Learning Unit 5
23 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
74 pages
AD3501-DL-UNIT 2 NOTES
No ratings yet
AD3501-DL-UNIT 2 NOTES
29 pages
Demystifying The Mathematics Behind Convolutional Neural Networks (CNNS)
No ratings yet
Demystifying The Mathematics Behind Convolutional Neural Networks (CNNS)
19 pages
AD3501-DL-Unit 2
No ratings yet
AD3501-DL-Unit 2
33 pages
Unit 3
No ratings yet
Unit 3
19 pages
UNIT - 2
No ratings yet
UNIT - 2
31 pages
Deep Neural Network DNN
No ratings yet
Deep Neural Network DNN
5 pages
Assignment 5_ _Implementing Image Classification using Deep Learning
No ratings yet
Assignment 5_ _Implementing Image Classification using Deep Learning
8 pages
What Is Convolutional Neural Network
No ratings yet
What Is Convolutional Neural Network
16 pages
Deep Learning (Part 3). CONVOLUTION NEURAL NETWORKS(CNNs). _ by Sumbatilinda _ Medium
No ratings yet
Deep Learning (Part 3). CONVOLUTION NEURAL NETWORKS(CNNs). _ by Sumbatilinda _ Medium
31 pages
DL Unit3
No ratings yet
DL Unit3
8 pages
DL 4
No ratings yet
DL 4
4 pages
CV Lab 12 - Implementatin of a Simple CNN
No ratings yet
CV Lab 12 - Implementatin of a Simple CNN
9 pages
Module 5
No ratings yet
Module 5
8 pages
Convolutional Neural Network (CNN) : Assignment On
No ratings yet
Convolutional Neural Network (CNN) : Assignment On
8 pages
Cnn
No ratings yet
Cnn
9 pages
CNN Eem305
100% (1)
CNN Eem305
7 pages
151180080_BM466_HOMEWORK 4
No ratings yet
151180080_BM466_HOMEWORK 4
10 pages
CNN
No ratings yet
CNN
5 pages
DLT Unit-4
No ratings yet
DLT Unit-4
25 pages
Deep Learning Image Classification
No ratings yet
Deep Learning Image Classification
11 pages
Chap 2 DL
No ratings yet
Chap 2 DL
88 pages
Convolutional Neural Networks 2 Now
No ratings yet
Convolutional Neural Networks 2 Now
6 pages
Workspace
No ratings yet
Workspace
19 pages
neural-networks-unit-3 edited
No ratings yet
neural-networks-unit-3 edited
94 pages
Assignment-6 STC-DL
No ratings yet
Assignment-6 STC-DL
17 pages
UNIT-1 Foundations of Deep Learning
100% (1)
UNIT-1 Foundations of Deep Learning
51 pages
Understanding of Convolutional Neural Network (CNN)
No ratings yet
Understanding of Convolutional Neural Network (CNN)
9 pages
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu - Medium
No ratings yet
Understanding of Convolutional Neural Network (CNN) - Deep Learning - by Prabhu - Medium
8 pages
Understanding Convolutional Neural Networks (CNNs) in Depth _ by Koushik _ Medium
No ratings yet
Understanding Convolutional Neural Networks (CNNs) in Depth _ by Koushik _ Medium
30 pages
DL_Unit3_1 (1)
No ratings yet
DL_Unit3_1 (1)
67 pages
Variants of Cnn(page no 17-23), structured output(29-31),datatypes
No ratings yet
Variants of Cnn(page no 17-23), structured output(29-31),datatypes
31 pages
Convolutional Neural Networks (CNN)
No ratings yet
Convolutional Neural Networks (CNN)
7 pages
Alex Ivan
No ratings yet
Alex Ivan
10 pages
Sommaire CNN Presentation
No ratings yet
Sommaire CNN Presentation
10 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
14 pages
UNIT-III DLL full unit
No ratings yet
UNIT-III DLL full unit
63 pages
Todos_Tienen_Celular_Uso_Apropiacion_e_I
No ratings yet
Todos_Tienen_Celular_Uso_Apropiacion_e_I
15 pages
UNIT-III DeepLearning Notes
No ratings yet
UNIT-III DeepLearning Notes
30 pages
Deep LearningUNIT-IV
No ratings yet
Deep LearningUNIT-IV
16 pages
Project Exhibition 2
No ratings yet
Project Exhibition 2
42 pages
Deep Learning
No ratings yet
Deep Learning
17 pages
ml2
No ratings yet
ml2
70 pages
Unit 5
No ratings yet
Unit 5
8 pages
Convolutional Neural Network
100% (1)
Convolutional Neural Network
3 pages
DL Unit4
No ratings yet
DL Unit4
31 pages
DL-Unit-3 final
No ratings yet
DL-Unit-3 final
25 pages
ANN
No ratings yet
ANN
5 pages
DL UNIT 3
No ratings yet
DL UNIT 3
27 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
5 pages
CNN vs. RNN vs. ANN - Analysing 3 Types of Neural Networks in Deep Learning
No ratings yet
CNN vs. RNN vs. ANN - Analysing 3 Types of Neural Networks in Deep Learning
10 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
61 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
15 pages
2111CS010077 deep learning
No ratings yet
2111CS010077 deep learning
10 pages
Unit 3 - Machine Learning - WWW - Rgpvnotes.in
No ratings yet
Unit 3 - Machine Learning - WWW - Rgpvnotes.in
29 pages
Combined Paper
No ratings yet
Combined Paper
26 pages
CNN Model Introduction and Overview
No ratings yet
CNN Model Introduction and Overview
2 pages
Functional Point Analysis 1679309233589
100% (1)
Functional Point Analysis 1679309233589
6 pages
2022 - 2023 Revised VI & IV Semester Academic Schedule For UG
No ratings yet
2022 - 2023 Revised VI & IV Semester Academic Schedule For UG
1 page
Syllabus 1672732874001 PDF
No ratings yet
Syllabus 1672732874001 PDF
1 page
1 Bind DL
No ratings yet
1 Bind DL
24 pages
Se Main Document
No ratings yet
Se Main Document
61 pages
Se Main Front Pages
No ratings yet
Se Main Front Pages
3 pages
SE MAIN DOCUMENT - Merged
No ratings yet
SE MAIN DOCUMENT - Merged
64 pages
Database Security 584
No ratings yet
Database Security 584
7 pages
SPLK 1001 Questions
No ratings yet
SPLK 1001 Questions
5 pages
CSCI2100 Project
No ratings yet
CSCI2100 Project
7 pages
Unit 3 Introduction To Operating System Concepts
No ratings yet
Unit 3 Introduction To Operating System Concepts
19 pages
Ixia VS PC Imperva Deployment Guide
No ratings yet
Ixia VS PC Imperva Deployment Guide
41 pages
C++ Lab 10 Alpha
No ratings yet
C++ Lab 10 Alpha
2 pages
Salesforce Apex Code Cheat Sheets (1)
No ratings yet
Salesforce Apex Code Cheat Sheets (1)
4 pages
Gas Sensor
No ratings yet
Gas Sensor
6 pages
Chapter 1
No ratings yet
Chapter 1
22 pages
Mapa Modbus Medidor Ultrasónico Ultra TT
No ratings yet
Mapa Modbus Medidor Ultrasónico Ultra TT
15 pages
MS-7549 Ver:1.1: Title
No ratings yet
MS-7549 Ver:1.1: Title
34 pages
Cisco AnyConnect VPN Statistics
No ratings yet
Cisco AnyConnect VPN Statistics
4 pages
Ss - CCS UC 1 X
No ratings yet
Ss - CCS UC 1 X
7 pages
Xyz
No ratings yet
Xyz
32 pages
Advanced Mechatronics of Courseguide Book2021
100% (1)
Advanced Mechatronics of Courseguide Book2021
3 pages
Jyoti Resume - 1
No ratings yet
Jyoti Resume - 1
1 page
C_by_SILICON
No ratings yet
C_by_SILICON
3 pages
Troubleshooting and System Notifications Guide: Ibm Security Qradar 7.3.3
No ratings yet
Troubleshooting and System Notifications Guide: Ibm Security Qradar 7.3.3
70 pages
(PDF) 8.1.4.6 Lab - Calculating IPv4 Subnets
No ratings yet
(PDF) 8.1.4.6 Lab - Calculating IPv4 Subnets
4 pages
Mastering JavaScript Involves Understanding Various Topics and Subtopics
No ratings yet
Mastering JavaScript Involves Understanding Various Topics and Subtopics
3 pages
Chapter 1 Introduction To System and Network Administration
100% (1)
Chapter 1 Introduction To System and Network Administration
50 pages
AirPrime - Open AT Tutorial - Rev1.0 PDF
No ratings yet
AirPrime - Open AT Tutorial - Rev1.0 PDF
508 pages
Languages and Compilers (Sprog Og Oversættere) : Bent Thomsen Department of Computer Science Aalborg University
No ratings yet
Languages and Compilers (Sprog Og Oversættere) : Bent Thomsen Department of Computer Science Aalborg University
41 pages
HP Color Pro M454 MFP M479 Manual Troubleshooting
No ratings yet
HP Color Pro M454 MFP M479 Manual Troubleshooting
240 pages
az-305_8
No ratings yet
az-305_8
50 pages
Backup Recovery Product Data Sheet en 56972
No ratings yet
Backup Recovery Product Data Sheet en 56972
10 pages
Project Report Tahir, Bilal, Taimoor, Ahtisham
No ratings yet
Project Report Tahir, Bilal, Taimoor, Ahtisham
12 pages
Btech Cs 5 Sem Augmented and Virtual Reality Kcs057 2023
No ratings yet
Btech Cs 5 Sem Augmented and Virtual Reality Kcs057 2023
2 pages
Cli Commands List
No ratings yet
Cli Commands List
4 pages
Reference and Analysis
No ratings yet
Reference and Analysis
4 pages