Convolution Neural Network

A Convolutional Neural Network (CNN) is a type of deep neural network used for analyzing visual imagery. CNNs compare images piece by piece using small image filters called features, which match common aspects of images. The network performs convolution operations by lining features up across the image to detect matches. It uses pooling and multiple stacked convolution/pooling layers to extract higher-level features from the image at different levels of abstraction. The final layers are fully connected layers that classify the image based on the learned feature combinations. This architecture allows CNNs to robustly recognize patterns and objects even with variations in position, scale, and deformation.

Uploaded by

Hitesh Son Son

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

57 views9 pages

Convolution Neural Network

Uploaded by

Hitesh Son Son

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Convolution Neural Network:

A Convolutional neural network (CNN) is a class of deep neural networks, most commonly
applied to analyzing visual imagery.CNNs use relatively little pre-processing compared to
other image classification algorithms.

Working Process of CNN

example: determining whether an image is of an X or an O. This example is just rich enough

to illustrate the principles behind CNNs, but still simple enough to avoid getting bogged
down in non-essential details. Our CNN has one job. Each time we hand it a picture, it has to
decide whether it has an X or an O. It assumes there is always one or the other.

A naïve approach to solving this problem is to save an image of an X and an O and compare
every new image to our exemplars to see which is the better match. What makes this task
tricky is that computers are extremely literal. To a computer, an image looks like a two-
dimensional array of pixels (think giant checkerboard) with a number in each position. In our
example a pixel value of 1 is white, and -1 is black. When comparing two images, if any pixel
values don’t match, then the images don’t match, at least to the computer. Ideally, we would
like to be able to see X’s and O’s even if they’re shifted, shrunken, rotated or deformed. This
is where CNNs come in.
CNNs compare images piece by piece. The pieces that it looks for are called features. By
finding rough feature matches in roughly the same positions in two images, CNNs get a lot
better at seeing similarity than whole-image matching schemes

Each feature is like a mini-image—a small two-dimensional array of values. Features match
common aspects of the images. In the case of X images, features consisting of diagonal lines
and a crossing capture all the important characteristics of most X’s. These features will
probably match up to the arms and center of any image of an X.
When presented with a new image, the CNN doesn’t know exactly where these features will
match so it tries them everywhere, in every possible position. In calculating the match to a
feature across the whole image, we make it a filter. The math we use to do this is called
convolution, from which Convolutional Neural Networks take their name.

The math behind convolution is nothing that would make a sixth-grader uncomfortable. To
calculate the match of a feature to a patch of the image, simply multiply each pixel in the
feature by the value of the corresponding pixel in the image. Then add up the answers and
divide by the total number of pixels in the feature. If both pixels are white (a value of 1) then
1 * 1 = 1. If both are black, then (-1) * (-1) = 1. Either way, every matching pixel results in a
1. Similarly, any mismatch is a -1. If all the pixels in a feature match, then adding them up
and dividing by the total number of pixels gives a 1. Similarly, if none of the pixels in a
feature match the image patch, then the answer is a -1.

To complete our convolution, we repeat this process, lining up the feature with every possible
image patch. We can take the answer from each convolution and make a new two-
dimensional array from it, based on where in the image each patch is located. This map of
matches is also a filtered version of our original image. It’s a map of where in the image the
feature is found. Values close to 1 show strong matches, values close to -1 show strong
matches for the photographic negative of our feature, and values near zero show no match of
any sort.

The next step is to repeat the convolution process in its entirety for each of the other features.
The result is a set of filtered images, one for each of our filters. It’s convenient to think of this
whole collection of convolution operations as a single processing step. In CNNs this is
referred to as a convolution layer, hinting at the fact that it will soon have other layers added
to it.

It’s easy to see how CNNs get their reputation as computation hogs. Although we can sketch
our CNN on the back of a napkin, the number of additions, multiplications and divisions can
add up fast. In math speak, they scale linearly with the number of pixels in the image, with
the number of pixels in each feature and with the number of features. With so many factors,
it’s easy to make this problem many millions of times larger without breaking a sweat. Small
wonder that microchip manufacturers are now making specialized chips in an effort to keep
up with the demands of CNNs
Another power tool that CNNs use is called pooling. Pooling is a way to take large images
and shrink them down while preserving the most important information in them. The math
behind pooling is second-grade level at most. It consists of stepping a small window across
an image and taking the maximum value from the window at each step. In practice, a window
2 or 3 pixels on a side and steps of 2 pixels work well.

After pooling, an image has about a quarter as many pixels as it started with. Because it keeps
the maximum value from each window, it preserves the best fits of each feature within the
window. This means that it doesn’t care so much exactly where the feature fit as long as it fit
somewhere within the window. The result of this is that CNNs can find whether a feature is
in an image without worrying about where it is. This helps solve the problem of computers
being hyper-literal.
A pooling layer is just the operation of performing pooling on an image or a collection of
images. The output will have the same number of images, but they will each have fewer
pixels. This is also helpful in managing the computational load. Taking an 8 megapixel image
down to a 2 megapixel image makes life a lot easier for everything downstream.

A small but important player in this process is the Rectified Linear Unit or ReLU. It’s math is
also very simple—wherever a negative number occurs, swap it out for a 0. This helps the
CNN stay mathematically healthy by keeping learned values from getting stuck near 0 or
blowing up toward infinity. It’s the axle grease of CNNs—not particularly glamorous, but
without it they don’t get very far.
The output of a ReLU layer is the same size as whatever is put into it, just with all the
negative values removed

You’ve probably noticed that the input to each layer (two-dimensional arrays) looks a lot like
the output (two-dimensional arrays). Because of this, we can stack them like Lego bricks.
Raw images get filtered, rectified and pooled to create a set of shrunken, feature-filtered
images. These can be filtered and shrunken again and again. Each time, the features become
larger and more complex, and the images become more compact. This lets lower layers
represent simple aspects of the image, such as edges and bright spots. Higher layers can
represent increasingly sophisticated aspects of the image, such as shapes and patterns. These
tend to be readily recognizable. For instance, in a CNN trained on human faces, the highest
layers represent patterns that are clearly face-like.

CNNs have one more arrow in their quiver. Fully connected layers take the high-level filtered
images and translate them into votes. In our case, we only have to decide between two
categories, X and O. Fully connected layers are the primary building block of traditional
neural networks. Instead of treating inputs as a two-dimensional array, they are treated as a
single list and all treated identically. Every value gets its own vote on whether the current
image is an X or and O. However, the process isn’t entirely democratic. Some values are
much better than others at knowing when the image is an X, and some are particularly good
at knowing when the image is an O. These get larger votes than the others. These votes are
expressed as weights, or connection strengths, between each value and each category.
When a new image is presented to the CNN, it percolates through the lower layers until it
reaches the fully connected layer at the end. Then an election is held. The answer with the
most votes wins and is declared the category of the input.

Fully connected layers, like the rest, can be stacked because their outputs (a list of
votes) look a whole lot like their inputs (a list of values). In practice, several fully
connected layers are often stacked together, with each intermediate layer voting on
phantom “hidden” categories. In effect, each additional layer lets the network learn
ever more sophisticated combinations of features that help it make better decisions.

Adversarial Attack and Defense Technologies in Natural Language
No ratings yet
Adversarial Attack and Defense Technologies in Natural Language
30 pages
Preschool Key Developmental Indicators Chart
No ratings yet
Preschool Key Developmental Indicators Chart
1 page
(EN7G-IV-g-6.2) (EN7G-IV-i-6.1) (EN7WC-IV-a-2.2) : Building Relationship
100% (4)
(EN7G-IV-g-6.2) (EN7G-IV-i-6.1) (EN7WC-IV-a-2.2) : Building Relationship
7 pages
CNNS, Part 1: An Introduction To Convolutional Neural Networks
No ratings yet
CNNS, Part 1: An Introduction To Convolutional Neural Networks
17 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
61 pages
unit - 5
No ratings yet
unit - 5
47 pages
What Is Convolutional Neural Network
No ratings yet
What Is Convolutional Neural Network
16 pages
What Is A Convolutional Neural Network-Unit3
No ratings yet
What Is A Convolutional Neural Network-Unit3
12 pages
Deep Neural Network DNN
No ratings yet
Deep Neural Network DNN
5 pages
DL-Unit-3 Final
No ratings yet
DL-Unit-3 Final
25 pages
An Introduction to Convolutional Neural Networks
No ratings yet
An Introduction to Convolutional Neural Networks
11 pages
Unit 3 CNN 2024
No ratings yet
Unit 3 CNN 2024
58 pages
Unit Iv DL
No ratings yet
Unit Iv DL
26 pages
Convolutional Neural Networks 2 Now
No ratings yet
Convolutional Neural Networks 2 Now
6 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
53 pages
Variants of CNN (Page No 17-23), Structured Output (29-31), Datatypes
No ratings yet
Variants of CNN (Page No 17-23), Structured Output (29-31), Datatypes
31 pages
Unit 5th Ig Ann
No ratings yet
Unit 5th Ig Ann
112 pages
UNIT 3 ComputerVision
No ratings yet
UNIT 3 ComputerVision
117 pages
DL Unit4
No ratings yet
DL Unit4
31 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES-www - Jntumaterials.co - in
26 pages
Unit 3
No ratings yet
Unit 3
80 pages
Deep Learning
No ratings yet
Deep Learning
17 pages
Introduction To Convolutional Neural Networks
No ratings yet
Introduction To Convolutional Neural Networks
4 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
66 pages
CNN Notes Unit-3
No ratings yet
CNN Notes Unit-3
12 pages
DL Unit-4
No ratings yet
DL Unit-4
26 pages
Module 3 Notes
No ratings yet
Module 3 Notes
22 pages
Convolutional Neural Networks: ZV0GDF798E
No ratings yet
Convolutional Neural Networks: ZV0GDF798E
9 pages
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES (WWW - Jntumaterials.co - In)
No ratings yet
JNTUK R20 UNIT-IV DEEP LEARNING TECHNIQUES (WWW - Jntumaterials.co - In)
26 pages
Chap 2 DL
No ratings yet
Chap 2 DL
88 pages
CNN Students
No ratings yet
CNN Students
170 pages
Scan 30 Sep 23 18 20 44
No ratings yet
Scan 30 Sep 23 18 20 44
30 pages
Combined Paper
No ratings yet
Combined Paper
26 pages
Lecture-25 - Building - Training CNN
No ratings yet
Lecture-25 - Building - Training CNN
26 pages
What Is A CNN
No ratings yet
What Is A CNN
46 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
9 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
11 pages
UNIT 2 Study Materials 1
No ratings yet
UNIT 2 Study Materials 1
42 pages
Convolutional NN
No ratings yet
Convolutional NN
34 pages
Ad3501-Dl-Unit 2 Notes
No ratings yet
Ad3501-Dl-Unit 2 Notes
29 pages
CNN (Neural Network)
No ratings yet
CNN (Neural Network)
32 pages
DLT Unit - 4
No ratings yet
DLT Unit - 4
36 pages
Convolution Neural Networks U2
No ratings yet
Convolution Neural Networks U2
24 pages
CNN 1
No ratings yet
CNN 1
19 pages
AI Facilitators Handbook X
No ratings yet
AI Facilitators Handbook X
42 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
9 pages
Convolution Neural Network
No ratings yet
Convolution Neural Network
74 pages
Unit 2
No ratings yet
Unit 2
20 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
8 pages
Introduction To Convolutional Neural Networks
No ratings yet
Introduction To Convolutional Neural Networks
41 pages
Lecture 6
No ratings yet
Lecture 6
17 pages
465-Lecture 5-6
No ratings yet
465-Lecture 5-6
40 pages
7 Applications of Convolutional Neural Networks - FWS
No ratings yet
7 Applications of Convolutional Neural Networks - FWS
3 pages
ML 2
No ratings yet
ML 2
70 pages
UNIT-III DeepLearning Notes
No ratings yet
UNIT-III DeepLearning Notes
30 pages
3 Distributing Tensor Flow Across Devices and Ser 241120 095224
No ratings yet
3 Distributing Tensor Flow Across Devices and Ser 241120 095224
47 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
38 pages
DL Mod3
No ratings yet
DL Mod3
102 pages
03 - CNN
No ratings yet
03 - CNN
10 pages
Seminar
No ratings yet
Seminar
16 pages
Unit - 4 DL
No ratings yet
Unit - 4 DL
19 pages
Nria20-Dl - Unit-3 Notes-Final
No ratings yet
Nria20-Dl - Unit-3 Notes-Final
23 pages
Introduction to Deep Learning
From Everand
Introduction to Deep Learning
Eugene Charniak
No ratings yet
Cfa Quick Start Guide l3
No ratings yet
Cfa Quick Start Guide l3
1 page
Muller-Lyer Illusion
No ratings yet
Muller-Lyer Illusion
8 pages
Psychological Theories
No ratings yet
Psychological Theories
19 pages
LESSON PLAN MARILYN R. Final
No ratings yet
LESSON PLAN MARILYN R. Final
8 pages
History of English: The Socio-Cultural Context
No ratings yet
History of English: The Socio-Cultural Context
38 pages
English 6 - Quarter 4 - Module 1 - Composing Sentences Using Appropriate Verb Tenses
86% (14)
English 6 - Quarter 4 - Module 1 - Composing Sentences Using Appropriate Verb Tenses
17 pages
The Importance of Mastering English
No ratings yet
The Importance of Mastering English
3 pages
Name Game
No ratings yet
Name Game
4 pages
Cefr Lesson Lia Poem f3
0% (1)
Cefr Lesson Lia Poem f3
2 pages
653 - Weebly
No ratings yet
653 - Weebly
4 pages
WEEK 3 TG-Introduction To Philosophy of The Human Person
100% (1)
WEEK 3 TG-Introduction To Philosophy of The Human Person
1 page
The French Alphabet
No ratings yet
The French Alphabet
12 pages
Yellow Rpms
No ratings yet
Yellow Rpms
126 pages
CELTA TP Lesson Plan 9
100% (1)
CELTA TP Lesson Plan 9
6 pages
Analyzing Theme of A Story
No ratings yet
Analyzing Theme of A Story
10 pages
SHS LP - PE and Health
100% (2)
SHS LP - PE and Health
6 pages
GROUP DISCUSSION and Interviews
No ratings yet
GROUP DISCUSSION and Interviews
8 pages
Subject-Verb Agreement (Advanced Grammar in Use by Martin Hewings)
100% (1)
Subject-Verb Agreement (Advanced Grammar in Use by Martin Hewings)
6 pages
Reading Comprehension Problems On English Texts Faced by High School Students in Medan
No ratings yet
Reading Comprehension Problems On English Texts Faced by High School Students in Medan
11 pages
Ijars 226
No ratings yet
Ijars 226
6 pages
Amsic Integrated School: Purok 3, Brgy Amsic Angeles City, Pampanga Angeles City Pampanga
No ratings yet
Amsic Integrated School: Purok 3, Brgy Amsic Angeles City, Pampanga Angeles City Pampanga
13 pages
Narrative Rubric
No ratings yet
Narrative Rubric
2 pages
MGT269 - BUSINESS COMMUNICATION (2) (Visual)
No ratings yet
MGT269 - BUSINESS COMMUNICATION (2) (Visual)
23 pages
EDEXCEL As 2025 - Learning Theories - POP QUIZ
No ratings yet
EDEXCEL As 2025 - Learning Theories - POP QUIZ
8 pages
Fidp (21ST Century) Q2
No ratings yet
Fidp (21ST Century) Q2
2 pages
Don Honorio Ventura Technological State University
No ratings yet
Don Honorio Ventura Technological State University
6 pages
Simple Present Tense: Adhitya Rinaldi Irawan
No ratings yet
Simple Present Tense: Adhitya Rinaldi Irawan
15 pages