Convolutional Layer: Web-Based Demo

This document discusses convolutional neural network (CNN) architectures and their components. It describes how CNNs are composed of layers that transform the input image volume into an output volume holding class scores. The main layer types are convolutional (CONV), fully-connected (FC), rectified linear unit (RELU), and pooling (POOL) layers. Each layer may have parameters and hyperparameters. The core building block is the convolutional layer, which applies learnable filters to local regions of the input volume to produce feature maps in the output volume. Key hyperparameters that determine the output size are the filter size, stride, depth, and zero padding.

Uploaded by

olia.92

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views

Convolutional Layer: Web-Based Demo

Uploaded by

olia.92

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

A ConvNet architecture is in the simplest case a list of Layers that transform the image

volume into an output volume (e.g. holding the class scores)

There are a few distinct types of Layers (e.g. CONV/FC/RELU/POOL are by far the most
popular)
Each Layer accepts an input 3D volume and transforms it to an output 3D volume through a
differentiable function
Each Layer may or may not have parameters (e.g. CONV/FC do, RELU/POOL don’t)
Each Layer may or may not have additional hyperparameters (e.g. CONV/FC/POOL do,
RELU doesn’t)

The activations of an example ConvNet architecture. The initial volume stores the raw image pixels (left) and
the last volume stores the class scores (right). Each volume of activations along the processing path is
shown as a column. Since it's diﬃcult to visualize 3D volumes, we lay out each volume's slices in rows. The
last layer volume holds the scores for each class, but here we only visualize the sorted top 5 scores, and
print the labels of each one. The full web-based demo is shown in the header of our website. The
architecture shown here is a tiny VGG Net, which we will discuss later.

We now describe the individual layers and the details of their hyperparameters and their
connectivities.

Convolutional Layer

The Conv layer is the core building block of a Convolutional Network that does most of the
computational heavy lifting.
Overview and intuition without brain stuff. Lets first discuss what the CONV layer computes
without brain/neuron analogies. The CONV layer’s parameters consist of a set of learnable filters.
Every filter is small spatially (along width and height), but extends through the full depth of the
input volume. For example, a typical filter on a first layer of a ConvNet might have size 5x5x3 (i.e.
5 pixels width and height, and 3 because images have depth 3, the color channels). During the
forward pass, we slide (more precisely, convolve) each filter across the width and height of the
input volume and compute dot products between the entries of the filter and the input at any
position. As we slide the filter over the width and height of the input volume we will produce a 2-
dimensional activation map that gives the responses of that filter at every spatial position.
Intuitively, the network will learn filters that activate when they see some type of visual feature
such as an edge of some orientation or a blotch of some color on the first layer, or eventually
entire honeycomb or wheel-like patterns on higher layers of the network. Now, we will have an
entire set of filters in each CONV layer (e.g. 12 filters), and each of them will produce a separate 2-
dimensional activation map. We will stack these activation maps along the depth dimension and
produce the output volume.

The brain view.

view If you’re a fan of the brain/neuron analogies, every entry in the 3D output volume
can also be interpreted as an output of a neuron that looks at only a small region in the input and
shares parameters with all neurons to the left and right spatially (since these numbers all result
from applying the same ﬁlter). We now discuss the details of the neuron connectivities, their
arrangement in space, and their parameter sharing scheme.

Local Connectivity. When dealing with high-dimensional inputs such as images, as we saw above
it is impractical to connect neurons to all neurons in the previous volume. Instead, we will connect
each neuron to only a local region of the input volume. The spatial extent of this connectivity is a
hyperparameter called the receptive ﬁeld of the neuron (equivalently this is the ﬁlter size). The
extent of the connectivity along the depth axis is always equal to the depth of the input volume. It
is important to emphasize again this asymmetry in how we treat the spatial dimensions (width
and height) and the depth dimension: The connections are local in space (along width and
height), but always full along the entire depth of the input volume.

Example 1. For example, suppose that the input volume has size [32x32x3], (e.g. an RGB CIFAR-10
image). If the receptive ﬁeld (or the ﬁlter size) is 5x5, then each neuron in the Conv Layer will have
weights to a [5x5x3] region in the input volume, for a total of 5*5*3 = 75 weights (and +1 bias
parameter). Notice that the extent of the connectivity along the depth axis must be 3, since this is
the depth of the input volume.

Example 2. Suppose an input volume had size [16x16x20]. Then using an example receptive ﬁeld
size of 3x3, every neuron in the Conv Layer would now have a total of 3*3*20 = 180 connections
to the input volume. Notice that, again, the connectivity is local in space (e.g. 3x3), but full along
the input depth (20).
Left: An example input volume in red (e.g. a 32x32x3 CIFAR-10 image), and an example volume of neurons in
the ﬁrst Convolutional layer. Each neuron in the convolutional layer is connected only to a local region in the
input volume spatially, but to the full depth (i.e. all color channels). Note, there are multiple neurons (5 in this
example) along the depth, all looking at the same region in the input - see discussion of depth columns in
text below. Right: The neurons from the Neural Network chapter remain unchanged: They still compute a dot
product of their weights with the input followed by a non-linearity, but their connectivity is now restricted to
be local spatially.

Spatial arrangement.
arrangement We have explained the connectivity of each neuron in the Conv Layer to the
input volume, but we haven’t yet discussed how many neurons there are in the output volume or
how they are arranged. Three hyperparameters control the size of the output volume: the depth,
stride and zero-padding
zero-padding. We discuss these next:

1. First, the depth of the output volume is a hyperparameter: it corresponds to the number of
filters we would like to use, each learning to look for something different in the input. For
example, if the first Convolutional Layer takes as input the raw image, then different
neurons along the depth dimension may activate in presence of various oriented edges, or
blobs of color. We will refer to a set of neurons that are all looking at the same region of the
input as a depth column (some people also prefer the term fibre).
2. Second, we must specify the stride with which we slide the filter. When the stride is 1 then
we move the filters one pixel at a time. When the stride is 2 (or uncommonly 3 or more,
though this is rare in practice) then the filters jump 2 pixels at a time as we slide them
around. This will produce smaller output volumes spatially.
3. As we will soon see, sometimes it will be convenient to pad the input volume with zeros
around the border. The size of this zero-padding is a hyperparameter. The nice feature of
zero padding is that it will allow us to control the spatial size of the output volumes (most
commonly as we’ll see soon we will use it to exactly preserve the spatial size of the input
volume so the input and output width and height are the same).

We can compute the spatial size of the output volume as a function of the input volume size (W ),
the receptive ﬁeld size of the Conv Layer neurons (F ), the stride with which they are applied (S ),
and the amount of zero padding used (P ) on the border. You can convince yourself that the

CV Lec6
No ratings yet
CV Lec6
57 pages
CNN For Visual Recognition
No ratings yet
CNN For Visual Recognition
4 pages
CS231n Convolutional Neural Networks For Visual Recognition
No ratings yet
CS231n Convolutional Neural Networks For Visual Recognition
2 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
55 pages
Convolutional Neural Networks - Annotated
No ratings yet
Convolutional Neural Networks - Annotated
83 pages
Intro_DL_02
No ratings yet
Intro_DL_02
49 pages
CNN Midterm
No ratings yet
CNN Midterm
103 pages
CNN Iitkgp
No ratings yet
CNN Iitkgp
112 pages
Additional CNN
No ratings yet
Additional CNN
82 pages
(W F + 2P) /S + 1: Use of Zero-Padding
No ratings yet
(W F + 2P) /S + 1: Use of Zero-Padding
3 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
21 pages
07 AIS302 CNN
No ratings yet
07 AIS302 CNN
56 pages
cnn
No ratings yet
cnn
10 pages
Module 2 Convolutional Neural Network
No ratings yet
Module 2 Convolutional Neural Network
20 pages
Chapter14 CNN
No ratings yet
Chapter14 CNN
54 pages
Convolutional Networks
No ratings yet
Convolutional Networks
25 pages
Convolutional Neural Networks (Cnns / Convnets)
No ratings yet
Convolutional Neural Networks (Cnns / Convnets)
21 pages
IC - Lez4-5-6 - Convolutional Nets
No ratings yet
IC - Lez4-5-6 - Convolutional Nets
85 pages
CS231n - Convolutional-Networks 1
No ratings yet
CS231n - Convolutional-Networks 1
3 pages
Cnnbasics 171028092801
No ratings yet
Cnnbasics 171028092801
43 pages
CNNS and Classification Networks
No ratings yet
CNNS and Classification Networks
115 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
161 pages
CNN Architecture
No ratings yet
CNN Architecture
24 pages
CC511 Week 7 - Deep - Learning
No ratings yet
CC511 Week 7 - Deep - Learning
33 pages
Lecture_3
No ratings yet
Lecture_3
48 pages
You Can't Stop The Clock
No ratings yet
You Can't Stop The Clock
14 pages
12 Convolutional Neural Networks
No ratings yet
12 Convolutional Neural Networks
101 pages
CNN and Autoencoder
No ratings yet
CNN and Autoencoder
56 pages
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
No ratings yet
Assignment #1: Afzal Ali (11282) Muhammad Hammad (11293) Muhammad Bilal (11291) Mehran Ahmed (11287) Date 20/03/2019
7 pages
CVlecture 5
No ratings yet
CVlecture 5
56 pages
Deep Learning CNN
No ratings yet
Deep Learning CNN
204 pages
6-DeepVisualLearning L6
No ratings yet
6-DeepVisualLearning L6
82 pages
A Beginner's Guide To Understanding Convolutional Neural Networks Part 1 - Adit Deshpande - CS Under
100% (1)
A Beginner's Guide To Understanding Convolutional Neural Networks Part 1 - Adit Deshpande - CS Under
14 pages
UNIT 2 Study Materials 1
No ratings yet
UNIT 2 Study Materials 1
42 pages
Convolutional Neural Network - Wikipedia
No ratings yet
Convolutional Neural Network - Wikipedia
21 pages
Deep Learning Notes For Easy Access
No ratings yet
Deep Learning Notes For Easy Access
14 pages
Convolutional Neural Networks: Computer Vision
No ratings yet
Convolutional Neural Networks: Computer Vision
14 pages
Ch-3 Convolutional Neural Networks (CNNs)
No ratings yet
Ch-3 Convolutional Neural Networks (CNNs)
11 pages
Super VIP Cheatsheet - Deep Learning
No ratings yet
Super VIP Cheatsheet - Deep Learning
47 pages
586_114_216_Convolutional_Neural_Networks
No ratings yet
586_114_216_Convolutional_Neural_Networks
48 pages
W H D K F S P W H D W W H H D F F D D K: Summary. To Summarize, The Conv Layer
No ratings yet
W H D K F S P W H D W W H H D F F D D K: Summary. To Summarize, The Conv Layer
3 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
102 pages
3 Distributing Tensor Flow Across Devices and Ser 241120 095224
No ratings yet
3 Distributing Tensor Flow Across Devices and Ser 241120 095224
47 pages
Convolutional_Networks_2024
No ratings yet
Convolutional_Networks_2024
44 pages
Convolutional Neural Network (CNN)
No ratings yet
Convolutional Neural Network (CNN)
38 pages
unit-3-CNN-2024
No ratings yet
unit-3-CNN-2024
58 pages
Convolutional NN
No ratings yet
Convolutional NN
34 pages
CNN
No ratings yet
CNN
8 pages
UNIT-4 Foundations of Deep Learning
100% (1)
UNIT-4 Foundations of Deep Learning
43 pages
Introduction To Convolution Neural Network
No ratings yet
Introduction To Convolution Neural Network
6 pages
Typical CNN (Convolutional Neural Network) Architecture: CHARAN S (1VE20CA005) Cse-Ai, Svce
No ratings yet
Typical CNN (Convolutional Neural Network) Architecture: CHARAN S (1VE20CA005) Cse-Ai, Svce
13 pages
mod5
No ratings yet
mod5
96 pages
Deep Learning
No ratings yet
Deep Learning
17 pages
Unit 3
No ratings yet
Unit 3
80 pages
CONVOLUTIONAL NEURAL NETWORK
No ratings yet
CONVOLUTIONAL NEURAL NETWORK
36 pages
05introduction To Convolutional Neural Networks
No ratings yet
05introduction To Convolutional Neural Networks
72 pages
DL Unit4
No ratings yet
DL Unit4
31 pages
[Fall 2024] Images and Convolutions
No ratings yet
[Fall 2024] Images and Convolutions
69 pages
2023 AN2DL Lez 3 CNN TL Data Scarcity
No ratings yet
2023 AN2DL Lez 3 CNN TL Data Scarcity
121 pages
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
From Everand
Convolutional Neural Networks: Fundamentals and Applications for Analyzing Visual Imagery
Fouad Sabry
No ratings yet
(J) 2020 - Development of Intelligent Waste Segregation System Based On Convolutional Neural Network
No ratings yet
(J) 2020 - Development of Intelligent Waste Segregation System Based On Convolutional Neural Network
13 pages
jrc120469 Historical Evolution of Ai-V1.1
No ratings yet
jrc120469 Historical Evolution of Ai-V1.1
36 pages
boualleg@M2GARSS 2020
No ratings yet
boualleg@M2GARSS 2020
4 pages
Yu COCAS A Large-Scale Clothes Changing Person Dataset For Re-Identification CVPR 2020 Paper
No ratings yet
Yu COCAS A Large-Scale Clothes Changing Person Dataset For Re-Identification CVPR 2020 Paper
10 pages
Prasang Biyani
No ratings yet
Prasang Biyani
1 page
A Generative Adversarial Network With Adaptive Con
No ratings yet
A Generative Adversarial Network With Adaptive Con
12 pages
Melanoma Skin Cancer Detection With Machine Learning
No ratings yet
Melanoma Skin Cancer Detection With Machine Learning
8 pages
Finn RTL
No ratings yet
Finn RTL
22 pages
A Major Project Report ON "Mnist (Digit Recognisation) " Submitted To (M.P.)
No ratings yet
A Major Project Report ON "Mnist (Digit Recognisation) " Submitted To (M.P.)
21 pages
THESis of NLP
No ratings yet
THESis of NLP
68 pages
PDF Computer vision: theory, algorithms, practicalities Fifth Edition Davies download
100% (2)
PDF Computer vision: theory, algorithms, practicalities Fifth Edition Davies download
52 pages
Mahor Project
No ratings yet
Mahor Project
25 pages
Deep Learning With Keras and Tensorflow
No ratings yet
Deep Learning With Keras and Tensorflow
557 pages
Modeling Beats and Downbeats With A Time-Frequency Transformer
No ratings yet
Modeling Beats and Downbeats With A Time-Frequency Transformer
5 pages
Smart Fiber-Optic Distributed Acoustic Sensing sDAS With Multitask Learning For Time-Efficient Ground Listening Applications
No ratings yet
Smart Fiber-Optic Distributed Acoustic Sensing sDAS With Multitask Learning For Time-Efficient Ground Listening Applications
15 pages
Application of Information Technology in Woodcut Prints: Abstract
No ratings yet
Application of Information Technology in Woodcut Prints: Abstract
9 pages
Nabeel Seminar - 20241006 - 164202 - 0000
No ratings yet
Nabeel Seminar - 20241006 - 164202 - 0000
12 pages
You Only Look Once - Object Detection Models A Review
No ratings yet
You Only Look Once - Object Detection Models A Review
8 pages
Capstone Project Sem-6
No ratings yet
Capstone Project Sem-6
29 pages
Machine Learning
No ratings yet
Machine Learning
106 pages
Phase 1 Road Lane Line Detection
No ratings yet
Phase 1 Road Lane Line Detection
9 pages
Pix 2 Style 2 Pix
No ratings yet
Pix 2 Style 2 Pix
21 pages
Mastering Classification of EEG PDF
No ratings yet
Mastering Classification of EEG PDF
2 pages
White Blood Cell Classification Using Convolutional Neural Network 2019
No ratings yet
White Blood Cell Classification Using Convolutional Neural Network 2019
9 pages
sms spam filtering system hybrid approaches
No ratings yet
sms spam filtering system hybrid approaches
25 pages
Pix2Vox Context-Aware 3D Reconstruction From Single and Multi-View Images
No ratings yet
Pix2Vox Context-Aware 3D Reconstruction From Single and Multi-View Images
9 pages
5.Sanity Checks for Saliency Maps
No ratings yet
5.Sanity Checks for Saliency Maps
30 pages
s10462-024-10806-2
No ratings yet
s10462-024-10806-2
28 pages
CNN Code
No ratings yet
CNN Code
6 pages
Thesis Report On Image Classification Using Conv
No ratings yet
Thesis Report On Image Classification Using Conv
9 pages