0% found this document useful (0 votes)

20 views68 pages

Lecture-CNN

Uploaded by

i001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views68 pages

Lecture-CNN

Uploaded by

i001

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 68

Convoluted Neural Networks

Dajiang Liu and Sen Yang

PHS 597
References
• Most of the materials in this lecture is taken from Chapter 9 of
Goodfellow book
• Some introductions of convolution also come from
• https://towardsdatascience.com/a-comprehensive-introduction-to-
different-types-of-convolutions-in-deep-learning-669281e58215
What is Convolution?
• Convolution is a commonly used operation for getting the distribution for
the sum of two random variables
• While being used for other purposes as well
• Consider two independently distributed random variables X and Y, with
cumulative distribution functions 𝐹 𝑥 and 𝐺 𝑦 and density functions
𝑓 𝑥 and 𝑔 𝑦
• If we want to calculate the distribution for 𝑍 = 𝑋 + 𝑌,
Pr 𝑍 ≤ 𝑧 = 𝑃 𝑋 + 𝑌 ≤ 𝑧 = 1Pr 𝑌 ≤ 𝑧 − 𝑥 𝑓 𝑥 𝑑𝑥
• If the density for Z exists, the density function equals
𝑝 𝑧 = 1𝑔 𝑧 − 𝑥 𝑓 𝑥 𝑑𝑥
• The operation above that involves 𝑓 and 𝑔 is called convolution, which is
often denoted by ∗
𝑝 𝑧 =𝑓∗𝑔 𝑧
Convolution

• Convolution can be thought of as an “average” filter

• Consider the convolution
𝑠 𝑡 = $𝑓 𝑡 − 𝑎 𝑔 𝑎 𝑑𝑎
• We can think of 𝑠 𝑡 as an average of 𝑓 𝑡 − 𝑎 , where 𝑎 is considered
a distance to 𝑡
• The values of 𝑓 𝑡 − 𝑎 are weighted by the weights 𝑔 𝑎
• The farther away the point is from t (i.e. |𝑎| bigger), the less weight it carries
• “average” here is a general term. For it to be a real average, we need
𝑔 𝑎 to be probability density function
• I.e. positive and integrate to 1
Convolution
• Convolution is commutative 𝑓 ∗ 𝑔 is the same as 𝑔 ∗ 𝑓
• Yet, we often give different names to f and g in a convolution 𝑠 = 𝑓 ∗ 𝑔
• We call f the input and g the kernel
• In practice we usually deal with discrete functions, so the integral is often
done by summation
• In math, summation is considered an integral over the discrete measure
𝑠 𝑡 = 8𝑓 𝑥 𝑔 𝑡 −𝑥
!
• Also, in practice, all functions have finite support, and are non-zero over a
finite set of point. So the above summation is over a finite number of
different 𝑥’s
Multi-dimensional Convolution
• In image applications, the convolution is done using multi-dimensional array (which we
call tensor)
• For a two dimensional convolution, it is of the form:
𝑆 𝑖, 𝑗 = & & 𝐼 𝑚, 𝑛 𝐾 𝑖 − 𝑚, 𝑗 − 𝑛
! "
• The convolution is commutative, i.e.
𝑆 𝑖, 𝑗 = & & 𝐼 𝑖 − 𝑚, 𝑗 − 𝑛 𝐾 𝑚, 𝑛
! "
• If we flip the kernel, 𝐾∗ −𝑚, −𝑛 = 𝐾 𝑚, 𝑛 , the convolution becomes
𝑆 𝑖, 𝑗 = & & 𝐼 𝑖 + 𝑚, 𝑗 + 𝑛 𝐾 ∗ 𝑚, 𝑛
! "
In this case, we call it the cross correlation between 𝐼 and 𝐾 ∗
• Given this equivalence, we interchangeably use convolution and cross correlation in the
following discussions.
Convolution as Matrix Multiplication
• Convolution can be taken as matrix multiplication.
• Take a simple kernel 𝐾 0 = 0, 𝐾 1 = 1 𝐾 −1 = −1
• So for a given function 𝐼(𝑥), the convolution takes the form
𝑆 𝑡 = ,𝐼 𝑡−𝑥 𝐾 𝑥
!
• We can write the input as a vector and the kernel as a matrix, and resulting convolution as a matrix
multiplication
𝑆- = 𝐾. 𝐼-

1 0 −1 …
1 0 −1 …
.=
𝐾 1 0 −1

. is called Toeplitz matrix (each row is a shift by 1 of the row above)

In algebra 𝐾
**: need to be careful about the boundary values
Example: Convolution as edge detection
• Input: BW image with 320x280
• Output: image with size 319x280
• If using kernel 𝐾 0 = 1, 𝐾 1 = −1 to process the image, it requires
319*280*3 operations
• If using matrix multiplication, it requires 320*180*319*280 floating
point operations
2D Convolution Example
Why convolution?
• Parameter sharing
• Sparse interaction
• Equivariant
• Flexibly handling input of different sizes
Sparse Connectivity
• Previously, in all of our examples, different nodes were linked by the
multiplication of weights and input
• In this case, each node is dependent on all nodes from previous layer (called fully
connected)

• In many real problems, data is organized into structures with sparse

connectivity
• For example: pixels in a picture only are only correlated with nearby pixels. But
farther apart two pixels are, the less dependent they become

• Sparse connectivity also reduce the computational burden

• For fully connected networks, connecting a layer with m nodes to a layer with n
nodes requires 𝑂 𝑚𝑛 parameters
• While for sparsely connected neural networks, each node may only be connected to
k node in the next layer (with k<n). In this case, there are only 𝑂 𝑚𝑘 parameters
needed
Example:

Sparse connectivity Full connectivity

Example: Sparse connectivity in multi-layer
neural networks

Receptive field of 𝑔"

Parameter Sharing
• In fully connected NN, the parameters that link a node with its
receptive field are used only once.

• For a CNN, the members of the kernels are repeatedly used, which
greatly reduce the parameters that need to be estimated

CNN with a kernel with 3 Fully Connected NN

parameters
Equivariant to Translation Property
• Translating the input before taking convolution gives the same result as
taking convolution first and then translation.
• Mathematically,
𝑠 𝑡 = 𝑓 ∗ 𝑔 𝑡 = 1𝑓 𝑡 − 𝑥 𝑔 𝑥 𝑑𝑥
• Translating s by Δ𝑡 we get
𝑠 𝑡 + Δ𝑡 = 1𝑓 𝑡 + Δ𝑡 − 𝑥 𝑔 𝑥 𝑑𝑥 = 1𝑓: 𝑡 − 𝑥 𝑔 𝑥 𝑑𝑥

Where 𝑓: 𝑡 ≔ 𝑓 𝑡 + Δ𝑡 (can be considered as a shifted image)

• Convolution is not necessarily equivariant to other transformations (e.g.,
scaling)
Standard CNN Architecture
• A CNN usually follows:
• Convolution
• Detector (applying a non-linear
activation function to the output
from a convolution)
• Pooling
• Maxpooling: selecting the max value
of a rectangular region.
Pooling
• Pooling helps to make the
convolution stable
• E.g., more invariant (stable)
against the translation
• This is very useful in image
analysis
• E.g., A shift in the image should
not affect the determination of
whether the image is a cat or
dog.
Pooling
• Pooling is a down sampling procedure, in order to make the detector
output invariants to small shifts in the data
• Pool is almost always done using a 2x2 filter
• Within each filter, either to retain
• The average (average pooling)
• The max (max pooling)

• For imaging analysis, many involve edge detection

• Max pool works much better than average pooling
• Edges are where pixel changes rapidly. Average tend to make these changes go away.
Stride
• Stride controls how the filter convolves around the input volume.
• The amount by which the filter shifts is the stride.
• Stride is normally set in a way so that the output volume is an integer
and not a fraction.
• Stride, similar to pooling, is a downsampling technique.
Stride is a down-sampling procedure

Stride 2 is equivalent to stride 1 +

Down-sampling
Padding
• Applying convolutional filters often lead to reductions of the image sizes

• If applying the filter multiple times, the image could reduce to none.
• To avoid this, we can apply a very simple technique called zero padding (padding for
short).
• The general formula is
$%&'()
• 𝑂= +1
*
• W is the width of the input;
• K is the kernel
• P is padding
• S is the size of the stride
Impact of Padding

No padding: each convolution

reduces the dimension by 6. Only 3
layers are possible

With padding
Standard CNN Architectures
• CNN typically run many
convolutions in parallel
• Images are often multi-channel
• Example includes RGB images
2D convolution Illustration
Convolution in RGB images
• RGB images contains 3 channels representing the intensity for red,
green and blue channels

• More generally, data may be present as multiple channels

How 2D-Convolution Works for Multiple
Channels

• For multi-channel data, if we say filters of size k x k, unless otherwise

stated, we mean the filters of the size k x k x Din
How to change dimensions between different
layers with 2D Convolution
• Let us say that the input layer is of height, width and depth of
𝐻#$ , 𝑊#$ , 𝐷#$
• Let us say that the input layer is of height, width and depth of
𝐻%&' , 𝑊%&' , 𝐷%&'
• The idea is to apply 𝐷%&' number of filters and stacking them
3D convolution
• In addition to 2D filters, 3D filters can be used.
1 x 1 Convolution
• While called a 1 x 1 convolution, it is indeed a 3D convolution, with 1
x 1 x D fitler, where D is the depth of the input layer
• Applying one such 1 x 1 x D filter yields a output of W x H x 1 output
• Applying N such filters gives output of size W x H x N
Why 1 x 1 Convolution
• Google inception network used this structure
• Several notable benefits
• Dimension reduction
• Applying the 1 x 1 x D filter can collapse the input to dimension W x H x 1
• Information embedding
• Even if entire depth is collapsed to 1, there can still be considerable amount information
retained
Transposed
convolution
• Recall that convolution is a
linear operation which can
be written as matrix
multiplication
Z1 Transposed
Z2
Z3
Convolution
Z4 Z1 Z2 Z3 Z4
Z5
Z5 Z6 Z7 Z8
Z6
Z9 Z10 Z11 Z12
Z7
Z8 Z13 Z14 Z15 Z16
Z9
Z10
Z11
Z12
Z13
Z14
Z15
Z16
Computation of Convolution – Separable
Kernels
• Separable kernel: if they can be written as outer project of two
vectors

•
ImageNet Competition
• 14 million images in over 20,000 categories
• Cat, dog, balloon, strawberry
• over 1 million images have boxes around the object of interest
• Annotated by croudsourcing
• Led by Fei-Fei Li
ImageNet Competition
ImageNet Competition
• Classification Error of winners over the years
AlexNet Architecture
Keras implementation • # 5th Convolutional Layer
model.add(Conv2D(filters=256, kernel_size=(3,3),
strides=(1,1), padding=’valid’))
• # 1st Convolutional Layer model.add(Activation(‘relu’))
model.add(Conv2D(filters=96, # Max Pooling
model.add(MaxPooling2D(pool_size=(2,2), strides=(2,2),
input_shape=(224,224,3), kernel_size=(11,11), padding=’valid’))
strides=(4,4), padding=’valid’)) • # Passing it to a Fully Connected layer
model.add(Activation(‘relu’)) model.add(Flatten())
# 1st Fully Connected Layer
# Max Pooling model.add(Dense(4096, input_shape=(224*224*3,)))
model.add(MaxPooling2D(pool_size=(2,2), model.add(Activation(‘relu’))
strides=(2,2), padding=’valid’)) # Add Dropout to prevent overfitting
model.add(Dropout(0.4))
• # 2nd Convolutional Layer • # 2nd Fully Connected Layer
model.add(Conv2D(filters=256, kernel_size=(11,11), model.add(Dense(4096))
model.add(Activation(‘relu’))
strides=(1,1), padding=’valid’)) # Add Dropout
model.add(Activation(‘relu’)) model.add(Dropout(0.4))
# Max Pooling • # 3rd Fully Connected Layer
model.add(MaxPooling2D(pool_size=(2,2), model.add(Dense(1000))
model.add(Activation(‘relu’))
strides=(2,2), padding=’valid’)) # Add Dropout
model.add(Dropout(0.4))
• # 3rd Convolutional Layer • # Output Layer
model.add(Conv2D(filters=384, kernel_size=(3,3), model.add(Dense(17))
strides=(1,1), padding=’valid’)) model.add(Activation(‘softmax’))
model.add(Activation(‘relu’)) • model.summary()

• # 4th Convolutional Layer • # Compile the model

model.compile(loss=keras.losses.categorical_crossentropy,
model.add(Conv2D(filters=384, kernel_size=(3,3), optimizer=’adam’, metrics=[“accuracy”])
strides=(1,1), padding=’valid’))
model.add(Activation(‘relu’))
My GPU Machine
Local Response Normalization (LRN)
• LRN was a technique introduced in AlexNet
• The original motivation is to make sure that the output from each
layer is bounded, and the magnitude does not change with the depth
of the network
• Two types of LRN
• Inter channel normalization
4
𝑎1,3
4
𝑏1,3 = @
:
567 <9=,4> ? ;
;
𝑘 +𝛼∑ : 𝑎1,3
567 8,49 ;
Inter-channel vs. Intra-channel LRN
Inter Channel LRN Example

•𝑘=0
•𝛼=1
•𝛽=1
•𝑛=2
Intra-Channel LRN Example
• The intra-channel normalization works as follows
+
+ =
𝑎(,*
𝑏(,* 9
$ $
-34 5,(6 -34 8,*6 2
2 2 +
𝑘 +𝛼∑ $ ∑ $ 𝑎#,7
#,-./ 0,(1 2 7,-./ 0,*1 2
• 𝑘 = 0, 𝛼 = 1, 𝛽 = 1, 𝑊 = 𝐻 = 8, 𝑛 = 2
+ is the value at position 𝑥, 𝑦 in channel 𝑘
• 𝑎(,*
• Should be distinguished from 𝛼
Batch Normalization
• Batch normalization avoids internal covariate shift
Batch Normalization (BN) Algorithm
• For values 𝑥 in the mini-batch, we scale and shift them to make them
have the same mean the standard deviations
• The input in a batch is 𝑥: , … , 𝑥;
• We calculate the mean and variance for the batch, which we denote
as 𝜇< and 𝜎<2
• We calculate the Z-score for the batch:
𝑧# = 𝜎<1: 𝑥# − 𝜇<
• The normalized output is given by
𝑦# = 𝛾𝑍# + 𝛽
ZFNet
• ZFNet is winner of 2013
• Heavily based upon AlexNet
VGGNet
VGGNet
• M ~ maxpooling
• LRN local response
normalization
• Different columns represent
different versions of VGGNet
• A, C represent smaller networks
• VGG16 and 19 are columns D,E
VGGNet Properties
• VGGNet while not winning the contest, proposes useful properties
that were widely used later on
• One uniqueness is that VGGNet uses small filters at great depth
• Could save parameters and potentially retrieve interesting features.
• Greater depth means more ReLU activation and more non-linearity
• VGGNet is very slow to train
• A workaround is to train a smaller network first, and then use the output from
smaller network as input for further training
GoogLeNet
• GoogLeNet introduced a novel concept, the inception network
• Image features could come at different resolutions
• Using filters of different sizes could help extract features of different
granularity
GoogLeNet Implementation - Bottleneck
• For filters of different sizes, we first apply 1 x 1 convolution bottleneck
in order to collapse the channels and reduce the dimension of the
data
• Recall: inception V1
GoogLeNet – Output Layer
• Another interesting feature for GoogLeNet lies in the output layer
• Instead of using a fully connected layer in the end, GoogLeNet uses
averge pooling, which reduces the number of parameters and
improves the performance (by .7%).
GoogLeNet Dimensions
GoogLeNet
• Overall architecture
ResNet
• ResNet is the first deep learning model that attains human level
accuracy
• It mostly benefits from fitting a much deeper networks
• It is generally believed that training deeper networks should ALWAYS
be helpful for solving complex problems
• As shallow networks are special cases for deep networks with identity
activation function
• Yet fitting deep network is often not easy
• Convergence can take very long;
• There may be problems of exploding and vanishing gradient problems
ResNet Key Algorithm
• One key idea that motivate the ResNet is by copying between layers,
we manage to allow portions of the data be fitted with shallower
networks than other parts of the data
ResNet Implementation
Other Image Processing Applications
• In addition to classification, there are often more complex tasks that
are based upon similar models
• Object localization
• Object detection
• Segmentation
• Video processing
Object Localization
• Question: Can we draw a box over the object that we deem present in
the image?
• It seems natural to see that we can answer object localization based
upon classification e.g. AlexNet
• For the last couple of fully connected layers that are used for
classification, we instead try to train a regression type of model
• The box surrounding the object is defined by 4 numbers
𝑥, 𝑦, 𝑤, ℎ
• 𝑥, 𝑦 are the top left coordinates for the pixel. W, h are the width and
height of the box
Object Detection
• Often there is an unknown number of objects in the image
• If we somehow know a region that contains some object, we can run
a standard CNN model to classify it
• So this boils down to find a regional proposal:
Region-based CNN
• One intuitive idea is to scan different regions of the image at different
resolutions (but limit the number of regions under 2000)
• Basic algorithm is the following:
• Run Selective Search to generate probable objects.
• Feed these patches to CNN, followed by SVM to predict the class of each
patch.
• Optimize patches by training bounding box regression separately.
Spatial Pyramid Pooling
• Similar regions can be identified in the CNN output as in the original
image
• Design pooling based upon the feature map
Spatial Pyramid Pooling
YOLO (You Only
Look Once)

Encyclopedia of Technology Terms PDF
100% (1)
Encyclopedia of Technology Terms PDF
840 pages
05introduction To Convolutional Neural Networks
No ratings yet
05introduction To Convolutional Neural Networks
72 pages
CH 9
No ratings yet
CH 9
41 pages
Experiment 3
No ratings yet
Experiment 3
48 pages
Pdclab 6
No ratings yet
Pdclab 6
15 pages
mod5
No ratings yet
mod5
96 pages
Part 1.4. Convolution Neural Network
No ratings yet
Part 1.4. Convolution Neural Network
24 pages
10-Variants of Convolution Function-21-Sep-2020Material I 21-Sep-2020 Module5 CNN
No ratings yet
10-Variants of Convolution Function-21-Sep-2020Material I 21-Sep-2020 Module5 CNN
23 pages
Convolutional Neural Networks in Computer Vision: Jochen Lang
No ratings yet
Convolutional Neural Networks in Computer Vision: Jochen Lang
42 pages
Convolution Neural Network-1
No ratings yet
Convolution Neural Network-1
44 pages
Cnn
No ratings yet
Cnn
35 pages
Convolutional Networks1
No ratings yet
Convolutional Networks1
52 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
102 pages
Income Tax RP
No ratings yet
Income Tax RP
295 pages
CNN_Intro
No ratings yet
CNN_Intro
30 pages
CNNs
No ratings yet
CNNs
22 pages
Recitation 4
No ratings yet
Recitation 4
18 pages
Cnn
No ratings yet
Cnn
123 pages
CNN Concept
No ratings yet
CNN Concept
57 pages
CNN_new
No ratings yet
CNN_new
225 pages
Convolutinal Neural Networks
No ratings yet
Convolutinal Neural Networks
43 pages
AIML_ECE_UNIT-5
No ratings yet
AIML_ECE_UNIT-5
48 pages
[Fall 2024] Images and Convolutions
No ratings yet
[Fall 2024] Images and Convolutions
69 pages
Convolutional Neural Networks - Part 2
No ratings yet
Convolutional Neural Networks - Part 2
49 pages
Module-4
No ratings yet
Module-4
20 pages
Lecture_10_slides_-_after
No ratings yet
Lecture_10_slides_-_after
66 pages
Ch. 10: Introduction To Convolution Neural Networks CNN and Systems
No ratings yet
Ch. 10: Introduction To Convolution Neural Networks CNN and Systems
69 pages
Unit 2 (1)
No ratings yet
Unit 2 (1)
45 pages
dl_mod4
No ratings yet
dl_mod4
18 pages
Binder 1
No ratings yet
Binder 1
60 pages
2 ConvolutionFilterig
No ratings yet
2 ConvolutionFilterig
42 pages
Cnn
No ratings yet
Cnn
32 pages
Chap4 CNN (20240205) - DL4H practioner guide
No ratings yet
Chap4 CNN (20240205) - DL4H practioner guide
23 pages
Lecture 3 Updated
No ratings yet
Lecture 3 Updated
56 pages
Unit- 4 Deep Learning (1)
No ratings yet
Unit- 4 Deep Learning (1)
14 pages
CNN
No ratings yet
CNN
62 pages
Module 3
No ratings yet
Module 3
46 pages
371810f3-a2d5-467f-aa88-bfa680405b79
No ratings yet
371810f3-a2d5-467f-aa88-bfa680405b79
17 pages
E-Note_33951_Content_Document_20250328020322PM
No ratings yet
E-Note_33951_Content_Document_20250328020322PM
29 pages
CNN 1
No ratings yet
CNN 1
9 pages
Unit Iii Convolutional Networks and Sequence Modelling
No ratings yet
Unit Iii Convolutional Networks and Sequence Modelling
38 pages
Iii Unit - Deeplearning
No ratings yet
Iii Unit - Deeplearning
93 pages
Cnn
No ratings yet
Cnn
26 pages
Convolutional Neural Networks - Part 1
No ratings yet
Convolutional Neural Networks - Part 1
44 pages
Convolution and Pooling Layers
No ratings yet
Convolution and Pooling Layers
42 pages
L09 Convolutional Networks
No ratings yet
L09 Convolutional Networks
9 pages
DeepLearning Unit-II
No ratings yet
DeepLearning Unit-II
70 pages
NN 06
No ratings yet
NN 06
18 pages
Unit - 2
No ratings yet
Unit - 2
51 pages
Convolutional Neural Networks
No ratings yet
Convolutional Neural Networks
35 pages
Module-4 dl
No ratings yet
Module-4 dl
22 pages
CF Module II
No ratings yet
CF Module II
71 pages
AMS Licensure Reviewer's Guide
No ratings yet
AMS Licensure Reviewer's Guide
1 page
CNN PPT Unit Iv
No ratings yet
CNN PPT Unit Iv
134 pages
Bermad Bermad Bermad: Waterworks Waterworks Waterworks
No ratings yet
Bermad Bermad Bermad: Waterworks Waterworks Waterworks
6 pages
[Ebooks PDF] download City at the Center of the World Space History and Modernity in Quito 1st Edition Ernesto Capello full chapters
100% (2)
[Ebooks PDF] download City at the Center of the World Space History and Modernity in Quito 1st Edition Ernesto Capello full chapters
82 pages
Net Integration Using Message Broker and Ibm Integration Bus
No ratings yet
Net Integration Using Message Broker and Ibm Integration Bus
55 pages
21CS743_Module4_notes
No ratings yet
21CS743_Module4_notes
15 pages
Unit 1 - Understanding Human Development
No ratings yet
Unit 1 - Understanding Human Development
30 pages
Lecture 6
No ratings yet
Lecture 6
17 pages
AE556_2024_Topic4_CNN
No ratings yet
AE556_2024_Topic4_CNN
26 pages
Sarma Cnn Vce Oct 2022
No ratings yet
Sarma Cnn Vce Oct 2022
63 pages
Auditing, Softwares
100% (1)
Auditing, Softwares
26 pages
ROLE OF Ngo
No ratings yet
ROLE OF Ngo
5 pages
Neural Networks and Deep Learning (PE - V) (18CSE23) Unit - 4
No ratings yet
Neural Networks and Deep Learning (PE - V) (18CSE23) Unit - 4
11 pages
Synthetic-Bag-Brochure
No ratings yet
Synthetic-Bag-Brochure
8 pages
Lesson Plan ON EBP
No ratings yet
Lesson Plan ON EBP
10 pages
Ch. 9: Introduction To Convolution Neural Networks (CNN) and Systems
No ratings yet
Ch. 9: Introduction To Convolution Neural Networks (CNN) and Systems
96 pages
EBTax Purchasing Whitepaper
No ratings yet
EBTax Purchasing Whitepaper
52 pages
SSC Foreign Diploma Requirements For Bachelor Programmes Um 2023-2024
No ratings yet
SSC Foreign Diploma Requirements For Bachelor Programmes Um 2023-2024
16 pages
Seeker Blank Catalog
No ratings yet
Seeker Blank Catalog
8 pages
Cdi3 SLM 1
No ratings yet
Cdi3 SLM 1
11 pages
A Comprehensive Tutorial To Learn Convolutional Neural Networks From Scratch
No ratings yet
A Comprehensive Tutorial To Learn Convolutional Neural Networks From Scratch
11 pages
The Cone of Experience
No ratings yet
The Cone of Experience
21 pages
FOOD PROCESSSNG Ap Recent
No ratings yet
FOOD PROCESSSNG Ap Recent
32 pages
[FREE PDF sample] eTextbook 978-1305268920 Fundamentals of Biostatistics ebooks
100% (6)
[FREE PDF sample] eTextbook 978-1305268920 Fundamentals of Biostatistics ebooks
41 pages
21CS743_DL_Module4_notes
No ratings yet
21CS743_DL_Module4_notes
7 pages
458 The Future Test A1 A2 Grammar Exercises
No ratings yet
458 The Future Test A1 A2 Grammar Exercises
4 pages
Shenzhen Sunsoont Technology Co.,Ltd: Construct Network World, Pursue Unlimited Excellence
No ratings yet
Shenzhen Sunsoont Technology Co.,Ltd: Construct Network World, Pursue Unlimited Excellence
10 pages
Results of 2nd Periodical Test Grade 5
No ratings yet
Results of 2nd Periodical Test Grade 5
4 pages
Cultural studies
No ratings yet
Cultural studies
5 pages
Turbulence Modeling: A Discussion On Different Techniques Used in Turbulence Modeling
No ratings yet
Turbulence Modeling: A Discussion On Different Techniques Used in Turbulence Modeling
19 pages
Unit 2a
No ratings yet
Unit 2a
31 pages
UBI Essay For g10 Students
No ratings yet
UBI Essay For g10 Students
2 pages
3rd Prep - First Units
No ratings yet
3rd Prep - First Units
2 pages
Convolutional Neural Network
No ratings yet
Convolutional Neural Network
11 pages
4HP16
100% (4)
4HP16
241 pages
TOEFL Test 2
No ratings yet
TOEFL Test 2
2 pages
(DOC) MGA MATALINHAGANG SALITA - RIN MEIJI - Academia - Edu
No ratings yet
(DOC) MGA MATALINHAGANG SALITA - RIN MEIJI - Academia - Edu
1 page
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
From Everand
Flood Fill: Flood Fill: Exploring Computer Vision's Dynamic Terrain
Fouad Sabry
No ratings yet

Lecture-CNN

Uploaded by

Lecture-CNN

Uploaded by

Convoluted Neural Networks

Dajiang Liu and Sen Yang

• Convolution can be thought of as an “average” filter

. is called Toeplitz matrix (each row is a shift by 1 of the row above)

• In many real problems, data is organized into structures with sparse

• Sparse connectivity also reduce the computational burden

Sparse connectivity Full connectivity

Receptive field of 𝑔"

CNN with a kernel with 3 Fully Connected NN

Where 𝑓: 𝑡 ≔ 𝑓 𝑡 + Δ𝑡 (can be considered as a shifted image)

• For imaging analysis, many involve edge detection

Stride 2 is equivalent to stride 1 +

No padding: each convolution

• More generally, data may be present as multiple channels

• For multi-channel data, if we say filters of size k x k, unless otherwise

• # 4th Convolutional Layer • # Compile the model

You might also like