Convolutional Networks1
Convolutional Networks1
Computer Vision
Computer Vision Problems:
1. Image classification
Computer Vision
Computer Vision Problems:
Computer Vision
The biggest challenge in Computer vision is that the input can become
very big
And that the goal is to learn the parameters, so that if you take the
image and convolve with the filter this give a good edge detector.
Edge Detection
Edge Detection
This can come out being better at capturing the statistic of the data,
• nxn image convolved with fxf filter will give n-f+1 x n-f+1
If you pad the original image with say one more line outside,
meaning p = 1
e.g. 6x6 becomes 8x8, and the result of the convolution with the 3x3,
the result is 6x6
Padding
n+2p-f+1 x n+2p-f+1
Padding
• Types of padding can be Valid or Same
• Valid: nxn * fxf -> n-f+1 x n-f+1
Padding
Same: Pad so that output size is same as input size
n + 2p - f + 1 x n + 2p - f + 1
n + 2p - f + 1 = n
p = (f-1)/2
What of 5x5 filter?
Padding
In Computer Vision f is usually odd
Makes padding simpler
Has a central pixel
Strided convolution
Padding
nxn*fxf with pad p and stride s then
(n + 2p – f)/s + 1 x (n + 2p – f)/s + 1
Convolution
Convolutions used to generate feature maps
Take an image
+
A filter
=
Feature map
CNN for Classification
Layers in Convolution Neural Networks
Convolution Layer
It has a number of filters that perform convolution
Every image is considered a matrix of pixel values
Local Connectivity
Local Connectivity
Instead of the dot product of the feed forward, apply convolution
Every single neuron in the hidden layer only sees a patch of input in its
previous layer