Deep Learning based Computer Vision
Deep Learning based Computer Vision
r ( x, y )
f ( x, y ) = g ( x, y )
b ( x, y )
The spatial filter mask is moved from point to point in an image. At each point (x, y),
the response of the filter is calculated
x
Neighbourhood
(x, y) Origin
y Sengar
Dr. Sandeep Singh Image f (x, y)
Neighbourhood Operations
Neighbourhood
(x, y)
Origin
y Image f (x, y)
Dr. Sandeep Singh Sengar
The Spatial Filtering Process
Origin x
a b c j k l
d
g
e
h
f
i
* m
p
n
q
o
r
Original Filter (w)
Simple 3*3
e 3*3 Filter Image
Neighbourhood Pixels
eprocessed = n*e + j*a + k*b +
l*c + m*d + o*f + p*g + q*h +
r*i
y Image f (x, y)
99 106 98
95 90 85
*
1/ 100108
9 /9 /9
104 1 1 Original Filter
Simple 3*3 /9 1106
199 /9 198
/9
3*3 Smoothing Image
Neighbourhood /9 190
195 /9 185
/9
Filter Pixels
Standard average
e e e
y
Image f (x, y)
border padding
Medical
Imaging
Ref:https://www.forbes.com/sites/kalevleetaru/2019/01/15/why-machine-learning-needs-semantics-not-
just-statistics/?sh=730fa3aa77b5 36
Dr. Sandeep Singh Sengar
Branch of Machine Learning
37
Dr. Sandeep Singh Sengar
Ref: https://www.wordstream.com/blog/ws/2017/07/28/machine-learning-applications
Deep Learning
Deep Learning is a subfield of machine learning concerned
with algorithms inspired by the structure and function of the
brain called artificial neural networks.
DL/ML is used to find the algorithm (model)
Large data High performance
Architecture Application
CNN Image recognition, video analysis, natural language processing
Stride s=2
#filters=2
#channels=3
Padding p=1
Stride: s
O/P size= [(n+2p-f)/s+1]* [(n+2p-f)/s+1]
Sparser representations:
Detect less (more abstract) features
https://towardsdatascience.com/applied-deep-learning-part-4-
convolutional-neural-networks-584bc134c1e2
Dr. Sandeep Singh Sengar
Non-linear Activation Function
23.8
12.7 18.3 22.3 7.9 8.3
Convolved Features
23.8 23.8
12.7 18.3 22.3 7.9 8.3
Convolved Features
Convolved Features
18.3
11.3 9.2 11.8 18.9 10.3
Convolved Features
18.3 18.9
11.3 9.2 11.8 18.9 10.3
Convolved Features
14.8
12.7 18.3 22.3 7.9 8.3
Convolved Features
14.8 15.6
12.7 18.3 22.3 7.9 8.3
Convolved Features
9 4x4 max
Hidden Layer i
-4 5 4 6
5 6
0 -3 2 -3 2x2 max,
8 9 non overlapping
7 8 -5 9
3 0 -4 1
5 5 6 2x2 max,
overlapping
8 8 9 (contains non-
I/P size: n*n overlapping, so
8 8 9 no need for both)
Filter size: f*f
Padding=p, Stride=s
O/P size: (n+2p-f)/s+1 Dr. Sandeep Singh Sengar
Fully Connected Layer
Ref: Ronneberger, Olaf, Philipp Fischer, and Thomas Brox. "U-net: Convolutional networks for biomedical image segmentation." In International Conference on Medical image computing and computer-assisted
intervention, pp. 234-241. Springer, Cham, 2015. Dr. Sandeep Singh Sengar
Train, Validation and Test Datasets
Make sure, validation and test set come from same distribution
Computer vision has its present challenges, but the humans working on this technology are steadily
improving it. Dr. Sandeep Singh Sengar
CNN: A Real Example