CVI Week 2 1 Pre Note
CVI Week 2 1 Pre Note
1
Feature Detection/Extraction
2
Deep Learning-based Features
10
11
Traditional visual feature descriptors
Points and patches
Affine invariance
12
13
I = imread('image.jpg');
points = detectSURFFeatures(I);
SIFT advantages.
Locality: features are local, so robust to
occlusion and clutter (no prior segmentation)
Distinctiveness: individual features can be
matched to a large database of objects
Quantity: many features can be generated for
even small objects
Efficiency: close to real-time performance
Extensibility: can easily be extended to wide
range of differing feature types
SIFT Mapping in Action…
SIFT Mapping in Action…
Traditional visual feature descriptors
Points and patches
Applications: Large-scale matching and retrieval
28
29
30
https://en.wikipedia.org/wiki/Sobel_operator
Traditional visual feature descriptors
Edges and contours
Canny edge detector
1.Apply Gaussian filter to smooth the image
in order to remove the noise
https://en.wikipedia.org/wiki/Canny_edge_detector
Traditional visual feature descriptors
Edges and contours
Contour detection
32
33
34
https://en.wikipedia.org/wiki/Hough_transform
Slide credit:
Training Training
Images
Training Labels
35
Slide credit:
Testing
Image Trained Prediction
Features Classifier Outdoor
Test Image 36
Slide credit:
Hand-crafted
SIFT [Loewe IJCV 04] HOG [Dalal and Triggs CVPR 05]
DPM [Felzenszwalb et al. PAMI 10]
3rd layer
Input data “Objects”
2nd layer
“Object parts”
1st layer
“Edges”
[Lee et al., ICML
2009; CACM 2011]
Pixels
39
40
41
Layer 1 … Layer k
42
43
https://towardsdatascience.com/what-is-deep-learning-and-how-does-it-work-2ce44bb692ac
Types of Learning & History
Brain
Supervised
learning
Unsupervised
learning
Modern
architectures
44
https://towardsdatascience.com/supervised-vs-unsupervised-learning-in-2-minutes-72dad148f242 https://medium.com/analytics-vidhya/cnns-architectures-lenet-alexnet-vgg-googlenet-resnet-and-more-666091488df5
Slide credit:
A brief history
46
Slide credit:
A brief history
47
Slide adapted from:
A brief history
Human Brain
Video 49
50
Basic definition
• Nonlinear
• Can approximate any continuous
function to arbitrary accuracy given
sufficiently many hidden units
• Layer 3 (final)
53
Basic definition
• Lots of hidden layers
• Depth = power (usually)
Weights to learn!
Weights to learn!
Weights to learn!
Weights to learn!
54
55