1.ObjectDetection Introduction
1.ObjectDetection Introduction
(x,y)
+
class
Template
Image
CV3DST | Prof. Leal-Taixé 5
Traditional object detection methods
• 1. Template matching + sliding window
Image
CV3DST | Prof. Leal-Taixé 6
Traditional object detection methods
• 1. Template matching + sliding window
Image
CV3DST | Prof. Leal-Taixé 7
Traditional object detection methods
• 1. Template matching + sliding window
Image HIGH
CV3DST | Prof. Leal-Taixé
correlation 8
Traditional object detection methods
• Problems of 1. Template matching + sliding window
Image LOW
CV3DST | Prof. Leal-Taixé
correlation 9
Traditional object detection methods
• Problems of 1. Template matching + sliding window
– Occlusions: we need to see the WHOLE object
– This works to detect a given instance of an object but not
a class of objects
Appearance and
shape changes
Pose changes
CV3DST | Prof. Leal-Taixé 10
Traditional object detection methods
• Problems of 1. Template matching + sliding window
– Occlusions: we need to see the WHOLE object
– This works to detect a given instance of an object but not
a class of objects
– Objects have an unknown position, scale and aspect
ratio, the search space is searched inefficiently with
sliding window
Viola and Jones. Rapid object detection using a boosted cascade of simple features. CVPR 2001.
Haar features
Viola and Jones. Rapid object detection using a boosted cascade of simple features. CVPR 2001.
Viola and Jones. Rapid object detection using a boosted cascade of simple features. CVPR 2001.
CV3DST | Prof. Leal-Taixé Dalal and Triggs. Histogram of oriented gradients for human detection. CVPR 2005. 17
Histogram of Oriented Gradients
• 2. Feature extraction + classification
CV3DST | Prof. Leal-Taixé Dalal and Triggs. Histogram of oriented gradients for human detection. CVPR 2005. 19
Histogram of Oriented Gradients
• 2. Feature extraction + classification
CV3DST | Prof. Leal-Taixé Dalal and Triggs. Histogram of oriented gradients for human detection. CVPR 2005. 20
Deformable Part Model
• Also based on HOG features, but based on body part
detection à more robust to different body poses
CV3DST | Prof. Leal-Taixé Felzenszwalb et al. A discriminatively trained, multiscale, deformable part model. CVPR 2008. 21
How to move
towards general
object detection?
CV3DST | Prof. Leal-Taixé 22
What defines an object?
• We need a generic, class-agnostic objectness
measure: how likely it is for an image region to
contain an object
Very likely to be
an object
Maybe it is an
object
+ classifier
A A A
Intersection Union
CV3DST | Prof. Leal-Taixé 29
Non-Maximum Suppression (NMS)
CV3DST | Prof. Leal-Taixé Hosang, Benenson and Schiele. A Convnet for Non-Maximum Suppression. 2015 31
NMS: the problem
• Choosing a narrow threshold
Ground truth positions
False positives
Low Precision
CV3DST | Prof. Leal-Taixé Hosang, Benenson and Schiele. A Convnet for Non-Maximum Suppression. 2015 32
NMS: the problem
• Choosing a wider threshold
Ground truth position
False negative
False positive
Low Recall
CV3DST | Prof. Leal-Taixé Hosang, Benenson and Schiele. A Convnet for Non-Maximum Suppression. 2015 33
Non-Maximum Suppression (NMS)
• NMS will be used at test time. Most detection
methods (even Deep Learning ones) use NMS!
• Two-stage detectors
Class score (cat,
Classification
Extraction of dog, person)
Feature
Image object
extraction
proposals Localization Refine bounding box
(Δx, Δy, Δw, Δh)
CV3DST | Prof. Leal-Taixé 36
Types of object detectors
• One-stage detectors
– YOLO, SSD, RetinaNet
– CenterNet, CornerNet, ExtremeNet
• Two-stage detectors
– R-CNN, Fast R-CNN, Faster R-CNN
– SPP-Net, R-FCN, FPN