Object Detection1
Object Detection1
deer
cat
Object Detection
Class Scores
Deer: 0.9
Fully Connected: Cat: 0.05
4096 to k Umbrella: 0.01
…
Fully Connected:
4096 to 4 Box Coordinates
(x, y, w, h)
Object Detection
Penguin: (x, y, w, h)
4096 Penguin: (x, y, w, h)
Penguin: (x, y, w, h)
Penguin: (x, y, w, h)
…
Object Detection as Classification
deer?
CNN cat?
background?
Object Detection as Classification
deer?
CNN cat?
background?
Object Detection as Classification
deer?
CNN cat?
background?
Object Detection as Classification with Sliding
Window
deer?
CNN cat?
background?
Object Detection as Classification with Box
Proposals
RCNN
https://people.eecs.berkeley.edu/~rbg/papers/r-cnn-cvpr.pdf
Rich feature hierarchies for accurate object detection and semantic segmentation.
Girshick et al. CVPR 2014.
RCNN
First stage: generate category-
independent region proposals.
• 2000 Region proposals for every image
• A multistage pipeline.
• Training is expensive in
space and time (features
are extracted from each
region proposal in each
?
image and written into
disk).
• Object detection is slow.
Fast-RCNN
+ four real-valued
FC+ numbers for each of
regressor the K object classes.
…
RCNN vs Fast-RCNN
k anchors boxes
2k scores 4k coordinates
Fast-RCNN
feature map
sliding window, nxn
…
RCNN vs Fast-RCNN
?
disk).
• Object detection is slow.
YOLO- You Only Look Once
• extremely fast
• reason globally
• learn generalizable represen-
tations
https://arxiv.org/abs/1506.02640
Redmon et al. CVPR 2016.
YOLO- You Only Look Once