0% found this document useful (0 votes)

55 views32 pages

Object Detection With Deformable Part-Based Models: Many Slides Based On

The document discusses object detection using deformable part-based models. It describes how histograms of oriented gradients (HOG) can be used to detect objects like pedestrians using a linear support vector machine. It then introduces discriminative part-based models that use a root filter and part filters, along with deformation weights, to represent objects with parts that can vary. The models are trained using a latent structural SVM approach on labeled image data. The trained models achieve state-of-the-art results on object detection benchmarks like the PASCAL VOC challenge. Later, deep convolutional neural networks are incorporated into object detection systems to further improve performance.

Uploaded by

JAGANNATHAN S

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

55 views32 pages

Object Detection With Deformable Part-Based Models: Many Slides Based On

Uploaded by

JAGANNATHAN S

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Object detection with

deformable part-based models

Many slides based on P. Felzenszwalb

Challenge: Generic object detection
Histograms of oriented gradients (HOG)
Partition image into blocks and compute histogram of
gradient orientations in each block

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection,

CVPR 2005

Image credit: N. Snavely

Pedestrian detection with HOG
Train a pedestrian template using a linear
support vector machine
posi%ve training examples

nega%ve training examples

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection,

CVPR 2005
Pedestrian detection with HOG
Train a pedestrian template using a linear support vector
machine
At test time, convolve feature map with template
Find local maxima of response
For multi-scale detection, repeat over multiple levels of a
HOG pyramid
HOG feature map Template Detector response map

N. Dalal and B. Triggs, Histograms of Oriented Gradients for Human Detection,

CVPR 2005
Example detections

[Dalal and Triggs, CVPR 2005]

Are we done?
Single rigid template usually not enough to
represent a category
Many objects (e.g. humans) are articulated, or
have parts that can vary in configuration

Many object categories look very different from

different viewpoints, or from instance to instance

Slide by N. Snavely
Discriminative part-based models

Root Part Deformation

filter filters weights

P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan,

Object Detection with Discriminatively Trained Part Based Models, PAMI 32(9), 2010
Discriminative part-based models

Multiple components

P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan,

Object Detection with Discriminatively Trained Part Based Models, PAMI 32(9), 2010
Discriminative part-based models

P. Felzenszwalb, R. Girshick, D. McAllester, D. Ramanan,

Object Detection with Discriminatively Trained Part Based Models, PAMI 32(9), 2010
Object hypothesis
Multiscale model: the resolution of part
filters is twice the resolution of the root
Scoring an object hypothesis
The score of a hypothesis is the sum of filter scores
minus the sum of deformation costs
Subwindow
n features n Displacements
score(p 0 ,..., p n ) = Fi H (p i ) Di (dxi , dyi ,dxi2 , dyi2 )
i =0 i =1

Filters Deformation weights

Scoring an object hypothesis
The score of a hypothesis is the sum of filter scores
minus the sum of deformation costs
Subwindow
n features n Displacements
score(p 0 ,..., p n ) = Fi H (p i ) Di (dxi , dyi ,dxi2 , dyi2 )
i =0 i =1

Filters Deformation weights

score(z ) = w H (z )

Concatenation of filter Concatenation of

and deformation subwindow features
weights and displacements
Detection
Define the score of each root filter location as the
score given the best part placements:
score(p 0 ) = max score(p 0 ,..., p n )
p1 ,...,p n
Detection
Define the score of each root filter location as the
score given the best part placements:

score(p 0 ) = max score(p 0 ,..., p n )

p1 ,...,p n
Efficient computation: generalized distance transforms
For each default part location, find the score of the
best displacement

(
Ri ( x, y ) = max Fi H ( x + dx, y + dy ) Di (dx, dy, dx 2 , dy 2 )
dx , dy
)

Head filter Deformation

cost
Detection
Define the score of each root filter location as the
score given the best part placements:

score(p 0 ) = max score(p 0 ,..., p n )

p1 ,...,p n
Efficient computation: generalized distance transforms
For each default part location, find the score of the
best displacement

(
Ri ( x, y ) = max Fi H( x + dx, y + dy ) Di (dx, dy , dx 2 , dy 2 )
dx , dy
)

Head
Distance
filter transform
responses
Head filter
Detection
Detection result
Training
Training data consists of images with labeled
bounding boxes
Need to learn the filters and deformation parameters
Training
Our classifier has the form

f (x) = max z w H (x, z )

w are model parameters, z are latent hypotheses

Latent SVM training:

Initialize w and iterate:
Fix w and find the best z for each training example (detection)
Fix z and solve for w (standard SVM training)

Issue: too many negative examples

Do data mining to find hard negatives
Car model

Component 1

Component 2
Car detections
Person model
Person detections
Cat model
Cat detections
Bottle model
More detections
PASCAL VOC Challenge (2005-2012)
http://host.robots.ox.ac.uk/pascal/VOC/

Challenge classes:
Person: person
Animal: bird, cat, cow, dog, horse, sheep
Vehicle: aeroplane, bicycle, boat, bus, car, motorbike, train
Indoor: bottle, chair, dining table, potted plant, sofa, tv/monitor

Dataset size (by 2012):

11.5K training/validation images, 27K bounding boxes, 7K
segmentations
Quantitative results (PASCAL 2008)
7 systems competed in the 2008 challenge
Out of 20 classes, first place in 7 classes and
second place in 8 classes

Bicycles Person Bird

DPM DPM

DPM
Object detection progress
PASCAL VOC

80%

70%
mean0Average0Precision0 (mAP)

60% Before deep convnets

50%

40%
Using deep convnets
30%

20%

10%

0%
2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016
year
Detection with deep networks

Object detection system overview. Our system (1) takes an input image, (2) extracts
around 2000 bottom-up region proposals, (3) computes features for each proposal using
a large convolutional neural network (CNN), and then (4) classifies each region using
class-specific linear SVMs. R-CNN achieves a mean average precision (mAP) of
53.7% on PASCAL VOC 2010. For comparison, Uijlings et al. (2013) report 35.1% mAP
using the same region proposals, but with a spatial pyramid and bag-of-visual-words
approach. The popular deformable part models perform at 33.4%.
R. Girshick, J. Donahue, T. Darrell, and J. Malik,
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation,
CVPR 2014.

Object Detection With Discriminatively Trained Part Based Models
No ratings yet
Object Detection With Discriminatively Trained Part Based Models
20 pages
Object Detection Using The Statistics
No ratings yet
Object Detection Using The Statistics
27 pages
Oxford-IIIT TRECVID 2010 - Notebook Paper
No ratings yet
Oxford-IIIT TRECVID 2010 - Notebook Paper
5 pages
Lec36 Obj Detn
No ratings yet
Lec36 Obj Detn
60 pages
Classifier
No ratings yet
Classifier
39 pages
Ref 2
No ratings yet
Ref 2
19 pages
EScholarship UC Item 3rd9150m
No ratings yet
EScholarship UC Item 3rd9150m
128 pages
Research Article: An Evaluation of Deep Learning Methods For Small Object Detection
No ratings yet
Research Article: An Evaluation of Deep Learning Methods For Small Object Detection
18 pages
1.ObjectDetection Introduction
No ratings yet
1.ObjectDetection Introduction
38 pages
Document 1
No ratings yet
Document 1
4 pages
Object Detection With Deep Learning: A Review
No ratings yet
Object Detection With Deep Learning: A Review
21 pages
An Evaluation of Deep Learning Methods For Small Object
No ratings yet
An Evaluation of Deep Learning Methods For Small Object
18 pages
Introduction To Object Detection
No ratings yet
Introduction To Object Detection
24 pages
MarszalekSchmid CVPR06 SpatialWeighting
No ratings yet
MarszalekSchmid CVPR06 SpatialWeighting
9 pages
Scalable Object Detection
No ratings yet
Scalable Object Detection
8 pages
Parkhi 2011 The Truth About Cats and Dogs
No ratings yet
Parkhi 2011 The Truth About Cats and Dogs
8 pages
Overfeat
No ratings yet
Overfeat
58 pages
1 Realtimeobjectdetection
No ratings yet
1 Realtimeobjectdetection
6 pages
1 ObjectDetection
No ratings yet
1 ObjectDetection
46 pages
A Comprehensive Study of Camouflaged Object Detection Using Deep Learning
No ratings yet
A Comprehensive Study of Camouflaged Object Detection Using Deep Learning
8 pages
Region-Based Convolutional Networks For Accurate Object Detection and Segmentation
No ratings yet
Region-Based Convolutional Networks For Accurate Object Detection and Segmentation
17 pages
IT5409 - Ch7 - Part2 - Object Recognition - v2 - 4pages
No ratings yet
IT5409 - Ch7 - Part2 - Object Recognition - v2 - 4pages
38 pages
Lesson 07
No ratings yet
Lesson 07
59 pages
Object Detection
No ratings yet
Object Detection
96 pages
Object Detection and Recognition: CS 534 Spring 2005: A. Elgammal Rutgers University
No ratings yet
Object Detection and Recognition: CS 534 Spring 2005: A. Elgammal Rutgers University
25 pages
Object Detection With Deep Learning: A Review
No ratings yet
Object Detection With Deep Learning: A Review
21 pages
Learning To Detect Objects in Images Via A Sparse, Part-Based Representation
No ratings yet
Learning To Detect Objects in Images Via A Sparse, Part-Based Representation
28 pages
Object Detection Using TensorFlow
No ratings yet
Object Detection Using TensorFlow
21 pages
Literature Survey For Robotics
No ratings yet
Literature Survey For Robotics
6 pages
Tensor Flow
No ratings yet
Tensor Flow
5 pages
RO47002 - Lecture 2A - Case Study Visual Object Detection
No ratings yet
RO47002 - Lecture 2A - Case Study Visual Object Detection
24 pages
Rich Feature Hierarchies For Accurate Object Detection and Semantic Segmentation
No ratings yet
Rich Feature Hierarchies For Accurate Object Detection and Semantic Segmentation
8 pages
机器学习读书会嘉宾分享计算机视觉目标检测
No ratings yet
机器学习读书会嘉宾分享计算机视觉目标检测
52 pages
Computer Vision Application
No ratings yet
Computer Vision Application
2 pages
Image and Video Analytics Unit 3
No ratings yet
Image and Video Analytics Unit 3
18 pages
On Hyperbolic Embeddings in Object Detection
No ratings yet
On Hyperbolic Embeddings in Object Detection
19 pages
Recent Advances in Deep Learning For Object Detection
No ratings yet
Recent Advances in Deep Learning For Object Detection
26 pages
General Framework For Object Detection
No ratings yet
General Framework For Object Detection
9 pages
Attribute-Centric Recognition For Cross-Category Generalization
No ratings yet
Attribute-Centric Recognition For Cross-Category Generalization
8 pages
DL1 Ver1
No ratings yet
DL1 Ver1
49 pages
Ballsack, Rotate, Eat Poop
No ratings yet
Ballsack, Rotate, Eat Poop
1 page
Large-Scale Image Classification
No ratings yet
Large-Scale Image Classification
8 pages
DL Unit-5
No ratings yet
DL Unit-5
34 pages
PHD Visual Object Category Recognition
No ratings yet
PHD Visual Object Category Recognition
193 pages
Generalized Focal Loss Towards Efficient Representation Learning For Dense Object Detection
No ratings yet
Generalized Focal Loss Towards Efficient Representation Learning For Dense Object Detection
15 pages
Region-Based Convolutional Networks For Accurate Object Detection and Segmentation
No ratings yet
Region-Based Convolutional Networks For Accurate Object Detection and Segmentation
21 pages
03-3 Feature Descriptors
No ratings yet
03-3 Feature Descriptors
58 pages
Large-Scale Image Classification: Fast Feature Extraction and SVM Training
No ratings yet
Large-Scale Image Classification: Fast Feature Extraction and SVM Training
8 pages
Object Detection and Identification
67% (3)
Object Detection and Identification
20 pages
Learning To Detect Objects in Images Via A Sparse, Part-Based Representation
No ratings yet
Learning To Detect Objects in Images Via A Sparse, Part-Based Representation
16 pages
Trainable COSFIRE Filters For Keypoint Detection and Pattern Recognition
No ratings yet
Trainable COSFIRE Filters For Keypoint Detection and Pattern Recognition
15 pages
Bai09 Descriptors
No ratings yet
Bai09 Descriptors
81 pages
Tinaface: Strong But Simple Baseline For Face Detection
No ratings yet
Tinaface: Strong But Simple Baseline For Face Detection
9 pages
The Fastest Deformable Part Model For Object Detection
No ratings yet
The Fastest Deformable Part Model For Object Detection
8 pages
Younis 2020
No ratings yet
Younis 2020
5 pages
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
From Everand
Scale Invariant Feature Transform: Unveiling the Power of Scale Invariant Feature Transform in Computer Vision
Fouad Sabry
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Pyramid Image Processing: Exploring the Depths of Visual Analysis
From Everand
Pyramid Image Processing: Exploring the Depths of Visual Analysis
Fouad Sabry
No ratings yet
Radiosity Computer Graphics: Advancing Visualization through Radiosity in Computer Vision
From Everand
Radiosity Computer Graphics: Advancing Visualization through Radiosity in Computer Vision
Fouad Sabry
No ratings yet
Modeling Intrusion Detection System Using Hybrid Intelligent Systems
No ratings yet
Modeling Intrusion Detection System Using Hybrid Intelligent Systems
21 pages
Debapriya Sengupta, Goutam Saha - Identification of The Major Language Families of India and Evaluation of Their Mutual Influence 2016
No ratings yet
Debapriya Sengupta, Goutam Saha - Identification of The Major Language Families of India and Evaluation of Their Mutual Influence 2016
16 pages
M.Tech - CSE Syllabus SIT Autonomy
No ratings yet
M.Tech - CSE Syllabus SIT Autonomy
88 pages
Deliverable Document - Template
No ratings yet
Deliverable Document - Template
7 pages
Frontal Lobe Real-Time EEG Analysis Using Machine Learning Techniques For Mental Stress Detection
No ratings yet
Frontal Lobe Real-Time EEG Analysis Using Machine Learning Techniques For Mental Stress Detection
11 pages
Crop Yield Report BT-4-1
No ratings yet
Crop Yield Report BT-4-1
23 pages
4-1 - Machine Learning - Intro-Classification
100% (1)
4-1 - Machine Learning - Intro-Classification
63 pages
Support Vector Machine (SVM)
No ratings yet
Support Vector Machine (SVM)
5 pages
Updated - Mini - Project - II - Predictive Maintenance Analysis For Industrial Machinary
No ratings yet
Updated - Mini - Project - II - Predictive Maintenance Analysis For Industrial Machinary
39 pages
AI Using Python
No ratings yet
AI Using Python
9 pages
A Survey of Machine Learning Algorithms For Big Data Analytics
No ratings yet
A Survey of Machine Learning Algorithms For Big Data Analytics
4 pages
A Comprehensive Analysis and Prediction of Earthquake Magnitude Based On Position and Depth Parameters Using Machine and Deep Learning Models
No ratings yet
A Comprehensive Analysis and Prediction of Earthquake Magnitude Based On Position and Depth Parameters Using Machine and Deep Learning Models
20 pages
SMV 3
No ratings yet
SMV 3
23 pages
Introduction To Data Mining Global Edition Pang Ning Tan Michael Steinbach Anuj Karpatne Vipin Kumar
No ratings yet
Introduction To Data Mining Global Edition Pang Ning Tan Michael Steinbach Anuj Karpatne Vipin Kumar
79 pages
An EEG-based Machine Learning Framework For Depression Detection Using Effective Connectivity Analysis
No ratings yet
An EEG-based Machine Learning Framework For Depression Detection Using Effective Connectivity Analysis
20 pages
Iris Recognition Using SVM and ANN
No ratings yet
Iris Recognition Using SVM and ANN
5 pages
License Plate Recognition System (LPR)
No ratings yet
License Plate Recognition System (LPR)
20 pages
Argument Reality
No ratings yet
Argument Reality
18 pages
Final Exam Review: Nishant Mehta
No ratings yet
Final Exam Review: Nishant Mehta
32 pages
Machine Learning A Bayesian and Optimization Perspective 1st Edition by Sergios Theodoridis
No ratings yet
Machine Learning A Bayesian and Optimization Perspective 1st Edition by Sergios Theodoridis
329 pages
A Machine Learning Model For Average Fuel Consumption in Heavy Vehicles
100% (1)
A Machine Learning Model For Average Fuel Consumption in Heavy Vehicles
70 pages
Image Analysis, Classification and Change Detection in Remote Sensing: With Algorithms For Python Fourth Edition Canty
100% (2)
Image Analysis, Classification and Change Detection in Remote Sensing: With Algorithms For Python Fourth Edition Canty
57 pages
Pyspark - Mllib Package
No ratings yet
Pyspark - Mllib Package
87 pages
M.Tech Software Engineering Full Syllabus
No ratings yet
M.Tech Software Engineering Full Syllabus
67 pages
Libsvm
No ratings yet
Libsvm
124 pages
Devika.v ML
No ratings yet
Devika.v ML
7 pages
Computer Vision Based Rice Leaf Disease Detection and Classification Using Multi Level Feature Extra
No ratings yet
Computer Vision Based Rice Leaf Disease Detection and Classification Using Multi Level Feature Extra
10 pages
REV-Boltasseva ACS Photonics ML For Quantum Photonics
No ratings yet
REV-Boltasseva ACS Photonics ML For Quantum Photonics
13 pages
CS 221 Paper
No ratings yet
CS 221 Paper
8 pages
Optimal Transport For Measures With Noisy Tree Metric
No ratings yet
Optimal Transport For Measures With Noisy Tree Metric
31 pages