0% found this document useful (0 votes)

204 views55 pages

A Robust Real Time Face Detection

This document discusses a robust real-time face detection system using AdaBoost. It begins with an introduction to AdaBoost and how it can be used to combine weak classifiers into a strong classifier. It then discusses how integral images allow features to be rapidly computed for detection. The key aspects are using Haar-like features as weak classifiers, boosting a cascade of classifiers using AdaBoost, and rejecting negatives early while detecting most positives.

Uploaded by

Gaurav Veer Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

204 views55 pages

A Robust Real Time Face Detection

Uploaded by

Gaurav Veer Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 55

A Robust Real Time Face

Detection
Outline
 AdaBoost – Learning Algorithm
 Face Detection in real life
 Using AdaBoost for Face Detection
 Improvements
 Demonstration
AdaBoost

A short Introduction to Boosting (Freund & Schapire, 1999)

Logistic Regression, AdaBoost and Bregman Distances
(Collins, Schapire, Singer, 2002)
Boosting
 The Horse-Racing Gambler Problem
– Rules of thumb for a set of races
– How should we choose the set of races in order
to get the best rules of thumb?
– How should the rules be combined into a single
highly accurate prediction rule?
 Boosting !
AdaBoost - the idea
 AdaBoost agglomerates many weak
classifiers into one strong classifier.

IQ  Initialize sample weights

 For each cycle:
– Find a classifier that performs
well on the weighted sample
– Increase weights of
misclassified examples
 Return a weighted list of
classifiers
Shoe
size
AdaBoost - algorithm
Given ( x1 , y1 ),.., ( xm , ym ) where xi  X , yi  Y  {1,1}
Initialize D1 (i )  1 / m
For y  1..T
Select the best weak classifier using distributi on Dt
Get weak hypothesis ht : X  {1,1} with error  t  Pri ~ Dt [ht ( xi )  yi ]
1 1 t
Choose  t  ln( )
2 t


e t if hi ( xi )  yi
D (i ) 
Update Dt 1 (i )  t   where Z t is a normalizat ion factor
Zt  et if hi ( xi )  y

Output the final hypothesis :

T
H ( x)  sign (  t ht ( x))
t 1
AdaBoost – training error
 Freund and Schapire (1997) proved that:
T
   t2 1
err ' ( H )  e t 1
, where γt    t
2

 AdaBoost ADApts to the error rates of the

individual weak hypotheses.
AdaBoost – generalization error
 Freund and Schapire (1997) showed that:

Td
err ( H )  Pr'[ H ( x)  y ]  O( ),
m
where :

Pr '[H(x)  y] - the empirical probability on the training sample

d - VC dimension

T - number of rounds

m - training set size

AdaBoost – generalization error
 The analysis implies that boosting will overfit
if run for too many rounds
 However, it was observed empirically that
AdaBoost does not overfit, even when run
thousands of rounds.
 Moreover, it was observed that the
generalization error continues to drive down
long after training error reached zero
AdaBoost – generalization error
 An alternative analysis was presented by
Schapire et al. (1998), that suits the empirical
findings
d
Pr[err ( H )  Pr'[margin ( x, y )   ]  O ( ) ]  1- 
m 2

where :

margin( x , y ) 
y  t ht ( x )
 t
AdaBoost – different point of view
 We try to solve the problem of approximating the
y’s using a linear combination of weak hypotheses
 In other words, we are interested in the problem of
finding a vector of parameters α such that
n
f ( xi )    j h j ( xi )
j 1

is a ‘good approximation’ of yi
 For classification problems we try to match the
sign of f(xi) to yi
AdaBoost – different point of view
 Sometimes it is advantageous to minimize some
other (non-negative) loss function instead of the
number of classification errors
 For AdaBoost the loss function is

 exp( y
i 1
i f ( xi ))

 This point of view was used by Collins, Schapire

and Singer (2002) to demonstrate that AdaBoost
converges to optimality
Face Detection
(not face recognition)
Face Detection in Monkeys
There are cells that
‘detect faces’
Face Detection in Human
There are ‘processes of
face detection’
Faces Are Special
We analyze faces in a
‘different way’
Faces Are Special
We analyze faces in a
‘different way’
Faces Are Special
We analyze faces in a
‘different way’
Face Recognition in Human
We analyze faces ‘in a
specific location’
Robust Real-Time Face
Detection
Viola and Jones, 2003
Features

Picture analysis, Integral Image

Features
 The system classifies images based on the value
of simple features

Two-rectangle

Value =
Three-rectangle
∑ (pixels in white area) - ∑
(pixels in black area)

Four-rectangle
Contrast Features
Source

Result
Features
 Notice that each feature is related to a
special location in the sub-window
 Why features and not pixels?
– Encode domain knowledge
– Feature based system operates faster
– Inspiration from human V1
Features
 Later we will see that there are other
features that can be used to implement an
efficient face detector
 The original system of Viola and Jones used
only rectangle features
Computing Features
 Given a detection resolution of 24x24, and
size of ~200x200, the set of rectangle
features is ~160,000 !
 We need to find a way to rapidly compute
the features
Integral Image
 Intermediate
representation of the
image

ii ( x, y )   i ( x' , y ' )
x '  x , y ' y

 Computed in one pass

over the original image
s ( x, y )  s ( x, y  1)  i ( x, y )
ii ( x, y )  ii ( x  1, y )  s ( x, y )
s ( x,1)  0
ii ( 1, y )  0
Integral Image
x
(0,0)
s(x,y) = s(x,y-1) + i(x,y)
ii(x,y) = ii(x-1,y) + s(x,y)

y (x,y)

Using the integral image representation

one can compute the value of any
rectangular sum in constant time.
For example the integral sum inside
rectangle D we can compute as:
ii(4) + ii(1) – ii(2) – ii(3)
Integral Image

(x,y)

Integral
Image
-1 +1
+2 -2
-1 +1
(x,y)
Building a Detector

Cascading, training a cascade

h( x, f , p, )  1  pf ( x)  p
Weak Classifiers
 A weak classifier is a combination of a
feature and a threshold
 We have K features
 We have N thresholds where N is the
number of examples
 Thus there are KN weak classifiers
Weak Classifier Selection
 For each feature sort the examples based on
feature value
 For each element evaluate the total sum of
positive/negative example weights (T+/T-) and
the sum of positive/negative weights below the
current example (S+/S-)
 The error for a threshold which splits the range
between the current and previous example in the
sorted list is
e  min(S   (T   S  ), S   (T   S  ))
An example
e B A S- S+ T- T+ W f y x

2/5 3/5 2/5 0 0 2/5 3/5 1/5 2 -1 X1

1/5 4/5 1/5 1/5 0 2/5 3/5 1/5 3 -1 X2

0 5/5 0 2/5 0 2/5 3/5 1/5 5 1 X3

1/5 4/5 1/5 2/5 1/5 2/5 3/5 1/5 7 1 X4

2/5 3/5 2/5 2/5 2/5 2/5 3/5 1/5 8 1 X5

Main Ideas
 The Features will be used as weak
classifiers
 We will concatenate several detectors
serially into a cascade
 We will boost (using a version of AdaBoost)
a number of features to get ‘good enough’
detectors
Main Ideas
 The Features will be used as weak
classifiers
 We will concatenate several detectors
serially into a cascade
 We will boost (using a version of AdaBoost)
a number of features to get ‘good enough’
detectors
Cascading
 We start with simple classifiers which reject
many of the negative sub-windows while
detecting almost all positive sub-windows
 Positive results from the first classifier
triggers the evaluation of a second (more
complex) classifier, and so on
 A negative outcome at any point leads to the
immediate rejection of the sub-window
Cascading
Main Ideas
 The Features will be used as weak
classifiers
 We will concatenate several detectors
serially into a cascade
 We will boost (using a version of AdaBoost)
a number of features to get ‘good enough’
detectors
Main Ideas
 The Features will be used as weak
classifiers
 We will concatenate several detectors
serially into a cascade
 We will boost (using a version of
AdaBoost) a number of features to get
‘good enough’ detectors
Training a cascade
 User selects values for:
– Maximum acceptable false positive rate per
layer
– Minimum acceptable detection rate per layer
– Target overall false positive rate
 User gives a set of positive and negative
examples
Training a cascade (cont.)
 While the overall false positive rate is not met:
– While the false positive rate of current layer is less than
the maximum per layer:
 Train a classifier with n features using AdaBoost on set of
positive and negative examples
 Decrease threshold for current classifier detection rate of the
layer is more than the minimum
 Evaluate current cascade classifier on validation set
– Evaluate current cascade detector on a set of non faces
images and put any false detections into the negative
training set
Results
Training Data Set
 4916 hand labeled faces
 Aligned to base resolution
(24x24)
 Non faces for first layer
were collected from 9500
non faces images
 Non faces for subsequent
layers were obtained by
scanning the partial
cascade across non faces
and collecting false
positives (max 6000 for
each layer)
Structure of the Detector
 38 layer cascade
 6060 features

Layer number 1 2 3 to 4 5 to 38
Number of feautures 2 10 50 -
Detection rate 100% 100% - -
Rejection rate 50% 80% - -
Speed of final Detector
 On a 700Mhz Pentium III processor, the
face detector can process a 384 by 288
pixel image in about .067 seconds
Improvements

Learning Object Detection from a Small

Number of Examples: the Importance of
Good Features (Levy & Weiss, 2004)
Improvements
 Performance depends crucially on the
features that are used to represent the
objects (Levy & Weiss, 2004)
 Good Features imply:
– Good results from small training databases
– Better generalization abilities
– Shorter (faster) classifiers
Edge Orientation Histogram

 Invariant to global illumination changes

 Captures geometric properties of faces
 Domain knowledge represented:
– Inner part of the face includes more horizontal edges then vertical
– The ration between vertical and horizontal edges is bounded
– The area of the eyes includes mainly horizontal edges
– The chin has more or less the same number of oblique edges on
both sides
Edge Orientation Histogram
 The EOH can be calculated using some kind
of Integral Image:
– We find the gradients at the point (x,y) using
Sobel masks
– We calculate the orientation of the edge (x,y)
– We divide the edges into K bins
– The result is stored in K matrices
– We use the same idea of Integral Image for the
matrices
EOH Features
 The ratio between two
orientations
 The dominance of a given
orientation
 Symmetry Features
Results
 Already with only 250 positive examples we
can see above 90% detection rate
 Faster classifier
 Better performance in profile faces
Demo
Implementing Viola & Jones system
Frank Fritze, 2004

Exercise Simulink Systems
No ratings yet
Exercise Simulink Systems
3 pages
A Short Introduction To Boosting
No ratings yet
A Short Introduction To Boosting
14 pages
2016 05 Viola Jones PDF
No ratings yet
2016 05 Viola Jones PDF
51 pages
Boosting and Applications Yuan
No ratings yet
Boosting and Applications Yuan
41 pages
A Short Introduction To Boosting
No ratings yet
A Short Introduction To Boosting
14 pages
Ada Boost
No ratings yet
Ada Boost
25 pages
Research of Usage of Haar-Like Features and Adaboost Algorithm in Viola-Jones Method of Object Detection
No ratings yet
Research of Usage of Haar-Like Features and Adaboost Algorithm in Viola-Jones Method of Object Detection
3 pages
A Short Introduction To Boosting
No ratings yet
A Short Introduction To Boosting
14 pages
Lecture Notes 7
No ratings yet
Lecture Notes 7
8 pages
Ada Boost
No ratings yet
Ada Boost
7 pages
Adaboost With Totally Corrective Updates For Fast Face Detection
No ratings yet
Adaboost With Totally Corrective Updates For Fast Face Detection
6 pages
The Viola/Jones Face Detector
No ratings yet
The Viola/Jones Face Detector
21 pages
Nhan Dang Khuon Mat Bi Nghieng
No ratings yet
Nhan Dang Khuon Mat Bi Nghieng
6 pages
Boosted Classifier For Car Detection: David C. Lee
No ratings yet
Boosted Classifier For Car Detection: David C. Lee
4 pages
Introduction To Boosting: Cynthia Rudin PACM, Princeton University
No ratings yet
Introduction To Boosting: Cynthia Rudin PACM, Princeton University
29 pages
AdaBoost New PDF
No ratings yet
AdaBoost New PDF
45 pages
Boosting (Machine Learning)
No ratings yet
Boosting (Machine Learning)
6 pages
Adaboost: Derek Hoiem March 31, 2004
No ratings yet
Adaboost: Derek Hoiem March 31, 2004
46 pages
TRAT04 TransformadaHaar ViolaJones
No ratings yet
TRAT04 TransformadaHaar ViolaJones
21 pages
Adaboost Matas
No ratings yet
Adaboost Matas
136 pages
A Brief Introduction To Adaboost: Hongbo Deng 6 Feb, 2007
No ratings yet
A Brief Introduction To Adaboost: Hongbo Deng 6 Feb, 2007
35 pages
Bagging and Boosting: 9.520 Class 10, 13 March 2006 Sasha Rakhlin
No ratings yet
Bagging and Boosting: 9.520 Class 10, 13 March 2006 Sasha Rakhlin
19 pages
Adaboost
No ratings yet
Adaboost
5 pages
Survey - Gradient Boosting Machine
No ratings yet
Survey - Gradient Boosting Machine
9 pages
IT5409 Ch7 Part1 Object Detection v2 4pages
No ratings yet
IT5409 Ch7 Part1 Object Detection v2 4pages
24 pages
Adaboost
No ratings yet
Adaboost
13 pages
Overview of Adaboost: Reconciling Its Views To Better Understand Its Dynamics
No ratings yet
Overview of Adaboost: Reconciling Its Views To Better Understand Its Dynamics
39 pages
Adaboost Algorithm
No ratings yet
Adaboost Algorithm
17 pages
Computational Data Analysis: Machine Learning
No ratings yet
Computational Data Analysis: Machine Learning
26 pages
1 Eric Boosting304FinalRpdf
No ratings yet
1 Eric Boosting304FinalRpdf
19 pages
Improvements of Object Detection Using Boosted Histograms
No ratings yet
Improvements of Object Detection Using Boosted Histograms
10 pages
Lect4 Log Reg
No ratings yet
Lect4 Log Reg
20 pages
10-701 Midterm Exam, Fall 2007
No ratings yet
10-701 Midterm Exam, Fall 2007
25 pages
Boosting With Structural Sparsity
No ratings yet
Boosting With Structural Sparsity
41 pages
Experimenting XGBoost Algorithmfor Predictionand Classificationof Different Datasets
No ratings yet
Experimenting XGBoost Algorithmfor Predictionand Classificationof Different Datasets
12 pages
Boosting Approach To Machine Learn
No ratings yet
Boosting Approach To Machine Learn
23 pages
Sol3 2016
No ratings yet
Sol3 2016
8 pages
Adaptive Boosting For Classification and Regression
No ratings yet
Adaptive Boosting For Classification and Regression
4 pages
Linear Algebra: Submitted by Ahmad Saeed Submitted To Sir Muzzam Ali BITM-F18-022
No ratings yet
Linear Algebra: Submitted by Ahmad Saeed Submitted To Sir Muzzam Ali BITM-F18-022
5 pages
Foundations of Machine Learning: Courant Institute and Google Research
No ratings yet
Foundations of Machine Learning: Courant Institute and Google Research
42 pages
8 Rapid Object Detection Using A Boosted Cascade of Simple Features
No ratings yet
8 Rapid Object Detection Using A Boosted Cascade of Simple Features
8 pages
Voilajones Paper PDF
No ratings yet
Voilajones Paper PDF
8 pages
AdaBoost Is Consistent
No ratings yet
AdaBoost Is Consistent
22 pages
Boosting Mit
No ratings yet
Boosting Mit
36 pages
Zhu - Multiclass Adaboost2009 PDF
No ratings yet
Zhu - Multiclass Adaboost2009 PDF
12 pages
An Introduction To Boosting and Leveraging: 1 A Brief History of Boosting
No ratings yet
An Introduction To Boosting and Leveraging: 1 A Brief History of Boosting
66 pages
07 Boosting Notes
No ratings yet
07 Boosting Notes
10 pages
Addaboost
No ratings yet
Addaboost
12 pages
DLbook
No ratings yet
DLbook
165 pages
IMPROVE Boost 1999
No ratings yet
IMPROVE Boost 1999
40 pages
Rapid Object Detection Using A Boosted Cascade of Simple Features
No ratings yet
Rapid Object Detection Using A Boosted Cascade of Simple Features
9 pages
Ensemble (v6)
No ratings yet
Ensemble (v6)
45 pages
Real Time Face Detection
No ratings yet
Real Time Face Detection
70 pages
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
AI Algorithms: Foundations, Applications, and Advancements
From Everand
AI Algorithms: Foundations, Applications, and Advancements
Anand Vemula
No ratings yet
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
From Everand
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
No ratings yet
Differential Games
From Everand
Differential Games
Avner Friedman
No ratings yet
Solution To The Positive Real Control Problem For Linear Time-Invariant Systems
No ratings yet
Solution To The Positive Real Control Problem For Linear Time-Invariant Systems
13 pages
PallavBag106 Biology
No ratings yet
PallavBag106 Biology
3 pages
Contrastive Analysis (CA)
No ratings yet
Contrastive Analysis (CA)
2 pages
Language Skills: L S R W
100% (2)
Language Skills: L S R W
17 pages
NNMNN
No ratings yet
NNMNN
1 page
AI 3 Year
No ratings yet
AI 3 Year
1 page
Fuzzy Lab#2 Udara VER 1
No ratings yet
Fuzzy Lab#2 Udara VER 1
12 pages
Machine Learning Books
No ratings yet
Machine Learning Books
2 pages
Specializations MTech Software Systems
No ratings yet
Specializations MTech Software Systems
5 pages
Generative Ai
No ratings yet
Generative Ai
13 pages
Analysis, Conversational Analysis, and The Last Part, The Implications of Discourse
No ratings yet
Analysis, Conversational Analysis, and The Last Part, The Implications of Discourse
6 pages
Power System Stability Book by M
No ratings yet
Power System Stability Book by M
229 pages
10985C ENU TrainerHandbook
No ratings yet
10985C ENU TrainerHandbook
174 pages
Pratical FIR Filter Design in Matlab
100% (3)
Pratical FIR Filter Design in Matlab
32 pages
Control System Toolbox - Designing Cascade Control System With PI Controllers Demo
No ratings yet
Control System Toolbox - Designing Cascade Control System With PI Controllers Demo
5 pages
Object Detection in Last Decade - A Survey : Scientific Journal of Informatics
No ratings yet
Object Detection in Last Decade - A Survey : Scientific Journal of Informatics
11 pages
Industrial Process Control Basic Concepts
100% (3)
Industrial Process Control Basic Concepts
39 pages
EE428 Industrial Process Control: Dr. Ammar Hasan
No ratings yet
EE428 Industrial Process Control: Dr. Ammar Hasan
36 pages
PABF End Term
No ratings yet
PABF End Term
7 pages
Abhijit Balaji PDF
No ratings yet
Abhijit Balaji PDF
1 page
Conditions of Learning - Wikipedia
No ratings yet
Conditions of Learning - Wikipedia
6 pages
Integrated Logistic Support Engineering
No ratings yet
Integrated Logistic Support Engineering
16 pages
How To Fine-Tune BERT For Text Classification?: Corresponding Author The Source Codes Are Available at
No ratings yet
How To Fine-Tune BERT For Text Classification?: Corresponding Author The Source Codes Are Available at
10 pages
Four Principles of Interpersonal Communication Handout Net
No ratings yet
Four Principles of Interpersonal Communication Handout Net
2 pages
Artificial Intelligence: Presentation By: Tripti Negi Priyanka Kapil Gogia Gurpeet Singh
No ratings yet
Artificial Intelligence: Presentation By: Tripti Negi Priyanka Kapil Gogia Gurpeet Singh
15 pages
Best AI, ML, & Blockchain Development Service Company USA - Cuneiform
No ratings yet
Best AI, ML, & Blockchain Development Service Company USA - Cuneiform
3 pages
Mockinsem
No ratings yet
Mockinsem
1 page
The Architecture of Cognition Rethinking Fodor and Pylyshyn 2014 PDF
No ratings yet
The Architecture of Cognition Rethinking Fodor and Pylyshyn 2014 PDF
483 pages
Artificial Intelligence and Machine Learning For Quantum Technologies
No ratings yet
Artificial Intelligence and Machine Learning For Quantum Technologies
23 pages

A Robust Real Time Face Detection

Uploaded by

A Robust Real Time Face Detection

Uploaded by

A Robust Real Time Face

A short Introduction to Boosting (Freund & Schapire, 1999)

IQ  Initialize sample weights

Output the final hypothesis :

 AdaBoost ADApts to the error rates of the

Pr '[H(x)  y] - the empirical probability on the training sample

m - training set size

 This point of view was used by Collins, Schapire

Picture analysis, Integral Image

 Computed in one pass

Using the integral image representation

Cascading, training a cascade

2/5 3/5 2/5 0 0 2/5 3/5 1/5 2 -1 X1

1/5 4/5 1/5 1/5 0 2/5 3/5 1/5 3 -1 X2

0 5/5 0 2/5 0 2/5 3/5 1/5 5 1 X3

1/5 4/5 1/5 2/5 1/5 2/5 3/5 1/5 7 1 X4

2/5 3/5 2/5 2/5 2/5 2/5 3/5 1/5 8 1 X5

Learning Object Detection from a Small

 Invariant to global illumination changes

You might also like