0% found this document useful (0 votes)

40 views24 pages

Artificial Intelligence Fundamentals: Learning: Boosting

This document discusses boosting algorithms for machine learning. It introduces the concept of combining multiple weak classifiers to create a strong classifier by having them vote. The key ideas are: (1) training classifiers on exaggerated versions of previous errors to reduce overlap, (2) weighting votes of classifiers, (3) minimizing error at each step to determine weight updates, and (4) stopping when all samples are correctly classified or no weak classifier remains. AdaBoost is presented as optimizing this process to exponentially decrease error over time. Face detection using Haar-like features and integral images is also briefly covered.

Uploaded by

alexandra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views24 pages

Artificial Intelligence Fundamentals: Learning: Boosting

Uploaded by

alexandra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Artificial Intelligence

Fundamentals

Learning: Boosting
• Binary classification – classify the elements of a
given set into two groups on a given rule

• Finding the classification rule can be a difficult

task

• A crowd can be smarter than the participant in

the crowd?
Classifiers strong/weak
• Suppose we have a set of classifiers
h which give as the output {-1 , +1}

• Error rate

0 0.5 1

Strong classifier Weak classifier

• Can we make a strong classifier by combining several of

these weak classifiers and let them vote?
H ( x ) sign ( h1 ( x ) + h 2 ( x ) + ... + h n ( x ) )
= x - is a sample
The perfect classifiers
( x ) sign ( h1 ( x ) + h 2 ( x ) + h3 ( x ) )
H=

h2 is wrong
h1 is wrong

h3 is wrong

• If it’s look like this, always we will have 0 error

A real situation

h2 is wrong
h1 is wrong

h3 is wrong

• Is the area overlapping by at least 2 classifiers sufficiently smaller

than the area covers by each individual tests for wrong cases?
Idea #1
• We use undisturbed DATA to produce h1
• We use DATA with an exaggeration of h1 errors
(disturbed set of data) to produce h2
• We use DATA with an exaggeration of data
where h1 give a different answer than h2 to
produce h3
Idea #2
H(x)

h1 h2 h3

h11 h12 h13 h21 h22 h23 h31 h32 h33

• Get out the vote

Idea #3 – Example of classifiers

For each horizontal stump we

could have 2
cases:
Up + , Down –
or
Up - , Down +

Similar for vertical stumps:

Left – Right.

Everything it’s a + or everything it’s a - . It’s an extra test.

• Decision tree stumps – a single test
• Could be 12 decision tree stumps: for each dimension we have # of
lines * 2 (we have 2 dimensions: 2*3*2=12
1
Error = ∑
WRONG N
CASES
ω3
N − # of cases

ω2

In the beginning:
ω1 1
ωi1 =
N

• Weighted the samples Error t = ∑

i −WRONG
ωit
CASES

• Enforced a distribution ∑
ALL
ωit = 1
CASES
Idea #4
( x ) sign (α 1h1 ( x ) + α 2 h 2 ( x ) + α 3 h3 ( x ) + ....)
H=

• Build a classifier in multiple steps

• We don’t treat equally each one on the crowd
-> wisdom of weighted crowd of experts
Idea #5

1
LET ωi1 = , where N - # of samples
N

Pick ht that
Pick α t
minimizes ERROR t

Calculate ω t +1
Idea #6
ω t
−α t ht ( x ) y ( x )
• Suppose that: ω t +1
i = i
e
Z
 +1 for samples the classifier thinks belongs to the class
h ( x) = 
−1 for samples that the classifier thinks do not belong to the class

y ( x) ∈ {+1, −1} - the desired output

Z - the normalizer, in order to have a distribution

Minimize the error
• The error BOUND is minimized for the whole #4 if:

1 1 − E t
α t = ln t , where E is the ERROR at time t
2 E
• The error will be bounded by an exponential decay function
• It’s guaranteed to converge on 0

Error

Exponential boundary

Error

time
Ada Boost
• You use uniform weights to start.
• For each step, you find the classifier that yields the lowest error
rate for the current weights, wit
• You use that best classifier, ht(xi) , to compute the error rate
associated with the step, Et
• You determine the alpha for the step, αt from the error for the step,
Et .
• With the alpha in hand, you compute the weights for the next step,
wit+1, from the weights for the current step, wit , taking care to
include a normalizing factor, Zt , so that the new weights add up to
1.
• You stop successfully when H(xi) correctly classifies all the samples,
xi; you stop unsuccessfully if you reach a point where there is no
weak classifier, one with an error rate < 1/2 .
Change the weights
 Et
t  if it's correct
ωi  1 − E T
ωit +1 = 
Z  1− Et
 if it's wrong
 ET
Et 1− Et
=Z
1− Et
∑
CORRECT
ω +
t
i ∑
E t WRONG
ωi
t

Et 1 − E t
= (1 − E t ) + E=
t
2 E t (1 − E t )
1− E E t

 1 1 1 1
t 
ωi 1 − E t
if it's correct = ∑ ω t +1
i =
− t ∑ ω t

ωit +1 =  CORRECT 2 1 E CORRECT 2

2  1 1
 E t
if it's wrong ∑
WRONG
ω t +1
i =
2
Improvements

• Tests that really matter

• Immune to overfitting
Face detection
• Haar-like features
– Edge features

– Line features

– Other features
Face detection
• Integral image – each pixel (x,y) is the sum of all pixels
above and left of x,y applied to original image

A B
1 2

(x,y)
C D
3 4

= ( location _1)
v1 ii= ∑i
ii ( x, y ) = ∑
x '≤ x , y '≤ y
i ( x ', y '),
v 2 ii ( location _=
= 2)
A

∑i + ∑i
A B

where ii ( x, y ) − integral image v3 ii ( location _=

= 3) ∑i + ∑i
A C

i ( x, y ) − original image v 4 = ii ( location _ 4 ) = ∑i + ∑i + ∑i + ∑i

A B C D

rect ( D) = v1 + v 4 − v 2 − v3
Face detection
• For 24x24 pixels image – 162.336 features
Face detection
• Choosing the threshold for each classifier
Face detection
• The first and second features selected by
AdaBoost
Face detection
Starts from top-left
• Sub-window – 24x24 pixels corner and go to
left 1 pixel at a
width time. When reach
the end of the row,
go down 1 pixel
and start again
24 from the left.

height
Face detection
• Cascade of classifier
– #of features in the first 5 layers: 1, 10, 25, 25 and 50
– total # of features in all layers - 6061

All
T T T T
sub-windows 1 2 3 38 Face

F F F F
Reject sub-window
Related resources
• P. Viola, M. Jones, “Robust Real-Time Face Detection”,
http://www.vision.caltech.edu/html-files/EE148-2005-
Spring/pprs/viola04ijcv.pdf

ModelSim Users Manual v10.1c PDF
No ratings yet
ModelSim Users Manual v10.1c PDF
733 pages
ATM Management
No ratings yet
ATM Management
36 pages
ISO 26262 The ISO 26262 The Emerging Automotive Emerging Automotive Safety Standard Safety Standard Safety Standard Safety Standard
No ratings yet
ISO 26262 The ISO 26262 The Emerging Automotive Emerging Automotive Safety Standard Safety Standard Safety Standard Safety Standard
53 pages
Project Workplan Users Guide
100% (2)
Project Workplan Users Guide
36 pages
Biojava How To
No ratings yet
Biojava How To
84 pages
VMWare Top Interview Questions With Answers
100% (1)
VMWare Top Interview Questions With Answers
12 pages
Small Talk
No ratings yet
Small Talk
105 pages
Stacks Queues Deques
No ratings yet
Stacks Queues Deques
28 pages
Artificial Intelligence Fundamentals
No ratings yet
Artificial Intelligence Fundamentals
31 pages
Artificial Intelligence Fundamentals
No ratings yet
Artificial Intelligence Fundamentals
31 pages
Sub: Web Security Name: Shubham Sati ROLL NO: 17BCA1100 Class: Bca5C
No ratings yet
Sub: Web Security Name: Shubham Sati ROLL NO: 17BCA1100 Class: Bca5C
7 pages
Solving
No ratings yet
Solving
46 pages
CSC 3304 Lecture 08 Boosting Ensemble Methods
No ratings yet
CSC 3304 Lecture 08 Boosting Ensemble Methods
41 pages
Workflow Simple Example
No ratings yet
Workflow Simple Example
20 pages
CCV User Manual 2013 10 03
No ratings yet
CCV User Manual 2013 10 03
32 pages
Test Suit ID Test Case Created by Date of Creation
No ratings yet
Test Suit ID Test Case Created by Date of Creation
54 pages
Bc5 Define
No ratings yet
Bc5 Define
49 pages
Identification Trees
No ratings yet
Identification Trees
11 pages
Introduction To Boosting - 2
No ratings yet
Introduction To Boosting - 2
79 pages
Systems Controls and Security Measures in An Accounting Information System Paprint
No ratings yet
Systems Controls and Security Measures in An Accounting Information System Paprint
7 pages
Report Intranet and Extranet
No ratings yet
Report Intranet and Extranet
26 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
32 pages
Boosting Mit
No ratings yet
Boosting Mit
36 pages
Introduction to Boosting: Slides Adapted from Che Wanxiang (车万翔) at HIT, and Robin Dhamankar of Many thanks!
100% (1)
Introduction to Boosting: Slides Adapted from Che Wanxiang (车万翔) at HIT, and Robin Dhamankar of Many thanks!
41 pages
Ensembles of Classifiers: Evgueni Smirnov
No ratings yet
Ensembles of Classifiers: Evgueni Smirnov
43 pages
Computational Data Analysis: Machine Learning
No ratings yet
Computational Data Analysis: Machine Learning
26 pages
In5490 Classification
No ratings yet
In5490 Classification
85 pages
Recommended Cache Settings: If Server Have 8 GB Memory
No ratings yet
Recommended Cache Settings: If Server Have 8 GB Memory
7 pages
7.1 Ranking Functions
No ratings yet
7.1 Ranking Functions
5 pages
EE3032 Final Demo Mark Sheet (November 2013)
No ratings yet
EE3032 Final Demo Mark Sheet (November 2013)
2 pages
Boosting: I I I I
No ratings yet
Boosting: I I I I
5 pages
Example With Flat Files and Database Tables
No ratings yet
Example With Flat Files and Database Tables
4 pages
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
No ratings yet
6.036: Intro To Machine Learning: Lecturer: Professor Leslie Kaelbling Notes By: Andrew Lin Fall 2019
50 pages
Lecture 10 Ensemble Methods
No ratings yet
Lecture 10 Ensemble Methods
69 pages
"Classifiers": R & D Project by Under The Guidance of
No ratings yet
"Classifiers": R & D Project by Under The Guidance of
59 pages
Introduction To Machine Learning - Boosting
No ratings yet
Introduction To Machine Learning - Boosting
6 pages
Oracle Performance Tuning
No ratings yet
Oracle Performance Tuning
18 pages
Debadutta Sahoo Profile
No ratings yet
Debadutta Sahoo Profile
2 pages
IAETSD-Implementation of HDLC Protocol Using Verilog
No ratings yet
IAETSD-Implementation of HDLC Protocol Using Verilog
3 pages
Hacking Adobe Experience Manager Sites
No ratings yet
Hacking Adobe Experience Manager Sites
23 pages
Combining Classifiers: Outline
No ratings yet
Combining Classifiers: Outline
15 pages
"Quick" Guide To Update ALE-L21 and Get Dual-Sim: Experimental
No ratings yet
"Quick" Guide To Update ALE-L21 and Get Dual-Sim: Experimental
3 pages
RiverSurveyor Release Notes (FWv3.00 RSLv3.50 RSSLv2.50)
No ratings yet
RiverSurveyor Release Notes (FWv3.00 RSLv3.50 RSSLv2.50)
3 pages
CS229 Supplemental Lecture Notes: 1 Boosting
No ratings yet
CS229 Supplemental Lecture Notes: 1 Boosting
11 pages
Baseball Simulator in C++
No ratings yet
Baseball Simulator in C++
7 pages
Slide07 Haykin Chapter 7: Committee Machines
No ratings yet
Slide07 Haykin Chapter 7: Committee Machines
8 pages
Lecture 2
No ratings yet
Lecture 2
57 pages
Fall 2022 Midterm Notes PDF
No ratings yet
Fall 2022 Midterm Notes PDF
15 pages
Machine Learning - Classifiers and Boosting: Reading CH 18.6-18.12, 20.1-20.3.2
No ratings yet
Machine Learning - Classifiers and Boosting: Reading CH 18.6-18.12, 20.1-20.3.2
54 pages
Tex
No ratings yet
Tex
7 pages
AIML Lect6 Ensembles
No ratings yet
AIML Lect6 Ensembles
41 pages
16 Boosting
No ratings yet
16 Boosting
7 pages
Machine Learning: Ensemble Methods
No ratings yet
Machine Learning: Ensemble Methods
54 pages
Row Level Security OBIEE Step by Step Guide
No ratings yet
Row Level Security OBIEE Step by Step Guide
8 pages
Ensemble Classifiers
No ratings yet
Ensemble Classifiers
37 pages
The Challenges Networksec
No ratings yet
The Challenges Networksec
40 pages
ML Minors Exp8
No ratings yet
ML Minors Exp8
6 pages
Apple Education Resource Guide - November 1996 - Volume Four Number Two
No ratings yet
Apple Education Resource Guide - November 1996 - Volume Four Number Two
24 pages
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
36 pages
Ensemble Classification
No ratings yet
Ensemble Classification
25 pages
Introduction To Classification - PPT Slides 1
No ratings yet
Introduction To Classification - PPT Slides 1
62 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
Ensemble Classifiers
100% (1)
Ensemble Classifiers
37 pages
Data Mining NOTES
No ratings yet
Data Mining NOTES
57 pages
Lec10 Intro ML
No ratings yet
Lec10 Intro ML
93 pages
Linear Classifiers and The Perceptron Algorithm: 36-350, Data Mining, Fall 2009 16 November 2009
No ratings yet
Linear Classifiers and The Perceptron Algorithm: 36-350, Data Mining, Fall 2009 16 November 2009
5 pages
Ensembles 1
No ratings yet
Ensembles 1
4 pages
Lecture18 Boosting
No ratings yet
Lecture18 Boosting
21 pages
cs188 Fa23 Note21
No ratings yet
cs188 Fa23 Note21
8 pages
Lecture 2.1 - AML
No ratings yet
Lecture 2.1 - AML
32 pages
Overfitting & Feature Engineering
No ratings yet
Overfitting & Feature Engineering
37 pages
Oceanspace V1300N
No ratings yet
Oceanspace V1300N
2 pages
05 Optimization Basics
No ratings yet
05 Optimization Basics
94 pages
09 EnsembleLearning
No ratings yet
09 EnsembleLearning
36 pages
הרצאה-Classifiers and Decision Trees
No ratings yet
הרצאה-Classifiers and Decision Trees
119 pages
Boosting
No ratings yet
Boosting
11 pages
07 Boosting Notes
No ratings yet
07 Boosting Notes
10 pages
Class Adv Classification V
No ratings yet
Class Adv Classification V
50 pages
ML8 Ensembles
No ratings yet
ML8 Ensembles
31 pages
کتاب هفتم بارگزاری شده
No ratings yet
کتاب هفتم بارگزاری شده
57 pages
Chp8 Classification Basic Concepts - Lecture#8
No ratings yet
Chp8 Classification Basic Concepts - Lecture#8
40 pages
Lecture Notes 7
No ratings yet
Lecture Notes 7
8 pages
Lecture 10 Boosting
No ratings yet
Lecture 10 Boosting
20 pages
Session 5
No ratings yet
Session 5
36 pages
ml1 Lab 6
No ratings yet
ml1 Lab 6
5 pages
Ensemble (v6)
No ratings yet
Ensemble (v6)
45 pages
L06 Slides - mlp3
No ratings yet
L06 Slides - mlp3
26 pages
02
No ratings yet
02
11 pages
Machine Learning
No ratings yet
Machine Learning
20 pages
3ML.02.MainConcepts Evaluation
No ratings yet
3ML.02.MainConcepts Evaluation
35 pages
MLB HA 6 Answers Final
No ratings yet
MLB HA 6 Answers Final
13 pages
14-AI ML Ensemble 2022
No ratings yet
14-AI ML Ensemble 2022
41 pages
Main
No ratings yet
Main
5 pages
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)

Artificial Intelligence Fundamentals: Learning: Boosting

Uploaded by

Artificial Intelligence Fundamentals: Learning: Boosting

Uploaded by

Artificial Intelligence

• Finding the classification rule can be a difficult

• A crowd can be smarter than the participant in

Strong classifier Weak classifier

• Can we make a strong classifier by combining several of

• If it’s look like this, always we will have 0 error

• Is the area overlapping by at least 2 classifiers sufficiently smaller

h11 h12 h13 h21 h22 h23 h31 h32 h33

• Get out the vote

For each horizontal stump we

Similar for vertical stumps:

Everything it’s a + or everything it’s a - . It’s an extra test.

• Weighted the samples Error t = ∑

• Build a classifier in multiple steps

y ( x) ∈ {+1, −1} - the desired output

Z - the normalizer, in order to have a distribution

ωit +1 =  CORRECT 2 1 E CORRECT 2

• Tests that really matter

where ii ( x, y ) − integral image v3 ii ( location _=

i ( x, y ) − original image v 4 = ii ( location _ 4 ) = ∑i + ∑i + ∑i + ∑i

You might also like