0% found this document useful (0 votes)

11 views20 pages

ML 05

Uploaded by

varalak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views20 pages

ML 05

Uploaded by

varalak

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 20

Machine Learning

Chapter 5. Evaluating Hypotheses

Tom M. Mitchell
Evaluating Hypotheses
 Sample error, true error
 Confidence intervals for observed hypothesis error
 Estimators
 Binomial distribution, Normal distribution, Central
Limit Theorem
 Paired t tests
 Comparing learning methods

2
Two Definitions of Error
 The true error of hypothesis h with respect to target functi
on f and distribution D is the probability that h will misclas
sify an instance drawn at random according to D.

 The sample error of h with respect to target function f and

data sample S is the proportion of examples h misclassifies

 Where (f(x)  h(x)) is 1 if f(x)  h(x), and 0 otherwise.

 How well does errorS(h) estimate errorD(h)?
3
Problems Estimating Error
1. Bias: If S is training set, errorS(h) is optimistically
biased
bias  E [errorS(h)] - errorD(h)
For unbiased estimate, h and S must be chosen ind
ependently

2. Variance: Even with unbiased S, errorS(h) may stil

l vary from errorD(h)

4
Example
 Hypothesis h misclassifies 12 of the 40 examples i
nS
errorS(h) = 12 / 40 = .30

 What is errorD(h) ?

5
Estimators
 Experiment:
1. choose sample S of size n according to distributi
on D
2. measure errorS(h)
errorS(h) is a random variable (i.e., result of an e
xperiment)
errorS(h) is an unbiased estimator for errorD(h)
Given observed errorS(h) what can we conclude
about errorD(h) ?
6
Confidence Intervals
 If
– S contains n examples, drawn independently of h and ea
ch other
– n  30
 Then, with approximately N% probability, errorD
(h) lies in interval

N% 50 68 80 90 95 98 99
where % % % % % % %
zN 0.67 1.00 1.28 1.64 1.96 2.33 2.58
7
errorS(h) is a Random Variable
 Rerun the experiment with different randomly drawn S (of
size n)
 Probability of observing r misclassified examples:

8
Binomial Probability Distribution

Probability P(r) of r heads in n coin flips, if p = Pr(heads)

 Expected, or mean value of X, E[X], is

 Variance of X is
 Standard deviation of X, X, is

9
Normal Distribution Approximates Binomial

errorS(h) follows a Binomial distribution, with

 mean errorS(h) = errorD(h)
 standard deviation errorS(h)

Approximate this by a Normal distribution with

 mean errorS(h) = errorD(h)
 standard deviation errorS(h)

10
Normal Probability Distribution
(1/2)

The probability that X will fall into the interval (a, b) is given by
 Expected, or mean value of X, E[X], is E[X] = 
 Variance of X is Var(X) = 2
 Standard deviation of X, X is X = 

11
Normal Probability Distribution
(2/2)

80% of area (probability) lies in   1.28

N% of area (probability) lies in   zN

N% 50 68 80 90 95 98 99
% % % % % % %
zN 0.67 1.00 1.28 1.64 1.96 2.33 2.58
12
Confidence Intervals, More Correctly
 If
– S contains n examples, drawn independently of h and ea
ch other
– n  30
 Then, with approximately 95% probability, errorS(h) lies i
n interval

equivalently, errorD(h) lies in interval

which is approximately
13
Central Limit Theorem
 Consider a set of independent, identically distributed rando
m variables Y1 . . . Yn, all governed by an arbitrary probabili
ty distribution with mean  and finite variance 2. Define t
he sample mean,

 Central Limit Theorem. As n  , the distribution gover

ning Y approaches a Normal distribution, with mean  and
variance 2 / n .

14
Calculating Confidence Intervals
1. Pick parameter p to estimate
– errorD(h)
2. Choose an estimator
– errorS(h)
3. Determine probability distribution that governs estimator
– errorS(h) governed by Binomial distribution, approximated by Normal
when n  30
4. Find interval (L, U) such that N% of probability mass falls in t
he interval
– Use table of zN values

15
Difference Between Hypotheses
Test h1 on sample S1, test h2 on S2
1. Pick parameter to estimate
d  errorD(h1) - errorD(h2)
2. Choose an estimator
^
d  errorS1(h1) – errorS2(h2)
3. Determine probability distribution that governs estimator

4. Find interval (L, U) such that N% of probability mass falls in the interv
al

16
Paired t test to compare hA, hB
1. Partition data into k disjoint test sets T1, T2, . . ., Tk of equal
size, where this size is at least 30.
2. For i from 1 to k, do
i  errorTi(hA) - errorTi(hB)
3. Return the value , where

N% confidence interval estimate for d:

Note i approximately Normally distributed

17
Comparing learning algorithms
LA and LB (1/3)
What we’d like to estimate:
ESD[errorD(LA (S)) - errorD(LB (S))]
where L(S) is the hypothesis output by learner L using training set S
i.e., the expected difference in true error between hypotheses output by
learners LA and LB, when trained using randomly selected training sets
S drawn according to distribution D.

But, given limited data D0, what is a good estimator?

– could partition D0 into training set S0 and test set T0, and measure
errorT0(LA (S0)) - errorT0(LB (S0))
– even better, repeat this many times and average the results (next slide)
18
Comparing learning algorithms
LA and LB (2/3)
1. Partition data D0 into k disjoint test sets T1, T2, . . ., Tk of eq
ual size, where this size is at least 30.
2. For i from 1 to k, do
use Ti for the test set, and the remaining data for training set S i
– S i  { D 0 – Ti }
– hA  LA(Si)
– hB  LB(Si)
 i  errorTi(hA) - errorTi(hB)

3. Return the value , where

19
Comparing learning algorithms
LA and LB (3/3)
Notice we’d like to use the paired t test on  to obtain a co
nfidence interval
but not really correct, because the training sets in this algor
ithm are not independent (they overlap!)
more correct to view algorithm as producing an estimate of
ESD0[errorD(LA (S)) - errorD(LB (S))]
instead of
ESD[errorD(LA (S)) - errorD(LB (S))]
but even this approximation is better than no comparison
20

C3 4 5 Sample
No ratings yet
C3 4 5 Sample
60 pages
ML - Mod 2 - Part 2
No ratings yet
ML - Mod 2 - Part 2
60 pages
Summary Week 2
No ratings yet
Summary Week 2
17 pages
Lecture15 s12 Naive Bayes
No ratings yet
Lecture15 s12 Naive Bayes
15 pages
Evaluation Hypothesis New
No ratings yet
Evaluation Hypothesis New
55 pages
Wk05 Lect-1
No ratings yet
Wk05 Lect-1
43 pages
Lecture14 s12
No ratings yet
Lecture14 s12
15 pages
AIML - Module 5 - Updated
No ratings yet
AIML - Module 5 - Updated
40 pages
Lecture 01
No ratings yet
Lecture 01
58 pages
Lectures On Statistics in Theory - Prelude To Statistics in Practice
No ratings yet
Lectures On Statistics in Theory - Prelude To Statistics in Practice
94 pages
Solutions To Probability Book-136-168
No ratings yet
Solutions To Probability Book-136-168
33 pages
ML05
No ratings yet
ML05
20 pages
Statistics
No ratings yet
Statistics
15 pages
XXXX - Mathematical Statistics II
No ratings yet
XXXX - Mathematical Statistics II
192 pages
ML Lecture23
No ratings yet
ML Lecture23
57 pages
Beamer NEW 091023
No ratings yet
Beamer NEW 091023
224 pages
Module 5
No ratings yet
Module 5
40 pages
Probs Stats
No ratings yet
Probs Stats
26 pages
Lect Main Blanc
No ratings yet
Lect Main Blanc
185 pages
I2ml3e Chap19
No ratings yet
I2ml3e Chap19
33 pages
4.4 Parametric and Non-Parametric Estimator
No ratings yet
4.4 Parametric and Non-Parametric Estimator
47 pages
M. L. CH .10
No ratings yet
M. L. CH .10
17 pages
جلسه پنجم-1
No ratings yet
جلسه پنجم-1
15 pages
Interval Estimation
No ratings yet
Interval Estimation
20 pages
Par Est
No ratings yet
Par Est
36 pages
Summary Week 2
No ratings yet
Summary Week 2
17 pages
Statistical Learning Theory
No ratings yet
Statistical Learning Theory
213 pages
Lecturenotes
No ratings yet
Lecturenotes
56 pages
NCERT Reference
No ratings yet
NCERT Reference
295 pages
Introduction
No ratings yet
Introduction
11 pages
Bracketing Paradoxes in Italian (Daniele Virgillito, Università Di Bologna 2010)
No ratings yet
Bracketing Paradoxes in Italian (Daniele Virgillito, Università Di Bologna 2010)
97 pages
ABD Formulas
No ratings yet
ABD Formulas
55 pages
ML Test2 2021 SAN - Set 3 Scheme
No ratings yet
ML Test2 2021 SAN - Set 3 Scheme
4 pages
Intro To Essential Stats With Python
No ratings yet
Intro To Essential Stats With Python
51 pages
MBA 643 - Final Exam Notes
No ratings yet
MBA 643 - Final Exam Notes
11 pages
Math and Statistics PDF
No ratings yet
Math and Statistics PDF
192 pages
OULD - bammOUNE Preparation Tp1
No ratings yet
OULD - bammOUNE Preparation Tp1
13 pages
Cheat Sheet
No ratings yet
Cheat Sheet
4 pages
Programming Logic and Design: Seventh Edition
No ratings yet
Programming Logic and Design: Seventh Edition
32 pages
Recurrent Neural Network: Unit - 3
No ratings yet
Recurrent Neural Network: Unit - 3
12 pages
Chapter 6
No ratings yet
Chapter 6
7 pages
Scan 9 Apr 2019 PDF
No ratings yet
Scan 9 Apr 2019 PDF
26 pages
Module 2
No ratings yet
Module 2
3 pages
Parity Bits Exercises
No ratings yet
Parity Bits Exercises
4 pages
Fundamentals of Statistics (18.6501x)
No ratings yet
Fundamentals of Statistics (18.6501x)
20 pages
Algebra MCQ
No ratings yet
Algebra MCQ
6 pages
Eva Uation Methods 273 A Spring 09
No ratings yet
Eva Uation Methods 273 A Spring 09
17 pages
Investigation of The Force Extension Graph For A Spring
No ratings yet
Investigation of The Force Extension Graph For A Spring
3 pages
Class 10th WTP 06 Retest Maths 25-05-2025 S. 2025-26
No ratings yet
Class 10th WTP 06 Retest Maths 25-05-2025 S. 2025-26
1 page
Math Tessellation Final Project
No ratings yet
Math Tessellation Final Project
8 pages
X400004 20220215 Solutions
No ratings yet
X400004 20220215 Solutions
8 pages
Convolution Sum PDF
No ratings yet
Convolution Sum PDF
17 pages
Rohatgi Expl
No ratings yet
Rohatgi Expl
192 pages
Statistical+Inference+1 Shaw2007
No ratings yet
Statistical+Inference+1 Shaw2007
66 pages
Probability and Statistics - 3
No ratings yet
Probability and Statistics - 3
59 pages
Statistical Inference Notes Melon University
No ratings yet
Statistical Inference Notes Melon University
5 pages
Predicate Logic PDF
No ratings yet
Predicate Logic PDF
22 pages
Statistical Tests Martin G 161131 V15 UPLOAD
No ratings yet
Statistical Tests Martin G 161131 V15 UPLOAD
33 pages
Statistics Formula Sheet-With Tables
No ratings yet
Statistics Formula Sheet-With Tables
5 pages
Test of Homogeneity Based On Geometric Mean of Variances
No ratings yet
Test of Homogeneity Based On Geometric Mean of Variances
11 pages
Interval Estimation
100% (1)
Interval Estimation
42 pages
Drone Design490 2015 FinalPresentation LDDS
No ratings yet
Drone Design490 2015 FinalPresentation LDDS
118 pages
Lecture 7-260
No ratings yet
Lecture 7-260
9 pages
Evaluating Hypotheses Problems Estimating Error: (H) Is Optimistically H Error H Error E Bias
No ratings yet
Evaluating Hypotheses Problems Estimating Error: (H) Is Optimistically H Error H Error E Bias
4 pages
Stat Risk
No ratings yet
Stat Risk
6 pages
Reliance JIO
No ratings yet
Reliance JIO
69 pages
Absorption: Instructor: Zafar Shakoor
No ratings yet
Absorption: Instructor: Zafar Shakoor
14 pages
Discontinuidades en Concreto
No ratings yet
Discontinuidades en Concreto
9 pages
Stats 2 Formulae
No ratings yet
Stats 2 Formulae
5 pages
Stat 535 C - Statistical Computing & Monte Carlo Methods: Arnaud Doucet
No ratings yet
Stat 535 C - Statistical Computing & Monte Carlo Methods: Arnaud Doucet
23 pages
Notes For 18.6501x, Fundamentals of Statistics: v0.2 (2019 April 24)
100% (1)
Notes For 18.6501x, Fundamentals of Statistics: v0.2 (2019 April 24)
14 pages
Basic Concepts of Inference: Corresponds To Chapter 6 of Tamhane and Dunlop
No ratings yet
Basic Concepts of Inference: Corresponds To Chapter 6 of Tamhane and Dunlop
40 pages
Assignment
No ratings yet
Assignment
5 pages
M7L36
No ratings yet
M7L36
6 pages
Hw2 - Raymond Von Mizener - Chirag Mahapatra
No ratings yet
Hw2 - Raymond Von Mizener - Chirag Mahapatra
13 pages
UMEP Sample
No ratings yet
UMEP Sample
2 pages
CSCE 970 Lecture 6: System Evaluation and Combining Classifiers
No ratings yet
CSCE 970 Lecture 6: System Evaluation and Combining Classifiers
9 pages
Jkbose Class Xi Papers
No ratings yet
Jkbose Class Xi Papers
3 pages
ME 2019 New Question Format Chem 2
No ratings yet
ME 2019 New Question Format Chem 2
4 pages
Abstract Classes
No ratings yet
Abstract Classes
5 pages
Intro Stats Formula Sheet
No ratings yet
Intro Stats Formula Sheet
5 pages
Osobine Var
No ratings yet
Osobine Var
19 pages
Parallelograms
No ratings yet
Parallelograms
4 pages
Sri Chaitanya: IIT Academy.,India
No ratings yet
Sri Chaitanya: IIT Academy.,India
11 pages
Tables For Processing Gcodes
No ratings yet
Tables For Processing Gcodes
33 pages
Grade 11 Ap CSP 4TH MP Exam
No ratings yet
Grade 11 Ap CSP 4TH MP Exam
4 pages
Fiitjee Rit 2
No ratings yet
Fiitjee Rit 2
11 pages
Solution 227
No ratings yet
Solution 227
15 pages
Shaft Misalignment and Vibration - A Model
No ratings yet
Shaft Misalignment and Vibration - A Model
13 pages

ML 05

Uploaded by

ML 05

Uploaded by

Machine Learning

Chapter 5. Evaluating Hypotheses

 The sample error of h with respect to target function f and

 Where (f(x)  h(x)) is 1 if f(x)  h(x), and 0 otherwise.

2. Variance: Even with unbiased S, errorS(h) may stil

Probability P(r) of r heads in n coin flips, if p = Pr(heads)

errorS(h) follows a Binomial distribution, with

Approximate this by a Normal distribution with

80% of area (probability) lies in   1.28

equivalently, errorD(h) lies in interval

 Central Limit Theorem. As n  , the distribution gover

N% confidence interval estimate for d:

Note i approximately Normally distributed

But, given limited data D0, what is a good estimator?

3. Return the value , where

You might also like