0% found this document useful (0 votes)

32 views4 pages

Evaluating Hypotheses Problems Estimating Error: (H) Is Optimistically H Error H Error E Bias

1) Estimating the true error rate of a hypothesis from a sample is challenging due to bias and variance issues. 2) The sample error rate is an unbiased estimator of the true error rate, but may still vary from it due to variance. 3) Confidence intervals can be constructed around the sample error rate to provide probability statements about the true unknown error rate based on the sample size and distribution.

Uploaded by

SbaStuff

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views4 pages

Evaluating Hypotheses Problems Estimating Error: (H) Is Optimistically H Error H Error E Bias

Uploaded by

SbaStuff

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Evaluating Hypotheses Problems Estimating Error

• Sample error, true error 1. Bias: If S is training set, errorS(h) is optimistically

biased
• Confidence intervals for observed hypothesis error
• Estimators bias ≡ E[errorS (h)] − errorD (h)

• Binomial distribution, Normal distribution, For unbiased estimate, h and S must be chosen
Central Limit Theorem independently

• Paired t-tests
2. Variance: Even with unbiased S, errorS(h) may
• Comparing Learning Methods still vary from errorD(h)

CS 5751 Machine Chapter 5 Evaluating Hypotheses 1 CS 5751 Machine Chapter 5 Evaluating Hypotheses 2
Learning Learning

Two Definitions of Error Example

The true error of hypothesis h with respect to target function f Hypothesis h misclassifies 12 of 40 examples in S.
and distribution D is the probability that h will misclassify
an instance drawn at random according to D. 12
errorD (h) ≡ Pr [ f ( x) ≠ h( x)] errorS (h) = = .30
x∈D
40

The sample error of h with respect to target function f and

data sample S is the proportion of examples h misclassifies
What is errorD(h)?
1
errorS (h) ≡ ∑ δ ( f ( x) ≠ h( x) )
n x∈S
where δ ( f ( x) ≠ h( x) ) is 1 if f ( x) ≠ h( x), and 0 otherwise

How well does errorS(h) estimate errorD(h)?

CS 5751 Machine Chapter 5 Evaluating Hypotheses 3 CS 5751 Machine Chapter 5 Evaluating Hypotheses 4
Learning Learning

Estimators Confidence Intervals

Experiment: If
1. Choose sample S of size n according to • S contains n examples, drawn independently of h and each
other
distribution D
• n ≥ 30
2. Measure errorS(h)
Then
errorS(h) is a random variable (i.e., result of an • With approximately N% probability, errorD(h) lies in
experiment) interval
errorS (h)(1 − errorS (h))
errorS(h) is an unbiased estimator for errorD(h) errorS (h) ± z N
n
where
Given observed errorS(h) what can we conclude N% : 50% 68% 80% 90% 95% 98% 99%
about errorD(h)? z N : 0.67 1.00 1.28 1.64 1.96 2.33 2.53

CS 5751 Machine Chapter 5 Evaluating Hypotheses 5 CS 5751 Machine Chapter 5 Evaluating Hypotheses 6
Learning Learning

1
Confidence Intervals errorS(h) is a Random Variable
If • Rerun experiment with different randomly drawn S (size n)
• S contains n examples, drawn independently of h and each • Probability of observing r misclassified examples:
other
Binomial distribution for n=40, p=0.3
• n ≥ 30 0.14
0.12
Then 0.10
• With approximately 95% probability, errorD(h) lies in 0.08

P(r)
interval 0.06
0.04
errorS ( h)(1 − errorS ( h)) 0.02
errorS (h) ±1.96 0.00
n
0 5 10 15 20 25 30 35 40
r

n!
P (r ) = errorD (h) r (1 − errorD (h)) n − r
r!(n − r )!
CS 5751 Machine Chapter 5 Evaluating Hypotheses 7 CS 5751 Machine Chapter 5 Evaluating Hypotheses 8
Learning Learning

Binomial Probability Distribution Normal Probability Distribution

Normal distribution with mean 0, standard deviation 1
Binomial distribution for n=40, p=0.3
0.14
0.12
0.4
1 − 12 ( xσ−µ )
2

n! 0.35
P(r ) = e
0.10
P(r ) = p r (1 − p ) n − r 0.3
2πσ 2
0.08
r!( n − r )! 0.25
P(r)

0.06 0.2
0.04 0.15
0.1
0.02
0.05
0.00 0
0 5 10 15 20 25 30 35 40 -3 -2.5 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2 2.5 3
r

Probabilty P(r) of r heads in n coin flips, if p = Pr (heads) The probability that X will fall into the interval (a,b) is given by
n b
• Expected, or mean value of X : E[X] ≡ ∑ iP (i ) = np ∫ p ( x)dx
a
i =0
• Expected, or mean value of X : E[X] = µ
• Variance of X : Var(X) ≡ E[( X − E[ X ]) ] = np(1 − p ) 2

• Variance of X : Var(X) = σ 2
• Standard deviation of X : σ X ≡ E[( X − E[ X ]) 2 ] = np(1 − p )
• Standard deviation of X : σ X = σ
CS 5751 Machine Chapter 5 Evaluating Hypotheses 9 CS 5751 Machine Chapter 5 Evaluating Hypotheses 10
Learning Learning

Normal Distribution Approximates Binomial Normal Probability Distribution

errors (h) follows a Binomial distribution, with 0.4

• mean µ errorS ( h ) = errorD (h)

0.35
0.3

• standard deviation 0.25

0.2

errorD (h)(1 − errorD (h)) 0.15

σ errorS ( h ) = 0.1

n 0.05
0
-3 -2.5 -2 -1.5 -1 -0.5 0 0.5 1 1.5 2 2.5 3

Approximate this by a Normal distribution with

80% of area (probability) lies in µ ± 1.28σ
• mean µ errorS ( h ) = errorD (h) N% of area (probability) lies in µ ± z Nσ
• standard deviation
errorS (h)(1 − errorS (h)) N% : 50% 68% 80% 90% 95% 98% 99%
σ errorS ( h ) ≈
n zN : 0.67 1.00 1.28 1.64 1.96 2.33 2.53
CS 5751 Machine Chapter 5 Evaluating Hypotheses 11 CS 5751 Machine Chapter 5 Evaluating Hypotheses 12
Learning Learning

2
Confidence Intervals, More Correctly Calculating Confidence Intervals
If 1. Pick parameter p to estimate
• S contains n examples, drawn independently of h and each
other • errorD(h)
• n ≥ 30 2. Choose an estimator
Then • errorS(h)
• With approximately 95% probability, errorS(h) lies in 3. Determine probability distribution that governs estimator
interval
errorD ( h)(1 − errorD ( h)) • errorS(h) governed by Binomial distribution, approximated
errorD (h) ±1.96
n by Normal when n ≥ 30
• equivalently, errorD(h) lies in interval 4. Find interval (L,U) such that N% of probability mass falls
errorD ( h)(1 − errorD ( h))
errorS ( h) ±1.96 in the interval
n
• which is approximately • Use table of zN values
errorS ( h)(1 − errorS ( h))
errorS (h) ±1.96
n
CS 5751 Machine Chapter 5 Evaluating Hypotheses 13 CS 5751 Machine Chapter 5 Evaluating Hypotheses 14
Learning Learning

Central Limit Theorem Difference Between Hypotheses

Consider a set of independent, identically distributed random Test h1 on sample S1 , test h2 on S 2
1. Pick parameter to estimate
variablesY1 …Yn , all governed by an arbitrary probability distribution
d ≡ errorD (h1 ) − errorD (h2 )
with mean µ and finite variance σ . Define the sample mean
2
2. Choose an estimator
n
1
Y ≡ ∑ Yi
n i =1
d ≡ errorS1 (h1 ) − errorS 2 (h2 )
3. Determine probability distribution that governs estimator
errorS1 (h1 )(1 − errorS1 (h1 )) errorS 2 (h2 )(1 − errorS 2 (h2 ))
σd ≈ +
n1 n2
Central Limit Theorem. As n → ∞, the distribution governing Y 4. Find interval (L, U) such that N% of probability mass falls
σ2 in the interval
approaches a Normal distribution, with mean µ and variance .
n errorS1 (h1 )(1 − errorS1 (h1 )) errorS 2 (h2 )(1 − errorS 2 (h2 ))
dˆ ± z N +
n1 n2
CS 5751 Machine Chapter 5 Evaluating Hypotheses 15 CS 5751 Machine Chapter 5 Evaluating Hypotheses 16
Learning Learning

Paired t test to Compare hA,hB Comparing Learning Algorithms LA and LB

1. Partition data into k disjoint test sets T1,T2 ,...,Tk of 1. Partition data D0 into k disjoint test sets T1,T2 ,...Tk of equal size,
equal size, where this size is at least 30. where this size is at least 30.
2. For i from 1 to k do
2. For i from 1 to k , do
δ i ← errorTi (hA ) − errorTi (hB )
use Ti for the test set, and the remaining data for training set S i
3. Return the value d, where
• Si ← {D0 − Ti }
1 k
δ ≡∑δ i
k i =1
• hA ← LA(Si )

N% confidence interval estimate for d :

• hB ← LB(Si )
δ ± t N,k-1sδ • δ i ← errorTi (hA ) − errorTi (hB )

1 k 3. Return the value δ , where

sδ ≡ ∑ (δ i − δ ) 2
k (k − 1) i =1 1 k
δ ≡ ∑δ i
k i =1
Note δ i approximately Normally distributed
CS 5751 Machine Chapter 5 Evaluating Hypotheses 17 CS 5751 Machine Chapter 5 Evaluating Hypotheses 18
Learning Learning

3
Comparing Learning Algorithms LA and LB Comparing Learning Algorithms LA and LB
What we would like to estimate: Notice we would like to use the paired t test on δ to
ES ⊂ D [errorD ( LA ( S )) − errorD ( LB ( S ))] obtain a confidence interval
where L(S) is the hypothesis output by learner L using But not really correct, because the training sets in
training set S
this algorithm are not independent (they overlap!)
i.e., the expected difference in true error between hypotheses output
by learners LA and LB, when trained using randomly selected More correct to view algorithm as producing an
training sets S drawn according to distribution D. estimate of
ES ⊂ D0 [errorD ( LA ( S )) − errorD ( LB ( S ))]
But, given limited data D0, what is a good estimator?
Could partition D0 into training set S and training set T0 and instead of
measure ES ⊂ D [errorD ( LA ( S )) − errorD ( LB ( S ))]
errorT0 ( LA ( S 0 )) − errorT0 ( LB ( S 0 )) but even this approximation is better than no
even better, repeat this many times and average the results comparison
(next slide)
CS 5751 Machine Chapter 5 Evaluating Hypotheses 19 CS 5751 Machine Chapter 5 Evaluating Hypotheses 20
Learning Learning

Cambridge International As A Level Mathematics Probability Statistics
No ratings yet
Cambridge International As A Level Mathematics Probability Statistics
53 pages
Cambridge International As A Level Mathematics Probability Statistics PDF
No ratings yet
Cambridge International As A Level Mathematics Probability Statistics PDF
53 pages
Airline Data Analysis
No ratings yet
Airline Data Analysis
20 pages
T - Test
No ratings yet
T - Test
45 pages
EDA Lecture Module 2
100% (1)
EDA Lecture Module 2
42 pages
Lecture2 - General Concepts For ML
No ratings yet
Lecture2 - General Concepts For ML
69 pages
AIML - Module 5 - Updated
No ratings yet
AIML - Module 5 - Updated
40 pages
Lesson 2 Quantitative Analysis and Interpretation
No ratings yet
Lesson 2 Quantitative Analysis and Interpretation
79 pages
Mathematics of Machine Learning MIT
No ratings yet
Mathematics of Machine Learning MIT
411 pages
CERN Academic Training Lectures - Practical Statistics For LHC Physicists by Prosper PDF
No ratings yet
CERN Academic Training Lectures - Practical Statistics For LHC Physicists by Prosper PDF
283 pages
NR-220105 - Probability and Statistics
100% (1)
NR-220105 - Probability and Statistics
8 pages
Curve Fitting
No ratings yet
Curve Fitting
48 pages
Partial Least Squares Structural Equation Modeling: September 2017
No ratings yet
Partial Least Squares Structural Equation Modeling: September 2017
41 pages
Chapter 4A: Inferences Based On A Single Sample: Confidence Intervals
No ratings yet
Chapter 4A: Inferences Based On A Single Sample: Confidence Intervals
88 pages
M2S2 - Statistical Modelling: DR Axel Gandy Imperial College London Spring 2011
No ratings yet
M2S2 - Statistical Modelling: DR Axel Gandy Imperial College London Spring 2011
25 pages
Week 11
No ratings yet
Week 11
97 pages
Lectures On Statistics in Theory - Prelude To Statistics in Practice
No ratings yet
Lectures On Statistics in Theory - Prelude To Statistics in Practice
94 pages
03 MLE MAP NBayes-1-21-2015
No ratings yet
03 MLE MAP NBayes-1-21-2015
40 pages
MIT18 657F15 LecNote PDF
No ratings yet
MIT18 657F15 LecNote PDF
194 pages
Lecture 2 - General Concepts For ML
No ratings yet
Lecture 2 - General Concepts For ML
63 pages
Time Series Econometrics - Assignment2
No ratings yet
Time Series Econometrics - Assignment2
14 pages
Analysis of Variance (ANOVA) : Table 1 K Random Samples
No ratings yet
Analysis of Variance (ANOVA) : Table 1 K Random Samples
5 pages
Bayesian Estimation Thesis
100% (2)
Bayesian Estimation Thesis
6 pages
Week 10
No ratings yet
Week 10
62 pages
Evaluation Hypothesis New
No ratings yet
Evaluation Hypothesis New
55 pages
ML - Mod 2 - Part 2
No ratings yet
ML - Mod 2 - Part 2
60 pages
Module 5
No ratings yet
Module 5
40 pages
Chapter 4 Bayesian Networks
No ratings yet
Chapter 4 Bayesian Networks
62 pages
Unit Ii
No ratings yet
Unit Ii
48 pages
Lecture1 Intro ML
No ratings yet
Lecture1 Intro ML
60 pages
Machine Learning Week 3
No ratings yet
Machine Learning Week 3
51 pages
Group Project Part 4
No ratings yet
Group Project Part 4
2 pages
Assignment 16 - Statistics
No ratings yet
Assignment 16 - Statistics
3 pages
Monte Carlo Integration
No ratings yet
Monte Carlo Integration
38 pages
Introduction To Bayesian Statistics: Foo Lee Kien (PHD)
No ratings yet
Introduction To Bayesian Statistics: Foo Lee Kien (PHD)
65 pages
ML 05
No ratings yet
ML 05
20 pages
Machine Learning Models and Theories
No ratings yet
Machine Learning Models and Theories
38 pages
Data Analytics and Visualization Previous Year Questions
No ratings yet
Data Analytics and Visualization Previous Year Questions
4 pages
Eva Uation Methods 273 A Spring 09
No ratings yet
Eva Uation Methods 273 A Spring 09
17 pages
HW 4
No ratings yet
HW 4
6 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
51 pages
L1: (Probability And) Statistics: ENGG 2780A ESTR 2020
No ratings yet
L1: (Probability And) Statistics: ENGG 2780A ESTR 2020
29 pages
Intro To Essential Stats With Python
No ratings yet
Intro To Essential Stats With Python
51 pages
2 Naive Bayes
No ratings yet
2 Naive Bayes
49 pages
Stat 535 C - Statistical Computing & Monte Carlo Methods: Arnaud Doucet
No ratings yet
Stat 535 C - Statistical Computing & Monte Carlo Methods: Arnaud Doucet
23 pages
L5 6 7 ML
No ratings yet
L5 6 7 ML
28 pages
IT ML Lab
No ratings yet
IT ML Lab
35 pages
Evaluating Hypothesis: Bias in The Estimate. First, The Observed Accuracy of The Learned Hypothesis Over The Training
No ratings yet
Evaluating Hypothesis: Bias in The Estimate. First, The Observed Accuracy of The Learned Hypothesis Over The Training
17 pages
ML05
No ratings yet
ML05
20 pages
Computing Unit 4
No ratings yet
Computing Unit 4
37 pages
Statistical+Inference+1 Shaw2007
No ratings yet
Statistical+Inference+1 Shaw2007
66 pages
Stat 535 C - Statistical Computing & Monte Carlo Methods: Arnaud Doucet
No ratings yet
Stat 535 C - Statistical Computing & Monte Carlo Methods: Arnaud Doucet
23 pages
AIML Mod-5
No ratings yet
AIML Mod-5
18 pages
PSFDS Unit1 Week3
No ratings yet
PSFDS Unit1 Week3
15 pages
M. L. CH .10
No ratings yet
M. L. CH .10
17 pages
MLT Assignment 1
No ratings yet
MLT Assignment 1
13 pages
Formula PDF
No ratings yet
Formula PDF
7 pages
Anova
No ratings yet
Anova
8 pages
Naive Bayes
No ratings yet
Naive Bayes
29 pages
2223hk1 Slide01 ML2022-2
No ratings yet
2223hk1 Slide01 ML2022-2
23 pages
Analisa Pengaruh Fasilitas Dan Kepuasan Pelanggan Terhadap Loyalitas Pelanggan Menginap Di Mikie Holiday Resort Dan Hotel Berastagi
No ratings yet
Analisa Pengaruh Fasilitas Dan Kepuasan Pelanggan Terhadap Loyalitas Pelanggan Menginap Di Mikie Holiday Resort Dan Hotel Berastagi
13 pages
Homework Assignments 2 PDF
No ratings yet
Homework Assignments 2 PDF
10 pages
2nd Year Statistics Question Bank CH#15
No ratings yet
2nd Year Statistics Question Bank CH#15
4 pages
CPK Index Vs PPM Sigma Level
No ratings yet
CPK Index Vs PPM Sigma Level
4 pages
Lecture15 s12 Naive Bayes
No ratings yet
Lecture15 s12 Naive Bayes
15 pages
STAT40950 2 HypothesisTesting
No ratings yet
STAT40950 2 HypothesisTesting
13 pages
Distribution Worksheeet
No ratings yet
Distribution Worksheeet
15 pages
DM - 02 - 02 - Descriptive Data Summarization
No ratings yet
DM - 02 - 02 - Descriptive Data Summarization
32 pages
Lecture14 s12
No ratings yet
Lecture14 s12
15 pages
CSCE 970 Lecture 6: System Evaluation and Combining Classifiers
No ratings yet
CSCE 970 Lecture 6: System Evaluation and Combining Classifiers
9 pages
Unit 5
No ratings yet
Unit 5
9 pages
EPS and DPS
No ratings yet
EPS and DPS
11 pages
Bab Iii
No ratings yet
Bab Iii
9 pages
Print
No ratings yet
Print
12 pages
Lecture 15 (FSH 103) Ahahshshab
No ratings yet
Lecture 15 (FSH 103) Ahahshshab
11 pages
ML 20230316 1
No ratings yet
ML 20230316 1
9 pages
1 SM
No ratings yet
1 SM
11 pages
Stat Risk
No ratings yet
Stat Risk
6 pages
STPM Physic Experiment 2 (Term 1, 2016) .
No ratings yet
STPM Physic Experiment 2 (Term 1, 2016) .
2 pages
Assumption OF CLASSICAL LINEAR REGRESSION MODEL
No ratings yet
Assumption OF CLASSICAL LINEAR REGRESSION MODEL
3 pages
Lorem Ipsum Dolor Sit Amet, Consectetur Adipiscing Elit. Aliquam Semper Ipsum Urna, Nec Cursus Dolor Dictum Nec. Donec Luctus Mauris Quis Cursus.
No ratings yet
Lorem Ipsum Dolor Sit Amet, Consectetur Adipiscing Elit. Aliquam Semper Ipsum Urna, Nec Cursus Dolor Dictum Nec. Donec Luctus Mauris Quis Cursus.
14 pages
ML Assaignment 4
No ratings yet
ML Assaignment 4
5 pages
Statistical Data Analysis: PH4515: 1 Course Structure
No ratings yet
Statistical Data Analysis: PH4515: 1 Course Structure
5 pages
2CSOE03 IR December 2022
No ratings yet
2CSOE03 IR December 2022
4 pages
Formula
No ratings yet
Formula
7 pages
Assignment 8 Pearson Correlation
No ratings yet
Assignment 8 Pearson Correlation
6 pages
Bba 1
No ratings yet
Bba 1
3 pages
GU5232 Syl
No ratings yet
GU5232 Syl
2 pages

Evaluating Hypotheses Problems Estimating Error: (H) Is Optimistically H Error H Error E Bias

Uploaded by

Evaluating Hypotheses Problems Estimating Error: (H) Is Optimistically H Error H Error E Bias

Uploaded by

Evaluating Hypotheses Problems Estimating Error

• Sample error, true error 1. Bias: If S is training set, errorS(h) is optimistically

Two Definitions of Error Example

The sample error of h with respect to target function f and

How well does errorS(h) estimate errorD(h)?

Estimators Confidence Intervals

Binomial Probability Distribution Normal Probability Distribution

Normal Distribution Approximates Binomial Normal Probability Distribution

• mean µ errorS ( h ) = errorD (h)

• standard deviation 0.25

errorD (h)(1 − errorD (h)) 0.15

Approximate this by a Normal distribution with

Central Limit Theorem Difference Between Hypotheses

Paired t test to Compare hA,hB Comparing Learning Algorithms LA and LB

N% confidence interval estimate for d :

1 k 3. Return the value δ , where

You might also like