L8 Bootstrap Methods
L8 Bootstrap Methods
5 Confidenc
Bootstrap Methods
Karim Seghouane
School of Mathematics & Statistics
The University of Melbourne
Outline
§6.1 Introduction
Introduction
Introduction
Introduction
Introduction
Empirical distribution
Resample
I Each xi∗ has probability 1/n of being equal to any given one
xj0 s
1
P (xi∗ = xj |X ) =
, 1 ≤ i, j ≤ n
n
I The xi∗ ’s are i.i.d. conditional on X .
I X ∗ is likely to contain repeats, all of which must be listed in
X ∗.
I Example: X ∗ = {1.5, 1.7, 1.7, 1.8} is different from
{1.5, 1.7, 1.8} and X ∗ is the same as {1.5, 1.7, 1.8, 1.7},
{1.7, 1.5, 1.8, 1.7}.
Estimation as functional
Bootstrap principle
Bootstrap principle
Bootstrap principle
Bootstrap principle
Example:
Bootstrap principle
I The equation
n0 − tn1 = 0
I where ni is the number of cars on level “i”
Bootstrap principle
n1 − tn2 = 0
I we obtain the solution t̂0 of this equation and thereby
n12
n̂0 = t̂0 n1 =
n2
I is the estimate of n0
Bootstrap principle
Bootstrap principle
Bootstrap principle
Parametric vs Nonparametric
Nonparametric
Parametric
Example
Bias correction
I Here we have
Bias correction
Bias correction
Bias correction
B B
1 X −1
X
ûB = θ(F2b ) = B θ̂b∗
B
b=1 b=1
I converge to (as B → ∞)
n o
û = E {θ(F2 )|F1 } = E θ̂∗ |X
Example
I Let
Z
µ= xdF0 (x) and assume θ0 = θ(F0 ) = µ3
θ̂ = θ(F1 ) = x̄ 3
Example
I In nonparametric approach
!3
1Xn
E {θ(F1 )|F0 } = EF0 xi
n
i=1
!3
n
1 X
=E µ+ (xi − µ)
n
i=1
Example
Example
Example
Example
Remarks
Nonparametric bootstrap
where X∗
= {x1∗ , ..., xn∗ }
is obtained by sampling randomly
with replacement from the original X = {x1 , ..., xn }.
I Instead, B i.i.d. samples, each of size n, are drawn from
F̂ = F1 , producing B nonparametric bootstrap samples.
Denote them as Xi∗ = {xi1 ∗ , · · · , x ∗ } iid
in = F1 for i = 1, · · · , B.
Nonparametric bootstrap
Parametric bootstrap
Parametric bootstrap
I The empirical estimation {ft Xi∗ , F (x|θ̂) , i = 1, · · · , B} is
then used to approximate the ideal bootstrap equation
E {ft (F1 , F2 )|F1 } and further approximates the population
equation of E {ft (F0 , F1 )|F0 }
I If the parametric model is not good, the parametric
bootstrap can give misleading inference.
Confidence interval
Confidence interval
Confidence interval
I and
I is a solution
h i
I θ̂ − t̂0 , θ̂ + t̂0 is a bootstrap confidence interval for
θ0 = θ(F0 ), called a two sided symmetric percentile interval
Confidence interval
Confidence interval
Confidence interval
Normal approximation
h i
θ̂(∗α [B+1]) , θ̂((1−
∗
α
)[B+1])
2 2
where both y and x are drawn randomly from the population (or y
only when considering fixed x).
I
¯ ] + Errin − E [err
Errin = E [err ¯]
| {z }
op
I
N
2 X
Errin − E [err
¯ ]= cov(ŷi , yi )
N
i=1
I
ˆ = err
Err ¯ + op
ˆ
Bootstrap Methods 63/69
§6.1 Introduction §6.2 Bootstrap principle §6.3 Parametric vs Nonparametric §6.4 Bias correction §6.5 Confidenc
B B N
1 X 1 XX 2
errB = errb = yi − fˆ∗b (xi )
B NB
b=1 b=1 i=1
N
1 X ∗b ˆ∗b 2
errb∗ = yi − f (xi )
N
i=1
I
B N
1 X X ∗b ˆ∗b 2
errB∗ = yi − f (xi )
NB
b=1 i=1
I
B
1 X
ˆ b = errb − errb∗ and op
op ˆB = opb
B
b=1
I
ˆ = err
Err ¯ + op
ˆB
However errb is underestimating Err because these are common
observations.
I
N
1 X
errOOB = errOOB i
N
i=1
I h i
(0.632)
op
ˆB = 0.632 errOOB − err
I
ˆ (0.632) = 0.368err + op
Err ˆB
(0.632)