0% found this document useful (0 votes)

45 views

Lecture 19 20

The document summarizes the bootstrap method for estimating standard errors and computing confidence intervals. It discusses how the bootstrap works by resampling from an empirical distribution to estimate properties of the original sample. It also describes how the bootstrap can be used to construct percentile and t-interval confidence intervals without assuming a specific population distribution. The jackknife method is also introduced as similar to the bootstrap for estimating bias and standard error.

Uploaded by

amanmatharu22

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

45 views

Lecture 19 20

Uploaded by

amanmatharu22

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Lecture - 18

Instructor: Dr. Arabin Kumar Dey February 18, 2012

The bootstrap, invented by Efron in 1979, is a method for estimating standard errors and computing condence intervals. Let the = T (F ) be an interesting parameter of a distribution function F, which is the mean parameter of the distribution F, where T () is a functional of F. One simple example is T (F ) = xdF (x), which is the mean parameter of the distribution F. Another example ydF (x))2 dF (x), which is the variance parameter.

is T (F ) = (x

Let X1 , , Xn be an i.i.d. sample from F, and we use Fn to denote the empirical distribution which puts mass is the sample mean 1 = Xn = n
n 1 n

on each of Xi s. An estimate of the parameter =

xdF (x)

Xi .
i=1

Now suppose we want to know the variance of our estimators usually depends on the unknown distribution F. For example, when is the sample mean, we have
2 = VF () n

where 2 = The basic bootstrap has two steps : 1 (x ydF (y))2 dF (x).

1. Estimate VF () with VFn (). 2. Approximate VFn using simulation. Note that for simple estimator we may be able to directly calculate VFn () without using simulation. For example, when = Xn = VFn () = 11 2 = n nn
n n Xi i=1 n ,

we have

(Xi Xn )2
i=1

However, in more complicated cases it is not easy to write down the formula of VFn () and we need to resort to simulation.

Simulation

Suppose that Y H and we want to estimate E(h(Y )). We can draw i.i.d. samples Y1 , Y2 , , YB from H and use the sample mean 1 B
B

h(Yj ) E(h(Y )) =
j=1

h(y)dH(y).

as B . In particular, we will use the following result that 1 B

(Yj Y )2 =
j=1

1 B

Yj2 (
j=1

1 B

Yj )2 V (Y ).
i=1

In bootstrap we want to estimate VFn () which stands for the variance of if the true population distribution is Fn , and recall that = T (X1 , X2 , , Xn )). Now think of as Y
in the above example (i.e. Y = T (X1 , , Xn )) and the distribution G of Y in this case is the empirical distribution of all samples (X1 , , Xn ) whose elements are drawn i.i.d., from

Fn . As a result we have the following bootstrap procedure:

(1) For b = 1, 2, , B:
1. Draw X1 , , Xn Fn 2. Compute b = T (X1 , , Xn )

Compute vboot =

1 B

B b=1 (b

1 B

B 2 c=1 c )

Bagging* : So far, we have investigated the bootstrap as a means to access estimation accuracy. An interesting question is whether the bootstrap can improve accuracy. Bagging is an attempt to do this. Bagging is a acronym meaning bootstrap aggregation. The idea is simple. Suppose we are estimating some quantity, e.g., the optimal portfolio weights to achieve an expected return of 0.012. We have one estimate from the original sample, and this estimate is often used. However, we also have B additional estimates, one from each of the bootstrap samples. The bagging estimate is the average of all of these bootstrap estimates.

Condence Interval:

There are dierent types of bootstrap condence intervals available in the literature. We are going to discuss only two types.

Bootstrap percentile condence interval Bootstrap-t condence interval

Bootstrap percentile condence interval:

1. Draw X1 , , Xn Fn 2. Compute b = T (X1 , , Xn ) for b = 1(1)B

3. Compute non-parametric condence interval based on b points. 3

4. Let (r) is the r-th order statistics. 5. Therefore the 100(1 )% condence interval will be [ (k) , (k ) ] where k = [ b] if b 2 2 is integer and k = [ b] + 1 if b is not an integer. Similarly, k = [(1 )b] if (1 )b 2 2 2 2 is integer and k = [(1 )b] + 1 if (1 )b is not an integer. 2 2

Bootstrap-t condence interval:

Condence interval for mean :

See Rupert page 328.
s s We can construct a (1 )% condence interval as [X t n , X + t n ] when Xi s 2 2

are i.i.d. sample from N (, 2 ). But the problem will occur if we are not sampling from normal distribution, but rather some other distribution. In that case the following bootstrap condence interval can be constructed. Let Xboot,b and sboot,b be the sample mean and standard deviation of the b-th resample, b = 1, , B. Dene tboot,b = X Xboot,b
sboot,b n

Notice that tboot,b is dened in the same way as t except for two changes. First, X and s in t are replaced by Xboot,b and sboot,b . Second, in t is replaced by X in tboot,b . The last point is a bit subtle, and you should stop to think about it. A resample is taken using the original sample as the population. Thus, for the resample, the population mean is X!. Because the resamples are independent of each other, the collection tboot,1 , tboot,1 , can be treated as a random sample from the distribution of the t statistic. After B values of tboot,b have been calculated, one from each resample, we nd the 100(1 )% and 100(1 2
)% 2

percentiles of this collection of tboot,b values. Call these percentiles tL and tU . More 4

specically, we nd tU and tL as we described earlier. We sort all the B values from smallest to largest. Then we calculate the B/2 and round to the nearest integer. Suppose the result is KL . Then the KL -th sorted value of tboot,b is tL . Similarly, let KU be B(1 ) rounded 2 to the nearest integer and then tU is the KU th sorted value of tboot,b . Finally we can make
s s the bootstrap condence interval for as (X + tL n , X + tU n ). We get two advantages

through bootstrap:

We do not need to know the population distribution. We do not need to calculate the distribution of t-statistic using probability theory.

See more examples and applications in Rupert.

4.1

Jackknife Estimator:

Jackkning, which is similar to bootstrapping, is used in statistical inference to estimate the bias and standard error (variance) of a statistic, when a random sample of observations is used to calculate it. The basic idea behind the jackknife estimator lies in systematically recomputing the statistic estimate leaving out one or more observations at a time from the sample set. Therefore, in delete-1 jackknife the resamples for the sample (X1 , X2 , X3 ) can be given by , (X2 , X3 ), (X1 , X3 ) and (X1 , X2 ). Suppose, b b = 1(1)n are the estimators based on jackknife resamples of size (n-1)

from the original sample of size n. Then jackknife estimate of can be written as
avg = n b=1 b

n
n b=1 (b
1 avg )2 ] 2

and standard error of can be given as [ n1 n

Remark : delete-1 jackknife is similar to cross-validation which is discussed later. 5

Bradley Efron, R.J. Tibshirani An Introduction To Bootstrap
60% (5)
Bradley Efron, R.J. Tibshirani An Introduction To Bootstrap
225 pages
Bootstrap Powerpoint
100% (1)
Bootstrap Powerpoint
20 pages
Smart Thinking - How To Think Conceptually, - Bryan Greetham
100% (12)
Smart Thinking - How To Think Conceptually, - Bryan Greetham
329 pages
Bootstrapping Regression Models: 1 Basic Ideas
No ratings yet
Bootstrapping Regression Models: 1 Basic Ideas
14 pages
L22 Bootstrap
No ratings yet
L22 Bootstrap
7 pages
Wasserman 8 PDF
No ratings yet
Wasserman 8 PDF
12 pages
Bootstrapping Techniques in Statistical Analysis and Approaches in R MATH 289
No ratings yet
Bootstrapping Techniques in Statistical Analysis and Approaches in R MATH 289
10 pages
Bootstrap Up
No ratings yet
Bootstrap Up
5 pages
bootstrap-methods-2020
No ratings yet
bootstrap-methods-2020
16 pages
Bootstrap 1
No ratings yet
Bootstrap 1
7 pages
Bootstrap Report
No ratings yet
Bootstrap Report
92 pages
Boot
No ratings yet
Boot
15 pages
Appendix Bootstrapping
No ratings yet
Appendix Bootstrapping
18 pages
Notessc w05
No ratings yet
Notessc w05
10 pages
Estimation Through Bootsrtapping
No ratings yet
Estimation Through Bootsrtapping
6 pages
Braun Bootstrap2012 PDF
No ratings yet
Braun Bootstrap2012 PDF
63 pages
Lecture 19
No ratings yet
Lecture 19
31 pages
Intro Bootstrap 341
No ratings yet
Intro Bootstrap 341
18 pages
Bootstrap: Estimate Statistical Uncertainties
No ratings yet
Bootstrap: Estimate Statistical Uncertainties
22 pages
Lecture13 PDF
No ratings yet
Lecture13 PDF
10 pages
Lecture 4
No ratings yet
Lecture 4
6 pages
Bootstrap Method PDF
No ratings yet
Bootstrap Method PDF
14 pages
The Bootstrap and The Jackknife
No ratings yet
The Bootstrap and The Jackknife
15 pages
An Introduction To Bootstrap Methods and Their Application: Prof. Dr. Diego Kuonen, Cstat Pstat Csci
No ratings yet
An Introduction To Bootstrap Methods and Their Application: Prof. Dr. Diego Kuonen, Cstat Pstat Csci
73 pages
Chapter 4
No ratings yet
Chapter 4
25 pages
Bootstrap 1
No ratings yet
Bootstrap 1
16 pages
1-s2.0-S0167947399000663-main
No ratings yet
1-s2.0-S0167947399000663-main
11 pages
s-m-s-t-c--lecture-2425-4
No ratings yet
s-m-s-t-c--lecture-2425-4
43 pages
Bootstrap
No ratings yet
Bootstrap
52 pages
Bootstrap Methodology
No ratings yet
Bootstrap Methodology
33 pages
Lecture3
No ratings yet
Lecture3
4 pages
L8 Bootstrap Methods
No ratings yet
L8 Bootstrap Methods
69 pages
Bootstrap Simulation
No ratings yet
Bootstrap Simulation
17 pages
This Content Downloaded From 140.213.190.131 On Tue, 13 Apr 2021 09:26:31 UTC
No ratings yet
This Content Downloaded From 140.213.190.131 On Tue, 13 Apr 2021 09:26:31 UTC
23 pages
Bootstrapping The General Linear Hypothesis Test: Pedro Delicado
No ratings yet
Bootstrapping The General Linear Hypothesis Test: Pedro Delicado
17 pages
Resampling Methods For Time Series
No ratings yet
Resampling Methods For Time Series
5 pages
Lecture On Bootstrap - Lecture Notes
No ratings yet
Lecture On Bootstrap - Lecture Notes
29 pages
Bootstrap Stat 498 B
No ratings yet
Bootstrap Stat 498 B
61 pages
AdvEcx Chp3 Full 3006
No ratings yet
AdvEcx Chp3 Full 3006
17 pages
Advanced Econometric Methods I: Lecture Notes On Bootstrap: 1 Motivation
No ratings yet
Advanced Econometric Methods I: Lecture Notes On Bootstrap: 1 Motivation
19 pages
HW 9 Bootstrap, Jackknife, and Permutation Tests
No ratings yet
HW 9 Bootstrap, Jackknife, and Permutation Tests
7 pages
Nonparametric Standard Errors and Confidence Intervals
No ratings yet
Nonparametric Standard Errors and Confidence Intervals
21 pages
A Leisurely Look at The Bootstrap, The Jackknife, and Cross-Validation (1983 13s) - BRADLEY EFRON
No ratings yet
A Leisurely Look at The Bootstrap, The Jackknife, and Cross-Validation (1983 13s) - BRADLEY EFRON
13 pages
Lab #6: Bootstrap Intervals: Why It Works
No ratings yet
Lab #6: Bootstrap Intervals: Why It Works
7 pages
Komputasi Statistik: Pertemuan 14
No ratings yet
Komputasi Statistik: Pertemuan 14
34 pages
MIT18 05S14 Class24-Slde-A
No ratings yet
MIT18 05S14 Class24-Slde-A
16 pages
Bootstrapping
100% (1)
Bootstrapping
18 pages
Simulations in Statistical Inference
No ratings yet
Simulations in Statistical Inference
12 pages
Bootstrap Explained
No ratings yet
Bootstrap Explained
1 page
Small-Sample Inference and Bootstrap: Leonid Kogan
No ratings yet
Small-Sample Inference and Bootstrap: Leonid Kogan
29 pages
Introduction To Monte Carlo Procedures: The Non-Parametric and Parametric Bootstrap 1. Review of The Non-Parametric Bootstrap
100% (1)
Introduction To Monte Carlo Procedures: The Non-Parametric and Parametric Bootstrap 1. Review of The Non-Parametric Bootstrap
10 pages
Basic Bootstrap in Stata
No ratings yet
Basic Bootstrap in Stata
2 pages
R Bootstrap PDF
No ratings yet
R Bootstrap PDF
5 pages
AOD Lec9
No ratings yet
AOD Lec9
26 pages
Ch4 Bootstrap
No ratings yet
Ch4 Bootstrap
90 pages
A Comparison of Bootstrap Methods For Variance Estimation
No ratings yet
A Comparison of Bootstrap Methods For Variance Estimation
22 pages
Bootstrap Example
No ratings yet
Bootstrap Example
5 pages
A Practical Guide To Bootstrap in R
No ratings yet
A Practical Guide To Bootstrap in R
4 pages
Resampling
No ratings yet
Resampling
5 pages
Optimization in Function Spaces
From Everand
Optimization in Function Spaces
Amol Sasane
No ratings yet
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Lecture 21
No ratings yet
Lecture 21
4 pages
Lecture 15
No ratings yet
Lecture 15
6 pages
Lecture 12
No ratings yet
Lecture 12
4 pages
Lecture 7
No ratings yet
Lecture 7
3 pages
Lecture 4
No ratings yet
Lecture 4
3 pages
Lecture - 2: Instructor: Dr. Arabin Kumar Dey
No ratings yet
Lecture - 2: Instructor: Dr. Arabin Kumar Dey
4 pages
literature-matrix
No ratings yet
literature-matrix
9 pages
branson-2004-anatomy-of-a-research-paper
No ratings yet
branson-2004-anatomy-of-a-research-paper
7 pages
Sample of Thesis in Education in The Philippines
100% (3)
Sample of Thesis in Education in The Philippines
8 pages
BSBINS603
No ratings yet
BSBINS603
11 pages
Gartner HR Research Finds Only 32
No ratings yet
Gartner HR Research Finds Only 32
4 pages
CO Lesson Plan
No ratings yet
CO Lesson Plan
12 pages
GEOG205 Syl Fall2023 Ofrecord
No ratings yet
GEOG205 Syl Fall2023 Ofrecord
7 pages
4 5841256694809627063
100% (7)
4 5841256694809627063
137 pages
luận văn hoàn chỉnh
No ratings yet
luận văn hoàn chỉnh
61 pages
Harnessing Artificial Intelligence For Sustainable Development in Emerging Markets Exploring 15435
No ratings yet
Harnessing Artificial Intelligence For Sustainable Development in Emerging Markets Exploring 15435
15 pages
Name Designation Department Faculty E-Mail Address
No ratings yet
Name Designation Department Faculty E-Mail Address
3 pages
Non-Pe Graduate Teaching Performance in Teaching Pe Classes: An Assessment
No ratings yet
Non-Pe Graduate Teaching Performance in Teaching Pe Classes: An Assessment
19 pages
Proposal of Masterplan Siam Maspion Terminal - English Version
100% (1)
Proposal of Masterplan Siam Maspion Terminal - English Version
9 pages
Characterization and Quantification of Residuals From Materials Recovery Facilities
No ratings yet
Characterization and Quantification of Residuals From Materials Recovery Facilities
8 pages
Marketability of The SHS of SNA
No ratings yet
Marketability of The SHS of SNA
28 pages
Research Letter About ACT Writing
No ratings yet
Research Letter About ACT Writing
13 pages
17 May2019
No ratings yet
17 May2019
11 pages
Chapter (2) :linear Programming (LP) Graphical Method: Benefits of This Topic
No ratings yet
Chapter (2) :linear Programming (LP) Graphical Method: Benefits of This Topic
22 pages
Master Thesis Agriculture PDF
100% (2)
Master Thesis Agriculture PDF
6 pages
Ancient Rome Thesis Topics
100% (3)
Ancient Rome Thesis Topics
8 pages
Reviewer Io Psychology
100% (2)
Reviewer Io Psychology
35 pages
Characteristics and Importance of Research
No ratings yet
Characteristics and Importance of Research
7 pages
Lecture Series IQAC
No ratings yet
Lecture Series IQAC
2 pages
Jeanne Kenney Deposition
No ratings yet
Jeanne Kenney Deposition
225 pages
Geriatric Manual Primary Care For Geriatrics Revised Geriatric Training For PSP 1 1
No ratings yet
Geriatric Manual Primary Care For Geriatrics Revised Geriatric Training For PSP 1 1
120 pages
Public Speaking-WPS Office
No ratings yet
Public Speaking-WPS Office
6 pages
Diagnosing Organizations
No ratings yet
Diagnosing Organizations
20 pages
Physics Project Finish 4
No ratings yet
Physics Project Finish 4
61 pages
Bce 221 Sim SDL Manual - Week 8-9
No ratings yet
Bce 221 Sim SDL Manual - Week 8-9
25 pages

Lecture 19 20

Uploaded by

Lecture 19 20

Uploaded by

Lecture - 18

Instructor: Dr. Arabin Kumar Dey February 18, 2012

on each of Xi s. An estimate of the parameter =

as B . In particular, we will use the following result that 1 B

Fn . As a result we have the following bootstrap procedure:

Bootstrap percentile condence interval Bootstrap-t condence interval

Bootstrap percentile condence interval:

3. Compute non-parametric condence interval based on b points. 3

Bootstrap-t condence interval:

Condence interval for mean :

See more examples and applications in Rupert.

and standard error of can be given as [ n1 n

Remark : delete-1 jackknife is similar to cross-validation which is discussed later. 5

You might also like