0% found this document useful (0 votes)

7 views13 pages

Lec 6

Uploaded by

foreverycc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views13 pages

Lec 6

Uploaded by

foreverycc

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Lecture 6: The Bootstrap

Reading: Chapter 5

STATS 202: Data mining and analysis

Rajan Patel

1/1
Cross-validation vs. the Bootstrap

Cross-validation: provides estimates of the (test) error.

The Bootstrap: provides the (standard) error of estimates.

I One of the most important techniques in all

of Statistics.
I Computer intensive method.
I Popularized by Brad Efron, from Stanford.

2/1
Standard errors in linear regression
Standard error: SD of an estimate from a sample of size n.

3/1
Classical way to compute Standard Errors

Example: Estimate the variance of a sample x1 , x2 , . . . , xn :

n
X
2 1
ˆ = (xi x)2 .
n 1
i=1

What is the Standard Error of ˆ 2 ?

1. Assume that x1 , . . . , xn are normally distributed.

2. Assume that the true variance is close to ˆ 2 and the true
mean is close to x.
3. Then ˆ 2 (n 1) has a -squared distribution with n degrees of
freedom.
4. The SD of this sampling distribution is the Standard Error.

4/1
Limitations of the classical approach

This approach has served statisticians well for 90 years; however,

what happens if:

I The distributional assumption — for example, x1 , . . . , xn

being normal — breaks down?
I The estimator does not have a simple form and its sampling
distribution cannot be derived analytically?

5/1
Example. Investing in two assets
Suppose that X and Y are the returns of two assets.

These returns are observed every day: (x1 , y1 ), . . . , (xn , yn ).

2
2

1
1
0
Y

Y
0
−1

−1
−2

−2
−2 −1 0 1 2 −2 −1 0 1

X X

6/1
Example. Investing in two assets

We have a fixed amount of money to invest and we will invest a

fraction ↵ on X and a fraction (1 ↵) on Y . Therefore, our return
will be
↵X + (1 ↵)Y.

Our goal will be to minimize the variance of our return as a

function of ↵. One can show that the optimal ↵ is:
2 Cov(X, Y )
Y
↵= 2 2 .
X + Y 2Cov(X, Y )

Proposal: Use an estimate:

ˆY2 ˆ
Cov(X, Y)
↵
ˆ= .
2 2 ˆ
ˆX + ˆY 2Cov(X, Y )

7/1
Example. Investing in two assets

Suppose we compute the estimate ↵

ˆ = 0.6 using the samples
(x1 , y1 ), . . . , (xn , yn ).

I How sure can we be of this value?

I If we resampled the observations, would we get a wildly
diﬀerent ↵ˆ?

In this thought experiment, we know the actual joint distribution

P (X, Y ), so we can resample the n observations to our hearts’
content.

8/1
Resampling the data from the true distribution

2
2

1
1
0
Y

Y
0
−1

−1
−2

−2
−2 −1 0 1 2 −2 −1 0 1 2

X X
2

2
1

1
0
0
Y

Y
−1
−1

−2
−2

−3
−3

−3 −2 −1 0 1 2 −2 −1 0 1 2 3

X X

9/1
Computing the standard error of ↵
ˆ

For each resampling of the data,

(1)
(x1 , . . . , x(1)
n )

(2)
(x1 , . . . , x(2)
n )

...
ˆ (1) , ↵
we can compute a value of the estimate ↵ ˆ (2) , . . . .

The Standard Error of ↵

ˆ is approximated by the standard deviation
of these values.

10 / 1
In reality, we only have n samples

I However, these samples can be

used to approximate the joint
distribution of X and Y .

2
2

I The Bootstrap: Resample from

1
1

the empirical distribution:

0
Y

Y
0
n
1X
−1

P̂ (X, Y ) = (xi , yi ).
−1
n
i=1
−2

−2

−2 −1 0 1 2
I Equivalently,
−2 −1 0
resample
1 2
the data by
X drawing n samples
X with
replacement from the actual
observations.
2

11 / 1
A schematic of the Bootstrap
Obs X Y

3 5.3 2.8
α̂ *1
1 4.3 2.4
*1
Z 3 5.3 2.8

Obs X Y
Obs X Y
2 2.1 1.1
1 4.3 2.4 Z *2
!!
3 5.3 2.8 α̂ *2
2 2.1 1.1
!! 1 4.3 2.4 !!
3 5.3 2.8 !!
!! !!
!! !! !!
!Z
*B
!! !!
Original Data (Z)
!!
Obs X Y
α̂ *B
2 2.1 1.1
2 2.1 1.1
1 4.3 2.4

12 / 1
Comparing Bootstrap resamplings
to resamplings from the true distribution

0.9
200
200

0.8
150
150

0.7
0.6
↵
100
100

0.5
50
50

0.4
0

0.3
0.4 0.5 0.6 0.7 0.8 0.9 0.3 0.4 0.5 0.6 0.7 0.8 0.9 True Bootstrap

↵ ↵

13 / 1

Answer
100% (2)
Answer
7 pages
Rewording The Brain How Cryptic Crosswords Can Improve Your Memory and Boost The Power and Agility of Your Brain Research PDF Download
100% (14)
Rewording The Brain How Cryptic Crosswords Can Improve Your Memory and Boost The Power and Agility of Your Brain Research PDF Download
15 pages
Resampling Methods For Dependent Data
No ratings yet
Resampling Methods For Dependent Data
382 pages
Organizational Change Management
100% (5)
Organizational Change Management
107 pages
Gfmam The Maintenance Framework First Edition English Version
100% (1)
Gfmam The Maintenance Framework First Edition English Version
24 pages
METERIAL REQUISITION FIL 2015-2016
No ratings yet
METERIAL REQUISITION FIL 2015-2016
329 pages
Brighton Spec ASME 80-10 2017 PDF
No ratings yet
Brighton Spec ASME 80-10 2017 PDF
1 page
Li Fung Trading - Case Study Solutions
100% (2)
Li Fung Trading - Case Study Solutions
3 pages
A World Class Carbon and Stainless Steel Flange Manufacturer
No ratings yet
A World Class Carbon and Stainless Steel Flange Manufacturer
5 pages
Distance & Direction-2: Floor, Behind Bus Stand, Karnal - Contact: 7015275075, 7206600658
No ratings yet
Distance & Direction-2: Floor, Behind Bus Stand, Karnal - Contact: 7015275075, 7206600658
8 pages
Dismantling Naik
No ratings yet
Dismantling Naik
45 pages
Sharif Abushaikha Hotel General Manager
No ratings yet
Sharif Abushaikha Hotel General Manager
2 pages
SCM in Motor Vehicle Industry
No ratings yet
SCM in Motor Vehicle Industry
44 pages
Bootstrap Stat 498 B
No ratings yet
Bootstrap Stat 498 B
61 pages
Econometrics Notes
No ratings yet
Econometrics Notes
85 pages
This Content Downloaded From 140.213.190.131 On Tue, 13 Apr 2021 09:26:31 UTC
No ratings yet
This Content Downloaded From 140.213.190.131 On Tue, 13 Apr 2021 09:26:31 UTC
23 pages
Gallup Test
No ratings yet
Gallup Test
25 pages
The Ghosts of Adichanallur - Artefacts That Suggest An Ancient Tamil Civilisation of Great Sophistication - The Hindu
No ratings yet
The Ghosts of Adichanallur - Artefacts That Suggest An Ancient Tamil Civilisation of Great Sophistication - The Hindu
12 pages
Action Plan in English
No ratings yet
Action Plan in English
4 pages
Microeconometrics Slides
No ratings yet
Microeconometrics Slides
346 pages
ANTENATAL ASSESSMENT Form 10
No ratings yet
ANTENATAL ASSESSMENT Form 10
4 pages
#6 Adding File Upload To A Form
No ratings yet
#6 Adding File Upload To A Form
10 pages
Bootstrap Simulation
No ratings yet
Bootstrap Simulation
17 pages
Transition To MATH503
No ratings yet
Transition To MATH503
12 pages
Komputasi Statistik: Pertemuan 14
No ratings yet
Komputasi Statistik: Pertemuan 14
34 pages
Bootstrap Confidence Intervals Class 24, 18.05 Jeremy Orloff and Jonathan Bloom 1 Learning Goals
No ratings yet
Bootstrap Confidence Intervals Class 24, 18.05 Jeremy Orloff and Jonathan Bloom 1 Learning Goals
12 pages
Bootstrapping The General Linear Hypothesis Test: Pedro Delicado
No ratings yet
Bootstrapping The General Linear Hypothesis Test: Pedro Delicado
17 pages
Practicum Report On Transformer Repairing and Testing at 33/11kV Substation of Gazipur PBS-1, BREB Power Distribution Network
No ratings yet
Practicum Report On Transformer Repairing and Testing at 33/11kV Substation of Gazipur PBS-1, BREB Power Distribution Network
82 pages
Institute of Mathematical Statistics
No ratings yet
Institute of Mathematical Statistics
12 pages
Bootstrap
No ratings yet
Bootstrap
52 pages
Revised PN Staff Writing Manual - 1
No ratings yet
Revised PN Staff Writing Manual - 1
334 pages
RFQ - Section - III - Technical - Questionnaire
No ratings yet
RFQ - Section - III - Technical - Questionnaire
12 pages
Bootstrapping Techniques in Statistical Analysis and Approaches in R MATH 289
No ratings yet
Bootstrapping Techniques in Statistical Analysis and Approaches in R MATH 289
10 pages
Basic Statistic
No ratings yet
Basic Statistic
20 pages
On A Clear Day A Town With An Ocean View Joe Hisaishi
No ratings yet
On A Clear Day A Town With An Ocean View Joe Hisaishi
22 pages
Braun Bootstrap2012 PDF
No ratings yet
Braun Bootstrap2012 PDF
63 pages
of Bootstrap by Spida - 2010
No ratings yet
of Bootstrap by Spida - 2010
80 pages
AdvEcx Chp3 Full 3006
No ratings yet
AdvEcx Chp3 Full 3006
17 pages
Bootstrap Methodology
No ratings yet
Bootstrap Methodology
33 pages
Explosion Proof Pressure Transmitter: Model PT124B-282 Intelligent Type
No ratings yet
Explosion Proof Pressure Transmitter: Model PT124B-282 Intelligent Type
2 pages
Hemant Resume 1
No ratings yet
Hemant Resume 1
4 pages
L22 Bootstrap
No ratings yet
L22 Bootstrap
7 pages
Bootstrap: Estimate Statistical Uncertainties
No ratings yet
Bootstrap: Estimate Statistical Uncertainties
22 pages
Bootstrapping Time Series Models
No ratings yet
Bootstrapping Time Series Models
43 pages
X, ..., X, X, ..., X X, X, ..., X: Basic Statistics
No ratings yet
X, ..., X, X, ..., X X, X, ..., X: Basic Statistics
29 pages
Small-Sample Inference and Bootstrap: Leonid Kogan
No ratings yet
Small-Sample Inference and Bootstrap: Leonid Kogan
29 pages
Wasserman 8 PDF
No ratings yet
Wasserman 8 PDF
12 pages
Bootstrap PDF
No ratings yet
Bootstrap PDF
24 pages
Bootstrap Method PDF
No ratings yet
Bootstrap Method PDF
14 pages
Bootstrap Example
No ratings yet
Bootstrap Example
5 pages
Advanced Econometric Methods I: Lecture Notes On Bootstrap: 1 Motivation
No ratings yet
Advanced Econometric Methods I: Lecture Notes On Bootstrap: 1 Motivation
19 pages
Resampling Methods For Time Series
No ratings yet
Resampling Methods For Time Series
5 pages
Lecture 7 Classification
No ratings yet
Lecture 7 Classification
52 pages
Biotechnology and It's Application by Hare Krishna Deepak
No ratings yet
Biotechnology and It's Application by Hare Krishna Deepak
42 pages
Lecture 9 PDF
No ratings yet
Lecture 9 PDF
22 pages
Nonparametric Standard Errors and Confidence Intervals
No ratings yet
Nonparametric Standard Errors and Confidence Intervals
21 pages
Bootstrapping Regression Models: 1 Basic Ideas
No ratings yet
Bootstrapping Regression Models: 1 Basic Ideas
14 pages
Bootstrap 1
No ratings yet
Bootstrap 1
16 pages
Stat 1
No ratings yet
Stat 1
6 pages
A Leisurely Look at The Bootstrap, The Jackknife, and Cross-Validation (1983 13s) - BRADLEY EFRON
No ratings yet
A Leisurely Look at The Bootstrap, The Jackknife, and Cross-Validation (1983 13s) - BRADLEY EFRON
13 pages
Random Sampling, Statistics, and Estimators
No ratings yet
Random Sampling, Statistics, and Estimators
9 pages
From The Canterbury Tales - The Prologue
No ratings yet
From The Canterbury Tales - The Prologue
24 pages
Intro Bootstrap 341
No ratings yet
Intro Bootstrap 341
18 pages
Financial Statistics Laboratory 3: Bootstrap
No ratings yet
Financial Statistics Laboratory 3: Bootstrap
16 pages
Bootstrap Up
No ratings yet
Bootstrap Up
5 pages
Bootstrap Explained
No ratings yet
Bootstrap Explained
1 page
CH 00
No ratings yet
CH 00
4 pages
Bootstrap 1
No ratings yet
Bootstrap 1
7 pages
Notessc w05
No ratings yet
Notessc w05
10 pages
Lecture 19 20
No ratings yet
Lecture 19 20
5 pages
Point Estimation: Statistics (MAST20005) & Elements of Statistics (MAST90058) Semester 2, 2018
No ratings yet
Point Estimation: Statistics (MAST20005) & Elements of Statistics (MAST90058) Semester 2, 2018
12 pages
S M S T C Lecture 2425 4
No ratings yet
S M S T C Lecture 2425 4
43 pages
Event Management and Marketing in Tourism
No ratings yet
Event Management and Marketing in Tourism
8 pages
Cheatsheet
No ratings yet
Cheatsheet
4 pages
Lecture 4
No ratings yet
Lecture 4
6 pages
CG Project Report
No ratings yet
CG Project Report
25 pages
Chapter 4
No ratings yet
Chapter 4
25 pages
Weekly Topical Test 1 Trigonometry
No ratings yet
Weekly Topical Test 1 Trigonometry
3 pages
CFA 2024 L1 Estimation and Inference
No ratings yet
CFA 2024 L1 Estimation and Inference
16 pages
What Is Galvanic Cell Give Half Cell Reaction of Daniell Cell and Ex
No ratings yet
What Is Galvanic Cell Give Half Cell Reaction of Daniell Cell and Ex
1 page
Short Story
No ratings yet
Short Story
2 pages
Sta255 Week 10-2 Pre
No ratings yet
Sta255 Week 10-2 Pre
20 pages
Topic 10 Point Estmation of Parameters
No ratings yet
Topic 10 Point Estmation of Parameters
36 pages
Bootstrap
No ratings yet
Bootstrap
4 pages
Bootstrap Methods 2020
No ratings yet
Bootstrap Methods 2020
16 pages
An Introduction To The Bootstrap 3ai7r0o65z
No ratings yet
An Introduction To The Bootstrap 3ai7r0o65z
8 pages
Application of Derivatives Tangents and Normals (Calculus) Mathematics E-Book For Public Exams
From Everand
Application of Derivatives Tangents and Normals (Calculus) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
5/5 (1)
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Generalized Fermat Equation
From Everand
Generalized Fermat Equation
Ran Van Vo
No ratings yet
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
From Everand
Mathematics 1St First Order Linear Differential Equations 2Nd Second Order Linear Differential Equations Laplace Fourier Bessel Mathematics
Andrew Igla
No ratings yet

Lec 6

Uploaded by

Lec 6

Uploaded by

Lecture 6: The Bootstrap

STATS 202: Data mining and analysis

Cross-validation: provides estimates of the (test) error.

The Bootstrap: provides the (standard) error of estimates.

I One of the most important techniques in all

Example: Estimate the variance of a sample x1 , x2 , . . . , xn :

What is the Standard Error of ˆ 2 ?

1. Assume that x1 , . . . , xn are normally distributed.

This approach has served statisticians well for 90 years; however,

I The distributional assumption — for example, x1 , . . . , xn

These returns are observed every day: (x1 , y1 ), . . . , (xn , yn ).

We have a fixed amount of money to invest and we will invest a

Our goal will be to minimize the variance of our return as a

Proposal: Use an estimate:

Suppose we compute the estimate ↵

I How sure can we be of this value?

In this thought experiment, we know the actual joint distribution

For each resampling of the data,

The Standard Error of ↵

I However, these samples can be

I The Bootstrap: Resample from

the empirical distribution:

You might also like