The Jackknife: Patrick Breheny
The Jackknife: Patrick Breheny
The Jackknife
Patrick Breheny
September 9
Introduction
Thus,
θ̂jack = θ̂ − bjack ,
For example,
P consider the plug-in estimate of variance:
θ̂ = n−1 i (xi − x̄)2
The expected value of the jackknife estimate of bias is
θ
E(bjack ) =
n
= Bias(θ̂)
θ̂jack = s2 ,
Pseudo-values
Note that,
Pfor the special case of a linear statistic
θ̂ = µ + a(xi ), θ̃i = a(xi ) (i.e., for the mean, θ̃i = xi )
The idea behind pseudo-values is that they are supposed to
act like a sample of n independent data values – or at least, a
linear approximation to that ideal
Note that
1X
θ̃i = nθ̂ − (n − 1)θ̄
n
= θ̂ − bjack ,
s̃2
vjack =
n
Example: Mean
Homework
Inconsistency
Homework
The standardized test used by law schools is called the Law School
Admission Test (LSAT), and it has a reasonably high correlation
with undergraduate GPA. The course website contains data on the
average LSAT score and average undergraduate GPA for the 1973
incoming class of 15 law schools.
(a) Use the jackknife to obtain an estimate of the bias and
standard error of the correlation coefficient between GPA and
LSAT scores. Comment on whether the estimate is biased
upward or downward.
(b) If x and y are drawn from a bivariate normal distribution, then
P
nV(ρ̂) −→ (1 − ρ2 )2 . Use this to estimate the standard error
of ρ̂.
Homework (cont’d)
Homework (cont’d)
(d) For each data point (xi , yi ), make a plot of ρ̂ vs. the mass at
point i, as the point mass at i varies from 0 to 1/3 (and the
rest of the mass is spread evenly on the rest of the
observations).
(e) Comment on why some plots slope upwards and others slope
downwards.
(f) Extra credit: Comment on the how the shape of the curve
relates to the comparison between the delta method and
jackknife estimates of the variance
Hints