ECON 351 - NOTE 7 Interval Estimation in The Classical Normal Linear Regression Model
ECON 351 - NOTE 7 Interval Estimation in The Classical Normal Linear Regression Model
Abbott
This note outlines the basic elements of interval estimation in the Classical
Normal Linear Regression Model (the CNLRM). Interval estimation – i.e., the
construction of confidence intervals for unknown population parameters – is one of
the two alternative approaches to statistical inference; the other is hypothesis
testing.
1. Introduction
♦ Assume that we have computed the point estimates β̂0 , β$ 1 and σ$ 2 of the
unknown parameters for a given set of sample data (Yi, Xi), i = 1, ..., N.
We therefore begin with the following OLS sample regression equation (or
OLS-SRE):
Yi = βˆ 0 + βˆ 1X i + û i = Ŷi + û i (i = 1, …, N) (2)
where
N
∑ xy ∑ (X i − X )(Yi − Y )
βˆ 1 = i i 2 i = i =1
= OLS estimate of β1 ;
∑i x i N
∑ (X i − X ) 2
i =1
βˆ 0 = Y − βˆ 1X = OLS estimate of β0 ;
∑ i u$ i2 RSS
σ$ =
2
= = unbiased OLS estimate of σ 2 ;
(N − 2) (N − 2)
σˆ 2 σˆ 2
Vâr (βˆ 1 ) = = ;
∑ i x i2 ∑ i ( X i −X ) 2
1
ˆ ˆ ⎛ σˆ 2 ⎞ 2 σˆ
sê(β1 ) = Vâr (β1 ) = ⎜⎜ ⎟
2 ⎟
= ;
⎝ ∑i x i ⎠ ∑ i x i2
ˆ σˆ 2 ∑ i X i2 σˆ 2 ∑ i X i2
Vâr (β0 ) = = ;
N ∑ i x i2 N ∑ i ( X i −X ) 2
1
⎛ σˆ ∑ i X
2 2
⎞2
sê(βˆ 0 ) = Vâr (βˆ 0 ) = ⎜⎜ i
⎟⎟ .
⎝ N ∑i x
2
i ⎠
βˆ 1 − β1 βˆ − β
t (βˆ 1 ) = = 1 1 ~ t[N − 2] ;
Vâr (βˆ 1 ) sê(βˆ 1 )
βˆ 0 − β0 βˆ − β0
t (βˆ 0 ) = = 0 ~ t[ N − 2] .
ˆ
Vâr (β0 ) s ê (βˆ )
0
A confidence interval for the slope coefficient β1 takes the general form
where
α = the significance level (0 < α < 1),
1 − α = the confidence level (or confidence coefficient),
δ$ = a positively-valued sample statistic,
β̂1L = β̂1 − δ$ = the lower confidence limit,
β̂ = β̂ + δ$ = the upper confidence limit.
1U 1
The interval [ β̂1L , β̂1U ] = [ β̂1 − δ$ , β̂1 + δ$ ] is called the two-sided (1 − α)-level
confidence interval, or two-sided 100(1 − α) percent confidence interval, for the
slope coefficient β1.
• The confidence limits β̂1L = β̂1 − δ$ and β̂1U = β̂1 + δ$ are random
variables (or sample statistics) that vary in value from one sample to
another because the values of β̂1 and δ$ vary from sample to sample.
• But for any one sample of data of size N and the corresponding estimates of
β̂1 and δ$ , the confidence limits β̂1L = β̂1 − δ$ and β̂1U = β̂1 + δ$ are simply
fixed numbers, i.e., they take fixed values. Therefore, any one confidence
interval calculated for a particular sample of data is a fixed, meaning
nonrandom, interval.
• Suppose a very large number of random samples of the same size N (e.g., N
= 50 observations) are independently selected from a given population.
• The probability statement in (3) means that 100(1 − α) percent of all the
confidence intervals so constructed will contain the true (but unknown)
population value of β1.
• But note that any one confidence interval [ β̂1L , β̂1U ] based on one sample of
N observations may or may not contain the true value of β1.
The two-sided 100(1−α) percent confidence interval for the slope coefficient
β1 is defined by probability statement (3):
(1) Any one confidence interval for β1, based on one sample of data, may or
may not contain the true value of β1. Since the true value of β1 is unknown,
we do not know whether that value does or does not lie inside any one
confidence interval.
(2) The probability statement (3) is therefore a statement about the procedure
used to construct the confidence interval, not about any one confidence
interval estimate calculated for a particular sample of data.
A two-step derivation:
( )
Pr − t α / 2 [ N − 2] ≤ t (βˆ 1 ) ≤ t α / 2 [ N − 2] = 1 − α (4)
where
βˆ 1 − β1 βˆ − β
t (βˆ 1 ) = = 1 1. (5)
Vâr (βˆ 1 ) sê(βˆ 1 )
0.4
0.35
0.3
0.25
0.2
0.15
0.1
0.05
0
-5 -4 -3 -2 -1 0 1 2 3 4 5
− tα/2 tα/2
↑ ↑
Pr(t < − tα/2) = α/2 ↑ Pr(− tα/2 ≤ t ≤ tα/2) = 1−α ↑ Pr(t > tα/2) = α/2
STEP 2: Express the double inequality inside the brackets in probability statement
(4) in terms of β1 rather than t( β̂1 ).
(
Pr − t α / 2 [ N − 2] ≤ t (βˆ 1 ) ≤ t α / 2 [ N − 2] = 1 − α) (4)
− t α / 2 [ N − 2] ≤ t (βˆ 1 ) ≤ t α / 2 [ N − 2]
βˆ 1 − β1
− t α / 2 [ N − 2] ≤ ≤ t α / 2 [ N − 2] . (6.1)
sê(βˆ 1 )
(2) Multiply the double inequality (6.1) by the positive number sê(βˆ 1 ) > 0:
(4) Multiply all terms in inequality (6.3) by −1, remembering to reverse the
direction of the inequalities:
( )
Pr βˆ 1 − t α 2 [ N − 2] sê(βˆ 1 ) ≤ β1 ≤ βˆ 1 + t α 2 [ N − 2] sê(βˆ 1 ) = 1 − α . (7)
βˆ 1 − t α 2 [ N − 2] sê(βˆ 1 ) ≤ β1 ≤ βˆ 1 + t α 2 [ N − 2] sê(βˆ 1 )
or more compactly as
and
0.4
0.35
0.3
0.25
0.2
0.15
0.1
0.05
0
-5 -4 -3 -2 -1 0 1 2 3 4 5
↑ ↑ ↑
βˆ 1L = βˆ 1 − t α 2 [ N − 2] sê(βˆ 1 ) β̂1 βˆ 1U = βˆ 1 + t α 2 [ N − 2] sê(βˆ 1 )
The confidence interval (or interval estimator) for the intercept coefficient β0 is
derived, interpreted, and constructed in exactly the same way as the confidence
interval for the slope coefficient β1.
( )
Pr − t α / 2 [ N − 2] ≤ t (βˆ 0 ) ≤ t α / 2 [ N − 2] = 1 − α (8)
where
βˆ 0 − β0 βˆ − β0
t (βˆ 0 ) = = 0 . (9)
Vâr (βˆ 0 ) sê(βˆ 0 )
2. The double inequality inside the brackets in probability statement (8) can be
expressed in terms of β0 rather than t( β̂ 0 ), using a derivation analogous to that
used in deriving the confidence interval for β1.
( )
Pr βˆ 0 − t α 2 [ N − 2] sê(βˆ 0 ) ≤ β0 ≤ βˆ 0 + t α 2 [ N − 2] sê(βˆ 0 ) = 1 − α . (10)
βˆ 0 − t α 2 [ N − 2] sê(βˆ 0 ) ≤ β0 ≤ βˆ 0 + t α 2 [ N − 2] sê(βˆ 0 )
or more compactly as
and
Consider the problem of computing a confidence interval for the slope coefficient
β1. Recall that the two-sided (1 − α)-level confidence interval for β1 is given by
the double inequality
βˆ 1 − t α 2 [ N − 2] sê(βˆ 1 ) ≤ β1 ≤ βˆ 1 + t α 2 [ N − 2] sê(βˆ 1 ) .
Step 1: After estimating the PRE (1) by OLS, retrieve from the estimation results
the OLS estimate β̂1 of β1 and the estimated standard error sê(βˆ 1 ) .
Step 2: Select the value of the confidence level (1 − α), which amounts to
selecting the value of α. Although the choice of confidence level is
essentially arbitrary, the values most commonly used in practice are:
Step 3: Obtain the value of t α 2 [ N − 2] , the α/2 critical value of the t-distribution
with N−2 degrees of freedom, either from statistical tables of the t-
distribution or from a computer software program.
Step 4: Use the values of β̂1 , sê(βˆ 1 ) , and t α 2 [ N − 2] to compute the upper and
lower 100(1 − α) percent confidence limits for β1:
βˆ j − t α 2 [ N − 2] sê(βˆ j ) ≤ β j ≤ βˆ j + t α 2 [ N − 2] sê(βˆ j )
or
[βˆ − t
j α2 [ N − 2] sê(βˆ j ), βˆ j + t α 2 [ N − 2] sê(βˆ j ) ]
By inspection, it is apparent that the two-sided confidence interval for βj is wider
(1) the greater the value of sê(βˆ j ) , the estimated standard error of β̂ j , i.e., the less
precise is the estimate of β̂ j ;
(2) the greater the critical value t α 2 [N − 2] , i.e., the greater the chosen value of
the confidence level (1 − α) for the given sample size N.
pricei = β0 + β1weight i + u i
------------------------------------------------------------------------------
price | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
weight | 2.044063 .3768341 5.42 0.000 1.292857 2.795268
_cons | -6.707353 1174.43 -0.01 0.995 -2347.89 2334.475
------------------------------------------------------------------------------
. display _b[weight]
2.0440626
. display _se[weight]
.37683413
• Selected results from OLS estimation of the above linear regression model:
β̂1 = 2.0440626
sê(βˆ 1 ) = 0.37683413
• Result: The two-sided 95% confidence interval for β1 is: [1.293, 2.795]
where
• For each of these 1,000 independent random samples, compute the values of the
OLS slope coefficient estimator
∑ xy
βˆ 1 = i i 2 i
∑i x i
1
⎛ σˆ ⎞
2 2 σˆ
sê(βˆ 1 ) = Vâr (βˆ 1 ) = ⎜⎜ ⎟ =
2 ⎟
∑ x
⎝ i i ⎠ ∑ i x i2
where x i ≡ X i − X , y i ≡ Yi − Y , X = ∑ i X i N , and Y = ∑ i Yi N .
• Save the 1,000 values of β̂1 and the 1,000 values of sê(β̂1 ) .
• Use each of the 1,000 values of β̂1 and sê(β̂1 ) to compute the two-sided 95
percent confidence interval for the slope coefficient β1 , and then count the
number and percentage of these 1,000 confidence intervals that contain the
true population value of β1, which is 0.90.