CH 04
CH 04
Principles of
Econometrics
Fifth Edition
R. Carter Hill William E. Griffiths Guay C. Lim
Chapter Outline
4.1 Least Squares Prediction
that y0 and x0 are related to one another by the same regression model that
describes our sample of data, so that, in particular, SR1 holds for these
observations.
y0 β1 β 2 x0 e0
(4.1)
The least squares predictor of y0 comes from the fitted regression line
(4.2)
ŷ 0 b1 b 2 x 0
which means, on average, the forecast error is zero and is an unbiased predictor of y0.
However, unbiasedness does not necessarily imply that a particular forecast will be close
(4.6)
yˆ 0 tc se f
(4.7) yi β1 β 2 xi ei
unexplainable components.
(4.8)
yˆ i y eˆi 0
we get: (4.11)
y y yˆi y eˆi2
2 2
i
Eq. 4.11 decomposition of the ‘‘total sample variation’’ in y into explained and
unexplained components.
model:
2 SSR SSE
(4.12) R 1
SST SST
The closer R2 is to 1, the closer the sample values yi are to the fitted
regression equation.
Copyright ©2018 John Wiley & Son, Inc. 15
4.2 Measuring Goodness-of-Fit (6 of 6)
If R2 = 1, then all the sample data fall exactly on the fitted least
squares line, so SSE = 0, and the model fits the data ‘‘perfectly.’’
If the sample data for y and x are uncorrelated and show no linear
cov x, y σ xy
ρ xy
(4.13) σ xσ y
var x var y
x x N 1
2
sx i
y y N 1
2
sy i
The sample correlation coefficient rxy has a value between -1 and 1, and it
1. r2xy = R2
3. Changing the scale of y and x by the same factor: there will be no change in
the reported regression results for b2 , but the estimated intercept and
residuals will change.
t-statistics and R2 are unaffected.
The interpretation of the parameters is made relative to the new units of
measurement.
What does economics really say about the relation between food expenditure
1. In the log-log model both the dependent and independent variables are
transformed by the ‘‘natural’’ logarithm.
1. Choose a shape that is consistent with what economic theory tells us about the
relationship.
3. Choose a shape so that assumptions SR1–SR6 are satisfied, ensuring that the
least squares estimators have the desirable properties described in Chapters 2
and 3.
Copyright ©2018 John Wiley & Son, Inc. 31
4.3.4 Using Diagnostic Residual Plots
(1 of 5)
When specifying a regression model, we may inadvertently choose an
inadequate or incorrect functional form.
Homoskedasticity
Serial correlation
If an unusual observation is the result of a data error, then we should correct it.
Understanding how it came about, the story behind it, can be informative.
the model.
one sample.
If the studentized residual falls outside the 95% interval estimate interval,
Logarithmic transformations are often used for variables that are monetary
values.
These variables have the characteristic that they are positive and often
have distributions that are positively skewed, with a long tail to the right.
Both its slope and elasticity change at each point and are the same sign as β2.
β2 % change in y.
=b1+b2 x
Thus, the effect of the correction is always to increase the value of the
log-linear model, and the correction offsets the downward bias in large
samples.
useless.
The slopes of these curves change at every point, but the elasticity
decreasing rate.
(4.15)
2
R corr Q, Q̂ c
g 2
0.939 2 0.8817