Model Specification
Model Specification
Violation of the classical linear regression model (CLRM) that the regression model is “correctly”
specified
If not correctly specified leads to problem of model specification error/model specification bias
1. How does one go about finding the “correct” model? i.e criteria in choosing a model for
empirical analysis?
2. Types of model specification errors likely to be encountered in practice?
3. Consequences of specification errors?
4. Detection of specification error? i.e. diagnostic tools used?
5. Remedies?
6. Evaluation of performance of competing models?
𝑌 =𝛽 + 𝛽 𝑋 + 𝛽 𝑋 +𝛽 𝑋 +𝑢 (2.1)
X=Output
𝑌 =𝛼 + 𝛼 𝑋 + 𝛼 𝑋 +𝑢 (2.2)
Equation (2.2) is a specification error omitting a relevant variable (𝑋 .). Therefore, error term
in (2.2) is
𝑢 =𝑢 + 𝛽 𝑋 (2.3)
2.2.1 Consequences?
(a) If the omitted variable X3 is correlated with the included variable X2 (i.e. r23=0) and 𝛼 and
𝛼 are biased as well as inconsistent and the bias doesn’t disappears as the sample size gets
larger.
(e) Usual confidence interval and hypothesis testing procedures are likely to give misleading
conclusions about statistical significance of estimated parameters
(f) Forecasts based on the incorrect model and the forecast (confidence) intervals will be
unreliable
𝑌 =𝛽 + 𝛽 𝑋 + 𝛽 𝑋 +𝛽 𝑋 +𝛽 𝑋 +𝑢 (2.4)
𝑢 =𝑢 − 𝛽 𝑋 =𝑢 (2.5)
2.3.1 Consequences?
(a) OLS estimators of the parameters of the “incorrect” model are unbiased and consistent
(d) Estimated 𝛽s will be generally inefficient, i.e. variances larger than estimated 𝛽s in true
model
2.4 Wrong Functional Form
𝑙𝑛𝑌 = 𝛽 + 𝛽 𝑋 + 𝛽 𝑋 +𝛽 𝑋 +𝑢 (2.6)
Log-linear vs.linear
𝑌 ∗ = 𝛽 ∗ + 𝛽 ∗ 𝑋 ∗ + 𝛽 ∗ 𝑋 ∗ + 𝛽 ∗ 𝑋 ∗ + 𝑢∗ (2.7)
Where:
𝑌∗ = 𝑌 + 𝜀
𝑋∗ = 𝑋 + 𝑊
N.b. Equation (2.7)=measurement error where the true Yi and Xi use proxies 𝑌 ∗ and 𝑋 ∗ (which
may contain errors of measurement)
Given 𝑌 = 𝛽𝑋 𝑢 (2.8)
i.e. the stochastic error enters in multiplicatively. However, taking logs satisfies the CLRM
against the following model:
𝑌 = 𝛽𝑋 +𝑢 (2.9)
Where error term enters additively. If equation (2.8) is “correct”or holds ‘true” would the 𝛽 in
2.9 be an unbiased estimate of one in (2.8)? If not, then we have a specification error.