0% found this document useful (0 votes)

35 views22 pages

Chapter 6 Econometrics

Uploaded by

Anum Abdullah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views22 pages

Chapter 6 Econometrics

Uploaded by

Anum Abdullah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

Chapter 6

Model Specification: Choosing

the Independent Variables

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٤

Specifying an Econometric Equation and
Specification Error

• Before any equation can be estimated, it must be completely

specified
• Specifying an econometric equation consists of three parts,
namely choosing the correct:
– independent variables
– functional form
– form of the stochastic error term
• A specification error results when one of these choices is
made
incorrectly
• This chapter will deal with the first of these choices (the two other
choices will be discussed in subsequent chapters)

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٥

Omitted Variables

• Two reasons why an important explanatory variable

might have been left out:
– we forgot…
– it is not available in the dataset, we are examining
• Either way, this may lead to omitted variable bias
(or, more generally, specification bias)
• The reason for this is that when a variable is not
included, it cannot be held constant
• Omitting a relevant variable usually is evidence that the
entire equation is a suspect, because of the likely bias of
the coefficients.

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٦

The Consequences of an Omitted Variable

• Suppose the true regression model is:

(6.1)
Where is a classical error term
• If X2 is omitted, the equation becomes instead:
(6.2)

Where:
(6.3)
• Hence, the explanatory variables in the estimated regression (6.2) are not
independent of the error term (unless the omitted variable is
uncorrelated with all the included variables—something which is very
unlikely)

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٧

The Consequences of an Omitted Variable (cont.)

• What happens if we estimate Equation 6.2 when Equation 6.1 is the truth?
• We get bias!
• What this means is that:
(6.4)
• Instead of having an expected value equal to the true β1 the estimate will compensate
for the fact that X2 is missing from the equation.
• If X1 and X2 are correlated and X2 is omitted from the equation, then the OLS
estimation procedure will attribute to X1 variations in Y actually caused by X2, and a
biased estimate of β1 will result.

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٨

The Consequences of an Omitted Variable (cont.)

• To see how a left-out variable can cause bias, picture a production function
that states that output depends on the amount of labor and capital used.
Y=f(K,L)
• What would happen if data on capital were unavailable for some reason and
K was omitted from the equation?
• In this case, we would be leaving out the impact of capital on output in our
model.
• This omission would almost surely bias the estimate of the coefficient of
labor because it is likely that capital and labor are positively correlated.
• As a result, the OLS program would attribute to labor the increase in output
actually caused by capital to the extent that labor and capital were
correlated.
• Thus the bias would be a function of the impact of capital on output and the
correlation between capital and labor.
© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٨
The Consequences of an Omitted Variable (cont.)

• To generalize for a model with two independent variables, the expected value of the
coefficient of an included variable (X1) when a relevant variable (X2) is omitted from the
equation equals:

Where α1 is the slope coefficient of the secondary regression that relates X2 to X1:

Where ui is a classical error term. α1 can be expressed as a function of the correlation

between X1 and X2, the included and excluded variables, or f(r12).

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٨

The Consequences of an Omitted Variable (cont.)

In a nutshell

The amount of bias is a function of the impact of the omitted variable on the
dependent variable times a function of the correlation between the included and
the omitted variable
• So, the bias exists unless:
1. the true coefficient equals zero, or
2. the included and omitted variables are uncorrelated

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٨

An Example of Specification Bias

As an example of specification bias, let’s take a look at a simple model of the

annual consumption of chicken in the United States

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٨

Correcting for an Omitted Variable

• In theory, the solution to a problem of specification bias seems easy:

add the omitted variable to the equation!
• Unfortunately, that’s easier said than done, for a couple of reasons
1. Omitted variable bias is hard to detect: the amount of bias introduced can
be small and not immediately detectable
2. Even if it has been decided that a given equation is suffering from omitted
variable bias, how to decide exactly which variable to include?
• Note here that dropping a variable is not a viable strategy to help cure
omitted variable bias:
– If anything you’ll just generate even more omitted variable bias on the
remaining coefficients!

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٩

Correcting for an Omitted Variable (cont.)

• What if:
– You have an unexpected result, which leads you to believe that you have
an omitted variable
– You have two or more theoretically sound explanatory variables as
potential “candidates” for inclusion as the omitted variable to the equation is
to use
• How do you choose between these variables?
• One possibility is expected bias analysis
– Expected bias: the likely bias that omitting a particular variable would
have caused in the estimated coefficient of one of the included variables

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٦٠

Correcting for an Omitted Variable (cont.)

• Expected bias can be estimated with Equation 6.7:

(6.7)
• When do we have a viable candidate?
– When the sign of the expected bias is the same as the
sign
of the unexpected result
• Similarly, when these signs differ, the variable is
extremely unlikely to have caused the unexpected
result

Irrelevant
Variables
• This refers to the case of including a variable in an equation when it
does not belong there
• This is the opposite of the omitted variables case—and so the impact
can be illustrated using the same model
• Assume that the true regression specification is:
(6.10)
• But the researcher for some reason includes an extra variable:
(6.11)
• The misspecified equation’s error term then becomes:
(6.12)

Irrelevant Variables
(cont.)
• So, the inclusion of an irrelevant variable will not cause bias
(since the true coefficient of the irrelevant variable is zero, and so
the second term will drop out of Equation 6.12)
• However, the inclusion of an irrelevant variable will:
– Increase the variance of the estimated coefficients, and this
increased variance will tend to decrease the absolute
magnitude of their t-scores
– Decrease the R2 (but not the R2)
• Table 6.1 summarizes the consequences of the omitted variable
and the included irrelevant variable cases (unless r12 = 0)

Table 6.1 Effect of Omitted Variables and Irrelevant Variables on
the Coefficient Estimates

Four Important Specification Criteria

• We can summarize the previous discussion into four criteria to help

decide whether a given variable belongs in the equation:
1. Theory: Is the variable’s place in the equation unambiguous and theoretically
sound?
2. t-Test: Is the variable’s estimated coefficient significant in the expected
direction?
3. R2: Does the overall fit of the equation (adjusted for degrees of freedom) improve
when the variable is added to the equation?
4. Bias: Do other variables’ coefficients change significantly when the variable is
added to the equation?
• If all these conditions hold, the variable belongs in the equation
• If none of them hold, it does not belong
• The tricky part is the intermediate cases: use sound judgment!

Specification
Searches
• Almost any result can be obtained from a given
dataset, by simply specifying different regressions until
estimates with the desired properties are obtained
• Hence, the integrity of all empirical work is open to
question
• To counter this, the following three points of Best
Practices in Specification Searches are suggested:
1. Rely on theory rather than statistical fit as much as possible when
choosing variables, functional forms, and the like
2. Minimize the number of equations estimated (except for
sensitivity analysis, to be discussed later in this
section)
3. Reveal, in a footnote or appendix, all alternative
specifications estimated
© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٦٦
Sequential Specification
Searches
• The sequential specification search technique allows a researcher to:
– Estimate an undisclosed number of regressions
– Subsequently present a final choice (which is based upon an unspecified
set of expectations about the signs and significance of the coefficients) as if
it were only a specification
• Such a method misstates the statistical validity of the regression
results for two reasons:
1. The statistical significance of the results is overestimated because the
estimations of the previous regressions are ignored
2. The expectations used by the researcher to choose between various
regression results rarely, if ever, are disclosed

Bias Caused by Relying on the
t-Test to Choose Variables

• Dropping variables solely based on low t-statistics may lead to two

different types of errors:
1. An irrelevant explanatory variable may sometimes be included in the
equation (i.e., when it does not belong there)
2. A relevant explanatory variables may sometimes be dropped from the
equation (i.e., when it does belong)
• In the first case, there is no bias but in the second case there is bias
• Hence, the estimated coefficients will be biased every time an excluded
variable belongs in the equation, and that excluded variable will be left out
every time its estimated coefficient is not statistically significantly different
from zero
• So, we will have systematic bias in our equation!

Sensitivity
Analysis
• Contrary to the advice of estimating as few equations as possible
(and based on theory, rather than fit!), sometimes we see journal article
authors listing results from five or more specifications
• What’s going on here:
• In almost every case, these authors have employed a technique called
sensitivity analysis
• This essentially consists of purposely running a number of alternative
specifications to determine whether particular results are robust (not
statistical flukes) to a change in specification
• Why is this useful? Because true specification isn’t known!

Data
Mining
• Data mining involves exploring a data set to try to uncover
empirical regularities that can inform economic theory
• That is, the role of data mining is opposite that of traditional
econometrics, which instead tests the economic theory on
a data set
• Be careful, however!
– a hypothesis developed using data mining techniques must be
tested on a different data set (or in a different context) than
the one used to develop the hypothesis
– Not doing so would be highly unethical: After all, the researcher
already knows ahead of time what the results will be!

Key Terms from
Chapter 6
• Omitted variable
• Irrelevant variable
• Specification bias
• Sequential specification search
• Specification error
• The four specification criteria
• Expected bias
• Sensitivity analysis

Model Specification
No ratings yet
Model Specification
2 pages
ch9 - Model Specification and Data Problems
No ratings yet
ch9 - Model Specification and Data Problems
79 pages
M06 StockWatson123635 03 Econ Ch06
No ratings yet
M06 StockWatson123635 03 Econ Ch06
50 pages
Lec12 Ecmt
No ratings yet
Lec12 Ecmt
30 pages
VariableSelectionAndModelBuilding IIT
No ratings yet
VariableSelectionAndModelBuilding IIT
22 pages
Introduction To Econometrics With R
No ratings yet
Introduction To Econometrics With R
18 pages
Variable Selection: Prof. Sharyn O'Halloran Sustainable Development U9611 Econometrics II
No ratings yet
Variable Selection: Prof. Sharyn O'Halloran Sustainable Development U9611 Econometrics II
79 pages
Lectures PowerPoints PDF
No ratings yet
Lectures PowerPoints PDF
459 pages
Class 5 Omitted Variable Bias OVB1 1
No ratings yet
Class 5 Omitted Variable Bias OVB1 1
23 pages
Model Specification in Multiple Regression Analysis
No ratings yet
Model Specification in Multiple Regression Analysis
45 pages
Regression Model
No ratings yet
Regression Model
16 pages
Wa0064.
No ratings yet
Wa0064.
16 pages
Econometrics Specification Data Issues
No ratings yet
Econometrics Specification Data Issues
22 pages
ECON 342 AE Model Specification and Data Problems 2021
No ratings yet
ECON 342 AE Model Specification and Data Problems 2021
43 pages
Specification Variable in Econometrics
No ratings yet
Specification Variable in Econometrics
15 pages
Violation of Assumptions2
No ratings yet
Violation of Assumptions2
21 pages
Econometric Modeling: Model Specification and Diagnostic Testing
No ratings yet
Econometric Modeling: Model Specification and Diagnostic Testing
11 pages
TCH442E Quantitative Methods For Finance: Last Lecture: Next
No ratings yet
TCH442E Quantitative Methods For Finance: Last Lecture: Next
13 pages
Specification: Choosing The Independent Variables: Slides by Niels-Hugo Blunch Washington and Lee University
No ratings yet
Specification: Choosing The Independent Variables: Slides by Niels-Hugo Blunch Washington and Lee University
16 pages
More On Specification and Data
No ratings yet
More On Specification and Data
22 pages
Lecture 8
No ratings yet
Lecture 8
10 pages
Chapter 6-Linear Regression With Multiple Regressors
No ratings yet
Chapter 6-Linear Regression With Multiple Regressors
68 pages
Chapter 9
No ratings yet
Chapter 9
38 pages
Solutions For Tutorial 2
No ratings yet
Solutions For Tutorial 2
14 pages
Unit 2
No ratings yet
Unit 2
15 pages
Endogeneity
No ratings yet
Endogeneity
10 pages
Lecture 7
No ratings yet
Lecture 7
14 pages
03 - Causality PDF
No ratings yet
03 - Causality PDF
80 pages
Statistical Modelling: Regression: Choosing The Independent Variables
No ratings yet
Statistical Modelling: Regression: Choosing The Independent Variables
14 pages
Ec226 24-25 Week7 Thursday
No ratings yet
Ec226 24-25 Week7 Thursday
13 pages
CH 07 Specification and Data Issues TQT
No ratings yet
CH 07 Specification and Data Issues TQT
45 pages
Mis-Specifications of Regression Model
No ratings yet
Mis-Specifications of Regression Model
18 pages
A Comprehensive Approach To Misspecification Testing in Linear Regression Models
No ratings yet
A Comprehensive Approach To Misspecification Testing in Linear Regression Models
6 pages
Lecture 09 Model Misspecification
No ratings yet
Lecture 09 Model Misspecification
5 pages
Econometric Modeling:: Model Specification and Diagnostic Testing
100% (1)
Econometric Modeling:: Model Specification and Diagnostic Testing
57 pages
Specification Error OR Misspecification in Statistical Models
No ratings yet
Specification Error OR Misspecification in Statistical Models
4 pages
Omitted Variable Bias: The Simple Case
No ratings yet
Omitted Variable Bias: The Simple Case
8 pages
Chapter 5
No ratings yet
Chapter 5
30 pages
Econometrics: Specification Errors
100% (2)
Econometrics: Specification Errors
13 pages
Violations of Gauss Markov Assumptions: Omitted Variable Bias
No ratings yet
Violations of Gauss Markov Assumptions: Omitted Variable Bias
10 pages
Specification Error
No ratings yet
Specification Error
12 pages
Variable Selection: Prof. Sharyn O'Halloran Sustainable Development U9611 Econometrics II
No ratings yet
Variable Selection: Prof. Sharyn O'Halloran Sustainable Development U9611 Econometrics II
79 pages
Econometrics
No ratings yet
Econometrics
13 pages
chp2 Econometric
No ratings yet
chp2 Econometric
54 pages
Unit 5. Model Selection: María José Olmo Jiménez
No ratings yet
Unit 5. Model Selection: María José Olmo Jiménez
15 pages
Specification Choosing Independent Variables
No ratings yet
Specification Choosing Independent Variables
7 pages
Econometrics: Specification Errors: Burcu Eke
No ratings yet
Econometrics: Specification Errors: Burcu Eke
35 pages
Basic Econometrics Revision - Econometric Modelling
No ratings yet
Basic Econometrics Revision - Econometric Modelling
65 pages
CMPSOmit PDF
No ratings yet
CMPSOmit PDF
12 pages
KTN Omitted Variables
No ratings yet
KTN Omitted Variables
6 pages
FIRSA AULIA RAHMAN/B200154011/R:) ) ) E (X - (X E ( ) ) ) E (X - (X) ) E ( - E ( ( ) X, Cov (
No ratings yet
FIRSA AULIA RAHMAN/B200154011/R:) ) ) E (X - (X E ( ) ) ) E (X - (X) ) E ( - E ( ( ) X, Cov (
6 pages
Items Upload-Slides SpatialEconomStata PDF
100% (2)
Items Upload-Slides SpatialEconomStata PDF
94 pages
Wooldridge 6e Ch09 SSM
No ratings yet
Wooldridge 6e Ch09 SSM
8 pages
Omitted Variable Bias
No ratings yet
Omitted Variable Bias
5 pages
Lampiran 1 Perhitungan Pembuatan Konsentrasi Ekstrak: Universitas Sumatera Utara
No ratings yet
Lampiran 1 Perhitungan Pembuatan Konsentrasi Ekstrak: Universitas Sumatera Utara
30 pages
Chapter11 Econometrics SpecificationerrorAnalysis
No ratings yet
Chapter11 Econometrics SpecificationerrorAnalysis
7 pages
Mastering Metrics Published
No ratings yet
Mastering Metrics Published
4 pages
Chapter 9 QA
No ratings yet
Chapter 9 QA
4 pages
Linear Regression in R - R Tutorial
100% (1)
Linear Regression in R - R Tutorial
33 pages
Dynamic Econometric Models
No ratings yet
Dynamic Econometric Models
18 pages
Cross-Validation and Model Selection
No ratings yet
Cross-Validation and Model Selection
46 pages
6 - Addi Topics in Reg.
No ratings yet
6 - Addi Topics in Reg.
1 page
Pengaruh Motivasi, Disiplin Kerja Dan Kepuasan Kerja Terhadap Kinerja Karyawan Pt. Jadi Abadi Corak Biscuit Surabaya
No ratings yet
Pengaruh Motivasi, Disiplin Kerja Dan Kepuasan Kerja Terhadap Kinerja Karyawan Pt. Jadi Abadi Corak Biscuit Surabaya
22 pages
Module 3 - Regression
No ratings yet
Module 3 - Regression
55 pages
Two-Phase Sampling in Estimation of Population Mean in The Presence of Non-Response
No ratings yet
Two-Phase Sampling in Estimation of Population Mean in The Presence of Non-Response
15 pages
Pricing Analytics Models and Advanced Quantitative Techniques For Product Pricing First Edition Paczkowski
100% (2)
Pricing Analytics Models and Advanced Quantitative Techniques For Product Pricing First Edition Paczkowski
65 pages
MBS 7e PPT 15
No ratings yet
MBS 7e PPT 15
51 pages
8 面板数据方法
No ratings yet
8 面板数据方法
63 pages
Exercise 2 Management Accounting S6
No ratings yet
Exercise 2 Management Accounting S6
19 pages
Chapter 2
No ratings yet
Chapter 2
41 pages
MA Economics CBCS 2023 24 With Objectives
No ratings yet
MA Economics CBCS 2023 24 With Objectives
34 pages
Relation CHRD and VET
No ratings yet
Relation CHRD and VET
268 pages
Econometric Analysis of Cross Section and Panel Data, 2e
No ratings yet
Econometric Analysis of Cross Section and Panel Data, 2e
72 pages
ch03 Forecasting
No ratings yet
ch03 Forecasting
43 pages
Linear Regression Model: Man - PN@VNP - Edu.vn
No ratings yet
Linear Regression Model: Man - PN@VNP - Edu.vn
77 pages
SPSS Binary Logistic Regression Demo 1 Terminate
No ratings yet
SPSS Binary Logistic Regression Demo 1 Terminate
22 pages
EViews
No ratings yet
EViews
9 pages
Regresion Lineal Simple PDF
100% (1)
Regresion Lineal Simple PDF
16 pages
Stats and Probabilty Reviewer 4th Quarter
No ratings yet
Stats and Probabilty Reviewer 4th Quarter
6 pages
Chapter9 Regression Multicollinearity
No ratings yet
Chapter9 Regression Multicollinearity
25 pages
Sinning in The Basement: What Are The Rules? The Ten Commandments of Applied Econometrics
No ratings yet
Sinning in The Basement: What Are The Rules? The Ten Commandments of Applied Econometrics
21 pages
SLR Solved Example
No ratings yet
SLR Solved Example
6 pages
Lab8 Hetero GLS and WLS
No ratings yet
Lab8 Hetero GLS and WLS
5 pages
The Glejser Test and The Median Regression: Marilena Furno
No ratings yet
The Glejser Test and The Median Regression: Marilena Furno
24 pages
Notes On The "Theoretical" Gravity Model of International Trade
No ratings yet
Notes On The "Theoretical" Gravity Model of International Trade
16 pages
Simple Classical Forecasting Methods:, ,, ,, and - ARIMA Models (Box-Jenkins Procedure)
No ratings yet
Simple Classical Forecasting Methods:, ,, ,, and - ARIMA Models (Box-Jenkins Procedure)
7 pages
PTCB Pharmacy Calculations Workbook: Master Alligations, Dilutions, IV Flow Rates, Dosages & Conversions with Over 350 Practice Questions with Detailed Explanations
From Everand
PTCB Pharmacy Calculations Workbook: Master Alligations, Dilutions, IV Flow Rates, Dosages & Conversions with Over 350 Practice Questions with Detailed Explanations
Stanley Lawrence Richardson
No ratings yet
Introductory Guide to Partial Differential Equations
From Everand
Introductory Guide to Partial Differential Equations
Sameer Kulkarni
No ratings yet
Gale Researcher Guide for: Econometric Models
From Everand
Gale Researcher Guide for: Econometric Models
Chupp
No ratings yet

Chapter 6 Econometrics

Uploaded by

Chapter 6 Econometrics

Uploaded by

Chapter 6

Model Specification: Choosing

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٤

• Before any equation can be estimated, it must be completely

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٥

• Two reasons why an important explanatory variable

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٦

• Suppose the true regression model is:

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٧

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٨

Where ui is a classical error term. α1 can be expressed as a function of the correlation

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٨

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٨

As an example of specification bias, let’s take a look at a simple model of the

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٨

• In theory, the solution to a problem of specification bias seems easy:

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٥٩

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٦٠

• Expected bias can be estimated with Equation 6.7:

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٦١

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٦٢

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٦٣

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٦٤

• We can summarize the previous discussion into four criteria to help

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٦٥

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٦٧

• Dropping variables solely based on low t-statistics may lead to two

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٦٨

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٦٩

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٧٠

© 2011 Pearson Addison-Wesley. All rights reserved. 1-١٧١

You might also like