Econometrics Slides
Econometrics Slides
Chapter 1
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
What is econometrics?
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Steps in econometric analysis
2) Econometric model
Economic models
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Economic model of crime (Becker (1968))
Derives equation for criminal activity based on utility maximization
Hours spent in
criminal activities
Age
„Wage“ of cri-
minal activities Probability of Expected
Wage for legal
Other Probability of conviction if sentence
employment
income getting caught caught
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Model of job training and worker productivity
What is effect of additional training on worker productivity?
Formal economic theory not really needed to derive equation:
Hourly wage
Years of formal
education Weeks spent
Years of work- in job training
force experience
Other factors may be relevant, but these are the most important (?)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Econometric model of criminal activity
The functional form has to be specified
Variables may have to be approximated by other quantities
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Econometric model of job training and worker productivity
Unobserved deter-
minants of the wage
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Econometric analysis requires data
Cross-sectional data
Panel/Longitudinal data
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Cross-sectional data sets
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Cross-sectional data set on wages and other characteristics
Indicator variables
(1=yes, 0=no)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Cross-sectional data on growth rates and country characteristics
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Time series data
Observations of a variable or several variables over time
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Time series data on minimum wages and related variables
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Pooled cross sections
Two or more cross sections are combined in one data set
Example:
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Pooled cross sections on housing prices Property tax
Size of house
in square feet
Number of bathrooms
Before reform
After reform
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Panel or longitudinal data
The same cross-sectional units are followed over time
Example:
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Two-year panel data on city crime statistics
Number of
police in 1986
Number of
police in 1990
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Causality and the notion of ceteris paribus
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Causal effect of fertilizer on crop yield
„By how much will the production of soybeans increase if one
increases the amount of fertilizer applied to the ground“
Implicit assumption: all other factors that influence crop yield such
as quality of land, rainfall, presence of parasites etc. are held fixed
Experiment:
Choose several one-acre plots of land; randomly assign different
amounts of fertilizer to the different plots; compare yields
Experiment works because amount of fertilizer applied is unrelated
to other factors influencing crop yields
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Measuring the return to education
„If a person is chosen from the population and given another
year of education, by how much will his or her wage increase? “
Implicit assumption: all other factors that influence wages such as
experience, family background, intelligence etc. are held fixed
Experiment:
Choose a group of people; randomly assign different amounts of
eduction to them (infeasable!); compare wage outcomes
Problem without random assignment: amount of education is related
to other factors that influence wages (e.g. intelligence)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Effect of law enforcement on city crime level
„If a city is randomly chosen and given ten additional police officers,
by how much would its crime rate fall? “
Alternatively: „If two cities are the same in all respects, except that
city A has ten more police officers, by how much would the two cities
crime rates differ?“
Experiment:
Randomly assign number of police officers to a large number of cities
In reality, number of police officers will be determined by crime rate
(simultaneous determination of crime and number of police)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Effect of the minimum wage on unemployment
„By how much (if at all) will unemployment increase if the minimum
wage is increased by a certain amount (holding other things fixed)? “
Experiment:
Government randomly chooses minimum wage each year and
observes unemployment outcomes
Experiment will work because level of minimum wage is unrelated
to other factors determining unemployment
In reality, the level of the minimum wage will depend on political
and economic factors that also influence unemployment
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Nature of Econometrics
and Economic Data
Testing predictions of economic theories
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Chapter 2
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Definition of the simple linear regression model
Dependent variable,
explained variable, Error term,
Independent variable, disturbance,
response variable,… explanatory variable, unobservables,…
regressor,…
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Interpretation of the simple linear regression model
as long as
By how much does the dependent Interpretation only correct if all other
variable change if the independent things remain equal when the indepen-
variable is increased by one unit? dent variable is increased by one unit
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Example: Soybean yield and fertilizer
Rainfall,
land quality,
presence of parasites, …
Measures the effect of fertilizer on
yield, holding all other factors fixed
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
When is there a causal interpretation?
Conditional mean independence assumption
e.g. intelligence …
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Population regression function (PFR)
The conditional mean independence assumption implies that
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
In order to estimate the regression model one needs data
First observation
Second observation
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Fit as good as possible a regression line through the data points:
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
What does „as good as possible“ mean?
Regression residuals
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
CEO Salary and return on equity
Fitted regression
Intercept
If the return on equity increases by 1 percent,
then salary is predicted to change by 18,501 $
Causal interpretation?
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Wage and education
Fitted regression
Intercept
In the sample, one more year of education was
associated with an increase in hourly wage by 0.54 $
Causal interpretation?
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Voting outcomes and campaign expenditures (two parties)
Fitted regression
Intercept
If candidate A‘s share of spending increases by one
percentage point, he or she receives 0.464 percen-
Causal interpretation? tage points more of the total vote
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Properties of OLS on any sample of data
Fitted values and residuals
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Goodness-of-Fit
„How well does the explanatory variable explain the dependent variable?“
Measures of Variation
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Decomposition of total variation
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
CEO Salary and return on equity
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Incorporating nonlinearities: Semi-logarithmic form
Regression of log wages on years of eduction
… if years of education
are increased by one year
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Fitted regression
For example:
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Incorporating nonlinearities: Log-logarithmic form
CEO salary and firm sales
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
CEO salary and firm sales: fitted regression
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Expected values and variances of the OLS estimators
The estimated regression coefficients are random variables
because they are calculated from a random sample
Data is random and depends on particular sample that has been drawn
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Standard assumptions for the linear regression model
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Discussion of random sampling: Wage and education
The population consists, for example, of all workers of country A
In the population, a linear relationship between wages (or log wages)
and years of education holds
Draw completely randomly a worker from the population
The wage and the years of education of the worker drawn are random
because one does not know beforehand which worker is drawn
Throw back worker into population and repeat random draw times
The wages and years of education of the sampled workers are used to
estimate the linear relationship between wages and education
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Assumptions for the linear regression model (cont.)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Theorem 2.1 (Unbiasedness of OLS)
Interpretation of unbiasedness
The estimated coefficients may be smaller or larger, depending on
the sample that is the result of a random draw
However, on average, they will be equal to the values that charac-
terize the true relationship between y and x in the population
„On average“ means if sampling was repeated, i.e. if drawing the
random sample und doing the estimation was repeated many times
In a given sample, estimates may differ considerably from true values
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Variances of the OLS estimators
Depending on the sample, the estimates will be nearer or farther
away from the true population values
How far can we expect our estimates to be away from the true
population values on average (= sampling variability)?
Sampling variability is measured by the estimator‘s variances
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Graphical illustration of homoskedasticity
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
An example for heteroskedasticity: Wage and education
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Theorem 2.2 (Variances of OLS estimators)
Conclusion:
The sampling variability of the estimated regression coefficients will be
the higher the larger the variability of the unobserved factors, and the
lower, the higher the variation in the explanatory variable
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Estimating the error variance
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
The Simple
Regression Model
Theorem 2.3 (Unbiasedness of the error variance)
Plug in for
the unknown
The estimated standard deviations of the regression coefficients are called „standard
errors“. They measure how precisely the regression coefficients are estimated.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Chapter 3
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Definition of the multiple linear regression model
Dependent variable,
explained variable, Error term,
Independent variables, disturbance,
response variable,… explanatory variables, unobservables,…
regressors,…
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Motivation for multiple regression
Incorporate more explanatory factors into the model
Explicitly hold fixed other factors that otherwise would be in
Allow for more flexible functional forms
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Example: Average test scores and per student spending
Other factors
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Example: Family income and family consumption
Other factors
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Example: CEO salary, sales and CEO tenure
Log of CEO salary Log sales Quadratic function of CEO tenure with firm
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
OLS Estimation of the multiple regression model
Random sample
Regression residuals
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Interpretation of the multiple regression model
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Example: Determinants of college GPA
Grade point average at college High school grade point average Achievement test score
Interpretation
Holding ACT fixed, another point on high school grade point average
is associated with another .453 points college grade point average
Or: If we compare two students with the same ACT, but the hsGPA of
student A is one point higher, we predict student A to have a colGPA
that is .453 higher than that of student B
Holding high school grade point average fixed, another 10 points on
ACT are associated with less than one point on college GPA
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
„Partialling out“ interpretation of multiple regression
One can show that the estimated coefficient of an explanatory
variable in a multiple regression can be obtained in two steps:
1) Regress the explanatory variable on all other explanatory variables
2) Regress on the residuals from this regression
Why does this procedure work?
The residuals from the first regression is the part of the explanatory
variable that is uncorrelated with the other explanatory variables
The slope coefficient of the second regression therefore represents
the isolated effect of the explanatory variable on the dep. variable
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Properties of OLS on any sample of data
Fitted values and residuals
Deviations from regression Correlations between deviations Sample averages of y and of the
line sum up to zero and regressors are zero regressors lie on regression line
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Goodness-of-Fit
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Example: Explaining arrest records
Number of times Proportion prior arrests Months in prison 1986 Quarters employed 1986
arrested 1986 that led to conviction
Interpretation:
Proportion prior arrests +0.5 ! -.075 = -7.5 arrests per 100 men
Months in prison +12 ! -.034(12) = -0.408 arrests for given man
Quarters employed +1 ! -.104 = -10.4 arrests per 100 men
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Example: Explaining arrest records (cont.)
An additional explanatory variable is added:
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Standard assumptions for the multiple regression model (cont.)
Remarks on MLR.3
The assumption only rules out perfect collinearity/correlation bet-
ween explanatory variables; imperfect correlation is allowed
If an explanatory variable is a perfect linear combination of other
explanatory variables it is superfluous and may be eliminated
Constant variables are also ruled out (collinear with intercept)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Example for perfect collinearity: small sample
In a small sample, avginc may accidentally be an exact multiple of expend; it will not
be possible to disentangle their separate effects because there is exact covariation
Either shareA or shareB will have to be dropped from the regression because there
is an exact linear relationship between them: shareA + shareB = 1
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Standard assumptions for the multiple regression model (cont.)
Assumption MLR.4 (Zero conditional mean)
If avginc was not included in the regression, it would end up in the error term;
it would then be hard to defend that expend is uncorrelated with the error
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Discussion of the zero mean conditional assumption
Explanatory variables that are correlated with the error term are
called endogenous; endogeneity is a violation of assumption MLR.4
Explanatory variables that are uncorrelated with the error term are
called exogenous; MLR.4 holds if all explanat. var. are exogenous
Exogeneity is the key assumption for a causal interpretation of the
regression, and for unbiasedness of the OLS estimators
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Including irrelevant variables in a regression model
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Omitted variable bias
If x1 and x2 are correlated, assume a linear
regression relationship between them
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Example: Omitting ability in a wage equation
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Omitted variable bias: more general cases
If exper is approximately uncorrelated with educ and abil, then the direction
of the omitted variable bias can be as analyzed in the simple two variable case.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Standard assumptions for the multiple regression model (cont.)
Assumption MLR.5 (Homoscedasticity)
with
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Theorem 3.2 (Sampling variances of OLS slope estimators)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Components of OLS Variances:
1) The error variance
A high error variance increases the sampling variance because there is
more „noise“ in the equation
A large error variance necessarily makes estimates imprecise
The error variance does not decrease with sample size
2) The total sample variation in the explanatory variable
More sample variation leads to more precise estimates
Total sample variation automatically increases with the sample size
Increasing the sample size is thus a way to get more precise estimates
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
3) Linear relationships among the independent variables
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
An example for multicollinearity
The different expenditure categories will be strongly correlated because if a school has a lot
of resources it will spend a lot on everything.
It will be hard to estimate the differential effects of different expenditure categories because
all expenditures are either high or low. For precise estimates of the differential effects, one
would need information about situations where expenditure categories change differentially.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Discussion of the multicollinearity problem
In the above example, it would probably be better to lump all expen-
diture categories together because effects cannot be disentangled
In other cases, dropping some independent variables may reduce
multicollinearity (but this may lead to omitted variable bias)
Only the sampling variance of the variables involved in multicollinearity
will be inflated; the estimates of other effects may be very precise
Note that multicollinearity is not a violation of MLR.3 in the strict sense
Multicollinearity may be detected through „variance inflation factors“
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Variances in misspecified models
The choice of whether to include a particular variable in a regression
can be made by analyzing the tradeoff between bias and variance
Estimated model 1
Estimated model 2
It might be the case that the likely omitted variable bias in the
misspecified model 2 is overcompensated by a smaller variance
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Variances in misspecified models (cont.)
Case 2: Trade off bias and variance; Caution: bias will not vanish even in large samples
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Estimating the error variance
An unbiased estimate of the error variance can be obtained by substracting the number of
estimated regression coefficients from the number of observations. The number of obser-
vations minus the number of estimated parameters is also called the degrees of freedom.
The n estimated squared residuals in the sum are not completely independent but related
through the k+1 equations that define the first order conditions of the minimization problem.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Estimation of the sampling variances of the OLS estimators
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Efficiency of OLS: The Gauss-Markov Theorem
Under assumptions MLR.1 - MLR.5, OLS is unbiased
However, under these assumptions there may be many other
estimators that are unbiased
Which one is the unbiased estimator with the smallest variance?
In order to answer this question one usually limits oneself to linear
estimators, i.e. estimators linear in the dependent variable
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Estimation
Theorem 3.4 (Gauss-Markov Theorem)
Under assumptions MLR.1 - MLR.5, the OLS estimators are the best
linear unbiased estimators (BLUEs) of the regression coefficients, i.e.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Chapter 4
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Statistical inference in the regression model
Hypothesis tests about population parameters
Construction of confidence intervals
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Assumption MLR.6 (Normality of error terms)
independently of
It follows that:
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Discussion of the normality assumption
The error term is the sum of „many“ different unobserved factors
Sums of independent factors are normally distributed (CLT)
Problems:
• How many different factors? Number large enough?
• Possibly very heterogenuous distributions of individual factors
• How independent are the different factors?
The normality of the error term is an empirical question
At least the error distribution should be „close“ to normal
In many cases, normality is questionable or impossible by definition
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Discussion of the normality assumption (cont.)
Examples where normality cannot hold:
• Wages (nonnegative; also: minimum wage)
• Number of arrests (takes on a small number of integer values)
• Unemployment (indicator variable, takes on only 1 or 0)
In some cases, normality can be achieved through transformations
of the dependent variable (e.g. use log(wage) instead of wage)
Under normality, OLS is the best (even nonlinear) unbiased estimator
Important: For the purposes of statistical inference, the assumption
of normality can be replaced by a large sample size
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Terminology
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Testing hypotheses about a single population parameter
Theorem 4.1 (t-distribution for standardized estimators)
Note: The t-distribution is close to the standard normal distribution if n-k-1 is large.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
t-statistic (or t-ratio)
The t-statistic will be used to test the above null hypothesis.
The farther the estimated coefficient is away from zero, the
less likely it is that the null hypothesis holds true. But what
does „far“ away from zero mean?
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Testing against one-sided alternatives (greater than zero)
Test against .
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Example: Wage equation
Test whether, after controlling for education and tenure, higher work
experience leads to higher hourly wages
Standard errors
Test against .
One would either expect a positive effect of experience on hourly wage or no effect at all.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Example: Wage equation (cont.)
t-statistic
Degrees of freedom;
here the standard normal
approximation applies
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Testing against one-sided alternatives (less than zero)
Test against .
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Example: Student performance and school size
Test whether smaller school size leads to better student performance
Percentage of students Average annual tea- Staff per one thou- School enrollment
passing maths test cher compensation sand students (= school size)
Test against .
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Example: Student performance and school size (cont.)
t-statistic
Degrees of freedom;
here the standard normal
approximation applies
One cannot reject the hypothesis that there is no effect of school size on
student performance (not even for a lax significance level of 15%).
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Example: Student performance and school size (cont.)
Alternative specification of functional form:
Test against .
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Example: Student performance and school size (cont.)
t-statistic
(small effect)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Testing against two-sided alternatives
Test against .
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Example: Determinants of college GPA Lectures missed per week
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
„Statistically significant“ variables in a regression
If a regression coefficient is different from zero in a two-sided test, the
corresponding variable is said to be „statistically significant“
If the number of degrees of freedom is large enough so that the nor-
mal approximation applies, the following rules of thumb apply:
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Guidelines for discussing economic and statistical significance
If a variable is statistically significant, discuss the magnitude of the
coefficient to get an idea of its economic or practical importance
The fact that a coefficient is statistically significant does not necessa-
rily mean it is economically or practically significant!
If a variable is statistically and economically important but has the
„wrong“ sign, the regression model might be misspecified
If a variable is statistically insignificant at the usual levels (10%, 5%,
1%), one may think of dropping it from the regression
If the sample size is small, effects might be imprecisely estimated so
that the case for dropping insignificant variables is less strong
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Testing more general hypotheses about a regression coefficient
Null hypothesis
Hypothesized value of the coefficient
t-statistic
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Example: Campus crime and enrollment
An interesting hypothesis is whether crime increases by one percent
if enrollment is increased by one percent
The hypothesis is
rejected at the 5%
level
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Computing p-values for t-tests
If the significance level is made smaller and smaller, there will be a
point where the null hypothesis cannot be rejected anymore
The reason is that, by lowering the significance level, one wants to
avoid more and more to make the error of rejecting a correct H0
The smallest significance level at which the null hypothesis is still
rejected, is called the p-value of the hypothesis test
A small p-value is evidence against the null hypothesis because one
would reject the null hypothesis even at small significance levels
A large p-value is evidence in favor of the null hypothesis
P-values are more informative than tests at fixed significance levels
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
How the p-value is computed (here: two-sided test)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Critical value of
Confidence intervals two-sided test
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Confidence intervals for typical confidence levels
reject in favor of
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Example: Model of firms‘ R&D expenditures
The effect of sales on R&D is relatively precisely estimated This effect is imprecisely estimated as the in-
as the interval is narrow. Moreover, the effect is significantly terval is very wide. It is not even statistically
different from zero because zero is outside the interval. significant because zero lies in the interval.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Testing hypotheses about a linear combination of parameters
Example: Return to education at 2 year vs. at 4 year colleges
Years of education Years of education
at 2 year colleges at 4 year colleges
Test against .
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Impossible to compute with standard regression output because
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Total years of college
Estimation results
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Testing multiple linear restrictions: The F-test
Testing exclusion restrictions
Salary of major lea- Years in Average number of
gue base ball player the league games per year
Batting average Home runs per year Runs batted in per year
against
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Estimation of the unrestricted model
Idea: How would the model fit be if these variables were dropped from the regression?
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Estimation of the restricted model
The sum of squared residuals necessarily increases, but is the increase statistically significant?
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Rejection rule (Figure 4.7)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Test decision in example Number of restrictions to be tested
Degrees of freedom in
the unrestricted model
Discussion
The three variables are „jointly significant“
They were not significant when tested individually
The likely reason is multicollinearity between them
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Test of overall significance of a regression
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Testing general linear restrictions with the F-test
Example: Test whether house price assessments are rational
The assessed housing value Size of lot
Actual house price
(before the house was sold) (in feet)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Unrestricted regression
Test statistic
cannot be rejected
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Inference
Regression output for the unrestricted regression
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: OLS Asymptotics
Chapter 5
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: OLS Asymptotics
So far we focused on properties of OLS that hold for any sample
Properties of OLS that hold for any sample/sample size
Expected values/unbiasedness under MLR.1 – MLR.4
Variance formulas under MLR.1 – MLR.5
Gauss-Markov Theorem under MLR.1 – MLR.5
Exact sampling distributions/tests under MLR.1 – MLR.6
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: OLS Asymptotics
Consistency
Alternative notation:
The estimate converges in proba-
bility to the true population value
Interpretation:
Consistency means that the probability that the estimate is arbitrari-
ly close to the true population value can be made arbitrarily high by
increasing the sample size
Consistency is a minimum requirement for sensible estimators
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: OLS Asymptotics
Theorem 5.1 (Consistency of OLS)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: OLS Asymptotics
For consistency of OLS, only the weaker MLR.4‘ is needed
Asymptotic analog of omitted variable bias
True model
Misspecified
model
Bias
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: OLS Asymptotics
Asymptotic normality and large sample inference
In practice, the normality assumption MLR.6 is often questionable
If MLR.6 does not hold, the results of t- or F-tests may be wrong
Fortunately, F- and t-tests still work if the sample size is large enough
Also, OLS estimates are normal in large samples even without MLR.6
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: OLS Asymptotics
Practical consequences
In large samples, the t-distribution is close to the N(0,1) distribution
As a consequence, t-tests are valid in large samples without MLR.6
The same is true for confidence intervals and F-tests
Important: MLR.1 – MLR.5 are still necessary, esp. homoscedasticity
Converges to
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: OLS Asymptotics
Asymptotic analysis of the OLS sampling errors (cont.)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression
Analysis: Further Issues
Chapter 6
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Further Issues
More on Functional Form
More on using logarithmic functional forms
Convenient percentage/elasticity interpretation
Slope coefficients of logged variables are invariant to rescalings
Taking logs often eliminates/mitigates problems with outliers
Taking logs often helps to secure normality and homoscedasticity
Variables measured in units such as years should not be logged
Variables measured in percentage points should also not be logged
Logs must not be used if variables take on zero or negative values
It is hard to reverse the log-operation when constructing predictions
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Further Issues
Using quadratic functional forms
Example: Wage equation Concave experience profile
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Further Issues
Wage maximum with respect to work experience
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Further Issues Nitrogen oxide in air, distance from em-
ployment centers, student/teacher ratio
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Further Issues
Calculation of the turnaround point
Turnaround point:
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Further Issues
Other possibilities
Higher polynomials
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Further Issues
Models with interaction terms
Interaction term
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Further Issues
Reparametrization of interaction effects Population means; may be
replaced by sample means
Advantages of reparametrization
Easy interpretation of all parameters
Standard errors for partial effects at the mean values available
If necessary, interaction may be centered at other interesting values
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Further Issues
More on goodness-of-fit and selection of regressors
General remarks on R-squared
A high R-squared does not imply that there is a causal interpretation
A low R-squared does not preclude precise estimation of partial effects
Adjusted R-squared
What is the ordinary R-squared supposed to measure?
is an estimate for
Population R-squared
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Further Issues
Correct degrees of freedom of
Adjusted R-squared (cont.) nominator and denominator
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Further Issues
Using adjusted R-squared to choose between nonnested models
Models are nonnested if neither model is a special case of the other
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Further Issues
Comparing models with different dependent variables
R-squared or adjusted R-squared must not be used to compare models
which differ in their definition of the dependent variable
Example: CEO compensation and firm performance
There is much
less variation
in log(salary)
that needs to
be explained
than in salary
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Further Issues
Controlling for too many factors in regression analysis
In some cases, certain variables should not be held fixed
In a regression of traffic fatalities on state beer taxes (and other
factors) one should not directly control for beer consumption
In a regression of family health expenditures on pesticide usage
among farmers one should not control for doctor visits
Different regressions may serve different purposes
In a regression of house prices on house characteristics, one would
only include price assessments if the purpose of the regression is to
study their validity; otherwise one would not include them
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Further Issues
Adding regressors to reduce the error variance
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Further Issues
Predicting y when log(y) is the dependent variable
Prediction for y
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Further Issues
Comparing R-squared of a logged and an unlogged specification
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis
with Qualitative Information
Chapter 7
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
Qualitative Information
Examples: gender, race, industry, region, rating grade, …
A way to incorporate qualitative information is to use dummy variables
They may appear as the dependent or as independent variables
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
Graphical Illustration
Intercept shift
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
This model cannot be estimated (perfect collinearity)
Dummy variable trap
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
Estimated wage equation with intercept shift
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
Comparing means of subpopulations described by dummies
Discussion
It can easily be tested whether difference in means is significant
The wage difference between men and women is larger if no other
things are controlled for; i.e. part of the difference is due to differ-
ences in education, experience and tenure between men and women
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
Further example: Effects of training grants on hours of training
Hours training per employee Dummy indicating whether firm received training grant
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
Using dummy explanatory variables in equations for log(y)
Dummy indicating
whether house is of
colonial style
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
Using dummy variables for multiple categories
1) Define membership in each category by a dummy variable
2) Leave out one category (which becomes the base category)
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
Incorporating ordinal information using dummy variables
Example: City credit ratings and municipal bond interest rates
This specification would probably not be appropriate as the credit rating only contains
ordinal information. A better way to incorporate this information is to define dummies:
Dummies indicating whether the particular rating applies, e.g. CR1=1 if CR=1 and CR1=0
otherwise. All effects are measured in comparison to the worst rating (= base category).
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
Interactions involving dummy variables Interaction term
Allowing for different slopes
Interesting hypotheses
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
Graphical illustration
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
Estimated wage equation with interaction term
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
Testing for differences in regression functions across groups
Unrestricted model (contains full set of interactions)
College grade point average Standardized aptitude test score High school rank percentile
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
Null hypothesis All interaction effects are zero, i.e.
the same regression coefficients
apply to men and women
Tested individually,
the hypothesis that
the interaction effects
are zero cannot be
rejected
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
Null hypothesis is rejected
Joint test with F-statistic
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
A Binary dependent variable: the linear probability model
Linear regression when the dependent variable is binary
Linear probability
model (LPM)
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
Example: Labor force participation of married women
=1 if in labor force, =0 otherwise Non-wife income (in thousand dollars per year)
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
Example: Female labor participation of married women (cont.)
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
Disadvantages of the linear probability model
Predicted probabilities may be larger than one or smaller than zero
Marginal probability effects sometimes logically impossible
The linear probability model is necessarily heteroskedastic
Variance of Ber-
noulli variable
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
More on policy analysis and program evaluation
Example: Effect of job training grants on worker productivity
No apparent effect of
grant on productivity
Treatment group: grant reveivers, Control group: firms that received no grant
Grants were given on a first-come, first-served basis. This is not the same as giving them out
randomly. It might be the case that firms with less productive workers saw an opportunity to
improve productivity and applied first.
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
Self-selection into treatment as a source for endogeneity
In the given and in related examples, the treatment status is probably
related to other characteristics that also influence the outcome
The reason is that subjects self-select themselves into treatment
depending on their individual characteristics and prospects
Experimental evaluation
In experiments, assignment to treatment is random
In this case, causal effects can be inferred using a simple regression
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Qualitative Information
Further example of an endogenuous dummy regressor
Are nonwhite customers discriminated against?
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Heteroscedasticity
Chapter 8
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Heteroscedasticity
Consequences of heteroscedasticity for OLS
The usual F-tests and t-tests are not valid under heteroscedasticity
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Heteroscedasticity
Heteroscedasticity-robust inference after OLS
Formulas for OLS standard errors and related statistics have been
developed that are robust to heteroscedasticity of unknown form
All formulas are only valid in large samples
Formula for heteroscedasticity-robust OLS standard error
Also called White/Eicker standard errors. They involve
the squared residuals from the regression and from a
regression of xj on all other explanatory variables.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Heteroscedasticity
Example: Hourly wage equation
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Heteroscedasticity
Testing for heteroscedasticity
It may still be interesting whether there is heteroscedasticity because
then OLS may not be the most efficient linear estimator anymore
Under MLR.4
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Heteroscedasticity
Breusch-Pagan test for heteroscedasticity (cont.)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Heteroscedasticity
Example: Heteroscedasticity in housing price equations
Heteroscedasticity
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Heteroscedasticity
Regress squared residuals on all expla-
White test for heteroscedasticity natory variables, their squares, and in-
teractions (here: example for k=3)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Heteroscedasticity
Alternative form of the White test
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Heteroscedasticity
Weighted least squares estimation
Heteroscedasticity is known up to a multiplicative constant
Transformed model
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Heteroscedasticity
Example: Savings and income
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Heteroscedasticity
OLS in the transformed model is weighted least squares (WLS)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Heteroscedasticity
Example: Financial wealth equation
Net financial wealth
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Heteroscedasticity
Important special case of heteroscedasticity
If the observations are reported as averages at the city/county/state/-
country/firm level, they should be weighted by the size of the unit
If errors are homoscedastic at the employee level, WLS with weights equal to firm size mi should
be used. If the assumption of homoscedasticity at the employee level is not exactly right, one can
calculate robust standard errors after WLS (i.e. for the transformed model).
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Heteroscedasticity
Unknown heteroscedasticity function (feasible GLS)
Assumed general form
of heteroscedasticity;
exp-function is used to
ensure positivity
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Heteroscedasticity
Example: Demand for cigarettes
Smoking
Estimation by OLS restrictions in
restaurants
Cigarettes smoked per day Logged income and cigarette price
Reject homo-
scedasticity
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Heteroscedasticity
Estimation by FGLS Now statistically significant
Discussion
The income elasticity is now statistically significant; other coefficients
are also more precisely estimated (without changing qualit. results)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Heteroscedasticity
What if the assumed heteroscedasticity function is wrong?
If the heteroscedasticity function is misspecified, WLS is still consistent
under MLR.1 – MLR.4, but robust standard errors should be computed
WLS is consistent under MLR.4 but not necessarily under MLR.4‘
If OLS and WLS produce very different estimates, this typically indi-
cates that some other assumptions (e.g. MLR.4) are wrong
If there is strong heteroscedasticity, it is still often better to use a
wrong form of heteroscedasticity in order to increase efficiency
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Heteroscedasticity
WLS in the linear probability model
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
More on Specification and
Data Issues
Chapter 9
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Specification and Data Issues
Tests for functional form misspecification
One can always test whether explanatory should appear as squares or
higher order terms by testing whether such terms can be excluded
Otherwise, one can use general specification tests such as RESET
Test for the exclusion of these terms. If they cannot be exluded, this is evidence for
omitted higher order terms and interactions, i.e. for misspecification of functional form.
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Specification and Data Issues
Example: Housing price equation
Evidence for
misspecification
Model 2:
Define a general model that contains both models as subcases and test:
Discussion
Can always be done; however, a clear winner need not emerge
Cannot be used if the models differ in their definition of the dep. var.
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Specification and Data Issues
Using proxy variables for unobserved explanatory variables
Example: Omitted ability in a wage equation Replace by proxy
In general, the estimates for the returns to education and experience will be biased because
one has omit the unobservable ability variable. Idea: find a proxy variable for ability which is
able to control for ability differences between individuals so that the coefficients of the other
variables will not be biased. A possible proxy for ability is the IQ score or similar test scores.
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Specification and Data Issues
Assumptions necessary for the proxy variable method to work
The proxy is „just a proxy“ for the omitted variable, it does not belong
into the population regression, i.e. it is uncorrelated with its error
If the error and the proxy were correlated, the proxy
would actually have to be included in the population
regression function
The proxy variable is a „good“ proxy for the omitted variable, i.e. using
other variables in addition will not help to predict the omitted variable
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Specification and Data Issues
Under these assumptions, the proxy variable method works:
In this regression model, the error term is uncorrelated with all explanatory variables. As a
consequence, all coefficients will be correctly estimated using OLS. The coefficents for the
explanatory variables x1 and x2 will be correctly identified. The coefficient for the proxy va-
riable may also be of interest (it is a multiple of the coefficient of the omitted variable).
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Specification and Data Issues
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Specification and Data Issues
Using lagged dependent variables as proxy variables
In many cases, omitted unobserved factors may be proxied by the
value of the dependent variable from an earlier time period
Including the past crime rate will at least partly control for the many
omitted factors that also determine the crime rate in a given year
Another way to interpret this equation is that one compares cities
which had the same crime rate last year; this avoids comparing cities
that differ very much in unobserved crime factors
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Specification and Data Issues
Models with random slopes (= random coefficient models)
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Specification and Data Issues
Properties of OLS under measurement error
Measurement error in the dependent variable
Population regression
Estimated regression
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Specification and Data Issues
Measurement error in an explanatory variable
Mismeasured value = True value + Measurement error
Population regression
Estimated regression
Error unrelated
Classical errors-in-variables assumption: to true value
The mismeasured
variable x1 is cor-
related with the
error term!
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Specification and Data Issues
Consequences of measurement error in an explanatory variable
Under the classical errors-in-variables assumption, OLS is biased and
inconsistent because the mismeasured variable is endogenous
One can show that the inconsistency is of the following form:
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Specification and Data Issues
Missing data and nonrandom samples
Missing data as sample selection
Missing data is a special case of sample selection (= nonrandom samp-
ling) as the observations with missing information cannot be used
If the sample selection is based on independent variables there is no
problem as a regression conditions on the independent variables
In general, sample selection is no problem if it is uncorrelated with the
error term of a regression (= exogenous sample selection)
Sample selection is a problem, if it is based on the dependent variable
or on the error term (= endogenous sample selection)
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Specification and Data Issues
Example for exogenous sample selection
If the sample was nonrandom in the way that certain age groups, income groups, or household sizes
were over- or undersampled, this is not a problem for the regression because it examines the savings
for subgroups defined by income, age, and hh-size. The distribution of subgroups does not matter.
If the sample is nonrandom in the way individuals refuse to take part in the sample survey if their
wealth is particularly high or low, this will bias the regression results because these individuals may
be systematically different from those who do not refuse to take part in the sample survey.
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Specification and Data Issues
Outliers and influential observations
Extreme values and outliers may be a particular problem for OLS
because the method is based on squaring deviations
If outliers are the result of mistakes that occured when keying in the
data, one should just discard the affected observations
If outliers are the result of the data generating process, the decision
whether to discard the outliers is not so easy
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Specification and Data Issues
Example: R&D intensity and firm size (cont.)
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Multiple Regression Analysis:
Specification and Data Issues
Least absolute deviations estimation (LAD)
The least absolute deviations estimator minimizes the sum of absolute
deviations (instead of the sum of squared deviations, i.e. OLS)
© 2012 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Basic Regression Analysis with
Time Series Data
Chapter 10
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
The nature of time series data
Temporal ordering of observations; may not be arbitrarily reordered
Typical features: serial correlation/nonindependence of observations
How should we think about the randomness in time series data?
• The outcome of economic variables (e.g. GNP, Dow Jones) is
uncertain; they should therefore be modeled as random variables
• Time series are sequences of r.v. (= stochastic processes)
• Randomness does not come from sampling from a population
• „Sample“ = the one realized path of the time series out of the
many possible paths the stochastic process could have taken
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Example: US inflation and unemployment rates 1948-2003
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Examples of time series regression models
Static models
In static time series models, the current value of one variable is
modeled as the result of the current values of explanatory variables
The current murderrate is determined by the current conviction rate, unemployment rate,
and fraction of young males in the population.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Finite distributed lag models
In finite distributed lag models, the explanatory variables are allowed
to influence the dependent variable with a time lag
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Interpretation of the effects in finite distributed lag models
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Graphical illustration of lagged effects
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Finite sample properties of OLS under classical assumptions
The time series involved obey a linear relationship. The stochastic processes yt, xt1,…,
xtk are observed, the error process ut is unobserved. The definition of the explanatory
variables is general, e.g. they may be lags or functions of other explanatory variables.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Notation
This matrix collects all the
information on the complete time
paths of all explanatory variables
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Discussion of assumption TS.3
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Theorem 10.1 (Unbiasedness of OLS)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Assumption TS.5 (No serial correlation)
Conditional on the explanatory variables, the un-
observed factors must not be correlated over time
The conditioning on the values of the explanatory variables is not easy to understand. It effectively
means that, in a finite sample, one ignores the sampling variability coming from the randomness of
the regressors. This kind of sampling variability will normally not be large (because of the sums).
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Theorem 10.4 (Gauss-Markov Theorem)
Under assumptions TS.1 – TS.5, the OLS estimators have the minimal
variance of all linear unbiased estimators of the regression coefficients
This holds conditional as well as unconditional on the regressors
independently of
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Example: Static Phillips curve
Contrary to theory, the estimated Phillips
Curve does not suggest a tradeoff between
inflation and unemployment
TS.1:
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Discussion of CLM assumptions (cont.)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Example: Effects of inflation and deficits on interest rates
TS.1:
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Discussion of CLM assumptions (cont.)
Easily violated
TS.3:
For example, past deficit spending may boost economic
activity, which in turn may lead to general interest rate rises
For example, unobserved demand shocks may increase
interest rates and lead to higher inflation in future periods
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Using dummy explanatory variables in time series
Children born per Tax exemption Dummy for World War Dummy for availabity of con-
1,000 women in year t in year t II years (1941-45) traceptive pill (1963-present)
Interpretation
During World War II, the fertility rate was temporarily lower
It has been permanently lower since the introduction of the pill in 1963
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Time series with trends
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Modelling a linear time trend
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Example for a time series with an exponential trend
Abstracting from
random deviations,
the time series has a
constant growth rate
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Using trending variables in regression analysis
If trending variables are regressed on each other, a spurious re-
lationship may arise if the variables are driven by a common trend
In this case, it is important to include a trend in the regression
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Example: Housing investment and prices (cont.)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
A Detrending interpretation of regressions with a time trend
It turns out that the OLS coefficients in a regression including a trend
are the same as the coefficients in a regression without a trend but
where all the variables have been detrended before the regression
This follows from the general interpretation of multiple regressions
Computing R-squared when the dependent variable is trending
Due to the trend, the variance of the dep. var. will be overstated
It is better to first detrend the dep. var. and then run the regression
on all the indep. variables (plus a trend if they are trending as well)
The R-squared of this regression is a more adequate measure of fit
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Basic Regression Analysis
Modelling seasonality in time series
A simple method is to include a set of seasonal dummies:
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Further Issues Using OLS with
Time Series Data
Chapter 11
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
The assumptions used so far seem to be too restricitive
Strict exogeneity, homoscedasticity, and no serial correlation are very
demanding requirements, especially in the time series context
Statistical inference rests on the validity of the normality assumption
Much weaker assumptions are needed if the sample size is large
A key requirement for large sample analysis of time series is that
the time series in question are stationary and weakly dependent
Stationary time series
Loosely speaking, a time series is stationary if its stochastic properties
and its temporal dependence structure do not change over time
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Stationary stochastic processes
A stochastic process is stationary, if for every
collection of indices the joint distribution of
, is the same as that of
for all integers .
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Weakly dependent time series
A stochastic process is weakly dependent , if
is „almost independent“ of if grows to infinity (for all ).
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Examples for weakly dependent time series
Moving average process of order one (MA(1))
The process is weakly dependent because observations that are more than one
time period apart have nothing in common and are therefore uncorrelated.
If the stability condition holds, the process is weakly dependent because serial
correlation converges to zero as the distance between observations grows to infinity.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Asymptotic properties of OLS
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Theorem 11.1 (Consistency of OLS)
Important note: For consistency it would even suffice to assume that the explanatory
variables are merely contemporaneously uncorrelated with the error term.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Why do lagged dependent variables violate strict exogeneity?
Contemporanous exogeneity:
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Assumption TS.4‘ (Homoscedasticity)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Example: Efficient Markets Hypothesis (EMH)
The EMH in a strict form states that information observable to the market prior to week t should
not help to predict the return during week t. A simplification assumes in addition that only past
returns are considered as relevant information to predict the return in week t.This implies that
A simple way to test the EMH is to specify an AR(1) model. Under the EMH assumption,TS.3‘ holds
so that an OLS regression can be used to test whether this week‘s returns depend on last week‘s.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Using trend-stationary series in regression analysis
Time series with deterministic time trends are nonstationary
If they are stationary around the trend and in addition weakly
dependent, they are called trend-stationary processes
Trend-stationary processes also satisfy assumption TS.1‘
Using highly persistent time series in regression analysis
Unfortunately many economic time series violate weak dependence
because they are highly persistent (= strongly dependent)
In this case OLS methods are generally invalid (unless the CLM hold)
In some cases transformations to weak dependence are possible
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Random walks
The random walk is called random walk because it wanders
from the previous position yt-1 by an i.i.d. random amount et
The value today is the accumulation of all past shocks plus an initial value. This is the reason why
the random walk is highly persistent: The effect of a shock will be contained in the series forever.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Examples for random walk realizations
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Three-month T-bill rate as a possible example for a random walk
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Random walks with drift
In addition to the usual random walk mechanism, there is
a deterministic increase/decrease (= drift) in each period
This leads to a linear time trend around which the series follows its random walk behaviour. As there
is no clear direction in which the random walk develops, it may also wander away from the trend.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Sample path of a random walk with drift
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Transformations on highly persistent time series
Order of integration
Weakly dependent time series are integrated of order zero (= I(0))
If a time series has to be differenced one time in order to obtain a
weakly dependent series, it is called integrated of order one (= I(1))
Examples for I(1) processes
After differencing, the
resulting series are weakly
dependent (because et is
weakly dependent).
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Deciding whether a time series is I(1)
There are statistical tests for testing whether a time series is I(1)
(= unit root tests); these will be covered in later chapters
Alternatively, look at the sample first order autocorrelation:
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Example: Fertility equation
This equation could be estimated by OLS if the CLM assumptions hold. These may be questionable,
so that one would have to resort to large sample analysis. For large sample analysis, the fertility
series and the series of the personal tax exemption have to be stationary and weakly dependent.
This is questionable because the two series are highly persistent:
It is therefore better to estimate the equation in first differences. This makes sense because if the
equation holds in levels, it also has to hold in first differences:
Estimate of
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Include trend because both
Example: Wages and productivity series display clear trends.
It turns out that even after detrending, both series display sample autocorrelations
close to one so that estimating the equation in first differences seems more adequate:
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Dynamically complete models
A model is said to be dynamically complete if enough lagged variab-
les have been included as explanatory variables so that further lags
do not help to explain the dependent variable:
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Further Issues Using OLS
Sequential exogeneity
A set of explanatory variables is said to be sequentially exogenous if
„enough“ lagged explanatory variables have been included:
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Serial Correlation and
Heteroscedasticity in
Time Series Regressions
Chapter 12
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Serial Correl. and Heterosced.
Properties of OLS with serially correlated errors
OLS still unbiased and consistent if errors are serially correlated
Correctness of R-squared also does not depend on serial correlation
OLS standard errors and tests will be invalid if there is serial correlation
OLS will not be efficient anymore if there is serial correlation
Serial correlation and the presence of lagged dependent variables
Is OLS inconsistent if there are ser. corr. and lagged dep. variables?
No: Including enough lags so that TS.3‘ holds guarantees consistency
Including too few lags will cause an omitted variable problem and serial
correlation because some lagged dep. var. end up in the error term
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Serial Correl. and Heterosced.
Testing for serial correlation
Testing for AR(1) serial correlation with strictly exog. regressors
Test in
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Serial Correl. and Heterosced.
Durbin-Watson test under classical assumptions
Under assumptions TS.1 – TS.6, the Durbin-Watson test is an exact
test (whereas the previous t-test is only valid asymptotically).
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Serial Correl. and Heterosced.
Testing for AR(1) serial correlation with general regressors
The t-test for autocorrelation can be easily generalized to allow for the
possibility that the explanatory variables are not strictly exogenous:
Test
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Serial Correl. and Heterosced.
Correcting for serial correlation with strictly exog. regressors
Under the assumption of AR(1) errors, one can transform the model
so that it satisfies all GM-assumptions. For this model, OLS is BLUE.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Serial Correl. and Heterosced.
Correcting for serial correlation (cont.)
Replacing the unknown by leads to a FGLS-estimator
There are two variants:
• Cochrane-Orcutt estimation omits the first observation
• Prais-Winsten estimation adds a transformed first observation
In smaller samples, Prais-Winsten estimation should be more efficient
Comparing OLS and FGLS with autocorrelation
For consistency of FGLS more than TS.3‘ is needed (e.g. TS.3) because
the transformed regressors include variables from different periods
If OLS and FGLS differ dramatically this might indicate violation of TS.3
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Serial Correl. and Heterosced.
Serial correlation-robust inference after OLS
In the presence of serial correlation, OLS standard errors overstate
statistical significance because there is less independent variation
One can compute serial correlation-robust std. errors after OLS
This is useful because FGLS requires strict exogeneity and assumes a
very specific form of serial correlation (AR(1) or, generally, AR(q))
Serial correlation-robust standard errors:
The usual OLS standard errors are
normalized and then „inflated“ by
a correction factor.
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Serial Correl. and Heterosced.
Correction factor for serial correlation (Newey-West formula)
g=3:
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Serial Correl. and Heterosced.
Discussion of serial correlation-robust standard errors
The formulas are also robust to heteroscedasticity; they are therefore
called „heteroscedasticity and autocorrelation consistent“ (=HAC)
For the integer g, values such as g=2 or g=3 are normally sufficient
(there are more involved rules of thumb for how to choose g)
Serial correlation-robust standard errors are only valid asymptotically;
they may be severely biased if the sample size is not large enough
The bias is the higher the more autocorrelation there is; if the series
are highly correlated, it might be a good idea to difference them first
Serial correlation-robust errors should be used if there is serial corr.
and strict exogeneity fails (e.g. in the presence of lagged dep. var.)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Serial Correl. and Heterosced.
Heteroscedasticity in time series regressions
Heteroscedasticity usually receives less attention than serial correlation
Heteroscedasticity-robust standard errors also work for time series
Heteroscedasticity is automatically corrected for if one uses the serial
correlation-robust formulas for standard errors and test statistics
Testing for heteroscedasticity
The usual heteroscedasticity tests assume absence of serial correlation
Before testing for heteroscedasticity one should therefore test for serial
correlation first, using a heteroscedasticity-robust test if necessary
After ser. corr. has been corrected for, test for heteroscedasticity
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Serial Correl. and Heterosced.
Example: Serial correlation and homoscedasticity in the EMH
Test equation for the EMH
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Serial Correl. and Heterosced.
Autoregressive Conditional Heteroscedasticity (ARCH)
Even if there is no heteroscedasticity in the usual sense (the error variance depends
on the explanatory variables), there may be heteroscedasticity in the sense that the
variance depends on how volatile the time series was in previous periods:
ARCH(1) model
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Serial Correl. and Heterosced.
Consequences of ARCH in dynamic models
In dynamic models, i.e. models including lagged dependent variables,
the homoscedasticity assumption TS.4 will necessarily be violated:
because
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Serial Correl. and Heterosced.
Example: Testing for ARCH-effects in stock returns
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Analyzing Time Series:
Serial Correl. and Heterosced.
A FGLS procedure for serial correlation and heteroscedasticity
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Pooling Cross Sections across
Time: Simple Panel Data Methods
Chapter 13
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Pooled Cross Sections and
Simple Panel Data Methods
Policy analysis with pooled cross sections
Two or more independently sampled cross sections can be used to
evaluate the impact of a certain event or policy change
Example: Effect of new garbage incinerator on housing prices
Examine the effect of the location of a house on its price before and
after the garbage incinerator was built:
After incinerator
was built
Before incinerator
was built
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Pooled Cross Sections and
Simple Panel Data Methods
Example: Garbage incinerator and housing prices (cont.)
It would be wrong to conclude from the regression after the incinerator
is there that being near the incinerator depresses prices so strongly
One has to compare with the situation before the incinerator was built:
Differential effect of being in the location and after the incinerator was built
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Pooled Cross Sections and
Simple Panel Data Methods
Policy evaluation using difference-in-differences
Compare the difference in outcomes of the units that are affected by the policy change (= treatment
group) and those who are not affected (= control group) before and after the policy was enacted.
For example, the level of unemployment benefits is cut but only for group A (= treatment group).
Group A normally has longer unemployment durations than group B (= control group). If the diffe-
rence in unemployment durations between group A and group B becomes smaller after the reform,
reducing unemployment benefits reduces unemployment duration for those affected.
Caution: Difference-in-differences only works if the difference in outcomes between the two groups
is not changed by other factors than the policy change (e.g. there must be no differential trends).
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Pooled Cross Sections and
Simple Panel Data Methods
Two-period panel data analysis
Example: Effect of unemployment on city crime rate
Assume that no other explanatory variables are available. Will it be
possible to estimate the causal effect of unemployment on crime?
Yes, if cities are observed for at least two periods and other factors
affecting crime stay approximately constant over those periods:
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Pooled Cross Sections and
Simple Panel Data Methods
Example: Effect of unemployment on city crime rate (cont.)
Subtract:
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.
Pooled Cross Sections and
Simple Panel Data Methods
Discussion of first-differenced panel estimator
Further explanatory variables may be included in original equation
Note that there may be arbitrary correlation between the unobserved
time-invariant characteristics and the included explanatory variables
OLS in the original equation would therefore be inconsistent
The first-differenced panel estimator is thus a way to consistently
estimate causal effects in the presence of time-invariant endogeneity
For consistency, strict exogeneity has to hold in the original equation
First-differenced estimates will be imprecise if explanatory variables
vary only little over time (no estimate possible if time-invariant)
© 2013 Cengage Learning. All Rights Reserved. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part.