Econometrics Final Exam Study Guide PDF
Econometrics Final Exam Study Guide PDF
1. Assumptions
d) No perfect multicollinearity
percentage terms
change in Y
in Y
A. Interactions between two binary variables: Yi = B0 + B1D1 + B2D2 + B3D1D2 + ui
depend on D2
of X to depend on D
interaction
A. Internal Validity - The statistical inferences about causal effects are valid for the
population and setting studied to other populations and settings (where the
“setting” refers to the legal, policy, and physical environment and related salient
features)
a) A determinant of Y
2. Solutions:
b) If you have data on one or more controls and they are adequate
variables regression
2. Solutions
dependent variables)
C. Errors-in-variables bias
etc…
2. Solutions:
error)
1. Three cases: Data are missing at random, data are missing based on the
value of one or more X’s, and data are missing based in part on the value
of Y or u
selection” bias
a) Ex: low STR results in better test scores, but suppose districts with
political process also have a low STR; STR and u are correlated in
this case
2. Solutions:
interaction
*all of these threats imply that the mean for any given X does not equal 0 (conditional mean
1. Balanced Panel: no missing observations; all variables are observed for all
regression
II. Panel Data with Two Time Periods: Yit = B0 + B1Xit + B2Zi + uit
A. Z is a factor that does not change over time (at least during the years on which we
have data
1. Suppose Zi is not observed, so its omission could result in omitted variable
bias
III. Fixed Effects Regression (if T>2) : Yit = B0 + B1Xit + B2Zi + uit, i = 1,...,n, T = 1,...,T
OLS
usual
A. An omitted variable may vary over time but not across states
1. Let St denote the combined effect of variables which changes over time
c) B2 = 1 if t = year 2, = 0 otherwise
A. LS Assumptions for Panel Data: Yit = B1Xit + ai + uit, i = 1,...,n, t = 1,...,T
1. E(uit| Xi1,...,XiTai) = 0
a) uit has mean zero, given the entity fixed effect and the entire
2. (Xj1,..., XiT, ui1,...,uIT), i = 1,...,n, are i.i.d. draws from their joint
distribution
a) Does not require observations to be i.i.d over time for the same
entity
B. Advantages
C. Disadvantages
1. A LPM says that the change in the predicted probability for a given
change in X is the same for all values of X. but that doesn’t make sense;
probability models
B. Probit regression - models the probability that Y=1 using the cumulative standard
A. The R2 and R-bar2 don’t make sense here so two other specialized measures are
used:
1. The fraction correctly predicted = fraction of Y’s for which the predicted
simultaneous causality bias, and errors-in-variables bias; all three problems can
B. IV regression breaks X into two parts: a part that might be correlated with u, and a
1. This is done using an instrumental variable, Zi, which is correlated with Xi
with ui; we may not know the values but we have estimated
them
+pi-hat1Zi, i = 1,...,n
A. Terminology:
depends on the relation between the number of instruments (m) and the
B. Summary of Jargon:
control variables (included so that Zi is uncorrelated with ui, once the W’s
are included)
variables
C. IV Regression Assumptions
1. E(ui|Wi1,...,Wir) = 0
a) I.e. the exogenous regressors are exogenous
3. The X’s, W’s, Z’s, and Y have nonzero, finite 4th movements