Lecture 3 Differences in Differences
Lecture 3 Differences in Differences
Fabian Waldinger
Waldinger () 1 / 47
Topics Covered in Lecture
Waldinger (Warwick) 2 / 47
Dierences-in-Dierences: Card & Krueger (1994)
Waldinger (Warwick) 3 / 47
Dierences-in-Dierences: Card & Krueger (1994)
They surveyed about 400 fast food stores both in NJ and in PA both
before and after the minimum wage increase in NJ.
Waldinger (Warwick) 4 / 47
Dierences-in-Dierences Strategy
DD is a version of xed eects estimation. To see this more formally:
Y1ist : employment at restaurant i, state s, time t with a high wmin .
Y0ist : employment at restaurant i, state s, time t with a low wmin .
In practice of course we only see one or the other.
We then assume that:
E [Y0ist js, t ] = s + t
In New Jersey:
Employment in February is:
E [Yist js = NJ, t = Feb ] = NJ + Feb
Employment in November is:
E [Yist js = NJ, t = Nov ] = NJ + Nov +
the dierence between February and November is:
E [Yist js = NJ, t = N ] E [Yist js = NJ, t = F ] = N F +
In Pennsylvania:
Employment in February is:
E [Yist js = PA, t = Feb ] = PA + Feb
Employment in November is:
E [Yist js = PA, t = Nov ] = PA + Nov
the dierence between February and November is:
E [Yist js = PA, t = Nov ] E [Yist js = PA, t = Feb ] = Nov Feb
Waldinger (Warwick) 6 / 47
Dierences-in-Dierences Strategy
Waldinger (Warwick) 7 / 47
Dierences-in-Dierences Table
Waldinger (Warwick) 9 / 47
Regression DD - Card & Krueger
In the Card & Krueger case the equivalent regression model would be:
Yist = + NJs + dt + (NJs dt ) + ist
NJ is a dummy which is equal to 1 if the observation is from NJ.
d is a dummy which is equal to 1 if the observation is from November
(post).
This equation takes the following values.
PA Pre:
PA Post: +
NJ Pre: +
NJ Post: + + +
Dierences-in-Dierences estimate: (NJ Post - NJ Pre) - (PA Post -
PA Pre) =
Waldinger (Warwick) 10 / 47
Graph - Observed Data
Waldinger (Warwick) 11 / 47
Graph - DD
Waldinger (Warwick) 12 / 47
Graph - DD
Waldinger (Warwick) 13 / 47
Graph - DD
Waldinger (Warwick) 14 / 47
Graph - DD
Waldinger (Warwick) 15 / 47
Key Assumption of Any DD Strategy: Common Trends
The key assumption for any DD strategy is that the outcome in
treatment and control group would follow the same time trend in the
absence of the treatment.
This does not mean that they have to have the same mean of the
outcome!
Common trend assumption is di cult to verify but one often uses
pre-treatment data to show that the trends are the same.
Even if pre-trends are the same one still has to worry about other
policies changing at the same time.
Waldinger (Warwick) 16 / 47
Regression DD Including Leads and Lags
Waldinger (Warwick) 17 / 47
Study Including Leads and Lags - Author (2003)
Waldinger (Warwick) 18 / 47
Results
Waldinger (Warwick) 19 / 47
Standard Errors in DD Strategies
Many papers using a DD strategy use data from many years (not only
1 pre and 1 post period).
The variables of interest in many of these setups only vary at a group
level (say state) and outcome variables are often serially correlated.
In the Card and Krueger study for example, it is very likely that
employment in each state is not only correlated within the state but
also serially correlated.
As Bertrand, Duo, and Mullainathan (2004) point out, conventional
standard errors often severely understate the standard deviation of the
estimators.
Waldinger (Warwick) 20 / 47
Standard Errors in DD Strategies - Practical Solutions
Bertrand, Duo, and Mullainathan propose the following solutions:
1 Block bootstrapping standard errors (if you analyze states the block
should be the states and you would sample whole states with replacing
for the bootstrapping).
2 Clustering standard errors at the group level. (in STATA one would
simply add cl(state) to the regression equation if one analyzes state
level variation).
3 Aggregating the data into one pre and one post period.
Literally works only if there is only one treatment date. With staggered
treatment dates one should adopt the following procedure:
Regress Yst on state FE, year FE, and relevant covariates.
Obtain residuals from the treatment states only and divide them into 2
groups: pre and post treatment.
Then regress the two groups of residuals on a post dummy.
Correct treatment of standard errors sometimes makes the number of
groups very small: in the Card and Krueger study the number of
groups is only 2.
Waldinger (Warwick) 21 / 47
Synthetic Control Methods
Waldinger (Warwick) 22 / 47
Abadie & Gardeazabal (2003) - The Eect of Terrorism on
Growth
Waldinger (Warwick) 23 / 47
The Basque Country is Dierent from the Rest of Spain
Waldinger (Warwick) 24 / 47
The Synthetic Control Method
Waldinger (Warwick) 25 / 47
The Synthetic Control Method - Details
(X1 X0 W)V(X1 X0 W)
They choose the matrix V such that the real per capita GDP path for
the Basque Country during the 1960s (pre terrorism) is best
reproduced by the resulting synthetic Basque Country.
Waldinger (Warwick) 26 / 47
The Synthetic Control Method - Details
The optimal weights they get are: Catalonia: 0.8508, Madrid: 0.1492,
and all other regions: 0.
Alternatively they could have just chosen the weights to reproduce
only the pre-terrorism growth path for the Basque country (and not
the growth predictors as well. In that case they would have minimized:
(Z1 Z0 W)(Z1 Z0 W)
Waldinger (Warwick) 27 / 47
The Synthetic Basque Country Looks Similar
Waldinger (Warwick) 28 / 47
Constructing the Counterfactual Using the Weights
Waldinger (Warwick) 29 / 47
Growth in the Basque Country with and without Terrorism
Waldinger (Warwick) 30 / 47
Terrorist Activity and Estimated GDP Gap
Waldinger (Warwick) 31 / 47
Combining DD and IV
Waldinger (Warwick) 32 / 47
Historical Background
Waldinger (Warwick) 33 / 47
Dismissed Professors Across German Universities
Waldinger (Warwick) 34 / 47
Dismissed Professors Across German Universities II
Waldinger (Warwick) 35 / 47
Eect of Dismissals on Department Size
Waldinger (Warwick) 36 / 47
Eect of Dismissals on Faculty Quality
Waldinger (Warwick) 37 / 47
Panel Data on PhD graduates from German Universities
Waldinger (Warwick) 38 / 47
Reduced Form Graphical Analysis - Publishing Dissertation
Waldinger (Warwick) 39 / 47
Reduced Form Graphical Analysis - Full Professor
Waldinger (Warwick) 40 / 47
Reduced Form Graphical Analysis - Lifetime Citations
Waldinger (Warwick) 41 / 47
Reduced Form Estimates
Waldinger (Warwick) 42 / 47
Reduced Form Estimates
Waldinger (Warwick) 43 / 47
Common Robustness Check for Parallel Trend Assumption
Only Look at Pre-Period Data and Move Placebo Treatment some Years Back
Waldinger (Warwick) 44 / 47
Use Dismissal as IV
To test for weak instruments one cannot simply look at the rst stage
F-statistics because here we have 2 endogenous regressors and 2 IVs.
! use Cragg-Donald EV statistic here critical value is 7.03.
Waldinger (Warwick) 46 / 47
OLS and IV
Waldinger (Warwick) 47 / 47