0% found this document useful (0 votes)

109 views

Diff PDF

This document discusses the difference-in-differences estimator, which is used to evaluate the impact of programs or treatments. It begins by introducing notation for the treatment and control groups before and after the treatment. It then presents a model for the outcome and discusses assumptions needed for an unbiased estimator. Next, it reviews simple pre-post and treatment-control estimators that can be biased. Finally, it defines the difference-in-differences estimator as the difference between the treatment group's pre-post change and the control group's pre-post change, which removes biases from time and group effects.

Uploaded by

pcg20013793

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

109 views

Diff PDF

Uploaded by

pcg20013793

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Economics 131

Section Notes
GSI: David Albouy

Program Evaluation and the Diﬀerence in Diﬀerence Estimator

1 Program Evaluation
1.1 Notation
We wish to evaluate the impact of a program or treatment on an outcome Y over a population of individuals.
Suppose that there are two groups indexed by treatment status T = 0, 1 where 0 indicates individuals who
do not receive treatment, i.e. the control group, and 1 indicates individuals who do receive treatment, i.e.
the treatment group. Assume that we observe individuals in two time periods, t = 0, 1 where 0 indicates
a time period before the treatment group receives treatment, i.e. pre-treatment, and 1 indicates a time
period after the treatment group receives treatment, i.e. post-treatment. Every observation is indexed
by the letter i = 1, ..., N ; individuals will typically have two observations each, one pre-treatment and one
post-treatment. For the sake of notation let Ȳ0T and Ȳ1T be the sample averages of the outcome for the
treatment group before and after treatment, respectively, and let Ȳ0C and Ȳ1C be the corresponding sample
averages of the outcome for the control group. Subscripts correspond to time period and superscripts to
the treatment status.

1.2 Modeling the Outcome

The outcome Yi is modeled by the following equation
Yi = α + βTi + γti + δ (Ti · ti ) + εi (Outcome)
where the coefficients given by the greek letters α, β, γ, δ , are all unknown parameters and εi is a random,
unobserved "error" term which contains all determinants of Yi which our model omits. By inspecting the
equation you should be able to see that the coefficients have the following interpretation
α = constant term
β = treatment group specific effect (to account for average
permanent differences between treatment and control)
γ = time trend common to control and treatment groups
δ = true effect of treatment

The purpose of the program evaluation is to find a "good" estimate of δ, δ̂, given the data that we have
available.
Example 1 Card and Krueger (1994, AER) in "Minimum Wages and Employment: A Case Study of the
Fast-Food Industry in New Jersey and Pennsylvania" try to evaluate the eﬀect of the minimum wage (the
treatment) on employment (the outcome). On April 1, 1992, New Jersey’s minimum wage rose from $4.25
to $5.05 per hour. To evaluate the impact of the law, the authors surveyed 410 fast-food restaurants in New
Jersey (the treatment group) and eastern Pennsylvania (the control group) before and after the rise. Yi is
the employment of a fast food restaurant, Ti is an indicator of whether or not a restaurant is in New Jersey,
and ti is an indicator of whether the observation is from before or after the minimum wage hike.

1.3 Assumptions for an Unbiased Estimator

A reasonable criterion for a good estimator is that it be unbiased which means that "on average" the
estimate will be correct, or mathematically that the expected value of the estimator
h i
E δ̂ = δ
1
The assumptions we need for the diﬀerence in diﬀerence estimator to be correct are given by the following

1. The model in equation (Outcome) is correctly specified. For example, the additive structure imposed
is correct.
2. The error term is on average zero: E [εi ] = 0. Not a hard assumption with the constant term α put
in.
3. The error term is uncorrelated with the other variables in the equation:

cov (εi , Ti ) = 0
cov (εi , ti ) = 0
cov (εi , Ti · ti ) = 0

the last of these assumptions, also known as the parallel-trend assumption, is the most critical.

Under these assumptions we can use equation (Outcome) to determine that expected values of the average
outcomes are given by

£ ¤
E Y0T = α + β
£ ¤
E Y1T = α + β + γ + δ
£ ¤
E Y0C = α
£ ¤
E Y1C = α + γ

These equations will prove helpful below.

2 The Diﬀerence in Diﬀerence Estimator

Before explaining the difference in difference estimator it is best to review the two simple difference estimators
and understand what can go wrong with these. Understanding what is wrong about as an estimator is as
important as understanding what is right about it.

2.1 Simple Pre versus Post Estimator

Consider first an estimator based on comparing the average diﬀerence in outcome Yi before and after treat-
ment in the treatment group alone.1

δ̂1 = Ȳ1T − Ȳ0T (D1)

Taking the expectation of this estimator we get
h i £ ¤ £ ¤
E δ̂1 = E Ȳ1T − E Ȳ0T
= [α + β + γ + δ] − [α + β]
=γ+δ

which means that this estimator will be biased so long as γ 6= 0, i.e. if a time-trend exists in the outcome Yi
then we will confound the time trend as being part of the treatment eﬀect.
1 This would be the estimate one would get from an OLS estimate on a regression equation of the form
Yi = α1 + δ1 Ti + εi
on the sample from the treatment group only.

2
2.2 Simple Treatment versus Control Estimator
Next consider the estimator based on comparing the average diﬀerence in outcome Yi post-treatment, between
the treatment and control groups, ignoring pre-treatment outcomes.2

δ̂2 = Ȳ1T − Ȳ1C (D2)

Taking the expectation of this estimator
h i £ ¤ £ ¤
E δ̂1 = E Ȳ1T − E Ȳ1C
= [α + β + γ + δ] − [α + γ]
=β+δ

and so this estimator is biased so long as β 6= 0, i.e. there exist permanent average differences in outcome Yi
between the treatment groups. The true treatment effect will be confounded by permanent differences in
treatment and control groups that existed prior to any treatment. Note that in a randomized experiments,
where subjects are randomly selected into treatment and control groups, β should be zero as both groups
should be nearly identical: in this case this estimator may perform well in a controlled experimental setting
typically unavailable in most program evaluation problems seen in economics.

2.3 The Diﬀerence in Diﬀerence Estimator

The difference in difference (or "double difference") estimator is defined as the difference in average
outcome in the treatment group before and after treatment minus the difference in average outcome in the
control group before and after treatment3 : it is literally a "difference of differences."
¡ ¢
δ̂DD = Ȳ1T − Ȳ0T − Ȳ1C − Ȳ0C (DD)

Taking the expectation of this estimator we will see that it is unbiased

£ ¤ £ ¤ ¡ £ ¤ £ ¤¢
δ̂DD = E Ȳ1T − E Ȳ0T − E Ȳ1C − E Ȳ0C
= α + β + γ + δ − (α + β) − (α + γ − γ)
= (γ + δ) − γ
=δ

This estimator can be seen as taking the difference between two pre-versus-post estimators seen above in
(D1), subtracting the control group’s estimator, which captures the time trend γ, from the treatment ¡ group’s¢
estimator to get δ. We can also rearrange terms in equation (DD) to get δ̂DD = Ȳ1T − Ȳ1C − Ȳ0T − Ȳ0C
in which can be interpreted as taking the difference of two estimators of the simple treatment versus control
type seen in equation (D2). The difference estimator for the pre-period is used to estimate the permanent
difference β, which is then subtracted away from the post-period estimator to get δ.
Another interpretation of the difference in difference estimator is that is a simple difference estimator
between the actual Ȳ1T and the Ȳ1T that would¡ occur¢in the post treatment period to the treatment group
T
had there been no treatment Ȳcf = Ȳ0T + Ȳ1C − Ȳ0C , where the subscript ”cf ” refers to the term "coun-
h i
terfactual," so that δ̂DD = Ȳ1T − ȲcfT
. This observation ȲcfT
, which has expectation E ȲcfT
= α + β + γ,
does not exist: it is literally "contrary to fact" since there actually was a treatment in fact. However if our
T
assumption are correct we can construct legitimate estimate of Ȳcf , taking the pre treatment average Ȳ0T
and adding the our estimate β using the pre versus post difference for the control group.
2 This would be the estimate one would get from an OLS estimate on a regression equation of the form
Yi = α2 + δ2 ti + εi
on the post-treatment samples only.
3 This would be the estimate one would get from an OLS estimate of a regression equation of the form given by (Outcome)

on the entire sample.

3
It is common to find diﬀerence in diﬀerence estimators presented in a table of the following form.

Pre Post Post-Pre Diﬀerence

Treatment Ȳ0T Ȳ1T Ȳ1T − Ȳ0T
Control Ȳ0C Ȳ1C Ȳ1C −¡Ȳ0C ¢
T-C Diﬀerence Ȳ0 − Ȳ0C
T
Ȳ1 − Ȳ1C
T
Ȳ1 − Ȳ1C − Ȳ0T − Ȳ0C
T

Notice that the first row ends with the estimate δ̂1 , the second column ends with estimate δ̂2 , and the lower
right hand corner entry gives the estimate δ̂DD .

Example 2 According to the model, by Card and Krueger (1994) comparisons of employment growth at
stores in New Jersey and Pennsylvania (where the minimum wage was constant), provide simple estimates
of the eﬀect of the higher minimum wage. Some of the results from Table 3 are shown below with the average
employment in the fast-food restaurants, with standard errors in parentheses

Before Increase After Increase Diﬀerence

New Jersey 20.44 21.03 0.59
(Treatment) (0.51) (0.52) (0.54)
Pennsylvania 23.33 21.17 −2.16
(Control) (1.35) (0.94) (1.25)
−2.89 −0.14 2.76
Diﬀerence
(1.44) (1.07) (1.36)

The diﬀerence in diﬀerence estimator shows a small increase in employment in New Jersey where the mini-
mum wage increased. This came as quite a shock to most economists who thought employment would fall.
Notice that we can see that prior to the increase in the minimum wage Pennsylvania had higher employment
than New Jersey and that it was bound to fall to a lower level. This may be a failure in the parallel trend
assumption. However the small, albeit insignificant increase in employment in New Jersey makes it hard to
accept the hypothesis that employment actually decreased in New Jersey over this time. Although still some-
what controversial, this study helped change the common presupposition that a small change in the minimum
wage from a low level was bound to cause a significant decrease in employment.

2.4 Problems with Diﬀerence in Diﬀerence Estimators

If any of the assumptions listed above do not hold then we have no guarantee that the estimator δ̂DD is
unbiased. Unfortunately, it is often difficult and sometimes impossible to check the assumptions in the model
as they are made about unobservable quantities. Keep in mind that small deviations from the assumptions
may not matter much as the biases they introduce may be rather small, biases are a matter of degree. It is
also possible, however, that the biases may be so huge that the estimates we get may be completely wrong,
even of the opposite sign of the true treatment effect.
One of the most common problems with difference in difference estimates is the failure of the parallel
trend assumption. Suppose that cov (εi , Ti · ti ) = E (εi (Ti · ti )) = ∆ so that Y follows a different trend for
the treatment and control group. The control group has a time trend of γ C = γ, while the treatment group
has a trend of γ T = γ + ∆. In this case the difference in difference estimator will be biased as
h i ¡ ¢
E δ̂DD = γ T + δ − γ C = γ + ∆ + δ − γ = δ + ∆

The failure of the parallel trend assumption may in fact be a relatively common problem in many program
evaluation studies, causing many difference in difference estimators to be biased.
One way to help avoid these problems is to get more data on other time periods before and after treatment
to see if there are any other pre-existing differences in trends. It may also be possible to find other control
groups which will can provide additional underlying trends. There is a huge literature on this subject,
although a good place to start is Meyer (1995).

Example Did Berkeley 1.11
No ratings yet
Example Did Berkeley 1.11
4 pages
正在发送邮件 wk-08-slides
No ratings yet
正在发送邮件 wk-08-slides
96 pages
2024 DiD Handout
No ratings yet
2024 DiD Handout
4 pages
Handout 6 Causality
No ratings yet
Handout 6 Causality
16 pages
Differences in Differences
No ratings yet
Differences in Differences
78 pages
AE Lecture 3 Differences-in-Differences
No ratings yet
AE Lecture 3 Differences-in-Differences
55 pages
Applied Economics DD Lecture Notes
No ratings yet
Applied Economics DD Lecture Notes
76 pages
Lecture 1b
No ratings yet
Lecture 1b
7 pages
Lesson 4 - Diff in diff
No ratings yet
Lesson 4 - Diff in diff
15 pages
DID
No ratings yet
DID
28 pages
Did, Iv
No ratings yet
Did, Iv
42 pages
L_II_3 (2)
No ratings yet
L_II_3 (2)
37 pages
Diff Diff
No ratings yet
Diff Diff
121 pages
Introduction To DiD Design
No ratings yet
Introduction To DiD Design
4 pages
Micro-Econometrics ECO 6175: Abel Brodeur
No ratings yet
Micro-Econometrics ECO 6175: Abel Brodeur
34 pages
Bertrand Et Al. (2004) - How Much Should We Trust Differences-In-Differences Estimates
No ratings yet
Bertrand Et Al. (2004) - How Much Should We Trust Differences-In-Differences Estimates
28 pages
Lecture 3 Differences in Differences
100% (1)
Lecture 3 Differences in Differences
47 pages
Chapter_13
No ratings yet
Chapter_13
14 pages
How Much Should We Trust Differences in Difference
No ratings yet
How Much Should We Trust Differences in Difference
32 pages
Empirical Methods in Microeconomics
No ratings yet
Empirical Methods in Microeconomics
3 pages
13_dind
No ratings yet
13_dind
58 pages
Triple Difference Statistics Tool
No ratings yet
Triple Difference Statistics Tool
2 pages
07 - Natural Experiment (Part 2) PDF
No ratings yet
07 - Natural Experiment (Part 2) PDF
90 pages
Evaluating the Impact of Health Policies Using a Difference-Indifferences Approach
No ratings yet
Evaluating the Impact of Health Policies Using a Difference-Indifferences Approach
6 pages
Unbiased Estimation of The Average Treatment Effect in Cluster-Randomized Experiments
No ratings yet
Unbiased Estimation of The Average Treatment Effect in Cluster-Randomized Experiments
37 pages
DID SC Counterfactuals
No ratings yet
DID SC Counterfactuals
7 pages
bacon_dd_timing_4_14_2021
No ratings yet
bacon_dd_timing_4_14_2021
94 pages
Experiments and Quasi-Experiments: Solutions To Exercises
No ratings yet
Experiments and Quasi-Experiments: Solutions To Exercises
4 pages
Emp Handout PDF
No ratings yet
Emp Handout PDF
36 pages
Empirical Methods - Esther Duflo 2002
No ratings yet
Empirical Methods - Esther Duflo 2002
36 pages
Introduction To The Difference-In-Differences Regression Model (2021)
No ratings yet
Introduction To The Difference-In-Differences Regression Model (2021)
2 pages
Difference in Differences
No ratings yet
Difference in Differences
7 pages
Aea Cookbook Econometrics Module 1
No ratings yet
Aea Cookbook Econometrics Module 1
117 pages
Callaway & SantAnna
No ratings yet
Callaway & SantAnna
31 pages
Wooldridge Slides 10 Diff in Diffs
No ratings yet
Wooldridge Slides 10 Diff in Diffs
31 pages
What's New in Econometrics? Difference-in-Differences Estimation
No ratings yet
What's New in Econometrics? Difference-in-Differences Estimation
31 pages
Experiments and Causality
No ratings yet
Experiments and Causality
21 pages
Panel Data II
No ratings yet
Panel Data II
32 pages
M300 Summary Notes
No ratings yet
M300 Summary Notes
12 pages
1 s2.0 S0304405X22000204 Main
No ratings yet
1 s2.0 S0304405X22000204 Main
26 pages
Pooled Cross Sections and Panel Data, Difference in Difference
No ratings yet
Pooled Cross Sections and Panel Data, Difference in Difference
35 pages
Section13 PDF
No ratings yet
Section13 PDF
7 pages
2025 More on Panels
No ratings yet
2025 More on Panels
17 pages
DiD Regression
No ratings yet
DiD Regression
18 pages
Triple Diferença
No ratings yet
Triple Diferença
23 pages
econ4
No ratings yet
econ4
92 pages
utac010
No ratings yet
utac010
23 pages
Week07_VideoSlidesECO372
No ratings yet
Week07_VideoSlidesECO372
67 pages
Multiple Linear Regression Model
No ratings yet
Multiple Linear Regression Model
99 pages
Wooldridge Session 5
No ratings yet
Wooldridge Session 5
57 pages
Panel Data Lecture Rome
No ratings yet
Panel Data Lecture Rome
47 pages
IE Methods
No ratings yet
IE Methods
112 pages
DiD
No ratings yet
DiD
14 pages
Research Paper - Econometrics - TWFE
No ratings yet
Research Paper - Econometrics - TWFE
35 pages
ssrn-3582447
No ratings yet
ssrn-3582447
26 pages
Takehome - Exam DiD and RDD
No ratings yet
Takehome - Exam DiD and RDD
36 pages
Causal Inference - A Statistical Learning Approach
No ratings yet
Causal Inference - A Statistical Learning Approach
247 pages
Stock Watson 3U ExerciseSolutions Chapter13 Instructors PDF
No ratings yet
Stock Watson 3U ExerciseSolutions Chapter13 Instructors PDF
14 pages
Cc-Me cs1 Did
No ratings yet
Cc-Me cs1 Did
66 pages
Student Solutions Manual to Accompany Modern Macroeconomics
From Everand
Student Solutions Manual to Accompany Modern Macroeconomics
Sanjay K. Chugh
No ratings yet
THE SOUND OF SILENCE CHORDS (Ver 5) by Simon & Garfunkel @ PDF
100% (1)
THE SOUND OF SILENCE CHORDS (Ver 5) by Simon & Garfunkel @ PDF
3 pages
THE SOUND OF SILENCE CHORDS (Ver 5) by Simon & Garfunkel @ PDF
100% (1)
THE SOUND OF SILENCE CHORDS (Ver 5) by Simon & Garfunkel @ PDF
3 pages
The Market Pricing of Earnings Quality: Jfrancis@duke - Edu
No ratings yet
The Market Pricing of Earnings Quality: Jfrancis@duke - Edu
43 pages
Investor Sentiment and Accruals Anomaly: European Evidence: Francisca BEER
No ratings yet
Investor Sentiment and Accruals Anomaly: European Evidence: Francisca BEER
36 pages
Did Regulation Fair Disclosure, SOX, and Other Analyst Regulations Reduce Security Mispricing?
No ratings yet
Did Regulation Fair Disclosure, SOX, and Other Analyst Regulations Reduce Security Mispricing?
57 pages
The Impact of Stock Market Illiquidity On Real UK GDP Growth
No ratings yet
The Impact of Stock Market Illiquidity On Real UK GDP Growth
33 pages
Macro To Micro: Country Exposures, Firm Fundamentals and Stock Returns
No ratings yet
Macro To Micro: Country Exposures, Firm Fundamentals and Stock Returns
68 pages
SSAP 13 Accounting For Research and Development File PDF
No ratings yet
SSAP 13 Accounting For Research and Development File PDF
14 pages
Program and Mata
No ratings yet
Program and Mata
24 pages
Ownership Covenants LW
No ratings yet
Ownership Covenants LW
35 pages
Bootstrapping For Regressions in Stata 031017 PDF
No ratings yet
Bootstrapping For Regressions in Stata 031017 PDF
20 pages
Diff in Diff Uk12 Villa
No ratings yet
Diff in Diff Uk12 Villa
16 pages
ICGR 2018 - Proceedings - Download PDF
No ratings yet
ICGR 2018 - Proceedings - Download PDF
649 pages
Di Erences-In-Di Erences: Jörn-Ste en Pischke
No ratings yet
Di Erences-In-Di Erences: Jörn-Ste en Pischke
12 pages
Aefmgovernancetravelworld2011jfefinal PDF
No ratings yet
Aefmgovernancetravelworld2011jfefinal PDF
28 pages
Engineering Everywhere: Don't Runoff: Engineering Urban Landscapes
No ratings yet
Engineering Everywhere: Don't Runoff: Engineering Urban Landscapes
22 pages
Bee Bot Cards PDF
No ratings yet
Bee Bot Cards PDF
46 pages
Those Who Do Not Remember The Past Are Condemned To Repeat It George Santayana Spanish Philosopher, Poet and Novelist (1863-1952)
No ratings yet
Those Who Do Not Remember The Past Are Condemned To Repeat It George Santayana Spanish Philosopher, Poet and Novelist (1863-1952)
32 pages
Test Code: STB (Short Answer Type) 2015
No ratings yet
Test Code: STB (Short Answer Type) 2015
3 pages
Cosinor Analysis of Accident Risk Using SPSS's Regression Procedures
No ratings yet
Cosinor Analysis of Accident Risk Using SPSS's Regression Procedures
31 pages
Chapter 9
No ratings yet
Chapter 9
14 pages
Math Assignment Unit 7
No ratings yet
Math Assignment Unit 7
5 pages
CH 3 Statistical Estimation
100% (1)
CH 3 Statistical Estimation
13 pages
Logit Model For Binary Data
No ratings yet
Logit Model For Binary Data
50 pages
Scheme of Valuation of Bussiness Statics Set 2.
No ratings yet
Scheme of Valuation of Bussiness Statics Set 2.
8 pages
Old Multichoice Questions STATISTICS
100% (1)
Old Multichoice Questions STATISTICS
8 pages
LEC4-Maintainability Measures and Functions
100% (1)
LEC4-Maintainability Measures and Functions
14 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
18 pages
Document
No ratings yet
Document
6 pages
INTRODUCTION TO STATISTICS2
No ratings yet
INTRODUCTION TO STATISTICS2
66 pages
Module No. 9 Title: Testing The Difference Between The Population Means (Large Independent Samples) and (Small Independent Samples)
No ratings yet
Module No. 9 Title: Testing The Difference Between The Population Means (Large Independent Samples) and (Small Independent Samples)
13 pages
CLRM Assumptions
No ratings yet
CLRM Assumptions
20 pages
Solutions - December 2013
No ratings yet
Solutions - December 2013
3 pages
Difference in Difference For Impact Evaluation
No ratings yet
Difference in Difference For Impact Evaluation
18 pages
A New Criterion For Model Selection
No ratings yet
A New Criterion For Model Selection
12 pages
Practice Questions
No ratings yet
Practice Questions
2 pages
5 6314393805420232845 PDF
No ratings yet
5 6314393805420232845 PDF
11 pages
Analisa Univariat
No ratings yet
Analisa Univariat
168 pages
User Guide of GARCH-MIDAS and DCC-MIDAS MATLAB Programs
No ratings yet
User Guide of GARCH-MIDAS and DCC-MIDAS MATLAB Programs
12 pages
Unit 3_Unit 4 Problems and Solutions
No ratings yet
Unit 3_Unit 4 Problems and Solutions
30 pages
Evaluation Metrics in Machine Learning
No ratings yet
Evaluation Metrics in Machine Learning
14 pages
GradedWorksheet A1-1
100% (1)
GradedWorksheet A1-1
5 pages
Lesson 9. Correlation Coefficient
No ratings yet
Lesson 9. Correlation Coefficient
18 pages
dividend policy 2nd chapter
No ratings yet
dividend policy 2nd chapter
41 pages
Distinguishing Between Random and Fixed: Variables, Effects, and Coefficients
No ratings yet
Distinguishing Between Random and Fixed: Variables, Effects, and Coefficients
3 pages
Exploring The Concept of Correlation and Its Applications in Data Science
No ratings yet
Exploring The Concept of Correlation and Its Applications in Data Science
17 pages
Hp1047, Vmr286 Loan Default Prediction Final Report
No ratings yet
Hp1047, Vmr286 Loan Default Prediction Final Report
8 pages

Diff PDF

Uploaded by

Diff PDF

Uploaded by

Economics 131

Program Evaluation and the Diﬀerence in Diﬀerence Estimator

1.2 Modeling the Outcome

1.3 Assumptions for an Unbiased Estimator

These equations will prove helpful below.

2 The Diﬀerence in Diﬀerence Estimator

2.1 Simple Pre versus Post Estimator

δ̂1 = Ȳ1T − Ȳ0T (D1)

δ̂2 = Ȳ1T − Ȳ1C (D2)

2.3 The Diﬀerence in Diﬀerence Estimator

Taking the expectation of this estimator we will see that it is unbiased

on the entire sample.

Pre Post Post-Pre Diﬀerence

Before Increase After Increase Diﬀerence

2.4 Problems with Diﬀerence in Diﬀerence Estimators

You might also like