0% found this document useful (0 votes)

25 views36 pages

Did Functional - Form

This document discusses the Difference-in-Differences (DiD) method for causal inference, emphasizing the importance of the parallel trends assumption and its sensitivity to functional form. It explores how parallel trends can be maintained across different transformations of outcomes, such as levels and logs, and presents a framework for testing the validity of these assumptions in empirical studies. The lecture also includes an empirical illustration using state-level minimum wage changes to demonstrate the application of these concepts.

Uploaded by

kanspurchase2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views36 pages

Did Functional - Form

Uploaded by

kanspurchase2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 36

Causal Inference using Difference-in-Differences

Lecture 4: Parallel Trends and Functional Form

Pedro H. C. Sant’Anna
Emory University

January 2025
Introduction
Introduction

■ Difference-in-differences (DiD) is one of the most popular strategies for estimating

causal effects in non-experimental contexts.

■ The reliability of DiD methods depends on the parallel trends assumption.

■ Random assignment of treatment (unconfoundedness) is not necessary for parallel

trends to hold.

What does parallel trends impose if treatment is not randomly assigned?

1
Introduction

■ There are potentially many ways of tackling this question.

■ A natural one focuses on the extent to which the validity of DiD depends on
functional form restrictions.

■ Following Athey and Imbens (2006), we will say parallel trends is insensitive to
functional form if when it holds for potential outcomes Y(∞), it also holds for
potential outcomes s(Y(∞)) for any strictly monotonic s.

■ Intuitively, this says that parallel trends holds regardless of the units in which one
measures the outcome.

2
Why study sensitivity to functional form

■ Studying sensitivity to functional form helps clarify the different ways that a
researcher can justify the validity of a DiD design:

▶ Can verify conditions that ensure PT holds for all functional forms.

▶ If sensitive to functional form, can justify the particular choice.

3
WWhy study sensitivity to functional form

■ It often may not be clear from subject-specific knowledge what is the “right”
transformation for PT to hold.

■ Example: different labor market studies have measured earnings in levels, logs, or
percentiles relative to national wage distribution.

■ The choice of transformation may be motivated by which ATT is “most relevant”, but
not always obvious that policy variation will generate PT for the same transformation

■ Moreover, we might want to use the same policy variation to study the ATT for
multiple transformations of the same outcome.

■ We will use Meyer, Viscusi and Durbin (1995) as a running example in the next slides:
interested in studying whether changes in weekly benefit amounts affected the
duration of time out of work in Michigan and Kentucky.
4
Parallel Trends in levels

■ Parallel trends assumption (in levels):

E [Yi,t=2 (∞)|Gi = 2] − E [Yi,t=1 (∞)|Gi = 2] = E [Yi,t=2 (∞)|Gi = ∞] − E [Yi,t=1 (∞)|Gi = ∞]

■ If Y is the duration of claims measured in weeks, and treatment is an increase of cap
(PT in levels)
▶ PT would suggest that the average untreated claims’ duration among workers who are
affected by the increased cap would evolve the same as the average untreated claims’
duration among workers who are not affected by the change in the cap.
▶ If the average change in untreated claims’ duration among workers who are not affected
by the change in the cap is 0.05 weeks, these would serve as counterfactual changes for
the average untreated claims’ duration among workers who are affected by the change
in cap
▶ ATT would provide the average treatment effect (in weeks) among workers who are
affected by the change in cap.
5
Parallel Trends in logs

■ Parallel trends assumption (in logs):

E ln Yi,t=2 (∞) | Gi = 2 − E ln Yi,t=1 (∞) | Gi = 2 = E ln Yi,t=2 (∞) | Gi = ∞ − E ln Yi,t=1 (∞) | Gi = ∞

E ln Yi,t=2 (∞) − ln Yi,t=1 (∞) | Gi = 2 = E ln Yi,t=2 (∞) − ln Yi,t=1 (∞) | Gi = ∞

Y (∞) Y (∞)
E ln i,t=2 Gi = 2 = E ln i,t=2 Gi = ∞
Y i, t = 1 ( ∞ ) Y i, t = 1 ( ∞ )

■ Under parallel trends (in logs), the ATT would take the format:

Yi,t=2 (2)
ATT = E [ln Yi,t=2 (2) − ln Yi,t=2 (∞) | G = 2] = E ln G=2 .
Yi,t=2 (∞)

■ ATT is measured in relative terms when you have PT in logs.

6
Parallel Trends in logs

■ Parallel trends assumption (in logs):

Yi,t=2 (∞) Yi,t=2 (∞)
E ln Gi = 2 = E ln Gi = ∞
Y i, t = 1 ( ∞ ) Y i, t = 1 ( ∞ )

■ If Y is the duration of claims measured in weeks, and treatment is an increase of cap

(PT in levels)
▶ PT would suggest that the average log relative growth of untreated claims’ duration
among workers who are treated would be the same as the average log relative growth of
untreated claims’ duration among workers who are not treated
▶ If the average log relative growth of untreated claims’ duration among workers who are
not treated is 0.008, these would serve as counterfactual changes for the average log
relative growth of untreated claims’ duration for workers that are treated.
▶ ATT would provide the average treatment effect (in relative terms) among workers who
are treated.
7
Which PT should we pick?

What if we take other transformations?

8
The rest of the lecture will build on

Roth and Sant’Anna (2023), but with

different notation.

9
Setup
Model setup

■ We consider the 2x2 DiD setup:

▶ 2 time periods: t = 1 (before treatment) and t = 2 (after treatment);

▶ 2 groups: G = 2 (treated at period 2) and G = ∞ (untreated by period 2);

■ Potential outcomes: Yi,t (2), Yi,t (∞). Observe Yi,t = 1{Gi =1} Yi,t (2) + 1{Gi =∞} Yi,t (∞).

■ Let’s assume No-anticipation: Yi,t=1 (2) = Yi,t=1 (∞).

■ Target parameter is the ATT in period t = 2,

ATT = E [Yi,t=2 (2) − Yi,t=2 (∞) | G = 2] .

10
More general models

■ We consider a 2-period, 2-group model for expositional simplicity.

■ More recent papers have considered settings with multiple periods and staggered
adoption.
▶ Typically impose a version of the 2-group, 2-period parallel trends assumption for many
periods/groups
(de Chaisemartin and D’Haultfœuille, 2020; Callaway and Sant’Anna, 2021; Sun and Abraham, 2021;
Borusyak, Jaravel and Spiess, 2024; Wooldridge, 2021).
▶ Thus, 2x2 results have immediate implications for the generalized PT assumption in the
staggered case.

■ The following results remain valid if all probability statements are implicitly
conditional on X, as when one assumes conditional parallel trends
(Heckman, Ichimura and Todd, 1997; Abadie, 2005; Sant’Anna and Zhao, 2020; Callaway and Sant’Anna,
11
2021).
Parallel Trends for all transformations of Y(∞)
Parallel Trends and Insensitivity to Functional Form

■ Following the definition in Athey and Imbens (2006), we say parallel trends is
insensitive to functional form (a.k.a. invariant to transformations) if

E s(Yi,t=2 (∞)) | Gi = 2 − E s(Yi,t=1 (∞)) | Gi = 2
=

E s(Yi,t=2 (∞)) | Gi = ∞ − E s(Yi,t=1 (∞)) | Gi = ∞

for all strictly monotonic s such that the expectations exist and are finite.

▶ s could be levels, logs, percentiles of a reference distribution, etc.

12
Insensitivity of Parallel Trends

Roth and Sant’Anna (2023) established the following characterization relating PT and
functional form.

Proposition (PT and functional form)

Parallel trends is insensitive to functional form if and only if parallel trends of CDFs is
satisfied, i.e.

FYi,t=2 (∞)|Gi =2 (y) − FYi,t=1 (∞)|Gi =2 (y) = FYi,t=2 (∞)|Gi =∞ (y) − FYi,t=1 (∞)|Gi =∞ (y), for all y ∈ R (1)
| {z } | {z }
Change in CDF for treated group Change in CDF for comparison group

where FYi,t (∞)|Gi =g is the cumulative distribution function of Yi,t (∞) | Gi = g.

Note that if Y(∞) is continuous (discrete), this is equivalent to parallel trends of PDFs (PMFs).
13
What Generates PT of CDFs?
What Generates PT of CDFs?

■ Under minor regularity conditions, Roth and Sant’Anna (2023) shows that parallel
trends of CDFs holds if and only if

FYi,t (∞)|Gi =g (y) = θJt (y) + (1 − θ )Hg (y) for all y ∈ R and g × t ∈ {2, ∞} × {1, 2}. (2)

for some θ ∈ [0, 1] and CDFs Jt (y) and Hg (y) depending only on time and group,
respectively.

■ This says that the distribution of Y(∞) for group g in period t is a mixture of a
time-dependent distribution (not depending on g) and a group-dependent
distribution (not depending on t).

14
Cases

This implies that PT is insensitive to funct form iff we are in the following three cases:
■ Case 1: (As-If) Randomized Treatment (θ = 1). The distribution of Yi,t (∞)|G = g is
the same for both groups (g = 2, ∞)

■ Case 2: Stationary Y(∞) (θ = 0). For each group, the distribution of Yi,t (∞)|G = g
doesn’t depend on t.

■ Case 3: A hybrid. (θ ∈ (0, 1)).

θ fraction of the population is as-if randomized btwn treatment and control
1 − θ fraction of the population is non-randomized in treatment and control but
have stationary Y(∞) distributions (conditional on group)
▶ Perhaps plausible if there is effectively an experiment among a sub-population with
time trends (e.g. younger workers), and endogenous selection into treatment among
sub-populations with stable earnings over time (e.g. older workers).
15
Numerical Illustration of Case 3

■ θ = 21 (e.g. share of younger workers)

Jt ∼ lognormal(1 + t, 1) (e.g. wages of younger workers in period t)
Hg ∼ lognormal(3 + 1{g=2} , 1) (e.g. wages of older workers in state g)
Yi,t (∞)|Gi = g ∼ θJt + (1 − θ )Hg

16
Can we test PT in CDFs?
Testable Implications

■ The parallel trends of CDFs condition implies that

FYi,t=2 (∞)|Gi =2 (y) = FYi,t=1 (∞)|Gi =2 (y) + FYi,t=2 (∞)|Gi =∞ (y) − FYi,t=1 (∞)|Gi =∞ (y) for all y ∈ R
| {z } | {z }
Counterfactual Identified
(3)

■ A (sharp) testable implication of PT of CDFs is that the RHS is monotonically

increasing.

■ If the RHS is non-monotonic, then there is no possible counterfactual distribution

Yi,t=2 (∞)|Gi = 2 such that parallel trends is insensitive to functional form!

■ Roth and Sant’Anna (2023) show that we can use this to test for cases where it is
clear from data we need to justify the particular choice of functional form
17
Testing in Practice

■ Consider the case where Y(∞) has finite support.

■ Then, testing that the implied CDF is increasing is equivalent to testing that the
implied mass is non-negative at all support points, i.e.
fYi,t=1 |Gi =2 (y) + fYi,t=2 |Gi =∞ (y) − fYi,t=2 |Gi =∞ (y) ≥ 0 for all y,
where fYi,t |Gi =g (y) is the probability mass function of Yi,t |Gi = g.

■ To test, we can merely replace the mass functions with sample analogs and apply
tools from the moment inequality literature to test that

E [fYi,t=1 |Gi =2 (y) + fYi,t=2 |Gi =∞ (y) − fYi,t=2 |Gi =∞ (y)] ≥ 0 for all y.

■ With continuous support, can likewise use methods for testing a continuum of
inequalities (e.g. Andrews and Shi (2013)). 18
Caveats

■ These tests may be useful for detecting when parallel trends is sensitive to
functional form.

■ But failure to reject does not mean that we don’t need to worry about functional
form!

■ PT of CDFs is falsifiable but not verifiable:

▶ Null is that there is some possible distribution for Yi,t=2 (∞)|Gi = 2 such that it holds.

■ Like tests of pre-trends, such pre-tests may be underpowered, and relying on them
can introduce distortions from pre-testing (Roth, 2022).

19
Empirical Illustration
Empirical Illustration

■ Stylized analysis of the impact of state-level minimum wage changes on wage

distribution

■ Testing PT of CDFs is interesting both because it determines whether PT is sensitive

to functional form and because DiD has been used to estimate distributional
impacts in this context.

■ Set-up:
▶ The pre-period is either 2007 or 2010. Post-period is 2015

▶ Treatment is whether the state raised MW between Pre and Post.

20
Empirical Illustration

■ Panel data from Cengiz, Dube, Lindner and Zipperer (2019) with state-level MW
changes and employment-to-population ratios for 25c wage-bins (in 2016 dollars) at
state-level

■ If Wi is person i’s wage if employed and 0 otherwise, then employment-to-pop ratio

at wage w is density of Wi at w.

■ Estimate counterfactual employment-to-population ratio in bin w under PT of CDFs

as:
f̂post,D=1 (w) = f̂pre,D=1 (w) + f̂post,D=0 (w) − f̂pre,D=0 (w),
weighting states by population size

■ Conduct moment inequality tests by comparing the minimum studentized moment

to “least-favorable” critical values (assuming all moments have mean zero) 21
Results: Pre = 2007, Post = 2015

22
Results: Pre = 2007, Post = 2015

■ Implied density is negative for wages $̃5-7.

■ Intuitively, employment declines in control states are larger than initial levels in
treatment states (likely b/c of differential effects of change in federal MW)

23
Results: Pre = 2010, Post = 2015

24
R package

■ Jon Roth and I have prepared the R package didFF to help you use these tests.

■ The package covers a variety of setups:

▶ Multiple time periods;

▶ Staggered treatment adoption;

▶ PT plausible only after conditioning on covariates.

■ Please check it out at https://github.com/pedrohcgs/didFF

25
References
Abadie, Alberto, “Semiparametric Difference-in-Differences Estimators,” The Review of
Economic Studies, 2005, 72 (1), 1–19.
Andrews, Donald W. K. and Xiaoxia Shi, “Inference Based on Conditional Moment
Inequalities,” Econometrica, 2013, 81 (2), 609–666.
Athey, Susan and Guido Imbens, “Identification and Inference in Nonlinear
Difference-in-Differences Models,” Econometrica, 2006, 74 (2), 431–497.
Borusyak, Kirill, Xavier Jaravel, and Jann Spiess, “Revisiting Event Study Designs: Robust
and Efficient Estimation,” Review of Economic Studies, 2024, Forthcoming.
Callaway, Brantly and Pedro H. C. Sant’Anna, “Difference-in-Differences with Multiple
Time Periods,” Journal of Econometrics, 2021, 225 (2), 200–230.
Cengiz, Doruk, Arindrajit Dube, Attila Lindner, and Ben Zipperer, “The Effect of Minimum
Wages on Low-Wage Jobs,” The Quarterly Journal of Economics, August 2019, 134 (3),
1405–1454.
de Chaisemartin, Clément and Xavier D’Haultfœuille, “Two-Way Fixed Effects Estimators
with Heterogeneous Treatment Effects,” American Economic Review, 2020, 110 (9),
2964–2996.
Heckman, James J., Hidehiko Ichimura, and Petra E. Todd, “Matching As An Econometric
Evaluation Estimator: Evidence from Evaluating a Job Training Programme,” The Review
of Economic Studies, October 1997, 64 (4), 605–654.
Meyer, Bruce D., W. Kip Viscusi, and David L. Durbin, “Workers’ Compensation and Injury
Duration: Evidence from a Natural Experiment,” The American Economic Review, 1995,
85 (3), 322–340.
Roth, Jonathan, “Pre-test with Caution: Event-study Estimates After Testing for Parallel
Trends,” American Economic Review: Insights, 2022, Forthcoming.
and Pedro H. C. Sant’Anna, “When Is Parallel Trends Sensitive to Functional Form?,”
Econometrica, 2023, 91 (2), 737–747.
Sant’Anna, Pedro H. C. and Jun Zhao, “Doubly robust difference-in-differences estimators,”
Journal of Econometrics, November 2020, 219 (1), 101–122.
Sun, Liyan and Sarah Abraham, “Estimating Dynamic Treatment Effects in Event Studies
with Heterogeneous Treatment Effects,” Journal of Econometrics, 2021, 225 (2).
Wooldridge, Jeffrey M, “Two-Way Fixed Effects, the Two-Way Mundlak Regression, and
Difference-in-Differences Estimators,” Working Paper, 2021, pp. 1–89.

Xero
80% (15)
Xero
18 pages
Breaker Blocks
100% (5)
Breaker Blocks
16 pages
Order of The Mass-2
100% (2)
Order of The Mass-2
2 pages
AACVPR Guidelines For AACVPR Guidelines For Pulmonary Rehabilitation Programs (4 Edition)
No ratings yet
AACVPR Guidelines For AACVPR Guidelines For Pulmonary Rehabilitation Programs (4 Edition)
37 pages
MESOPOTAMIA
No ratings yet
MESOPOTAMIA
22 pages
SSRN 4487202
No ratings yet
SSRN 4487202
382 pages
Working With Change Systems Approaches To Public Sector Challenges
No ratings yet
Working With Change Systems Approaches To Public Sector Challenges
122 pages
Dynamic DiD Regression Li Strezhnev June 25 2024
No ratings yet
Dynamic DiD Regression Li Strezhnev June 25 2024
112 pages
NA DeiselShip Latest
No ratings yet
NA DeiselShip Latest
105 pages
05 Covariates
No ratings yet
05 Covariates
104 pages
Att#11 - A - Painting Procedure
No ratings yet
Att#11 - A - Painting Procedure
14 pages
Slides 1 Arnold Ventures 2024
No ratings yet
Slides 1 Arnold Ventures 2024
68 pages
Differences in Differences
No ratings yet
Differences in Differences
78 pages
SSRN 4487202
No ratings yet
SSRN 4487202
380 pages
The Orthodox Christian Mission
No ratings yet
The Orthodox Christian Mission
3 pages
Astm A641
No ratings yet
Astm A641
5 pages
Selection and Parallel Trends
No ratings yet
Selection and Parallel Trends
51 pages
DID Paper 2
No ratings yet
DID Paper 2
51 pages
Difference-In-Differences With Unequal Baseline Treatment Status
No ratings yet
Difference-In-Differences With Unequal Baseline Treatment Status
49 pages
13 Dind
No ratings yet
13 Dind
58 pages
DP 16202
No ratings yet
DP 16202
51 pages
Non-Stationary Time Series and Unit Root Tests: Deterministic Trend
No ratings yet
Non-Stationary Time Series and Unit Root Tests: Deterministic Trend
13 pages
Event Studies Slides
No ratings yet
Event Studies Slides
39 pages
Did Staggered
No ratings yet
Did Staggered
37 pages
Did, Iv
No ratings yet
Did, Iv
42 pages
DiD Review Paper
No ratings yet
DiD Review Paper
54 pages
DID Topics
No ratings yet
DID Topics
86 pages
Science 5 - Q2 - M12
No ratings yet
Science 5 - Q2 - M12
16 pages
Lecture4 Chapter1 - Binary - Gray, and ASCII Codes
No ratings yet
Lecture4 Chapter1 - Binary - Gray, and ASCII Codes
36 pages
2023 Roth Santanna Bilinski
No ratings yet
2023 Roth Santanna Bilinski
58 pages
Chaisemartind'Haultfoeuille (2023) EconometricsJournal
No ratings yet
Chaisemartind'Haultfoeuille (2023) EconometricsJournal
30 pages
Difference-In-Differences Estimation Under Non-Parallel Trends
No ratings yet
Difference-In-Differences Estimation Under Non-Parallel Trends
33 pages
Panel3 DID
No ratings yet
Panel3 DID
36 pages
Takehome - Exam DiD and RDD
No ratings yet
Takehome - Exam DiD and RDD
36 pages
SSRN 3555463
No ratings yet
SSRN 3555463
64 pages
Econ721 Lecture On Linear Trend Model
No ratings yet
Econ721 Lecture On Linear Trend Model
24 pages
MPRA Paper 119367
No ratings yet
MPRA Paper 119367
17 pages
Slides 2 Arnold Ventures 2024
No ratings yet
Slides 2 Arnold Ventures 2024
57 pages
Causality
No ratings yet
Causality
89 pages
STAT 443 Project
No ratings yet
STAT 443 Project
19 pages
Isthescience
No ratings yet
Isthescience
34 pages
Xu GeneralizedSyntheticControl 2017
No ratings yet
Xu GeneralizedSyntheticControl 2017
21 pages
01 Introduction
No ratings yet
01 Introduction
53 pages
DID Princeton
No ratings yet
DID Princeton
38 pages
Arkhangelsky SyntheticDifferenceinDifferences 2021
No ratings yet
Arkhangelsky SyntheticDifferenceinDifferences 2021
32 pages
Distribution Regression Difference-in-Differences
No ratings yet
Distribution Regression Difference-in-Differences
49 pages
Now Trending Coping With Non-Parallel Trends in Difference-In-Differences Analysis
No ratings yet
Now Trending Coping With Non-Parallel Trends in Difference-In-Differences Analysis
15 pages
Generalized Synthetic Control Method
No ratings yet
Generalized Synthetic Control Method
33 pages
Causal Review w31942
No ratings yet
Causal Review w31942
65 pages
HCIA-HarmonyOS Device Developer V1.0 学员用书
No ratings yet
HCIA-HarmonyOS Device Developer V1.0 学员用书
166 pages
Causal Inference: Miguel A. Hernán, James M. Robins May 19, 2017
No ratings yet
Causal Inference: Miguel A. Hernán, James M. Robins May 19, 2017
26 pages
Lecture4 Panelt-Smodels 12-04-2017 Corrections
No ratings yet
Lecture4 Panelt-Smodels 12-04-2017 Corrections
65 pages
Research Paper - Econometrics - TWFE
No ratings yet
Research Paper - Econometrics - TWFE
35 pages
Lecture Note 11 Panel Analysis
No ratings yet
Lecture Note 11 Panel Analysis
11 pages
Lee Wooldridge 20230720
No ratings yet
Lee Wooldridge 20230720
45 pages
01 Introduction
No ratings yet
01 Introduction
28 pages
Utad 016
No ratings yet
Utad 016
36 pages
Interpreting Event-Studies From Recent Difference-in-Differences Methods
No ratings yet
Interpreting Event-Studies From Recent Difference-in-Differences Methods
9 pages
Research Paper - Econometrics - DiD Event Studies
No ratings yet
Research Paper - Econometrics - DiD Event Studies
9 pages
Journal - Generalized Synthetic Control Method - Causal Inference With Interactive Fixed Effects Models
No ratings yet
Journal - Generalized Synthetic Control Method - Causal Inference With Interactive Fixed Effects Models
20 pages
w29691 PDF
No ratings yet
w29691 PDF
30 pages
Chapter 13
No ratings yet
Chapter 13
14 pages
DID SC Counterfactuals
No ratings yet
DID SC Counterfactuals
7 pages
Balancing, Regression, Difference-In-Difference and Synthetic Control Methods
No ratings yet
Balancing, Regression, Difference-In-Difference and Synthetic Control Methods
38 pages
Handout 6 Causality
No ratings yet
Handout 6 Causality
16 pages
Callaway & SantAnna
No ratings yet
Callaway & SantAnna
31 pages
Alkaline Earth Metals and Their Compounds
100% (1)
Alkaline Earth Metals and Their Compounds
9 pages
Unit Root Testing: ° Physica-Verlag 0, ISSN 0002-6018
No ratings yet
Unit Root Testing: ° Physica-Verlag 0, ISSN 0002-6018
16 pages
C - 16922312 - Shafa Raisa Hazet - P-3.4 - 1
No ratings yet
C - 16922312 - Shafa Raisa Hazet - P-3.4 - 1
10 pages
Solutions 5
No ratings yet
Solutions 5
6 pages
2017 Fall ME501 06 VectorCalculus
No ratings yet
2017 Fall ME501 06 VectorCalculus
95 pages
Lecture 2 - Stationarity and Endogeneity
No ratings yet
Lecture 2 - Stationarity and Endogeneity
24 pages
DDD Analysis
No ratings yet
DDD Analysis
21 pages
4a - Training
No ratings yet
4a - Training
38 pages
Diff - Simplifying The Estimation of Difference-In-difference Treatment Effects
No ratings yet
Diff - Simplifying The Estimation of Difference-In-difference Treatment Effects
20 pages
Nokia Solutions and Networks Jaipur (Raj.) : Seminar Report ON Industrial Training AT
No ratings yet
Nokia Solutions and Networks Jaipur (Raj.) : Seminar Report ON Industrial Training AT
51 pages
Noun. (1) The French Indirect Object Pronouns Are
No ratings yet
Noun. (1) The French Indirect Object Pronouns Are
4 pages
CLASS 12 - Chemistry - Notes - ch02 - Electrochemistry
No ratings yet
CLASS 12 - Chemistry - Notes - ch02 - Electrochemistry
8 pages
Aroon Kumar: "Award Winning Global Marketer and Digital Business Leader"
No ratings yet
Aroon Kumar: "Award Winning Global Marketer and Digital Business Leader"
6 pages
Anas Enterprise
No ratings yet
Anas Enterprise
6 pages
Forest Monitoring System Using Wireless Sensor Network: Prof. Sagar Pradhan
No ratings yet
Forest Monitoring System Using Wireless Sensor Network: Prof. Sagar Pradhan
8 pages
General Ledger of Journal 1
No ratings yet
General Ledger of Journal 1
8 pages
Bol BPP
No ratings yet
Bol BPP
2 pages
4ME Brochure Update V2657
No ratings yet
4ME Brochure Update V2657
12 pages
ITI Newsletter July 2024
No ratings yet
ITI Newsletter July 2024
3 pages
Least Squares & Pseudo Inverse
No ratings yet
Least Squares & Pseudo Inverse
12 pages
Guidelines For Filling-Up The Post Graduate Application Form
No ratings yet
Guidelines For Filling-Up The Post Graduate Application Form
2 pages
Sampling and Data Collection
No ratings yet
Sampling and Data Collection
7 pages
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet

Did Functional - Form

Uploaded by

Did Functional - Form

Uploaded by

Causal Inference using Difference-in-Differences

Lecture 4: Parallel Trends and Functional Form

■ Difference-in-differences (DiD) is one of the most popular strategies for estimating

■ The reliability of DiD methods depends on the parallel trends assumption.

■ Random assignment of treatment (unconfoundedness) is not necessary for parallel

What does parallel trends impose if treatment is not randomly assigned?

■ There are potentially many ways of tackling this question.

▶ If sensitive to functional form, can justify the particular choice.

■ Parallel trends assumption (in levels):

E [Yi,t=2 (∞)|Gi = 2] − E [Yi,t=1 (∞)|Gi = 2] = E [Yi,t=2 (∞)|Gi = ∞] − E [Yi,t=1 (∞)|Gi = ∞]

■ Parallel trends assumption (in logs):

■ ATT is measured in relative terms when you have PT in logs.

■ Parallel trends assumption (in logs):

■ If Y is the duration of claims measured in weeks, and treatment is an increase of cap

What if we take other transformations?

Roth and Sant’Anna (2023), but with

■ We consider the 2x2 DiD setup:

▶ 2 time periods: t = 1 (before treatment) and t = 2 (after treatment);

▶ 2 groups: G = 2 (treated at period 2) and G = ∞ (untreated by period 2);

■ Let’s assume No-anticipation: Yi,t=1 (2) = Yi,t=1 (∞).

■ Target parameter is the ATT in period t = 2,

ATT = E [Yi,t=2 (2) − Yi,t=2 (∞) | G = 2] .

■ We consider a 2-period, 2-group model for expositional simplicity.

▶ s could be levels, logs, percentiles of a reference distribution, etc.

Proposition (PT and functional form)

where FYi,t (∞)|Gi =g is the cumulative distribution function of Yi,t (∞) | Gi = g.

■ Case 3: A hybrid. (θ ∈ (0, 1)).

■ θ = 21 (e.g. share of younger workers)

■ The parallel trends of CDFs condition implies that

■ A (sharp) testable implication of PT of CDFs is that the RHS is monotonically

■ If the RHS is non-monotonic, then there is no possible counterfactual distribution

■ Consider the case where Y(∞) has finite support.

■ PT of CDFs is falsifiable but not verifiable:

■ Stylized analysis of the impact of state-level minimum wage changes on wage

■ Testing PT of CDFs is interesting both because it determines whether PT is sensitive

▶ Treatment is whether the state raised MW between Pre and Post.

■ If Wi is person i’s wage if employed and 0 otherwise, then employment-to-pop ratio

■ Estimate counterfactual employment-to-population ratio in bin w under PT of CDFs

■ Conduct moment inequality tests by comparing the minimum studentized moment

■ Implied density is negative for wages $̃5-7.

■ The package covers a variety of setups:

▶ Multiple time periods;

▶ Staggered treatment adoption;

▶ PT plausible only after conditioning on covariates.

■ Please check it out at https://github.com/pedrohcgs/didFF

You might also like