1-Linear SSIV
1-Linear SSIV
Introductions
Me and This Course
(Linear) SSIV
Shock Exogeneity
Motivation
Borusyak et al. (2022)
Share Exogeneity
Motivation
Goldsmith-Pinkham et al. (2020)
Key question: under what assumptions does this SSIV strategy “work”?
SSIV Examples
X shares shocks
Instrument z` = s`n gn for model y` = βx` + γ 0 w` + ε`
n
X shares shocks
Instrument z` = s`n gn for model y` = βx` + γ 0 w` + ε`
n
X shares shocks
Instrument z` = s`n gn for model y` = βx` + γ 0 w` + ε`
n
X shares shocks
Instrument z` = s`n gn for model y` = βx` + γ 0 w` + ε`
n
X shares shocks
Instrument z` = s`n gn for model y` = βx` + γ 0 w` + ε`
n
Of course, we can always run IV with such z` ... but what does the
corresponding estimand identify?
What Do We Do With This?
Of course, we can always run IV with such z` ... but what does the
corresponding estimand identify?
Of course, we can always run IV with such z` ... but what does the
corresponding estimand identify?
Introductions
Me and This Course
(Linear) SSIV
Shock Exogeneity
Motivation
Borusyak et al. (2022)
Share Exogeneity
Motivation
Goldsmith-Pinkham et al. (2020)
• Relevance: both ∆IPnt and ∆IP Ont are driven by the same
Chinese productivity shocks
• Validity: local productivity/demand shocks in the US are
uncorrelated with those of other countries (entering ∆IP Ont )
Identification from a Natural Experiment
ADH Solution: specify the outcome equation for local labor markets
• Works if local economies are isolated “islands”
(simple model in Adao-Kolesar-Morales 2019; richer structure of
spatial spillovers in Adao-Arkolakis-Esposito 2020)
Problem 2: GE Spillovers
ADH Solution: specify the outcome equation for local labor markets
• Works if local economies are isolated “islands”
(simple model in Adao-Kolesar-Morales 2019; richer structure of
spatial spillovers in Adao-Arkolakis-Esposito 2020)
• Reduced-form allowed: x` = z`
• Reduced-form allowed: x` = z`
First step: note that by the FWL thm., the estimator can be written
⊥ ⊥
P P P
` z ` y` ` n s`n gn y`
β̂ = P ⊥
= P P ⊥
` z` x` ` n s`n gn x`
1
s`n gn y`⊥
P P
β̂ = L
1 P` Pn ⊥
=
L ` n s`n gn x`
Leveraging gn
BHJ Numerical Equivalence
1 ⊥ ⊥
P P P P 1
L ` n s`n gn y` n gn ` L s`n y`
β̂ = 1 P P ⊥
= P P 1 ⊥
=
L ` n s`n gn x` n gn ` L s`n x`
Leveraging gn
BHJ Numerical Equivalence
1 ⊥ ⊥
P P P P 1 P ⊥
L ` n s`n gn y` n gn ` L s`n y` n sn gn ȳn
β̂ = 1 P P ⊥
= P P 1 ⊥
= P ⊥
,
L ` n s`n gn x` n gn ` L s`n x` n sn gn x̄n
where sn = 1
are weights capturing the average importance of
P
L ` s`n
P
s v
shock n, and v̄n = P` s`n`n ` is an exposure-weighted average of v`
`
Leveraging gn
BHJ Numerical Equivalence
sn gn ȳn⊥
P
β̂ = Pn ⊥
n sn gn x̄n
sn gn ȳn⊥
P
β̂ = Pn ⊥
n sn gn x̄n
for g̃`n = `0 6=` ω`0 n g`0 n (akin to JIVE solution to many-IV bias)
P
BHJ Extensions (cont.)
Panel Data: Have (y`t , x`t , s`nt , gnt ) across ` = 1, . . . , L, t = 1, . . . , T
BHJ show that A1/A2 are not enough for validity of z` in this case
X X
z` = s`n (µ + (gn − µ)) = µS` + s`n (gn − µ)
n n
| {z }
Clean Shock Variation
X X
z` = s`n (µ + (gn − µ)) = µS` + s`n (gn − µ)
n n
| {z }
Clean Shock Variation
BHJ use similar logic to show robust/clustered SEs can be valid when
β̂ is given by estimating the ‘industry-level’ regression
ȳn⊥ = α + β x̄⊥ 0 ⊥
n + qn τ + ε̄n ,
n by gn and weighting by sn
instrumenting x̄⊥
Practical Consideration 2: Exposure Clustering
The Solution
BHJ use similar logic to show robust/clustered SEs can be valid when
β̂ is given by estimating the ‘industry-level’ regression
ȳn⊥ = α + β x̄⊥ 0 ⊥
n + qn τ + ε̄n ,
n by gn and weighting by sn
instrumenting x̄⊥
BHJ use similar logic to show robust/clustered SEs can be valid when
β̂ is given by estimating the ‘industry-level’ regression
ȳn⊥ = α + β x̄⊥ 0 ⊥
n + qn τ + ε̄n ,
n by gn and weighting by sn
instrumenting x̄⊥
Download at https://github.com/kylebutts/ssaggregate
Application: “The China Shock”
Introductions
Me and This Course
(Linear) SSIV
Shock Exogeneity
Motivation
Borusyak et al. (2022)
Share Exogeneity
Motivation
Goldsmith-Pinkham et al. (2020)
E[ε` | s`n ] = 0, ∀n
Goldsmith-Pinkham, Sorkin, and Swift (GPSS; 2020)
In other words, GPSS show that the SSIV estimator can be seen as
pooling many Boatlift-style diff-in-diff IVs, one for each industry
Rotemberg Weights
How does SSIV pool different diff-in-diffs?
⊥ gn ` s`n x⊥
P P
` s`n y`
X
β̂ = α̂n β̂n , where β̂n = P ⊥
and α̂n = P P ` ⊥
s x
`n ` g
n0 n 0 s`n0 x`
n | {z` } | {z ` }
n-specific IV estimate Rotemberg weight
Rotemberg Weights
How does SSIV pool different diff-in-diffs?
⊥ gn ` s`n x⊥
P P
` s`n y`
X
β̂ = α̂n β̂n , where β̂n = P ⊥
and α̂n = P P ` ⊥
s x
`n ` g
n0 n 0 s`n0 x`
n | {z` } | {z ` }
n-specific IV estimate Rotemberg weight
• Balance/pre-trend tests
• Balance/pre-trend tests
Introductions
Me and This Course
(Linear) SSIV
Shock Exogeneity
Motivation
Borusyak et al. (2022)
Share Exogeneity
Motivation
Goldsmith-Pinkham et al. (2020)
• See also Card (2009), where national immiration rates are estimated
A Taxonomy of SSIV Settings
Case 1 the IV is based on a set of shocks which can be thought of as an
instrument (i.e. many, plausibly quasi-randomly assigned)
• See also Card (2009), where national immiration rates are estimated
• Identification may (or may not) instead follow from share exogeneity
Ex Ante vs. Ex Post Validity
BHJ emphasize that the decision to pursue a “shocks” vs. “shares”
identification strategy must be made ex ante
• Undesirable to base identifying assumptions on ex post tests,
though balance/pre-trend tests can be used to falsify assumptions
• Tailored shares have a diff-in-diff feel; don’t even need the shocks,
except to possibly improve power or avoid many-IV bias