0% found this document useful (0 votes)

8 views8 pages

Slides 1

The document outlines a course on Applied Econometrics, focusing on the estimation of average treatment effects using various methods such as matching, regression, and propensity score estimators. It discusses key concepts like potential outcomes, unconfoundedness, and the importance of covariate overlap for accurate estimation. The document also highlights the significance of combining different estimation techniques for robustness and efficiency in applied settings.

Uploaded by

Ethio-Education Consulting

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views8 pages

Slides 1

Uploaded by

Ethio-Education Consulting

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

“A Course in Applied Econometrics”

Outline
Lecture 1

1. Introduction

Estimation of Average Treatment Eﬀects

2. Potential Outcomes
Under Unconfoundedness, Part I
3. Estimands and Identiﬁcation

Guido Imbens 4. Estimation and Inference

IRP Lectures, UW Madison, August 2008

1. Introduction Unusual case with many proposed (semi-parametric) estima-

tors (matching, regression, propensity score, or combinations),
many of which are actually used in practice.
We are interested in estimating the average effect of a program
or treatment, allowing for heterogenous effects, assuming that
We discuss implementation, and assessment of the critical as-
selection can be taken care of by adjusting for differences in
sumptions (even if they are not testable).
observed covariates.
In practice concern with overlap in covariate distributions tends
This setting is of great applied interest. to be important.

Long literature, in both statistics and economics. Inﬂuential Once overlap issues are addressed, choice of estimators is less
economics/econometrics papers include Ashenfelter and Card important. Estimators combining matching and regression or
(1985), Barnow, Cain and Goldberger (1980), Card and Sulli- weighting and regression are recommended for robustness rea-
van (1988), Dehejia and Wahba (1999), Hahn (1998), Heck- sons.
man and Hotz (1989), Heckman and Robb (1985), Lalonde
Key role for analysis of the joint distribution of treatment in-
(1986). In stat literature work by Rubin (1974, 1978), Rosen-
dicator and covariates prior to using outcome data.
baum and Rubin (1983).
2 3
2. Potential Outcomes (Rubin, 1974)

We observe N units, indexed by i = 1, . . . , N , viewed as drawn

randomly from a large population. Several additional pieces of notation.

We postulate the existence for each unit of a pair of potential

First, the propensity score (Rosenbaum and Rubin, 1983) is
outcomes,
Yi(0) for the outcome under the control treatment and defined as the conditional probability of receiving the treat-
Yi(1) for the outcome under the active treatment ment,
Yi(1) − Yi(0) is unit-level causal effect
e(x) = Pr(Wi = 1|Xi = x) = E[Wi|Xi = x].
Covariates Xi (not affected by treatment)
Each unit is exposed to a single treatment; Wi = 0 if unit i
receives the control treatment and Wi = 1 if unit i receives Also the two conditional regression and variance functions:
the active treatment. We observe for each unit the triple
(Wi, Yi, Xi ), where Yi is the realized outcome: μw (x) = E[Yi (w)|Xi = x], 2 (x) = V(Y (w)|X = x).
σw i i

Yi(0) if Wi = 0,
Yi ≡ Yi(Wi) =
Yi(1) if Wi = 1.
6 7

3. Estimands and Identiﬁcation 4. Estimation and Inference

Assumption 1 (Unconfoundedness, Rosenbaum and Rubin, 1983a)

Population average treatments
(Yi(0), Yi(1)) ⊥
⊥ Wi | Xi.
τP = E[Yi(1) − Yi(0)] τP,T = E[Yi (1) − Yi(0)|W = 1].
“conditional independence assumption,” “selection on observ-
Most of the discussion in these notes will focus on τP , with ables.” In missing data literature “missing at random.”
extensions to τP,T available in the references.
To see the link with standard exogeneity assumptions, assume
We will also look at the sample average treatment eﬀect (SATE): constant eﬀect and linear regression:

N Yi(0) = α + Xi β + εi, =⇒ Yi = α + τ · Wi + Xiβ + εi

1
τS = (Yi(1) − Yi(0))
N i=1 with εi ⊥
⊥ Xi . Given the constant treatment eﬀect assumption,
unconfoundedness is equivalent to independence of Wi and εi
τP versus τS does not matter for estimation, but matters for conditional on Xi, which would also capture the idea that Wi
variance. is exogenous.
8 9
Motivation for Unconfoundeness Assumption (I) Motivation for Unconfoundeness Assumption (II)

The first is a statistical, data descriptive motivation. A second argument is that almost any evaluation of a treatment
involves comparisons of units who received the treatment with
A natural starting point in the evaluation of any program is a units who did not.
comparison of average outcomes for treated and control units.
The question is typically not whether such a comparison should
A logical next step is to adjust any difference in average out- be made, but rather which units should be compared, that is,
comes for differences in exogenous background characteristics which units best represent the treated units had they not been
(exogenous in the sense of not being affected by the treat- treated.
ment).
It is clear that settings where some of necessary covariates are
Such an analysis may not lead to the final word on the efficacy not observed will require strong assumptions to allow for iden-
of the treatment, but the absence of such an analysis would tification. E.g., instrumental variables settings Absent those
seem difficult to rationalize in a serious attempt to understand assumptions, typically only bounds can be identified (e.g., Man-
the evidence regarding the effect of the treatment. ski, 1990, 1995).
10 11

Motivation for Unconfoundeness Assumption (III)

Example of a model that is consistent with unconfoundedness:

suppose we are interested in estimating the average eﬀect of Overlap
a binary input on a ﬁrm’s output, or Yi = g(W, εi).

Suppose that proﬁts are output minus costs, Second assumption on the joint distribution of treatments and
covariates:
Wi = arg max E[πi (w)|ci ] = arg max E[g(w, εi) − ci · w|ci ],
w w

implying Assumption 2 (Overlap)

Wi = 1{E[g(1, εi) − g(0, εi) ≥ ci|ci ]} = h(ci). 0 < Pr(Wi = 1|Xi) < 1.

If unobserved marginal costs ci diﬀer between ﬁrms, and these

marginal costs are independent of the errors εi in the ﬁrms’ Rosenbaum and Rubin (1983a) refer to the combination of the
forecast of output given inputs, then unconfoundedness will two assumptions as ”stongly ignorable treatment assignment.”
hold as

(g(0, εi), g(1, εi)) ⊥

⊥ ci.
12 13
Alternative Assumptions
Identiﬁcation Given Assumptions
E[Yi (w)|Wi, Xi ] = E[Yi(w)|Xi ],
τ (x) ≡ E[Yi(1) − Yi(0)|Xi = x] = E[Yi(1)|Xi = x] − E[Yi (0)|Xi = x]
for w = 0, 1. Although this assumption is unquestionably
= E[Yi(1)|Xi = x, Wi = 1] − E[Yi(0)|Xi = x, Wi = 0] weaker, in practice it is rare that a convincing case can be
made for the weaker assumption without the case being equally
= E[Yi|Xi , Wi = 1] − E[Yi |Xi, Wi = 0]. strong for the stronger Assumption.

To make this feasible, one needs to be able to estimate the The reason is that the weaker assumption is intrinsically tied to
expectations E[Yi |Xi = x, Wi = w] for all values of w and x in the functional form assumptions, and as a result one cannot iden-
support of these variables. This is where overlap is important. tify average effects on transformations of the original outcome
(e.g., logarithms) without the strong assumption.
Given identification of τ (x),
If we are interested in τP,T it is sufficient to assume
τP = E[τ (Xi)]
Yi(0) ⊥
⊥ Wi | Xi,
14 15

Eﬃciency Bound
Propensity Score
Hahn (1998): for any regular estimator for τP , denoted by τ̂ ,
Result 1 Suppose that Assumption 1 holds. Then: with

√
(Yi(0), Yi(1)) ⊥
⊥ Wi | e(Xi). d
N · (τ̂ − τP ) −→ N(0, V ),

Only need to condition on scalar function of covariates, which the variance must satisfy:
would be much easier in practice if Xi is high-dimensional.

σ12(Xi) σ02(Xi)
V ≥E + + (τ (Xi) − τP )2 . (1)
(Problem is that the propensity score e(x) is almost never e(Xi) 1 − e(Xi)
known.)
Estimators exist that achieve this bound.

16 17
A. Regression Estimators

Estimate μw (x) consistently and estimate τP or τS as

Estimators
N
1
τ̂reg = (μ̂1(Xi) − μ̂0(Xi )).
N i=1
A. Regression Estimators
Simple implementations include
B. Matching
μw (x) = β x + τ · w,

in which case the average treatment eﬀect is equal to τ . In

C. Propensity Score Estimators this case one can estimate τ simply by least squares estimation
using the regression function

D. Mixed Estimators (recommended) Yi = α + β Xi + τ · Wi + εi.

More generally, one can specify separate regression functions

x.
for the two regimes, μw (x) = βw
18 19

These simple regression estimators can be sensitive to dif-

ferences in the covariate distributions for treated and control

8
units.

7
The reason is that in that case the regression estimators rely

6
heavily on extrapolation.

5
4
Note that μ0(x) is used to predict missing outcomes for the
treated. Hence on average one wishes to use predict the control

3
outcome at X T = i Wi · Xi/NT , the average covariate value

2
for the treated. With a linear regression function, the average
prediction can be written as ȲC + β̂ (X T − X C ).

1
0
If X T and X C are close, the precise speciﬁcation of the regres-

−1
sion function will not matter much for the average prediction.
0.35

0.25

0.15

0.05
0.4

0.3

0.2

0.1

0
With the two averages very diﬀerent, the prediction based on
a linear regression function can be sensitive to changes in the
speciﬁcation.
20
B. Matching

let m(i) is the mth closest match, that is, the index l that
satisfies Wl = Wi and
Issues with Matching

1{ Xj − Xi ≤ Xl − Xi } = m,
j|Wj =Wi Bias is of order O(N −1/K ), where K is dimension of covariates.
Is important in large samples if K ≥ 2 (and dominates variance
Then asymptotically if K ≥ 3)

Yi if Wi = 0, 1
Ŷi(0) = 1 Ŷi(1) = M j∈JM (i) Yj if Not Efficient (but efficiency loss is small)
M j∈JM (i) Yj if Wi = 1, Yi if
Easy to implement, robust.
The simple matching estimator is

1 N
sm =
τ̂M Ŷi(1) − Ŷi(0) . (2)
N i=1
21 22

C.1 Propensity Score Estimators: Weighting

WY W Yi(1) e(X)Yi (1) Implementation of Horvitz-Thompson Estimator
E =E E X =E E = E[Yi (1)],
e(X) e(X) e(X)
Estimate e(x) ﬂexibly (Hirano, Imbens and Ridder, 2003)
and similarly

(1 − W )Y N
W i · Yi N
Wi
E = E[Yi(0)], τ̂weight = /
1 − e(X) i=1 ê(Xi ) i=1 ê(Xi )

implying
N (1 − W ) · Y
N (1 − W )

i i i
W ·Y (1 − W ) · Y − /
τP = E − . i=1 1 − ê(Xi ) i=1 1 − ê(Xi )
e(X) 1 − e(X)

With the propensity score known one can directly implement Is eﬃcient given nonparametric estimator for e(x).
this estimator as
N
Potentially sensitive to estimator for propensity score.
1 Wi · Yi (1 − Wi) · Yi
τ̃ = − . (3)
N i=1 e(Xi) 1 − e(Xi)
23 24
D.1 Mixed Estimators: Weighting and Regression

Interpret Horvitz-Thompson estimator as weighted regression

estimator:

Matching or Regression on the Propensity Score Wi 1 − Wi

Yi = α + τ · W i + ε i , with weights λi = + .
e(Xi ) 1 − e(Xi)
Not clear what advantages are.
This weighted-least-squares representation suggests that one
may add covariates to the regression function to improve pre-
Large sample properties not known. cision, for example as

Simulation results not encouraging. Yi = α + β Xi + τ · Wi + εi,

with the same weights λi. Such an estimator is consistent

as long as either the regression model or the propensity score
(and thus the weights) are speciﬁed correctly. That is, in the
Robins-Ritov terminology, the estimator is doubly robust.
25 26

Matching and Regression

First match observations.

Estimation of the Variance
Define
For efficient estimator of τP :
Xi if Wi = 0, X(i) if Wi = 0,
X̂i(0) = X̂i(1) =
X(i) if Wi = 1, Xi if Wi = 1.
σ 2(X ) σ02(Xi)
VP = E 1 i + + (μ1(Xi) − μ0(Xi) − τ )2 ,
Then adjust within pair difference for the within-pair difference e(Xi) 1 − e(Xi)
in covariates X̂i (1) − X̂i(0):
Estimate all components nonparametrically, and plug in.
1 N
adj
τ̂M = Ŷi(1) − Ŷi(0) − β̂ · X̂i (1) − X̂i(0) ,
N i=1 Alternatively, use bootstrap.

using regression estimate for β. (Does not work for matching estimator)
Can eliminate bias of matching estimator given ﬂexible speci-
ﬁcation of regression function.
27 28
Estimation of the Variance

For all estimators of τS , for some known λi(X, W)

N

τ̂ = λi (X, W) · Yi,
i=1

N

V (τ̂ |X, W) = λi(X, W)2 · σW
2 (X ).
i i
i=1

To estimate σW 2 (X ) one uses the closest match within the set

i i
of units with the same treatment indicator. Let v(i) be the
closest unit to i with the same treatment indicator.

The sample variance of the outcome variable for these 2 units

2 (X ):
can then be used to estimate σW i i

2
2 (X ) = Y − Y
σ̂W /2.
i i i v(i)
29

Photoshop MCQ Questions and Answers
73% (15)
Photoshop MCQ Questions and Answers
9 pages
Imbens Wooldridge Notes
No ratings yet
Imbens Wooldridge Notes
473 pages
Implementing Propensity Score Matching Estimators With STATA
100% (1)
Implementing Propensity Score Matching Estimators With STATA
15 pages
Slides 33 Ate Regdisc
No ratings yet
Slides 33 Ate Regdisc
73 pages
FroelichaSperlich Book
No ratings yet
FroelichaSperlich Book
365 pages
Topic - Four - Decisions - Are - Hard Aa
No ratings yet
Topic - Four - Decisions - Are - Hard Aa
85 pages
Causal Inference - A Statistical Learning Approach
No ratings yet
Causal Inference - A Statistical Learning Approach
247 pages
Germany17 Jann
No ratings yet
Germany17 Jann
84 pages
Simple and Bias-Corrected Matching Estimators For Average Treatment Effects
No ratings yet
Simple and Bias-Corrected Matching Estimators For Average Treatment Effects
57 pages
Potential Outcomes Framework
100% (1)
Potential Outcomes Framework
7 pages
Drukker 2015 Treatment Effects
No ratings yet
Drukker 2015 Treatment Effects
61 pages
Simple and Bias-Corrected Matching Estimators For Average Treatment Effects
No ratings yet
Simple and Bias-Corrected Matching Estimators For Average Treatment Effects
52 pages
CIML2023
No ratings yet
CIML2023
87 pages
Regression3 Discussion
No ratings yet
Regression3 Discussion
30 pages
Hrs RDD Slides F
No ratings yet
Hrs RDD Slides F
40 pages
PSM Inès
No ratings yet
PSM Inès
71 pages
Chapter 1
No ratings yet
Chapter 1
21 pages
Gary Chamberlain Econometric S
No ratings yet
Gary Chamberlain Econometric S
152 pages
Utad 016
No ratings yet
Utad 016
36 pages
Econ 4
No ratings yet
Econ 4
92 pages
04 Interference Dynamics Notes
No ratings yet
04 Interference Dynamics Notes
13 pages
Prop Scores
No ratings yet
Prop Scores
77 pages
Demystifying and Avoiding The OLS
No ratings yet
Demystifying and Avoiding The OLS
23 pages
Caliendo Kopeinig JESurveys 2008
No ratings yet
Caliendo Kopeinig JESurveys 2008
42 pages
2.lecture2 Ate
No ratings yet
2.lecture2 Ate
61 pages
Becker Ichino Pscore SJ 2002
No ratings yet
Becker Ichino Pscore SJ 2002
20 pages
Hirano Imbens Ridder 2003
No ratings yet
Hirano Imbens Ridder 2003
30 pages
Course3 Generalization
No ratings yet
Course3 Generalization
26 pages
Unbiased Estimation of The Value of An Optimized Policy: Preprint. Work in Progress
No ratings yet
Unbiased Estimation of The Value of An Optimized Policy: Preprint. Work in Progress
11 pages
Chicago19 Masten
No ratings yet
Chicago19 Masten
35 pages
Week04 LectureSlidesECO372
No ratings yet
Week04 LectureSlidesECO372
40 pages
Best Linear Predictor
No ratings yet
Best Linear Predictor
15 pages
03 Propensity Scores Notes
No ratings yet
03 Propensity Scores Notes
12 pages
Lecture 1b
No ratings yet
Lecture 1b
7 pages
Matching Methods
No ratings yet
Matching Methods
9 pages
Empirical Methods in Microeconomics
No ratings yet
Empirical Methods in Microeconomics
3 pages
Nihms 1780206
No ratings yet
Nihms 1780206
11 pages
Introduction To Propensity Score Analysis
No ratings yet
Introduction To Propensity Score Analysis
41 pages
B1 Regression Adjustment
No ratings yet
B1 Regression Adjustment
29 pages
Estimating Average Treatment E Ects 615
No ratings yet
Estimating Average Treatment E Ects 615
5 pages
Slides Intro 1 r1
No ratings yet
Slides Intro 1 r1
13 pages
Lecture 24: Weighted and Generalized Least Squares 1 Weighted Least Squares
No ratings yet
Lecture 24: Weighted and Generalized Least Squares 1 Weighted Least Squares
8 pages
Causal Inference and Stable Learning: Peng Cui Tong Zhang
No ratings yet
Causal Inference and Stable Learning: Peng Cui Tong Zhang
95 pages
Robust and Efficient Estimation of Potential Outcome Means Under Random Assignment
No ratings yet
Robust and Efficient Estimation of Potential Outcome Means Under Random Assignment
15 pages
EH426 AT3 2024 Matching
No ratings yet
EH426 AT3 2024 Matching
31 pages
Ex Chap18 18.3
No ratings yet
Ex Chap18 18.3
3 pages
Lunceford Davidian 2004
No ratings yet
Lunceford Davidian 2004
24 pages
Lanners 23 A
No ratings yet
Lanners 23 A
11 pages
Asz 048
No ratings yet
Asz 048
14 pages
6 Causal Inference Technical
No ratings yet
6 Causal Inference Technical
28 pages
7 ST Gallen Oct2024 CausalInference
No ratings yet
7 ST Gallen Oct2024 CausalInference
30 pages
M300 Summary Notes
No ratings yet
M300 Summary Notes
12 pages
CH 12
No ratings yet
CH 12
14 pages
Slides 1 Match7!30!07
No ratings yet
Slides 1 Match7!30!07
40 pages
Causal Inference in Python
No ratings yet
Causal Inference in Python
10 pages
Candle Making: Leaflet NO
100% (4)
Candle Making: Leaflet NO
16 pages
Wooldridge 6e Ch09 SSM
No ratings yet
Wooldridge 6e Ch09 SSM
8 pages
The Law of Iterated Expectations
No ratings yet
The Law of Iterated Expectations
3 pages
The Central Role of The Propensity Score in Observational Studies For Causal Effects
No ratings yet
The Central Role of The Propensity Score in Observational Studies For Causal Effects
15 pages
SAP CONTROLLING - PRODUCT COSTING PART-1 - SAP Blogs
No ratings yet
SAP CONTROLLING - PRODUCT COSTING PART-1 - SAP Blogs
47 pages
Kobelco 6E - Hyd Motors PDF
100% (1)
Kobelco 6E - Hyd Motors PDF
26 pages
Drafting and Making The Shieldmaiden Corset
100% (2)
Drafting and Making The Shieldmaiden Corset
6 pages
Tunnelling Applications Shotcrete Reinforcement
No ratings yet
Tunnelling Applications Shotcrete Reinforcement
11 pages
Otto Cycle - Wikipedia
No ratings yet
Otto Cycle - Wikipedia
13 pages
Ciprofloxacin Suspension in Syrup NF
No ratings yet
Ciprofloxacin Suspension in Syrup NF
0 pages
1.2 Newtonian Relativity and Galilean Transformations
No ratings yet
1.2 Newtonian Relativity and Galilean Transformations
7 pages
Iare DS Lecture Notes 2
No ratings yet
Iare DS Lecture Notes 2
135 pages
Add Math Project Work 1 2010
100% (1)
Add Math Project Work 1 2010
17 pages
Itelect2a Module 1
No ratings yet
Itelect2a Module 1
37 pages
MLE1101 - Tutorial 2 - Suggested Solutions
No ratings yet
MLE1101 - Tutorial 2 - Suggested Solutions
8 pages
Ergonomically Designed Turmeric - FINALE
No ratings yet
Ergonomically Designed Turmeric - FINALE
24 pages
ModHb MSC ECMTX 100724 01
No ratings yet
ModHb MSC ECMTX 100724 01
123 pages
Blas Lapack
No ratings yet
Blas Lapack
21 pages
FlashLoanExample Sol
No ratings yet
FlashLoanExample Sol
3 pages
CHAPTER 3 Agricultural Marketing
No ratings yet
CHAPTER 3 Agricultural Marketing
30 pages
Comparison of Shielding Methods
No ratings yet
Comparison of Shielding Methods
2 pages
Heat of Combustion Lab 2
No ratings yet
Heat of Combustion Lab 2
14 pages
Study Scholarships - Vliruos2
No ratings yet
Study Scholarships - Vliruos2
9 pages
PMA 133 Book - Verbal Intelligence Test Questions (Solved) - 1
No ratings yet
PMA 133 Book - Verbal Intelligence Test Questions (Solved) - 1
4 pages
Object Oriented Analysis
No ratings yet
Object Oriented Analysis
6 pages
Bulletin 193: Devicenet™ Configuration Terminal
No ratings yet
Bulletin 193: Devicenet™ Configuration Terminal
86 pages
National Scholarship Programme - International Applicants
No ratings yet
National Scholarship Programme - International Applicants
13 pages
Lec1 PDF
No ratings yet
Lec1 PDF
28 pages
Server Information Gathering Packet v1.0
No ratings yet
Server Information Gathering Packet v1.0
12 pages
Review It
No ratings yet
Review It
8 pages
Biology Revision KS3 Cells To Systems and Respiration
No ratings yet
Biology Revision KS3 Cells To Systems and Respiration
3 pages
Rat IL - 4 Assay Kit 2014
No ratings yet
Rat IL - 4 Assay Kit 2014
14 pages
Price Discovery
No ratings yet
Price Discovery
3 pages
Sylmicro II0405
No ratings yet
Sylmicro II0405
2 pages
EC3355 SS IAT II Question Paper
No ratings yet
EC3355 SS IAT II Question Paper
2 pages
Fungsi Sistem Otot
No ratings yet
Fungsi Sistem Otot
8 pages
14 Slide
No ratings yet
14 Slide
44 pages
3.cutting Tool Materials
No ratings yet
3.cutting Tool Materials
14 pages
EGU2020 Poster Thiesen E 02 ST A0portrait
No ratings yet
EGU2020 Poster Thiesen E 02 ST A0portrait
1 page