Week04 LectureSlidesECO372
Week04 LectureSlidesECO372
¨ Selection on observables
¤ Hint at Chapter 2 Selection on observables (i.e., introducing control variables)
Selim
Robin
SDO = 2
SDO = 2 Y0
Y0
Selim
Robin
SDO =1
SDO = 3 Y0
Y0
Selim
Robin
SDO = 1
Het = 0.5(1 − 2) SDO = 2
Y0 Y0
¨ Both start at the same Y0
¨ Selim has 𝛿 = 1 and Robin has 𝛿 = 2 (𝛿!"#$% − 𝛿&'($) = 1)
o so ATE is 1.5
¨ Suppose we treat Selim and not Robin: what is the SDO?
*
SDO = 1 = 1.5 + (𝐴𝑇𝑇!"#$% − 𝐴𝑇𝑈&'($) ) +0
+
¨ Suppose we treat Robin and not Selim: what is the SDO?
*
SDO = 3 = 1.5 + (𝐴𝑇𝑇&'($) − 𝐴𝑇𝑈!"#$% ) +0
+
ECO372
¤ Possibility for Selection Bias: recipe for selection bias is two yes’s:
n Do these two types (𝜀) of people tend to have a different likelihood of choosing the treatment: D?
n Do these two types (𝜀) of people tend to have different Y under the scenario where both are not treated (Y0i)?
¤ Possibility for Heterogenous Treatment Effect Bias: recipe for het bias is two yes’s:
n Do these two types (𝜀) of people tend to have a different likelihood of choosing the treatment: D?
n Do these two types (𝜀) of people tend to have different effects (𝛿, ) of treatment (D)?
ECO372
¨ Reminder
¤ Selection bias is about baseline differences in outcomes among the two groups: Y0
¤ Heterogenous Effect bias is about the difference in the effect of the treatment between the two
groups: 𝛿
¤ Two yes’s means failure of the zero conditional mean assumption, leading to omitted variable bias
n It can be for either type of bias (selection or heterogeneous effect);
n The standard regression framework doesn’t break this down.
ECO372
¨ So why bother?
¤ Because sometimes we can identify exogenous sources of variation in D, but which only affect
certain types of people (counties, classrooms, cities, etc.).
¤ In this case we can only identify the ATT (the average effect for these types on the margin of being
affected by D), which may be different than ATE (the average effect for the population).
ECO372
What about in reality? Can we get a look at these factors in our data?
¨ It can take some practice to use the potential outcomes framework effectively:
¤ We are going to start with some practice on our created data
¤ Then next week we will extend this to other examples
ECO372
Multi-tasking Data
What is the estimated SDO?
ECO372
¤ Focused task survey: the focused work (D=0) required typing the phrase:
n “ECONOMETRICS IS THE BEST” followed by: “1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21”
¤ Task-switching survey: the task-switching work (D=1) required typing out the same set of characters but
switching between each sequence:
n “1E2C3O4N5O6M7E8T9R10I11C12S13I14S15T16H17E18B19E20S21T”
ECO372
45 students: 45 unhurried
randomized to students:
focused assigned to focused
Selection →0 Selection →?
(ATT-ATU) →0 (ATT-ATU) →?
SDO → ATE SDO → ATE?
¨ These provide nice labels to the data so you know what things mean
69% answer
91% answer
82% answer
80% answer
ECO372
Survey Non-Response
Group 1: treatment to task 179 Students Group 2: treatment to task
switching vs focused was switching vs focused was
randomly determined assigned based on ”speediness”
45 students:
37 students: 45 31
unhurried
unhurried
randomized to students:
focused assigned to focused
(8 missing) (14 missing)
45 students:
36 students: speedy
44 40 students:
speedy students:
randomized to task assigned to task
switching switching
(9 missing) (4 missing)
ECO372
¨ What is the correlation between D (the treatment) and Alphabet as a first writing
language in the ASSIGNED group?
¨ What is the correlation between D (the treatment) and Alphabet as a first writing
language in the RANDOMIZED group?
¨ Why do you suppose the treatment variable D is correlated with Alphabet as a first
writing language in the ASSIGNED group but not in the RANDOMIZED group?
¨ Why do you suppose the treatment variable D is correlated with "totalmins" but NOT
correlated Alphabet as a first writing language in the RANDOMIZED group?
ECO372
Assigned Randomized
Correlation Correlation
correct -0.08 correct -0.34
between D between D
and alphabet and alphabet
speed 0.22 -0.08 speed -0.26 0.12
assigned Random
english 0.03 0.18 0.12 english -0.12 -0.01 0.01
0.76
0.57
0.57
0.23
0.23
-0.12
alphabet 0.05 0.03 0.11 0.66 -0.12 alphabet -0.15 -0.01 0.02 0.53
-0.47
-0.47
-0.82
-0.82
touchscreen 0.19 0.02 -0.12 -0.18 -0.09 touchscreen 0.02 0.02 0.03 -0.24 -0.08
Dtaskswitch 0.47 -0.22 0.77 0.19 0.17 -0.05 Dtaskswitch 0.63 -0.15 -0.12 -0.01 -0.01 0.00
s
ct
et
en
ct
et
en
in
ee
is
in
ee
is
rre
ab
rre
ab
re
m
gl
re
m
gl
sp
sp
ph
co
sc
en
l
ph
co
sc
en
l
ta
ta
ch
al
ch
to
al
to
u
u
to
to
Outcomes: Y Predetermined Variables: X Outcomes: Y Predetermined Variables: X
ECO372
This is a scatter
Total time to complete tasks
of totalmins on 4 4
0 0
1 2 3 4 5 1 2 3 4 5
Accuracy-speed to answer prereq qs (corr num/10 min) Accuracy-speed to answer prereq qs (corr num/10 min)
n = 31 RMSE = .7291345 n = 40 RMSE = 1.101876
ECO372
Assigned group
Randomized group
ECO372
¨ Create an interaction variable between the treatment and the randomization type
Dtaskswitch*assigned.
¨ For the whole sample, run a regression of totalmins on Assigned, Dtaskswitch, and
the interaction.
¤ What is the meaning of the coefficient on the constant?
¤ What is the meaning of the coefficient on Assigned?
¤ What is the meaning of the coefficient on Dtaskswitch?
¤ What is the meaning of the coefficient on the interaction?
¤ Can you relate these 4 coefficients numerically to the point estimates in the regressions from
question 3 and 4?
ECO372
¤ For assigned=0
𝑌 = 𝛼 + 𝛽! 𝐷 + ε
¤ For assigned =1
𝑌 = 𝛼 + 𝛽! 𝐷 + 𝛽" 1 + 𝛽# 𝐷 ∗ 1 + ε
𝑌 = 𝛼 + 𝛽" + (𝛽! + 𝛽# )𝐷 + ε
In this case:
This is the difference in the
SDO for the assigned vs
randomized sample.
(difference-in the difference)
In this case:
This is the time difference for
those doing the focused
survey in the assigned vs
randomized sample (those
assigned are slower on the
focused survey)
1.33 + 0.20 = 1.52
ECO372
∆.
n SDO if A=0 = 𝛽0
∆/
∆.
n SDO if A=1 = 𝛽0 + 𝛽1
∆/
∆"
¤ Difference in SDO by A? = 𝛽%
∆&∆#
¤ SDO is a difference in means, so the interaction is the difference in the difference in means.
ECO372
Q5: Interacted Model (in THIS particular context it has this particular meaning)
¨ If the randomized model SDO is providing us the average treatment effect (ATE), the interaction is
giving us the bias from assignment by speed in the assigned group.
¨ How so?
¨ A=1: assigned means in the population we still have this (no basis to cancel of the last terms):
SDO = ATE + (1−π)(ATT-ATU) + E[Y0i| Di = 1] – E[Y0i| Di = 0]
¨ Take the difference between them. Our interaction is giving us a sample estimate of this:
(1−π)(ATT-ATU) + E[Y0i| Di = 1] – E[Y0i| Di = 0]
ECO372
Tutorial