YD Slides6 Panel
YD Slides6 Panel
STAT-S-301
11
FatalityRatei1988 FatalityRatei1982 =
(n = 48)
(n = 48)
16
17
Example:
Suppose we have n = 3 states: California, Texas, Massachusetts.
Yit = 0 + 1Xit + 2Zi + ui, i =1,,n, T = 1,,T
Population regression for California (that is, i = CA):
YCA,t = 0 + 1XCA,t + 2ZCA + uCA,t
= (0 + 2ZCA) + 1XCA,t + uCA,t
= CA + 1XCA,t + uCA,t
CA = 0 + 2ZCA doesnt change over time.
CA is the intercept for CA, and 1 is the slope.
The intercept is unique to CA, but the slope is the same in all
the states: parallel lines.
18
19
Y
CA
Y = TX + 1X
CA
Y = MA+ 1X
TX
TX
MA
MA
X
21
where
(1)
etc.
T t 1
T t 1
T t 1
24
1 T
1 T
1 T
or
Yit = 1 X it + uit
1 T
1 T
where Yit = Yit Yit and X it = Xit X it
T t 1
T t 1
Yit = 1 X it + uit
(2)
1 T
where Yit = Yit Yit , etc.
T t 1
First construct the demeaned variables Yit and X it .
Then estimate (2) by regressing Yit on X it using OLS.
Inference (hypothesis tests, confidence intervals) is as usual
(using heteroskedasticity-robust standard errors).
This is like the changes approach, but instead Yit is deviated
from the state average instead of Yi1.
This can be done in a single command in STATA.
26
Number of obs
F( 1,
287)
Prob > F
R-squared
Adj R-squared
Root MSE
=
=
=
=
=
=
336
10.41
0.0014
0.9050
0.8891
.18986
-----------------------------------------------------------------------------|
Robust
vfrall |
Coef.
Std. Err.
t
P>|t|
[95% Conf. Interval]
-------------+---------------------------------------------------------------beertax | -.6558736
.2032797
-3.23
0.001
-1.055982
-.2557655
_cons |
2.377075
.1051515
22.61
0.000
2.170109
2.584041
-------------+---------------------------------------------------------------state |
absorbed
(48 categories)
For n = 48, T = 7:
FatalityRate_est = 0.66BeerTax + State fixed effects
(0.20)
Should you report the intercept?
How many binary regressors would you include to
estimate this using the binary regressor method?
Compare slope, standard error to the estimate for the
1988 v. 1982 changes specification (T = 2, n = 48):
FR1988-FR1982 = 0.072 1.04(BeerTax1988BeerTax1982).
(0.065) (0.36)
28
1982 = 0 + 3S1982
Similarly,
Yi,1983 = 1983 + 1Xi,1983 + ui,1983,
1983 = 0 + 3S1983
etc.
30
Number of obs =
336
F( 7,
281) =
3.70
Prob > F
= 0.0008
R-squared
= 0.9089
Adj R-squared = 0.8914
Root MSE
= .18788
-----------------------------------------------------------------------------|
Robust
vfrall |
Coef.
Std. Err.
t
P>|t|
[95% Conf. Interval]
-------------+---------------------------------------------------------------beertax | -.6399799
.2547149
-2.51
0.013
-1.141371
-.1385884
y83 | -.0799029
.0502708
-1.59
0.113
-.1788579
.0190522
y84 | -.0724206
.0452466
-1.60
0.111
-.161486
.0166448
y85 | -.1239763
.0460017
-2.70
0.007
-.214528
-.0334246
y86 | -.0378645
.0486527
-0.78
0.437
-.1336344
.0579055
y87 | -.0509021
.0516113
-0.99
0.325
-.1524958
.0506917
y88 | -.0518038
.05387
-0.96
0.337
-.1578438
.0542361
_cons |
2.42847
.1468565
16.54
0.000
2.139392
2.717549
-------------+---------------------------------------------------------------state |
absorbed
(48 categories)
Assumption #1:
E(uit|Xi1,,XiT,i) = 0
uit has mean zero, given the state fixed effect and the
entire history of the Xs for that state.
This is an extension of the previous multiple regression
Assumption #1.
This means there are no omitted lagged effects (any
lagged effects of X must enter explicitly).
Also, there is not feedback from u to future X:
o Whether a state has a particularly high fatality rate this
year doesnt subsequently affect whether it increases
the beer tax.
o Well return to this when we take up time series data.
37
Assumption #2:
(Xi1,,XiT,Yi1,,YiT), i =1,,n, are i.i.d.
This is an extension of Assumption #2 for multiple
regression with cross-section data.
This is satisfied if entities (states, individuals) are
randomly sampled from their population by simple
random sampling, then data for those entities are collected
over time.
This does not require observations to be i.i.d. over time for
the same entity that would be unrealistic (whether a state
has a mandatory DWI sentencing law this year is strongly
related to whether it will have that law next year).
38
Assumption #5:
corr(uit,uis|Xit,Xis,i) = 0 for t s
This is new.
This says that (given X), the error terms are uncorrelated
over time within a state.
For example, uCA,1982 and uCA,1983 are uncorrelated..
Is this plausible? What enters the error term?
o Especially snowy winter.
o Opening major new divided highway.
o Fluctuations in traffic density from local economic
conditions.
Assumption #5 requires these omitted factors entering uit
to be uncorrelated over time, within a state.
39
40
45
46
48
49
50