Microeconometrie Chapitre4 BinaryChoicePanelDataModels
Microeconometrie Chapitre4 BinaryChoicePanelDataModels
Théophile T. Azomahou
University Clermont Auvergne, CNRS, CERDI
Maastricht University, School of Business and Economics
Email: [email protected]
Pooled Estimator
Random Effects Models
Fixed Effects Models
Dynamic Binary Choice Models
Théophile T. Azomahou (CERDI) Février 20-28, 2020 1 / 14
Introduction
1. Introduction
The structural model for a possibly unbalanced panel of data would be written
to indicate a variable that equals one when the condition in parentheses is true
and zero when it is not.
A more promising approach is an effects
model,
The pooled estimator results if we simply ignore the heterogeneity, ui and fit the
model as if the cross-section specification applies.
If the fixed effects model is appropriate, then all the results for omitted
variables apply. The pooled MLE that ignores fixed effects will be
inconsistent. Observe that since the estimator is ML, not least squares,
converting the data to deviations from group means is not a
solution—converting the binary dependent variable to deviations will
produce a continuous variable with unknown properties.
The random effects case is more benign. From (17-39), the marginal
probability implied by the model is
= F (x0it δ).
The implication, which is absent in the linear case is that ignoring the
random effects in a pooled model produces an attenuated (inconsistent —
downward biased) estimate of β; the scale factor that produces δ is
1/(1 + σu2 )1/2 which is between zero and one.
and X indicates all the exogenous data in the sample, xit for all i and t. Then,
E [εit | X] = 0,
Var[εit | X] = σv2 + σu2 = 1 + σu2 ,
and σu2
Corr[εit , εis | X] = ρ = .
1 + σu2
The new free parameter is σu2 = ρ/(1 − ρ).
Théophile T. Azomahou (CERDI) Février 20-28, 2020 6 / 14
The Fixed Effects Models
where dit is a dummy variable that takes the value one for individual i and zero
otherwise. For convenience, we have redefined xit to be the nonconstant variables
in the model.
Estimation
What follows can be extended to any index function model, but for the present,
we’ll confine our attention to symmetric distributions such as the normal and
logistic, so that the probability can be conveniently written as
Prob(Yit = yit | xit ) = P[qit (αi + x0it β)]. It will be convenient to let
zit = αi + x0it β so Prob(Yit = yit | xit ) = P(qit zit ).
The problems with the fixed effects estimator are statistical, not practical.
The estimator relies on Ti increasing for the constant terms to be
consistent—in essence, each αi is estimated with Ti observations. But, in
this setting, not only is Ti fixed, it is likely to be quite small.
Hsiao (1986) found that for Ti = 2, the bias in the MLE of β is 100
percent, which is extremely pessimistic. Heckman and MaCurdy found in a
Monte Carlo study that in samples of n = 100 and T = 8, the bias appeared
to be on the order of 10 percent, which is substantive, but certainly less
severe than Hsiao’s results suggest.
A random or fixed effects model that explicitly allows for lagged effects would be
Lagged effects, or persistence, in a binary choice setting can arise from three
sources:
A related problem is that with a relatively short panel, the initial conditions, yi0 ,
have a crucial impact on the entire path of outcomes. Modeling dynamic effects
and initial conditions in binary choice models is more complex than in the linear
model.
The correlation between αi and yi,t−1 in the dynamic binary choice model makes
yi,t−1 endogenous. Thus, the estimators we have examined thus far will not be
consistent. Two familiar alternative approaches that have appeared in recent
applications are due to Heckman (1981) and Wooldridge (2005), both of which
build on the random effects specification.
Heckman’s approach
where zi is a set of “instruments” observed at the first period that are not
contained in xit .
where ζ = x0i1 β
+ γyi,t−1 In this form, the model can be viewed as a random
parameters (random constant term) model in which there is heteroscedasticity in
the random part of the constant term.
Wooldridge’s approach
Wooldridge’s approach suggests a model for the random effect conditioned on the
initial value. Thus,