Limited Dependent Variables Models PDF
Limited Dependent Variables Models PDF
The probability that yi = 1 is not constant over the data. Via the chain
rule, we see that the effect of an increase in Xj on the probability is the
product of two factors: the effect of Xj on the latent variable and the
derivative of the CDF evaluated at yi∗ . The latter term, ψ( ), is the
probability density function (PDF ) of the distribution.
Christopher F Baum (BC / DIW) Limited Dependent Variables BBS 2013 12 / 47
Limited dependent variables Marginal effects and predictions
. margins, dydx(_all)
Average marginal effects Number of obs = 2000
Model VCE : OIM
Expression : Pr(work), predict()
dy/dx w.r.t. : age married children education
Delta-method
dy/dx Std. Err. z P>|z| [95% Conf. Interval]
The marginal effects imply that married women have a 12.5% higher
probability of labor force participation, while the addition of a child is
associated with an 13% increase in participation.
We can define the logit, or log of the odds ratio, in terms of grouped
data (averages of microdata). For instance, in the 2004 U.S.
presidential election, the ex post probability of a Massachusetts
resident voting for John Kerry was 0.62, with a logit of
log (0.62/(1 0.62)) = 0.4895. The probability of that person voting
for George Bush was 0.37, with a logit of 0.5322. Say that we had
such data for all 50 states. It would be inappropriate to use linear
regression on the probabilities voteKerry and voteBush, just as it would
be inappropriate to run a regression on individual voter’s voteKerry and
voteBush indicator variables.
Truncation
We turn now to a context where the response variable is not binary nor
necessarily integer, but subject to truncation. This is a bit trickier, since
a truncated or censored response variable may not be obviously so.
We must fully understand the context in which the data were
generated. Nevertheless, it is quite important that we identify situations
of truncated or censored response variables. Utilizing these variables
as the dependent variable in a regression equation without
consideration of these qualities will be misleading.
. use womenwk,clear
. regress lwf age married children education
Source SS df MS Number of obs = 2000
F( 4, 1995) = 134.21
Model 937.873188 4 234.468297 Prob > F = 0.0000
Residual 3485.34135 1995 1.74703827 R-squared = 0.2120
Adj R-squared = 0.2105
Total 4423.21454 1999 2.21271363 Root MSE = 1.3218
The tobit estimates of lwf show positive, significant effects for age,
marital status, the number of children and the number of years of
education. Each of these factors is expected to both increase the
probability that a woman will work as well as increase her wage
conditional on employed status.
Delta-method
dy/dx Std. Err. z P>|z| [95% Conf. Interval]
Delta-method
dy/dx Std. Err. z P>|z| [95% Conf. Interval]
Note, for instance, the much smaller marginal effects associated with
number of children and level of education in tobit vs. regress.