Tobit Regression 1
Tobit Regression 1
Econ 674
Purdue University
Like the probit and ordered probit, the tobit model can be given a latent
variable interpretation. We write this as follows:
Case #1:
Suppose we seek to model expenditures on automobiles during the
calendar year. We apply a tobit to model this data. How would you
interpret your model in terms of this specific application?
Case #2:
Suppose that you seek to model expenditures on tobacco products during
the calendar year. The observed variable yi represents the fraction of
income spent on such products during the calendar year. The data is likely
characterized by lots of zeros.
In this case,
1 It is quite likely to see yi values very close to zero, given its
construction.
2 Perhaps negative values of zi make more sense in the context of this
application. Specifically, people may contribute to anti-smoking
campaigns, which we might interpret as a type of negative
expenditure.
That is, there may be some minimum level of expenditure that is possible.
For this reason, we might consider a variant of the tobit with an unknown
censoring point:
yi = max{0, zi }.
To derive the log likelihood in the tobit, (though it is not necessary to do
so), we first consider the c.d.f. :
Pr(Yi ≤ c|X ).
It is convenient to express this probability in the following way:
Pn
In the above n1 = i=1 Di , or the number of uncensored observations.
That is, the score vector can be obtained, as can the Hessian matrix.
Then we obtain
n1 1 X
L(δ, θ; y ) = − log(2π) + n1 log δ − (δyi − xi θ)2
2 2
i:yi >0
X
+ log[1 − Φ(xi θ)].
i:yi =0
With a bit of work, the components of the Hessian matrix can also be
obtained:
X φ(xi θ) φ(xi θ) X
Lθθ0 = xi θ − xi0 xi − xi0 xi .
1 − Φ(xi θ) 1 − Φ(xi θ)
i:yi =0 i:yi >0
Let
0 0 X0
γ = [θ δ] , X = , y = [y0 y1 ]0 ,
X1
where X0 consists of the X observations with yi = 0 (and similarly for X1 ,
etc.). That is, we first arrange the data with the yi = 0 outcomes
appearing first, followed by those with yi = 1.
With this notation in hand, one can show that the Hessian can be written
as:
This is indeed true, but in order to prove it, we must digress a little bit.
That is, we keep the shape of the marginal density, chop off the tail, and
scale it up to make sure it integrates to unity. Thus,
or
Note that this is exactly the term we needed to prove was negative in
order to verify that the Hessian is negative semidefinite.
σ̂ = δ̂ −1 , β̂ = θ̂/δ̂.
The Delta method can be used to obtain large sample standard errors.
Thus, the conditional mean function, given that positive values occur, is
not simply the population conditional mean xi β. As such, OLS results will
be biased and inconsistent.
The last term, again, is the mean of a truncated normal random variable,
though in this case the variance of u is σ 2 rather than unity. It follows by
similar reasoning that
Therefore,
Using the female labor supply data on the course website, we fit a
tobit model to account for the censoring at zero weeks of work.
MATLAB STATA
Variable Pt. Est Std. Err Marg Eff Pt. Est Stderr
Constant 31.93 3.83 —- 31.93 3.66
Ability .061 .0221 .056 .061 .022
SpouseInc. -.123 .0254 -.114 -.123 .025
Kids -13.52 1.22 -12.55 -13.52 1.16
Education .932 .292 .865 .932 .291
σ 23.24 .383 —- 23.24 .383