0% found this document useful (0 votes)

72 views26 pages

Tobit Regression 1

The document provides an overview of the tobit model, which is used to account for mass points in an otherwise continuous dependent variable. It discusses how the tobit model can be given a latent variable interpretation and how maximum likelihood estimation is performed using a reparameterization technique. This involves deriving the log-likelihood function and showing that the Hessian is negative semidefinite, ensuring global concavity. Marginal effects are also briefly discussed.

Uploaded by

Stata demos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views26 pages

Tobit Regression 1

Uploaded by

Stata demos

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 26

The Tobit Model

Econ 674

Purdue University

Justin L. Tobias (Purdue) The Tobit 1/1

Estimation

In this lecture, we address estimation and application of the tobit

model.

The tobit model is a useful specification to account for mass points in

a dependent variable that is otherwise continuous.

For example, our outcome may be characterized by lots of zeros, and

we want our model to speak to this incidence of zeros.
1

Justin L. Tobias (Purdue) The Tobit 2/1

The tobit

Like the probit and ordered probit, the tobit model can be given a latent
variable interpretation. We write this as follows:

We observe data on (xi , yi ) but not on zi . Note that zi is partially

observed.
Note that, unlike the probit and ordered probit, the scale parameter is not
fixed at unity (why)?
In some cases, application of the tobit is, perhaps, not ideal while in
others, the tobit can be applied more credibly. Two examples illustrate.

Justin L. Tobias (Purdue) The Tobit 3/1

The tobit

Case #1:
Suppose we seek to model expenditures on automobiles during the
calendar year. We apply a tobit to model this data. How would you
interpret your model in terms of this specific application?

Many would give zi an interpretation like desired expenditure. If this is

positive, then the person buys a car and spends the desired amount. If this
is negative or zero, then we simply see that the person did not buy the car.

Are there any problems here?

Justin L. Tobias (Purdue) The Tobit 4/1

The tobit

Case #2:
Suppose that you seek to model expenditures on tobacco products during
the calendar year. The observed variable yi represents the fraction of
income spent on such products during the calendar year. The data is likely
characterized by lots of zeros.

In this case,
1 It is quite likely to see yi values very close to zero, given its
construction.
2 Perhaps negative values of zi make more sense in the context of this
application. Specifically, people may contribute to anti-smoking
campaigns, which we might interpret as a type of negative
expenditure.

Justin L. Tobias (Purdue) The Tobit 5/1

The tobit

An important and often overlooked point is that, although it might seem

natural to assert that the “censoring” point is at zero, it may, in fact, be
something different from zero. [Zuehlke (2003)].

That is, there may be some minimum level of expenditure that is possible.

For this reason, we might consider a variant of the tobit with an unknown
censoring point:

for some constant c that is to be estimated from the data.

Justin L. Tobias (Purdue) The Tobit 6/1

Estimation in the (Standard) Tobit
iid
zi = xi β + ui , ui |xi ∼ N (0, σ 2 )

yi = max{0, zi }.
To derive the log likelihood in the tobit, (though it is not necessary to do
so), we first consider the c.d.f. :
Pr(Yi ≤ c|X ).
It is convenient to express this probability in the following way:

where Di can be any binary variable, yet it is convenient to define it here

Justin L. Tobias (Purdue) The Tobit 7/1

Estimation in the (Standard) Tobit

Pr(Yi ≤ c|xi ) = Pr(Yi ≤ c|xi , Di = 1)Pr(Di = 1|xi )

+ Pr(Yi ≤ c|xi , Di = 0)Pr(Di = 0|xi ),

With respect to the components of the above, some of these are

straightforward:

and hence, Pr(Di = 0|xi ) = 1 − Φ(xi β/σ). What about

Pr(Yi ≤ c|xi , Di = 0)? Intuitively,

Justin L. Tobias (Purdue) The Tobit 8/1

Estimation in the (Standard) Tobit

As for the remaining conditional density, note for c > 0:

Justin L. Tobias (Purdue) The Tobit 9/1

Estimation in the (Standard) Tobit

Thus, we obtain the following “density” function for Yi :

1
f (yi |xi ) = φ([yi − xi β]/σ)I (yi > 0) + I (yi = 0) [1 − Φ(xi β/σ)] .
σ
From here, it is not hard to get to the log likelihood:

Pn
In the above n1 = i=1 Di , or the number of uncensored observations.

Justin L. Tobias (Purdue) The Tobit 10 / 1

Estimation in the (Standard) Tobit

From here, a standard tobit analysis can be carried out.

That is, the score vector can be obtained, as can the Hessian matrix.

However, these are quite messy, particularly the Hessian.

Moreover, it turns out that a reparameterization of the problem

simplifies these expressions considerably and, furthermore, that we
can prove global concavity for the reparameterized model.

Justin L. Tobias (Purdue) The Tobit 11 / 1

Estimation in the (Standard) Tobit
We employ the reparameterization suggested by Olson (1978). Specifically,
we let

Then we obtain
n1 1 X
L(δ, θ; y ) = − log(2π) + n1 log δ − (δyi − xi θ)2
2 2
i:yi >0
X
+ log[1 − Φ(xi θ)].
i:yi =0

From this, we obtain the score:

Justin L. Tobias (Purdue) The Tobit 12 / 1

Estimation in the (Standard) Tobit

With a bit of work, the components of the Hessian matrix can also be
obtained:

X φ(xi θ) φ(xi θ) X
Lθθ0 = xi θ − xi0 xi − xi0 xi .
1 − Φ(xi θ) 1 − Φ(xi θ)
i:yi =0 i:yi >0

Justin L. Tobias (Purdue) The Tobit 13 / 1

Estimation in the (Standard) Tobit

Let
0 0 X0
γ = [θ δ] , X = , y = [y0 y1 ]0 ,
X1
where X0 consists of the X observations with yi = 0 (and similarly for X1 ,
etc.). That is, we first arrange the data with the yi = 0 outcomes
appearing first, followed by those with yi = 1.
With this notation in hand, one can show that the Hessian can be written
as:

Justin L. Tobias (Purdue) The Tobit 14 / 1

Estimation in the (Standard) Tobit

In the last slide, Di is an n0 × n0 matrix with diagonal element

φ(xi θ) φ(xi θ)
− xi θ − .
1 − Φ(xi θ) 1 − Φ(xi θ)

Furthermore, one can show that the Hessian is always negative

semidefinite (and thus the log likelihood is globally concave) provided the
elements of D are positive. (why?)
Given the form of these elements above, this is true iff
φ(xi θ)
xi θ − < 0.
1 − Φ(xi θ)

This is indeed true, but in order to prove it, we must digress a little bit.

Justin L. Tobias (Purdue) The Tobit 15 / 1

Mean of Truncated Normal
To obtain the density function for any truncated random variable w , we
apply the formula:

That is, we keep the shape of the marginal density, chop off the tail, and
scale it up to make sure it integrates to unity. Thus,

For the case of a standard normal random variable w , with c = xi θ, we

get:

Justin L. Tobias (Purdue) The Tobit 16 / 1

Mean of Truncated Normal

Now, clearly, it must be the case that

In the case of a standard normal random variable w , then, we have

Note that this is exactly the term we needed to prove was negative in
order to verify that the Hessian is negative semidefinite.

Justin L. Tobias (Purdue) The Tobit 17 / 1

Mean of Truncated Normal

This result motivates use of the reparameterization in practice.

An iterative maximization routine should converge quickly to the

maximum given the uniqueness of this maximum.

Invariance can be applied to estimate β and σ. Specifically,

σ̂ = δ̂ −1 , β̂ = θ̂/δ̂.

The Delta method can be used to obtain large sample standard errors.

Justin L. Tobias (Purdue) The Tobit 18 / 1

A note on discarding the zeros

It is somewhat common, though unfortunate, practice in the applied

literatures to simply discard the zero responses when estimating the tobit.
Of course, this is not a valid procedure since:

E (yi |xi , yi > 0) = xi β + E (ui |ui > −xi β, xi )

φ(xi β)
= xi β + σ .
Φ(xi β)

Thus, the conditional mean function, given that positive values occur, is
not simply the population conditional mean xi β. As such, OLS results will
be biased and inconsistent.

Justin L. Tobias (Purdue) The Tobit 19 / 1

Marginal Effects

We now describe a method for calculating marginal effects in the tobit.

Though several of these have been discussed, we focus our attention on
effects with respect to the mean of the observed y outcome: First, note
(similar to our previous discussion):

E (y |x) = E (y |x, z > 0)Pr(z > 0|x) + E (y |x, z ≤ 0)Pr(z ≤ 0|x)

= E (y |x, z > 0)Pr(z > 0|x)

(why?) Hence, we have:

∂E (y |x) ∂E (y |x, z > 0) ∂Pr(z > 0|x)

= Pr(z > 0|x) + E (y |x, z > 0) .
∂xj ∂xj ∂xj

Justin L. Tobias (Purdue) The Tobit 20 / 1

Marginal Effects
To make things a bit simpler notationally, let φ ≡ φ(xi β/σ) and define Φ
analogously.
To put together all of the pieces of the marginal effect expression, we first
note:

E (y |x, z > 0) = E (z|x, z > 0)

= xβ + E (u|x, z > 0)
= xβ + E (u|u > −xβ, x).

The last term, again, is the mean of a truncated normal random variable,
though in this case the variance of u is σ 2 rather than unity. It follows by
similar reasoning that

Justin L. Tobias (Purdue) The Tobit 21 / 1

Marginal Effects

In order to completely characterize the marginal effect, we must

differentiate the normal density function. That is, we seek:

∂ (2π)−1/2 exp −[1/2](xβ/σ)2

∂φ
=
∂xj ∂xj
−1/2
exp −[1/2](xβ/σ)2 (−xβ/σ)(βj /σ)

= (2π)
= φ[−xβ/σ](βj /σ)

Therefore,

Justin L. Tobias (Purdue) The Tobit 22 / 1

Marginal Effects
Putting this together with the other pieces comprising our marginal effect,
we obtain:
−φΦ[xβ/σ](βj /σ) − φ2 (βj /σ)

∂E (y |x) φ
= βj + σ Φ+ xβ + σ φ[βj /σ].
∂xj Φ2 Φ

Rather conveniently, terms cancel to produce:

Any intuition here?

As Φ → 1, the probability associated with the mass point at zero

approaches zero. In this limiting case, we are essentially back into
linear regression framework, whence the marginal effect reduces to βj .

Justin L. Tobias (Purdue) The Tobit 23 / 1

Tobit: Application

Using the female labor supply data on the course website, we fit a
tobit model to account for the censoring at zero weeks of work.

We work in the (δ, θ) parameterization.

The fsolve command is used in MATLAB, so the score vector is

programmed into the maximization routine.
The following slide gives results from this exercise.

Justin L. Tobias (Purdue) The Tobit 24 / 1

Tobit: Application

MATLAB STATA
Variable Pt. Est Std. Err Marg Eff Pt. Est Stderr
Constant 31.93 3.83 —- 31.93 3.66
Ability .061 .0221 .056 .061 .022
SpouseInc. -.123 .0254 -.114 -.123 .025
Kids -13.52 1.22 -12.55 -13.52 1.16
Education .932 .292 .865 .932 .291
σ 23.24 .383 —- 23.24 .383

Justin L. Tobias (Purdue) The Tobit 25 / 1

References

Tobin, J. (1958). “Estimation of Relationships for Limited Dependent

Variables” Econometrica.

Olsen, R.J. (1978). “ANote on the Uniqueness of the Maximum

Likelihood Estimator for the tobit model” Econometrica.

Zuehlke, T. (2003). ”Estimation of a Tobit Model with Unknown

Censoring Threshold.” Applied Economics.

Justin L. Tobias (Purdue) The Tobit 26 / 1

82-P01.91.300096-07 GE300 GE320 Operation Manual
No ratings yet
82-P01.91.300096-07 GE300 GE320 Operation Manual
126 pages
Introduction To Statistical Methods: BITS Pilani
No ratings yet
Introduction To Statistical Methods: BITS Pilani
40 pages
NATM PPT Gall-Natm-Design-Construction PDF
No ratings yet
NATM PPT Gall-Natm-Design-Construction PDF
63 pages
Sources of Data
100% (3)
Sources of Data
18 pages
q2 Activity Sheets - Grade 3
100% (2)
q2 Activity Sheets - Grade 3
13 pages
The Tobit Model
No ratings yet
The Tobit Model
13 pages
18.6501x Fundamentals of Statistics
100% (1)
18.6501x Fundamentals of Statistics
8 pages
Probit and Logit-Madesh
No ratings yet
Probit and Logit-Madesh
22 pages
Deming Regression: Methcomp Package May 2007
100% (1)
Deming Regression: Methcomp Package May 2007
10 pages
Eco and Youth Club 2023-24
No ratings yet
Eco and Youth Club 2023-24
9 pages
COHESIVE DEVICES-Advanced
100% (2)
COHESIVE DEVICES-Advanced
2 pages
Tobit Models A Survey PDF
No ratings yet
Tobit Models A Survey PDF
59 pages
Gretl Guide (401 450)
No ratings yet
Gretl Guide (401 450)
50 pages
Threshold Tobit
No ratings yet
Threshold Tobit
15 pages
UMVUE Statmat 2 2022
No ratings yet
UMVUE Statmat 2 2022
43 pages
Xtxttobit
No ratings yet
Xtxttobit
8 pages
Limited Depenedt Variable Models and Sample Selection Corrections
No ratings yet
Limited Depenedt Variable Models and Sample Selection Corrections
62 pages
Econometrics Eviews 7
No ratings yet
Econometrics Eviews 7
9 pages
Tobit Analysis - Stata Data Analysis Examples
No ratings yet
Tobit Analysis - Stata Data Analysis Examples
10 pages
Tedo New Se
No ratings yet
Tedo New Se
29 pages
Efficient Estimation of Conditional Covariance Matrices For Dimension Reduction - Solís, Loubes, Marteau - JDS, Bruxelles 2012
No ratings yet
Efficient Estimation of Conditional Covariance Matrices For Dimension Reduction - Solís, Loubes, Marteau - JDS, Bruxelles 2012
44 pages
Marginal Effects in The Censored Regression Model
No ratings yet
Marginal Effects in The Censored Regression Model
7 pages
Econometrics Lecture Notes
No ratings yet
Econometrics Lecture Notes
16 pages
Tobit Postestimation - Postestimation Tools For Tobit
No ratings yet
Tobit Postestimation - Postestimation Tools For Tobit
5 pages
Presentation Last
No ratings yet
Presentation Last
20 pages
CH 12 Sol
No ratings yet
CH 12 Sol
5 pages
R300 - Summer 2018 Advanced Econometric Methods Study Aid
No ratings yet
R300 - Summer 2018 Advanced Econometric Methods Study Aid
9 pages
A Modified Expectation Maximization Algorithm For Penalized Likelihood Estimation in Emission Tomorzradhv
No ratings yet
A Modified Expectation Maximization Algorithm For Penalized Likelihood Estimation in Emission Tomorzradhv
6 pages
Implementation of A Double-Hurdle Model: 13, Number 4, Pp. 776-794
No ratings yet
Implementation of A Double-Hurdle Model: 13, Number 4, Pp. 776-794
19 pages
Censoring
No ratings yet
Censoring
29 pages
Hyvarinen 05 A
No ratings yet
Hyvarinen 05 A
15 pages
A Nested Tobit Analysis For A Sequentially Censored Regression Model
No ratings yet
A Nested Tobit Analysis For A Sequentially Censored Regression Model
5 pages
M604 Final Solutions
No ratings yet
M604 Final Solutions
20 pages
R300 Solution Guide 2018M
No ratings yet
R300 Solution Guide 2018M
8 pages
MLE Lecture Note For Econometrician
No ratings yet
MLE Lecture Note For Econometrician
13 pages
Regression With A Binary Dependent Variable
No ratings yet
Regression With A Binary Dependent Variable
63 pages
Exegeses ANOVA III
No ratings yet
Exegeses ANOVA III
26 pages
UnivariateRegression 3
No ratings yet
UnivariateRegression 3
81 pages
Green Book
0% (1)
Green Book
22 pages
Fixed Effects Estimation of Structural Parameters and Marginal Effects in Panel Probit Models
No ratings yet
Fixed Effects Estimation of Structural Parameters and Marginal Effects in Panel Probit Models
44 pages
Econometria Avanzada: Generalized Linear Models
No ratings yet
Econometria Avanzada: Generalized Linear Models
30 pages
Notes 13
No ratings yet
Notes 13
18 pages
3.handouts Binary Dependent Variables
No ratings yet
3.handouts Binary Dependent Variables
8 pages
Binary
No ratings yet
Binary
47 pages
Limited Dependent Variables - Binary Dependent Variables
No ratings yet
Limited Dependent Variables - Binary Dependent Variables
24 pages
STAT3902 (24-25) Assignment 2 Solution
No ratings yet
STAT3902 (24-25) Assignment 2 Solution
5 pages
17 Ae2
No ratings yet
17 Ae2
29 pages
8 Limiteddependent2up
No ratings yet
8 Limiteddependent2up
9 pages
Lecture 6&7 - Qualitative Dependent Models
No ratings yet
Lecture 6&7 - Qualitative Dependent Models
15 pages
Econometrics Eviews 6
No ratings yet
Econometrics Eviews 6
12 pages
Lecture 9
No ratings yet
Lecture 9
8 pages
Restricted Maximum Likelihood (REML) Estimation of Variance Components in The Mixed Model
No ratings yet
Restricted Maximum Likelihood (REML) Estimation of Variance Components in The Mixed Model
8 pages
Discrete Probability and Likelihood: Readings: Agresti (2002), Section 1.2
No ratings yet
Discrete Probability and Likelihood: Readings: Agresti (2002), Section 1.2
17 pages
Section 9 Limited Dependent Variables
No ratings yet
Section 9 Limited Dependent Variables
17 pages
Open Questions 2
No ratings yet
Open Questions 2
26 pages
3rd Quarter Test SCIENCE 6
No ratings yet
3rd Quarter Test SCIENCE 6
10 pages
8212HW1 Ak
No ratings yet
8212HW1 Ak
4 pages
Newsletter 23 - Logit, Probit, Tobit (2P)
No ratings yet
Newsletter 23 - Logit, Probit, Tobit (2P)
2 pages
Weather Wax Hastie Solutions Manual
No ratings yet
Weather Wax Hastie Solutions Manual
18 pages
11.metar and Taf
No ratings yet
11.metar and Taf
51 pages
Design and Analysis of Computer Experiments: Theory: 1 Density Estimation
No ratings yet
Design and Analysis of Computer Experiments: Theory: 1 Density Estimation
9 pages
MLE Lecture4
No ratings yet
MLE Lecture4
5 pages
Tobit
No ratings yet
Tobit
28 pages
Tobit Model
No ratings yet
Tobit Model
4 pages
Nonlinear Panel Data
No ratings yet
Nonlinear Panel Data
29 pages
Tobit Models - R Data Analysis Examples
No ratings yet
Tobit Models - R Data Analysis Examples
9 pages
Estimating Econometric Models With Fixed Effects
No ratings yet
Estimating Econometric Models With Fixed Effects
14 pages
Impact of Colonialism On Africa and Its Economic Development
No ratings yet
Impact of Colonialism On Africa and Its Economic Development
8 pages
CN-Design Research, Architectural Research, Architectural Design Research - An Argument On Disciplinarity and Identity
No ratings yet
CN-Design Research, Architectural Research, Architectural Design Research - An Argument On Disciplinarity and Identity
30 pages
0471 Thermal Insulation and Pliable Membranes
No ratings yet
0471 Thermal Insulation and Pliable Membranes
9 pages
Modified Bitumens
No ratings yet
Modified Bitumens
6 pages
Problems On Ages
No ratings yet
Problems On Ages
3 pages
T301WFP
No ratings yet
T301WFP
1 page
Agnico Eagle 2023 Sustainability Performance Data - 25042024
No ratings yet
Agnico Eagle 2023 Sustainability Performance Data - 25042024
147 pages
Revision For Gifted Student
No ratings yet
Revision For Gifted Student
6 pages
GS Paper 1 1 Geography of India
No ratings yet
GS Paper 1 1 Geography of India
42 pages
Thesis Definition of Terms Format
100% (3)
Thesis Definition of Terms Format
4 pages
Homework Riddles
100% (1)
Homework Riddles
5 pages
Module 2 Notes
No ratings yet
Module 2 Notes
30 pages
Complex Thought FINAL
No ratings yet
Complex Thought FINAL
25 pages
FFBL FML FPCL Answer Key
No ratings yet
FFBL FML FPCL Answer Key
19 pages
Product Conformity Certificate - O2000 Oxygen Analyser
No ratings yet
Product Conformity Certificate - O2000 Oxygen Analyser
9 pages
Business Etiquette in South Korea - 20230908 - 122053 - 0000
No ratings yet
Business Etiquette in South Korea - 20230908 - 122053 - 0000
8 pages
Prueba Modelo Diagnostica Optativa Ingles
No ratings yet
Prueba Modelo Diagnostica Optativa Ingles
5 pages
Iaad 2023
No ratings yet
Iaad 2023
4 pages
SATs Revision Pack - 20-04-2025
No ratings yet
SATs Revision Pack - 20-04-2025
9 pages
2024-25 Master Thesis Validated Topics EUCONEXUS Supervisors
No ratings yet
2024-25 Master Thesis Validated Topics EUCONEXUS Supervisors
2 pages
SOP (Mahi - Project Coordinator)
No ratings yet
SOP (Mahi - Project Coordinator)
1 page
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet