0% found this document useful (0 votes)

30 views24 pages

Lecture 7 Probit

This document discusses probit regression models for binary dependent variables. It introduces the probit model, which uses the cumulative standard normal distribution to model the probability of a binary outcome as a nonlinear function of explanatory variables. This allows the predicted probabilities to always be between 0 and 1. The document provides an example using STATA to estimate a probit model using HMDA data and discusses interpreting the results, including computing predicted probabilities and marginal effects.

Uploaded by

Richa Jha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views24 pages

Lecture 7 Probit

Uploaded by

Richa Jha

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

Lecture 7 Limited Dependent Variable -2

ECMT-7302
Econometrics II
MA Eco. 2022, Fall 2023
Instructor: Sunaina Dhingra
Lectures: Wednesday, (11.20-12.50pM) & Thursday (9-40-11.10 am)
Lecture Meeting Mode: In person (Classroom: T4-F99)
Office Hours: Wednesday 1-2.30 pm & by appointment in FOB, Office No.1B in south on 7th Floor)
Email-id: [email protected]
Lecture Material: Slides and textbooks
Credits: 4.5
• Assume that in the two-variable model Yi = β1 + β2 Xi + ui the Yi are normally
and independently distributed with mean = β1 + β2 Xi and variance = σ 2.
• The joint probability density function of Y1, Y2, ... , Yn , given the preceding
mean and variance, can be written as

• But in view of the independence of the Y’s, this joint probability density
function can be written as a product of n individual density functions as

• Where

• which is the density function of a normally distributed variable with the given mean and 1-2
variance.
• Substituting Equation (2) for each Yi into Equation (1) gives

• If Y1, Y2, . . . , Yn are known or given, but β1, β2, and σ2 are not known, the function in
Equation (3) is called a likelihood function, denoted by LF(β1, β2, σ2), and written as

• MLE Method consists in estimating the unknown parameters (β1, β2, and σ2 )in such a manner
that the probability of observing the given Y’s is as high (or maximum) as possible.
• Therefore, we find the maximum of the function in Equation (4) using differential calculus.
• For differentiation it is easier to express Equation (4) in the log term as follows.
(Note: ln = natural log.)
• Differentiating Equation (5) partially with respect to β1, β2, and σ2, we obtain

1-4
• After simplifying, Eqs. (9) and (10) yield

• which are precisely the normal equations of the least-squares theory obtained by OLS

1-5
• the ML estimator of σ2 is biased. The magnitude of this bias can be easily determined
as follows.

1-6
Limited Dependent Variable Models
• Logit and Probit models for binary response

• Disadvantages of the LPM for binary dependent variables

• Predictions sometimes lie outside the unit interval
• Partial effects of explanatory variables are constant

• Nonlinear models for binary response

• Response probability is a nonlinear function of explanatory variables

7
Limited Dependent Variable Models
• Choices for the link function

• Latent variable formulation of the Logit and Probit models

8
Limited Dependent Variable Models
• Interpretation of coefficients in Logit and Probit models

• Partial effects are nonlinear and depend on the level of x.

9
Limited Dependent Variable Models
• Maximum likelihood estimation of Logit and Probit models

• Properties of maximum likelihood estimators

• Maximum likelihood estimators are consistent, asymptotically normal, and asymptotically efficient if the
distributional assumptions hold.

10
Probit and Logit Regression

• The problem with the linear probability model is that it

models the probability of Y=1 as being linear:

Pr(Y = 1|X) = β0 + β1X

• Instead, we want:

• Pr(Y = 1|X) to be increasing in X for β1>0, and

• 0 ≤ Pr(Y = 1|X) ≤ 1 for all X

• This requires using a nonlinear functional form for the

probability. How about an “S-curve” (like a CDF from earlier classes)
• The probit model satisfies these conditions:
I. Pr(Y = 1|X) to be increasing in X for β1>0, and
II. 0 ≤ Pr(Y = 1|X) ≤ 1 for all X
Probit Regression
• Probit regression models the probability that Y=1 using
the cumulative standard normal distribution function,
Φ(z), evaluated at z = β0 + β1X. The probit regression
model is,
• Pr(Y = 1|X) = Φ(β0 + β1X)
• where Φ is the cumulative normal distribution function
and z = β0 + β1X is the “z-value” or “z-index” of the
probit model.
• Example: Suppose β0 = -2, β1= 3, X = .4, so
• Pr(Y = 1|X=.4) = Φ(-2 + 3×.4) = Φ(-0.8)
• Pr(Y = 1|X=.4) = area under the standard normal density
to left of z = -.8, which is…
Pr(z ≤ -0.8) = .2119
Probit regression, ctd.
• Why use the cumulative normal probability distribution?
• The “S-shape” gives us what we want:
• Pr(Y = 1|X) is increasing in X for β1>0
• 0 ≤ Pr(Y = 1|X) ≤ 1 for all X
• Easy to use – the probabilities are tabulated in the cumulative
normal tables (and easily using regression software)
• Relatively straightforward interpretation:
• β0 + β1X = z-value
• ̂0+ ̂1X is the predicted z-value, given X
• β1 is the change in the z-value for a unit change in X
• The probit model satisfies these conditions:
I. Pr(Y = 1|X) to be increasing in X for β1>0, and
II. 0 ≤ Pr(Y = 1|X) ≤ 1 for all X
The probit model uses the cumulative normal distribution function to
model the probability of denial given the payment-to income ratio or,
more generally, to model Pr(Y = 1| X). Unlike the linear probability model,
the probit conditional probabilities are always between 0 and 1.

1-17
STATA Example: HMDA data
. probit deny p_irat, r;
Iteration 0: log likelihood = -872.0853 We’ll discuss this later
Iteration 1: log likelihood = -835.6633
Iteration 2: log likelihood = -831.80534
Iteration 3: log likelihood = -831.79234
Probit estimates Number of obs = 2380
Wald chi2(1) = 40.68
Prob > chi2 = 0.0000
Log likelihood = -831.79234 Pseudo R2 = 0.0462
------------------------------------------------------------------------------
| Robust
deny | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
p_irat | 2.967908 .4653114 6.38 0.000 2.055914 3.879901
_cons | -2.194159 .1649721 -13.30 0.000 -2.517499 -1.87082
------------------------------------------------------------------------------
Pr (deny = 1|P / Iratio) = Φ(-2.19 + 2.97×P/I ratio)
(.16) (.47)
STATA Example: HMDA data, ctd.
Pr (deny = 1|P / Iratio) = Φ(-2.19 + 2.97×P/I ratio)
(.16) (.47)
• Positive coefficient: Does this make sense?
• Standard errors have the usual interpretation
• Predicted probabilities:
Pr (deny = 1|P / Iratio = .3) = Φ (-2.19+2.97×.3)
= Φ (-1.30) = .097
• Effect of change in P/I ratio from .3 to .4:
Pr (deny = 1|P / Iratio = .4) = Φ (-2.19+2.97×.4)
= Φ (-1.00) = .159
• Predicted probability of denial rises from .097 to .159
• increase in the probability of denial of 6.2 percentage points,
from 9.7% to 15.9%
• Because the probit regression function is nonlinear, the effect of
a change in X depends on the starting value of X.

• For example, if P/I ratio = 0.5, the estimated denial probability

based on Equation is (-2.19 + 2.97 * 0.5) = (-0.71) = 0.239.

• Thus the change in the predicted probability when P/I ratio

increases from 0.4 to 0.5 is 0.239 - 0.159, or 8.0 percentage
points,

• Which larger than the increase of 6.2 percentage points when

P/I ratio increases from 0.3 to 0.4.

1-20
Probit regression with multiple regressors
Pr(Y = 1|X1, X2) = Φ (β0 + β1X1 + β2X2)
• The model is best interpreted by computing predicted probabilities and the
effect of a change in a regressor.
• Φ is the cumulative normal distribution function.
• The predicted probability that Y = 1, given values of X1, X2 is calculated by
computing the z-value, z = β0 + β1X1 + β2X2 and then looking up this z-value
in the normal distribution table (Appendix Table 1).
• z = β0 + β1X1 + β2X2 is the “z-value” or “z-index” of the probit model.
• β1 is the effect on the z-score of a unit change in X1, holding constant X2
• The effect on the predicted probability of a change in a regressor is
computed by
• (1) computing the predicted probability for the initial value of the regressors,
• (2) computing the predicted probability for the new or changed value of the
regressors, and
• (3) taking their difference.
STATA Example: HMDA data
. probit deny p_irat black, r;
Iteration 0: log likelihood = -872.0853
Iteration 1: log likelihood = -800.88504
Iteration 2: log likelihood = -797.1478
Iteration 3: log likelihood = -797.13604
Probit estimates Number of obs = 2380
Wald chi2(2) = 118.18
Prob > chi2 = 0.0000
Log likelihood = -797.13604 Pseudo R2 = 0.0859
------------------------------------------------------------------------------
| Robust
deny | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
p_irat | 2.741637 .4441633 6.17 0.000 1.871092 3.612181
black | .7081579 .0831877 8.51 0.000 .545113 .8712028
_cons | -2.258738 .1588168 -14.22 0.000 -2.570013 -1.947463
------------------------------------------------------------------------------
We’ll go through the estimation details later…
STATA Example, ctd.: Predicted probit probabilities
. probit deny p_irat black, r;
Probit estimates Number of obs = 2380
Wald chi2(2) = 118.18
Prob > chi2 = 0.0000
Log likelihood = -797.13604 Pseudo R2 = 0.0859
------------------------------------------------------------------------------
| Robust
deny | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
p_irat | 2.741637 .4441633 6.17 0.000 1.871092 3.612181
black | .7081579 .0831877 8.51 0.000 .545113 .8712028
_cons | -2.258738 .1588168 -14.22 0.000 -2.570013 -1.947463
------------------------------------------------------------------------------
. sca z1 = _b[_cons]+_b[p_irat]*.3+_b[black]*0;
. display "Pred prob, p_irat=.3, white: " normprob(z1);
Pred prob, p_irat=.3, white: .07546603
NOTE
_b[_cons] is the estimated intercept (-2.258738)
_b[p_irat] is the coefficient on p_irat (2.741637)
sca creates a new scalar which is the result of a calculation
display prints the indicated information to the screen
STATA Example, ctd.
Pr (deny = 1|P/I, black)
= Φ(-2.26 + 2.74×P/I ratio + .71×black)
(.16) (.44) (.08)
• Is the coefficient on black statistically significant?
• Estimated effect of race for P/I ratio = .3:
Pr (deny = 1|.3,1)= Φ(-2.26+2.74×.3+.71×1) = .233

Pr (deny = 1|.3,0)= Φ(-2.26+2.74×.3+.71×0) = .075

• Difference in rejection probabilities = .158 (15.8 pp)

• Still plenty of room for omitted variable bias!

Practice Aspen Plus User Certification Exam
100% (6)
Practice Aspen Plus User Certification Exam
9 pages
Counting Atoms - Worksheet - Docx WK1
100% (2)
Counting Atoms - Worksheet - Docx WK1
2 pages
Regresi Logistik
No ratings yet
Regresi Logistik
34 pages
Piper Pitch Trim Service Manual
83% (18)
Piper Pitch Trim Service Manual
206 pages
Regression With A Binary Dependent Variable
No ratings yet
Regression With A Binary Dependent Variable
55 pages
Introduction To Econometrics - Stock & Watson - CH 9 Slides
100% (1)
Introduction To Econometrics - Stock & Watson - CH 9 Slides
69 pages
Regression With A Binary Dependent Variable
No ratings yet
Regression With A Binary Dependent Variable
63 pages
Econometrics Chapter 11 PPT Slides
No ratings yet
Econometrics Chapter 11 PPT Slides
46 pages
Lecture15 Binary Dependent Variables
No ratings yet
Lecture15 Binary Dependent Variables
38 pages
ECON3002 2013 Final Merged Answer
No ratings yet
ECON3002 2013 Final Merged Answer
23 pages
Notes 13
No ratings yet
Notes 13
18 pages
Assignment On Probit Model
No ratings yet
Assignment On Probit Model
17 pages
Msfe Week9
No ratings yet
Msfe Week9
5 pages
Section 9 Limited Dependent Variables
No ratings yet
Section 9 Limited Dependent Variables
17 pages
Regression With A Binary Dependent Variable: Michael Ash
No ratings yet
Regression With A Binary Dependent Variable: Michael Ash
18 pages
Limited Dependent Variables - Binary Dependent Variables
No ratings yet
Limited Dependent Variables - Binary Dependent Variables
24 pages
Lecture 8
No ratings yet
Lecture 8
39 pages
Seminar Econometrie
No ratings yet
Seminar Econometrie
15 pages
An Introduction To Logistic Regression
No ratings yet
An Introduction To Logistic Regression
48 pages
Probit Model
No ratings yet
Probit Model
29 pages
CH 5. Discrete Choice Model
No ratings yet
CH 5. Discrete Choice Model
38 pages
09-Limited Dependent Variable Models
No ratings yet
09-Limited Dependent Variable Models
71 pages
Probit and Logit-Madesh
No ratings yet
Probit and Logit-Madesh
22 pages
Unitb - II - Linear Probability, Logit and Probit
No ratings yet
Unitb - II - Linear Probability, Logit and Probit
34 pages
Chapter 5 MGT
No ratings yet
Chapter 5 MGT
60 pages
Chapter 5-LDVM-2024
No ratings yet
Chapter 5-LDVM-2024
27 pages
An Introduction To Logistic Regression: Johnwhitehead Department of Economics East Carolina University
No ratings yet
An Introduction To Logistic Regression: Johnwhitehead Department of Economics East Carolina University
48 pages
Qualitative Response Models
No ratings yet
Qualitative Response Models
35 pages
EC212: Introduction To Econometrics Multiple Regression: Inference (Wooldridge, Ch. 4)
No ratings yet
EC212: Introduction To Econometrics Multiple Regression: Inference (Wooldridge, Ch. 4)
89 pages
Cap1 Slides
No ratings yet
Cap1 Slides
30 pages
2101 F 17 Assignment 1
No ratings yet
2101 F 17 Assignment 1
8 pages
Logistic Regression: Continued Psy 524 Ainsworth
0% (1)
Logistic Regression: Continued Psy 524 Ainsworth
29 pages
Section and Solution
No ratings yet
Section and Solution
4 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
23 pages
Binary Logistic Regression - 6.2
No ratings yet
Binary Logistic Regression - 6.2
34 pages
X400004 20220215 Solutions
No ratings yet
X400004 20220215 Solutions
8 pages
Logit
No ratings yet
Logit
48 pages
3 Classification
No ratings yet
3 Classification
26 pages
Regn Lect 5
No ratings yet
Regn Lect 5
9 pages
Regression 101
No ratings yet
Regression 101
18 pages
STAT511Q2Q4
No ratings yet
STAT511Q2Q4
11 pages
Econometric Lec7
No ratings yet
Econometric Lec7
26 pages
Econometrics Eviews 6
No ratings yet
Econometrics Eviews 6
12 pages
Cheat Sheet
No ratings yet
Cheat Sheet
4 pages
Econometrics - Qualitative Response Models
No ratings yet
Econometrics - Qualitative Response Models
17 pages
Nu - Edu.kz Econometrics-I Assignment 4 Answer Key
No ratings yet
Nu - Edu.kz Econometrics-I Assignment 4 Answer Key
4 pages
Homework 8
100% (1)
Homework 8
6 pages
Econometrics
No ratings yet
Econometrics
37 pages
Econometrics
No ratings yet
Econometrics
40 pages
MLE Lecture Note For Econometrician
No ratings yet
MLE Lecture Note For Econometrician
13 pages
Logit, Probit and Multinomial Logit Models in R: Oscar Torres-Reyna
100% (1)
Logit, Probit and Multinomial Logit Models in R: Oscar Torres-Reyna
24 pages
Logit and Probit: Models With Discrete Dependent Variables
No ratings yet
Logit and Probit: Models With Discrete Dependent Variables
30 pages
2-13 Limited Dependent Variables - Probit and Logit
No ratings yet
2-13 Limited Dependent Variables - Probit and Logit
21 pages
Logit and Probit Models
No ratings yet
Logit and Probit Models
44 pages
Seu Ds610 Mod03
No ratings yet
Seu Ds610 Mod03
45 pages
Logit, Probit and Multinomial Logit Models in R: Oscar Torres-Reyna
No ratings yet
Logit, Probit and Multinomial Logit Models in R: Oscar Torres-Reyna
27 pages
4 - HY577 - Hypothesis Testing Basics
No ratings yet
4 - HY577 - Hypothesis Testing Basics
57 pages
Section 11 PDF
No ratings yet
Section 11 PDF
7 pages
Homework 02 Key Answer STAT 4444
No ratings yet
Homework 02 Key Answer STAT 4444
5 pages
Calculus: Maths of the Gods
From Everand
Calculus: Maths of the Gods
Bill Todorovich
No ratings yet
Shortcuts to College Calculus Refreshment Kit
From Everand
Shortcuts to College Calculus Refreshment Kit
Juan Acevedo
No ratings yet
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
From Everand
Digital Signal and Image Processing using MATLAB, Volume 3: Advances and Applications, The Stochastic Case
Gérard Blanchet
3/5 (1)
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
From Everand
Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition
Stuart A. Klugman
4/5 (1)
Iec TC 114 Marine Stds Brochure June2022
No ratings yet
Iec TC 114 Marine Stds Brochure June2022
11 pages
CCATMMS006
No ratings yet
CCATMMS006
44 pages
Y5 Y6 Probability Q&A
No ratings yet
Y5 Y6 Probability Q&A
2 pages
Jominy End Quench Test - Nas
No ratings yet
Jominy End Quench Test - Nas
3 pages
Magnetic Filed of Earth
67% (3)
Magnetic Filed of Earth
25 pages
June 2016 Regents Exam Explained
No ratings yet
June 2016 Regents Exam Explained
188 pages
Che 10
No ratings yet
Che 10
10 pages
EMI PMC 2024 Abstracts 122724
No ratings yet
EMI PMC 2024 Abstracts 122724
1,199 pages
m1 Rws Sept
No ratings yet
m1 Rws Sept
2 pages
INPhO 2006 09
No ratings yet
INPhO 2006 09
98 pages
GR Xi Eng PR L11 Sup L5 QB
No ratings yet
GR Xi Eng PR L11 Sup L5 QB
26 pages
Dynamic Equilibrium Among Erosion, River Incision, and Coastal Uplift in The Nothern and Central Apennines, Italy - Columbu A. Et Alii - 2008
No ratings yet
Dynamic Equilibrium Among Erosion, River Incision, and Coastal Uplift in The Nothern and Central Apennines, Italy - Columbu A. Et Alii - 2008
4 pages
PH User Guide
No ratings yet
PH User Guide
11 pages
Cellier, F.E. (1991), Continuous System Modelling Incomplete
100% (1)
Cellier, F.E. (1991), Continuous System Modelling Incomplete
549 pages
Electricity MCQ
No ratings yet
Electricity MCQ
14 pages
Tutorial - Air Comfort
No ratings yet
Tutorial - Air Comfort
13 pages
MET 3145 ASSIGN 2 - 2025 - Solutions
No ratings yet
MET 3145 ASSIGN 2 - 2025 - Solutions
6 pages
Electrical and Electronic Measurements and Instrumentation
No ratings yet
Electrical and Electronic Measurements and Instrumentation
19 pages
Örnek Sorular 2
No ratings yet
Örnek Sorular 2
17 pages
Engineering Structures: Sciencedirect
No ratings yet
Engineering Structures: Sciencedirect
16 pages
Regr I Ration
No ratings yet
Regr I Ration
2 pages
ThermodyncamicFormalism&BowenFormula Iommi2008
No ratings yet
ThermodyncamicFormalism&BowenFormula Iommi2008
43 pages
Projectile Launched at An Angle
100% (1)
Projectile Launched at An Angle
47 pages
Differences Between UV and EB
No ratings yet
Differences Between UV and EB
2 pages
1811016-Experiment 3
No ratings yet
1811016-Experiment 3
21 pages
Teri W. Odom
No ratings yet
Teri W. Odom
5 pages
Why It Is Important To Understand: The Theory of Matrices and Determinants
No ratings yet
Why It Is Important To Understand: The Theory of Matrices and Determinants
8 pages

Lecture 7 Probit

Uploaded by

Lecture 7 Probit

Uploaded by

Lecture 7 Limited Dependent Variable -2

• Disadvantages of the LPM for binary dependent variables

• Nonlinear models for binary response

• Latent variable formulation of the Logit and Probit models

• Partial effects are nonlinear and depend on the level of x.

• Properties of maximum likelihood estimators

• The problem with the linear probability model is that it

Pr(Y = 1|X) = β0 + β1X

• Pr(Y = 1|X) to be increasing in X for β1>0, and

• 0 ≤ Pr(Y = 1|X) ≤ 1 for all X

• This requires using a nonlinear functional form for the

• For example, if P/I ratio = 0.5, the estimated denial probability

• Thus the change in the predicted probability when P/I ratio

• Which larger than the increase of 6.2 percentage points when

Pr (deny = 1|.3,0)= Φ(-2.26+2.74×.3+.71×0) = .075

• Difference in rejection probabilities = .158 (15.8 pp)

You might also like