Chapter 5 MGT
Chapter 5 MGT
modeling.
OLS.
Zenegnaw Abiy Hailu (PhD) 5
The Linear Probability Model
• Suppose, for example, that we wanted to model the probability
Size
themil. of
US dollars
• For any firm whose asset is less than $25m, the model-predicted
– Are we really certain that very small firms will definitely never
pay a dividend and that large firms will always make a payout?
Zenegnaw Abiy Hailu (PhD) 9
Disadvantages of the Linear
Probability Model
• The LPM also suffers from more standard econometric problems
that we have examined in the previous chapters.
– Since the dependent variable only takes one or two values, for
given (fixed in repeated samples) values of the explanatory
variables, the disturbance term will also only take on one of
two values – normality is violated
• F(Y) = Y′ = Xb + e
0 CDF
0
• With the logistic model, 0 and 1 are asymptotes to the function and thus
the probabilities will never actually fall to exactly zero or rise to one,
although they may come infinitesimally close.
real line
coefficients will be, and then determines the direction and size
independent variables?
significantly).
Xβ
e
P(Y 1| X) Xβ
1 e
– Model Chi-Square
– Pseudo-R2
McFadden's-R2 = 1 - [LL(,)/LL()]
model.
– Level of satisfaction
– Level of motivation
------------------------------------------------------------------------------
envspend | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
educ | .0419784 .0108409 3.87 0.000 .0207307 .0632261
income | .0023984 .0057545 0.42 0.677 -.0088802 .013677
female | .2753095 .0591542 4.65 0.000 .1593693 .3912496
age | -.012762 .0017667 -7.22 0.000 -.0162247 -.0092994
dblack | .2898025 .0930178 3.12 0.002 .1074911 .472114
class | -.0719344 .0485173 -1.48 0.138 -.1670266 .0231578
city | .227895 .080983 2.81 0.005 .0691711 .3866188
suburb | .0752643 .0695921 1.08 0.279 -.0611337 .2116624
attendchurch | -.086372 .0109998 -7.85 0.000 -.1079312 -.0648128
-------------+----------------------------------------------------------------
/cut1 | -2.872315 .1930206 -3.250628 -2.494001
/cut2 | -.8156047 .1867621 -1.181652 -.4495577
------------------------------------------------------------------------------
envspend | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
educ | .0419784 .0108409 3.87 0.000 .0207307 .0632261
income | .0023984 .0057545 0.42 0.677 -.0088802 .013677
female | .2753095 .0591542 4.65 0.000 .1593693 .3912496
age | -.012762 .0017667 -7.22 0.000 -.0162247 -.0092994
dblack | .2898025 .0930178 3.12 0.002 .1074911 .472114
class | -.0719344 .0485173 -1.48 0.138 -.1670266 .0231578
city | .227895 .080983 2.81 0.005 .0691711 .3866188
suburb | .0752643 .0695921 1.08 0.279 -.0611337 .2116624
attendchurch | -.086372 .0109998 -7.85 0.000 -.1079312 -.0648128
-------------+----------------------------------------------------------------
/cut1 | -2.872315 .1930206 -3.250628 -2.494001
/cut2 | -.8156047 .1867621 -1.181652 -.4495577
------------------------------------------------------------------------------
envspend | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+-------------------------------------------------------
---------
educ | .012701 .0032069 3.96 0.000 .0064141 .0189878
income | .0006037 .0016821 0.36 0.720 -.002694 .0039013
female | .0900251 .0173081 5.20 0.000 .0560938 .1239563
age | -.0038736 .0005258 -7.37 0.000 -.0049044 -.0028428
dblack | .0726494 .0261632 2.78 0.006 .0213585 .1239403
class | -.0165553 .0142495 -1.16 0.245 -.0444904 .0113797
city | .0555329 .0229917 2.42 0.016 .0104594 .1006065
suburb | .031217 .0205407 1.52 0.129 -.0090515 .0714855
attendchur | -.0243782 .0032213 -7.57 0.000 -.0306934 -.0180631
_cons | 2.618234 .0547459 47.83 0.000 2.510909 2.72556
In this case, OLS produced similar results to ordered logit. But, that doesn’t
always happen… and you won’t know if you don’t check.
Zenegnaw Abiy Hailu (PhD) 54
Multinomial Logistic Regression
• What if you want have a dependent variable has several non-
ordinal outcomes?
------------------------------------------------------------------------------
mode | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
Bus |
income | .0311874 .0141811 2.20 0.028 .0033929 .0589818
family size | -.6731862 .3312153 -2.03 0.042 -1.322356 -.0240161
_cons | -.5659882 .580605 -0.97 0.330 -1.703953 .5719767
-------------+----------------------------------------------------------------
Car |
income | .057199 .0125151 4.57 0.000 .0326698 .0817282
family size | .1978772 .1989113 0.99 0.320 -.1919817 .5877361
_cons | -2.272809 .5201972 -4.37 0.000 -3.292377 -1.253241
------------------------------------------------------------------------------
(mode==Train is the base outcome)
Here, the pattern is clearer: Wealthy & large families use cars
Zenegnaw Abiy Hailu (PhD) 59
End