CH-4-Discrete Choice Models-PG (Compatibility Mode)
CH-4-Discrete Choice Models-PG (Compatibility Mode)
Chapter FOUr:
Discrete choice MoDels
(non-linear regression analysis)
By: Teklebirhan A. 1
Outline
1. Introduction
2. Linear Regression Model (LPM)
3. Non-linear Regression models
3.1. The logit and Probit models
3.2. Multinomial logit and Probit models
3.3. Ordered logit and Probit models
By: Teklebirhan A. 2
1. Introduction
By: Teklebirhan A. 3
Cont…
Determinates of employees satisfaction in AAU.
Factors affecting loan repayment in AAU.
Factors affecting Turnover Intention in CBE.
Determinants of women’s willingness to practice
family planning utilities in the case of AA town.
Determinants of Household’s Saving behavior in AA
town.
By: Teklebirhan A. 4
Cont…
In the above examples, response (dependent)
variable is qualitative/ categorical/discrete.
Models dealing with such kind of binary responses
are called binary/ Discrete choice models.
Technically, it is possible to estimate the binary
choices using OLS.
If OLS is used to estimate qualitative response
variable, the resulting model is called linear
probability model (LPM).
By: Teklebirhan A. 5
Cont…
Are you satisfied with your job?
By: Teklebirhan A. 6
Cont…
Value of Yi Ui Probability of Yi Probability of Ui
1 p
0 1-p 1-p
By: Teklebirhan A. 8
Cont…
9
By: Teklebirhan A.
Cont…
The very difference between linear regression
model (LRM) and binary regression model
(BRM) is that
In the case of LRM, regression analysis deals
with the prediction of the average value of the
response variable from the given values of
explanatory variables.
This is because, the response variable is
continuous/quantitative in LRM.
By: Teklebirhan A. 10
Cont…
By: Teklebirhan A. 11
2. Linear Probability Model
By: Teklebirhan A. 12
Cont…
By: Teklebirhan A. 13
Cont…
(Not Always)
By: Teklebirhan A. 15
Cont…
Violation of the axiom of Probability
reg grade gpa pc ase
predict yhat
scatter grade yhat||lfit grade yhat
By: Teklebirhan A. 16
Cont…
Violation of the assumption of normality
predict r, resid
pnorm r
qnorm r
mvtest norm r
kdensity r, normal
histogram r, kdensity normal
The coefficient of determination is not
dependable which is 0.4 2
By: Teklebirhan A. 17
3. Nonlinear Regression Models
In the LPM, the heteroskedasticity problem is less
worrying as it can be easily handled.
We need to resort to other methods to account for
the other shortcomings.
In particular, we need a model which satisfies
0
By: Teklebirhan A. 18
Cont…
Non-linear regression model includes
a) The Logit Model
b) The Probit Model
c) Multinomial Logit and Probit Model (MNL &
MNP)
d) Ordered Logit and Probit Model.
By: Teklebirhan A. 19
3.1. The Logit and Probit Models
Non-linearity
By: Teklebirhan A. 20
Cont…
By: Teklebirhan A. 21
Cont…
By: Teklebirhan A. 22
Cont…
Satisfied
Not-Satisfied
By: Teklebirhan A. 23
Cont…
Take the ratio of the probability of an event
occurring (Pi) to the probability of an event not
happening (1-Pi) and the resulting ratio is called
odds ratio.
By: Teklebirhan A. 24
Cont…
Take the natural log of the above odds ratio and the
resulting equation is called Logit.
By: Teklebirhan A. 25
Cont…
Pi
Pi =1
By: Teklebirhan A. 26
Cont…
By: Teklebirhan A. 27
Cont…
Example on Logit and Probit
Suppose that we want to examine the effect of routine
weekly exercises on the performance of students.
To this end, suppose we gave routine exercises to
MBA student and at the end of the semester, we
found average scores in exercise (ASE) for each
student.
By: Teklebirhan A. 28
Cont…
By: Teklebirhan A. 29
Cont…
A) Logit Interpretation of Logit Model
logit grade gpa ase pc
By: Teklebirhan A. 30
Cont…
Interpretation:
As GPA increases by one point, the log of the odds ratio
increases by 2.8 and statistically significant.
A student who owned PC, the log of the odds ratio
increases by 2.4 and statistically significant.
By: Teklebirhan A. 31
Cont…
B) Odds Ratio Interpretation of Logit Model
logit grade gpa ase pc, or
By: Teklebirhan A. 32
Cont…
Interpretation:
As GPA increases by one point, the odds of getting ‘A’ is
16.87 times the odds of getting other grades (B, C, D, F)
A student who owned PC, the odds of getting ‘A’ is 10.8
times the odds of getting other grades (B, C, D, F)
By: Teklebirhan A. 33
Cont…
C) Probability (mfx) interpretation of the logit model
By: Teklebirhan A. 34
Cont…
Interpretation:
As GPA increases by one point, the probability of getting
grade ‘A’ increases by 53%.
A student who owned PC, the probability of getting grade
‘A’ increases by 45.6%.
By: Teklebirhan A. 35
Cont…
D) Probit Estimation
probit grade gpa ase pc
By: Teklebirhan A. 37
Cont…
Logit/Probit Model Diagnostic Tests
Multicollinearity Test
vif
Heteroscedasticity Test
hettest
Model specification/omitted variable Test
linktest
NB: In any of the discrete choice models, we first run a linear
regression to test for multicollinearity using VIF command,
and then test for multicollinearity via the vif command.
By: Teklebirhan A. 38
Cont…
Multicollinearity Test
. vif
Heteroscedasticity Test
. hettest
chi2(1) = 2.53
Prob > chi2 = 0.1117
39
By: Teklebirhan A.
Cont…
Link Test/Model specification test
By: Teklebirhan A. 47
Cont…
In other words, you take each of the M-1 log odds
you computed and exponentiate it.
By: Teklebirhan A. 48
Cont…
By: Teklebirhan A. 49
Cont…
Multinomial Example
Suppose that we want to study the determinants of
rural households income diversification:
On-farm,
Local off farm &
Migration
Data on diversification, education, gender and age
were collected from a total of 500 households from
Kebele X using simple random sampling technique.
By: Teklebirhan A. 50
Cont…
By: Teklebirhan A. 51
Cont…
Basic Commands for using MNL model:
describe
summarize
tabulate divers
mlogit divers edu age sex exp, baseoutcome(1)
mlogit divers edu sex age exp, baseoutcome(1), rrr
mfx, predict(outcome (1))
mfx, predict(outcome (2))
mfx, predict(outcome (3))
predict plogit1 plogit2 plogit3
summarize plogit1 plogit2 plogit3
By: Teklebirhan A. 52
Cont…
. summarize
By: Teklebirhan A. 53
Cont…
. tabulate divers
income
diversificatio
n Freq. Percent Cum.
By: Teklebirhan A. 54
Cont…
Multinomial logistic regression Number of obs = 500
LR chi2(8) = 241.43
Prob > chi2 = 0.0000
Log likelihood = -372.34821 Pseudo R2 = 0.2448
local_off_farm
edu -.5263692 .0975398 -5.40 0.000 -.7175437 -.3351948
age .0144084 .0146985 0.98 0.327 -.0144001 .0432169
sex -12.47745 527.1158 -0.02 0.981 -1045.605 1020.651
exp .8035401 .2313668 3.47 0.001 .3500694 1.257011
_cons 18.60435 527.1174 0.04 0.972 -1014.527 1051.736
migration
edu -.1060083 .0926298 -1.14 0.252 -.2875594 .0755427
age -.0015199 .0157332 -0.10 0.923 -.0323565 .0293166
sex -15.74113 527.1155 -0.03 0.976 -1048.869 1017.386
exp 1.40202 .2312467 6.06 0.000 .9487852 1.855256
_cons 16.90534 527.1172 0.03 0.974 -1016.225 1050.036
By: Teklebirhan A. 55
Cont…
As education increases by one year, the log of the
ratio of the two probabilities, P(off farm=2)/P(on-
farm=1) will decrease by 0.52, and the log of the
ratio of the two probabilities P(migration=3)/P(on-
farm=1) will decrease by 0.11.
By: Teklebirhan A. 57
Cont…
Multinomial logistic regression Number of obs = 500
LR chi2(8) = 241.43
Prob > chi2 = 0.0000
Log likelihood = -372.34821 Pseudo R2 = 0.2448
local_off_farm
edu .5907459 .0576212 -5.40 0.000 .4879493 .7151988
sex 3.81e-06 .0020092 -0.02 0.981 0 .
age 1.014513 .0149118 0.98 0.327 .9857031 1.044164
exp 2.233434 .5167425 3.47 0.001 1.419166 3.514899
_cons 1.20e+08 6.33e+10 0.04 0.972 0 .
migration
edu .8994171 .0833128 -1.14 0.252 .750092 1.078469
sex 1.46e-07 .0000768 -0.03 0.976 0 .
age .9984812 .0157093 -0.10 0.923 .9681614 1.029751
exp 4.063402 .9396483 6.06 0.000 2.58257 6.393333
_cons 2.20e+07 1.16e+10 0.03 0.974 0 .
By: Teklebirhan A. 58
Cont…
For one year increase in level of education, the probability of
choosing local off farm is 0.591 times the probability of on-
farm income diversification.
So, as age increases by one year, the probability of choosing
migration is 0.998 times the probability of on-farm
diversification.
For a dichotomous dummy explanatory variable such as male,
the ratio of the relative risks of choosing migration (2) over on
farm diversification(1) for male as compare to female is
0.00000381.
The log of the ratio of the two probabilities,
(migration=2)/P(on-farm=1), for male will be lower by
15.74 than female.
59
Thus, male is less probable to migrate. By: Teklebirhan A.
Cont…
Marginal Effect Result
ME after MNL
ME after MNL for local off ME after MNL for
Categories for on-farm
farm diversification Migration
diversification
By: Teklebirhan A. 60
Cont…
ME after MNL for on-farm income diversification
As the above table shows, male are more probable (8%) to
participate in on farm income diversification than female, and
statistically significant.
ME after MNL for local off-farm income diversification
More educated person is less probable (10%) than less
educated person to participate in local off farm income
diversification, and statistically significant.
Male is more probable (41%) to participate in local off farm
income diversification than female, & statistically significant.
Thus, it is more probable for male and less educated person
to participate in local off farm income diversification.
61
By: Teklebirhan A.
Cont…
ME after MNL for migration
By: Teklebirhan A. 62
Cont…
We can also determine the probability that an
individual chooses each alternatives using the
following command.
predict plogit1 plogit2 plogit3, pr
Finally, we can also determine the average probability
for each category using the following Stata command.
summarize plogit1 plogit2 plogit3
By: Teklebirhan A. 63
Cont…
By: Teklebirhan A. 64
Cont…
The above summary statistics of the probability of
choosing on-farm diversification, local off farm
diversification and migration of rural households
showed that about half of the rural households have at
least one migrant family member and therefore, rural
households used migration as one type of income
diversification.
By: Teklebirhan A. 65
3.3 Ordered Logit Model
The ordered logit model is also known as the proportional
odds model.
The terms parallel lines model and parallel regressions
model are also sometimes used.
By: Teklebirhan A. 67
Cont…
For example, when the number of categories are three
(M=3)
By: Teklebirhan A. 69
Cont…
Note that there is a random disturbance term, which,
in this case, has a standard logistic distribution (mean
of 0 and variance of 3.29).
This reflects the fact that relevant variables may
be left out of the equation, or variables may not
be perfectly measured.
The ordered logit model estimates part of the above:
By: Teklebirhan A. 70
Cont…
By: Teklebirhan A. 71
Cont…
By: Teklebirhan A. 72
Cont…
By: Teklebirhan A. 73
Cont…
Example on Ordered Logit
Suppose that we want to study the level and determinants
of the service satisfaction of the employees of AAU.
By: Teklebirhan A. 75
Cont…
Basic Commands
summarize
tabulate LEVEL
ologit LEVEL EDUC MALE AGE EXPR
ologit LEVEL EDUC MALE AGE EXPR, or
mfx, predict(outcome (1))
mfx, predict(outcome (2))
mfx, predict(outcome (3))
predict plogit1 plogit2 plogit3, pr
summarize plogit1 plogit2 plogit3
linktest
By: Teklebirhan A. 76
Cont…
. describe
By: Teklebirhan A. 77
Cont…
. sum
By: Teklebirhan A. 78
Cont…
. tabulate LEVEL
LEVEL OF
SATISFACTIO
N Freq. Percent Cum.
By: Teklebirhan A. 79
Cont…
ologit LEVEL EDUC MALE AGE EXPR
Ordered logistic regression Number of obs = 498
LR chi2(4) = 185.97
Prob > chi2 = 0.0000
Log likelihood = -398.38453 Pseudo R2 = 0.1892
By: Teklebirhan A. 80
Cont…
Ordered Logit Interpretation
The level of satisfaction is better (from low to medium
to high) with higher level of education and experience,
female and lower age.
By: Teklebirhan A. 82
Cont…
Odds Ratio Interpretation of the ordered logit model
ologit LEVEL EDUC MALE AGE EXPR, or
By: Teklebirhan A. 83
Cont…
For gender, the odds of being in the higher level of
satisfaction of male is 0.0330 times that of female
staff.
By: Teklebirhan A. 84
Cont…
Probability Interpretation of ordered logit
By: Teklebirhan A. 85
Cont…
A year increase in education is associated with 0.6%
less likely to be in the low level of satisfaction, 1.4%
less likely to be in the medium level of satisfaction and
2% more likely to be in the high level of satisfaction.
As years of schooling (education) increases by one year,
the probability of being in the higher level of
satisfaction increases.
NB: Sum of the probability of each category is Zero for each
explanatory variable.
By: Teklebirhan A. 86
Cont…
Similarly, one year increases in experience is
associated with 4.8% less likely to be in the low level
of satisfaction, 12.6% less likely to be in the medium
level of satisfaction and 17.4% more likely to be in
the high level of satisfaction.
By: Teklebirhan A. 87
Cont…
Predict plogit1 plogit2 plogit3
By: Teklebirhan A. 88
Cont…
As you can see, the predicted probability of being in
the lowest level of satisfaction is 13.3% and that of
the middle and highest level of satisfaction are 37.6%
and 49.2%, respectively.
By: Teklebirhan A. 89
Cont…
Link test of model specification
The test result showed that there is no problem.
By: Teklebirhan A. 90
Cont…
Measures of fit
The R2 don’t make sense for logit and probit model
rather we use pseudo-R2.
The pseudo-R2 measure the fit using the likelihood
function and measures the improvement in the value of
the log likelihood, relative to having no explanatory
variables (Xi).
Suppose that if pseudo-R2 = 0.189
This suggests that the log-likelihood value increases by
about 18.9% with the introduction of the set of regressors.
By: Teklebirhan A. 91
Cont…
Note:
By: Teklebirhan A. 92
End!
Thank you!
By: Teklebirhan A. 93