0% found this document useful (0 votes)

19 views93 pages

CH-4-Discrete Choice Models-PG (Compatibility Mode)

The document discusses discrete choice models in applied econometrics, specifically focusing on non-linear regression analysis. It covers various models including linear probability models, logit and probit models, and multinomial logit and probit models, along with their applications in analyzing qualitative response variables. The document also includes diagnostic tests for model validation and examples of practical applications in understanding factors affecting employee satisfaction and household behaviors.

Uploaded by

NATNAEL MENGISTU

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views93 pages

CH-4-Discrete Choice Models-PG (Compatibility Mode)

Uploaded by

NATNAEL MENGISTU

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 93

Applied econometrics for

ManageMent (MgMt MSC 5411)

Chapter FOUr:
Discrete choice MoDels
(non-linear regression analysis)

Teklebirhan Alemnew (Assistant Professor)

[email protected]
AAU, 2024

By: Teklebirhan A. 1
Outline
1. Introduction
2. Linear Regression Model (LPM)
3. Non-linear Regression models
3.1. The logit and Probit models
3.2. Multinomial logit and Probit models
3.3. Ordered logit and Probit models

By: Teklebirhan A. 2
1. Introduction

By: Teklebirhan A. 3
Cont…
Determinates of employees satisfaction in AAU.
Factors affecting loan repayment in AAU.
Factors affecting Turnover Intention in CBE.
Determinants of women’s willingness to practice
family planning utilities in the case of AA town.
Determinants of Household’s Saving behavior in AA
town.

By: Teklebirhan A. 4
Cont…
In the above examples, response (dependent)
variable is qualitative/ categorical/discrete.
Models dealing with such kind of binary responses
are called binary/ Discrete choice models.
Technically, it is possible to estimate the binary
choices using OLS.
If OLS is used to estimate qualitative response
variable, the resulting model is called linear
probability model (LPM).
By: Teklebirhan A. 5
Cont…
Are you satisfied with your job?

By: Teklebirhan A. 6
Cont…
Value of Yi Ui Probability of Yi Probability of Ui

1 p
0 1-p 1-p

Both Yi and Ui take only two values with

respective probabilities and such variables are
called Bernoulli variables and follow Bernoulli
distributions.
But, OLS assumes that both Yi and Ui are
normally distributed.
By: Teklebirhan A. 7
Cont…

By: Teklebirhan A. 8
Cont…

9
By: Teklebirhan A.
Cont…
The very difference between linear regression
model (LRM) and binary regression model
(BRM) is that
In the case of LRM, regression analysis deals
with the prediction of the average value of the
response variable from the given values of
explanatory variables.
 This is because, the response variable is
continuous/quantitative in LRM.

By: Teklebirhan A. 10
Cont…

By: Teklebirhan A. 11
2. Linear Probability Model

By: Teklebirhan A. 12
Cont…

By: Teklebirhan A. 13
Cont…
(Not Always)

e) The true relationship between a binary outcome and

a continuous explanatory variable is inherently
nonlinear
 This means the functional form of the LPM is
generally not correctly specified, which can lead to
biased estimates of some parameters of interest.
By: Teklebirhan A. 14
Cont…
 Due to the above five basic limitations of using OLS on
qualitative response variable model, non-linear regression
models were developed.
 Example:

By: Teklebirhan A. 15
Cont…
Violation of the axiom of Probability
reg grade gpa pc ase
predict yhat
scatter grade yhat||lfit grade yhat

Violation of the assumption of Heteroscedasticity

rvfplot, yline(0)
hettest

By: Teklebirhan A. 16
Cont…
Violation of the assumption of normality
predict r, resid
pnorm r
qnorm r
mvtest norm r
kdensity r, normal
histogram r, kdensity normal
The coefficient of determination is not
dependable which is 0.4 2
By: Teklebirhan A. 17
3. Nonlinear Regression Models
In the LPM, the heteroskedasticity problem is less
worrying as it can be easily handled.
We need to resort to other methods to account for
the other shortcomings.
In particular, we need a model which satisfies
0

By: Teklebirhan A. 18
Cont…
Non-linear regression model includes
a) The Logit Model
b) The Probit Model
c) Multinomial Logit and Probit Model (MNL &
MNP)
d) Ordered Logit and Probit Model.

By: Teklebirhan A. 19
3.1. The Logit and Probit Models

Non-linearity
By: Teklebirhan A. 20
Cont…

By: Teklebirhan A. 21
Cont…

By: Teklebirhan A. 22
Cont…

Satisfied
Not-Satisfied

By: Teklebirhan A. 23
Cont…
Take the ratio of the probability of an event
occurring (Pi) to the probability of an event not
happening (1-Pi) and the resulting ratio is called
odds ratio.

By: Teklebirhan A. 24
Cont…
Take the natural log of the above odds ratio and the
resulting equation is called Logit.

Where, Li is called Logit which is linearly related

with Xi is explanatory variables

By: Teklebirhan A. 25
Cont…
Pi

Cumulative Normal Distribution Function

Pi =1

Logistic Distribution Function

By: Teklebirhan A. 26
Cont…

By: Teklebirhan A. 27
Cont…
Example on Logit and Probit
Suppose that we want to examine the effect of routine
weekly exercises on the performance of students.
To this end, suppose we gave routine exercises to
MBA student and at the end of the semester, we
found average scores in exercise (ASE) for each
student.

By: Teklebirhan A. 28
Cont…

By: Teklebirhan A. 29
Cont…
A) Logit Interpretation of Logit Model
logit grade gpa ase pc

By: Teklebirhan A. 30
Cont…
Interpretation:
As GPA increases by one point, the log of the odds ratio
increases by 2.8 and statistically significant.
A student who owned PC, the log of the odds ratio
increases by 2.4 and statistically significant.

By: Teklebirhan A. 31
Cont…
B) Odds Ratio Interpretation of Logit Model
logit grade gpa ase pc, or

By: Teklebirhan A. 32
Cont…
Interpretation:
As GPA increases by one point, the odds of getting ‘A’ is
16.87 times the odds of getting other grades (B, C, D, F)
A student who owned PC, the odds of getting ‘A’ is 10.8
times the odds of getting other grades (B, C, D, F)

By: Teklebirhan A. 33
Cont…
C) Probability (mfx) interpretation of the logit model

By: Teklebirhan A. 34
Cont…
Interpretation:
As GPA increases by one point, the probability of getting
grade ‘A’ increases by 53%.
A student who owned PC, the probability of getting grade
‘A’ increases by 45.6%.

By: Teklebirhan A. 35
Cont…
D) Probit Estimation
probit grade gpa ase pc

NB: the interpretation is similar with Logit estimation

By: Teklebirhan A. 36
Cont…
E) Probability Interpretation of Probit Model

By: Teklebirhan A. 37
Cont…
Logit/Probit Model Diagnostic Tests
Multicollinearity Test
 vif
 Heteroscedasticity Test
 hettest
Model specification/omitted variable Test
 linktest
NB: In any of the discrete choice models, we first run a linear
regression to test for multicollinearity using VIF command,
and then test for multicollinearity via the vif command.
By: Teklebirhan A. 38
Cont…
Multicollinearity Test
. vif

Variable VIF 1/VIF

ase 1.19 0.840735

gpa 1.18 0.850226
pc 1.01 0.987262

Mean VIF 1.13

Heteroscedasticity Test
. hettest

Breusch-Pagan / Cook-Weisberg test for heteroskedasticity

Ho: Constant variance
Variables: fitted values of grade

chi2(1) = 2.53
Prob > chi2 = 0.1117

39
By: Teklebirhan A.
Cont…
Link Test/Model specification test

grade Coef. Std. Err. z P>|z| [95% Conf. Interval]

_hat .9551764 .383456 2.49 0.013 .2036165 1.706736

_hatsq -.0453861 .1881828 -0.24 0.809 -.4142177 .3234455
_cons .0817277 .6074585 0.13 0.893 -1.108869 1.272324

 The insignificant hat square shows that the model

has no error on its formula and no omission of
significant variable.
40
By: Teklebirhan A.
3.2 Multinomial Logit and Probit
Model
The probability that a particular consumer will choose a
particular alternative is given by the probability that the utility
of that alternative to that consumer is greater than the utility
to that consumer of all other alternatives.

Then the consumer picks the alternative that maximizes his or

her utility.

MNL model is a simple extension to the logit model

when the dependent variable can take more than two
categorical values.
By: Teklebirhan A. 41
Cont…
For instance, in Addis Ababa, a person may have the
following choice of means of transportation to go to
work place.
 Car, Bus, Train
A person may have three voting options:
 Labor party
 Conservative party
 liberal democrat party
A respondent is provided with more than two
alternatives and s/he is expected to choose one.
By: Teklebirhan A. 42
Cont…
There is no order within the categories of Y (any of a
chosen categories can be the baseline for comparison).

In Multinomial Logit Model (MLM), a response variable

with K categories will generate K-1 equations.

That means, in multinomial logit model we have K-1

equations instead of one equation.

That is why Multinomial logit models are called multi-

equation models.
By: Teklebirhan A. 43
Cont…
Each of these K-1 equations is a binary logistic
regression comparing a group with the reference group.

The choice of reference /base category is arbitrary.

Example: if our dependent variable Y= 1, 2, 3 then

our reference category is 1, then we will have two logit
equations,
 First equation : Y=2 versus Y=1
 Second equation: Y=3 versus Y=1
By: Teklebirhan A. 44
Cont…
The probabilities for all the categories of Y(all the possible
outcomes for our dependent variable) add to 1 or 100%.
 That means, P1 + P2 + P3=1

The multinomial logit is equivalent to running a series of

separate binary logit models to find the coefficients, but these
would not give us a single overall measures of our model.

Multinomial logistic regression simultaneously estimates the

K-1 logit coefficients through MLE.
Hence, if the first category is the reference, then for m= 2,
…, M.
By: Teklebirhan A. 45
Cont…

Hence, for each case, there will be M-1 predicted log

odds, one for each category relative to the reference
category.

When there are more than 2 groups, computing

probabilities is a little more complicated than it was
in logistic regression. For j = 2, …, M,
By: Teklebirhan A. 46
Cont…

By: Teklebirhan A. 47
Cont…
In other words, you take each of the M-1 log odds
you computed and exponentiate it.

Once you have done that the calculation of the

probabilities is straightforward.

Note that, when M=2, the mlogit and logistic

regression models (and for that matter the ordered
logit model) become one and the same.

By: Teklebirhan A. 48
Cont…

By: Teklebirhan A. 49
Cont…
Multinomial Example
Suppose that we want to study the determinants of
rural households income diversification:
On-farm,
Local off farm &
Migration
Data on diversification, education, gender and age
were collected from a total of 500 households from
Kebele X using simple random sampling technique.

By: Teklebirhan A. 50
Cont…

By: Teklebirhan A. 51
Cont…
Basic Commands for using MNL model:
 describe
 summarize
 tabulate divers
 mlogit divers edu age sex exp, baseoutcome(1)
 mlogit divers edu sex age exp, baseoutcome(1), rrr
 mfx, predict(outcome (1))
 mfx, predict(outcome (2))
 mfx, predict(outcome (3))
 predict plogit1 plogit2 plogit3
 summarize plogit1 plogit2 plogit3
By: Teklebirhan A. 52
Cont…

. summarize

Variable Obs Mean Std. Dev. Min Max

divers 500 2.358 .7062816 1 3

edu 500 11.652 1.644477 8 17
sex 500 .89 .3132031 0 1
age 500 45.1 10.84275 22 88
exp 500 1.378 1.656685 0 7

By: Teklebirhan A. 53
Cont…

. tabulate divers

income
diversificatio
n Freq. Percent Cum.

on farm 67 13.40 13.40

local off farm 187 37.40 50.80
migration 246 49.20 100.00

Total 500 100.00

By: Teklebirhan A. 54
Cont…
Multinomial logistic regression Number of obs = 500
LR chi2(8) = 241.43
Prob > chi2 = 0.0000
Log likelihood = -372.34821 Pseudo R2 = 0.2448

divers Coef. Std. Err. z P>|z| [95% Conf. Interval]

on_farm (base outcome)

local_off_farm
edu -.5263692 .0975398 -5.40 0.000 -.7175437 -.3351948
age .0144084 .0146985 0.98 0.327 -.0144001 .0432169
sex -12.47745 527.1158 -0.02 0.981 -1045.605 1020.651
exp .8035401 .2313668 3.47 0.001 .3500694 1.257011
_cons 18.60435 527.1174 0.04 0.972 -1014.527 1051.736

migration
edu -.1060083 .0926298 -1.14 0.252 -.2875594 .0755427
age -.0015199 .0157332 -0.10 0.923 -.0323565 .0293166
sex -15.74113 527.1155 -0.03 0.976 -1048.869 1017.386
exp 1.40202 .2312467 6.06 0.000 .9487852 1.855256
_cons 16.90534 527.1172 0.03 0.974 -1016.225 1050.036

By: Teklebirhan A. 55
Cont…
As education increases by one year, the log of the
ratio of the two probabilities, P(off farm=2)/P(on-
farm=1) will decrease by 0.52, and the log of the
ratio of the two probabilities P(migration=3)/P(on-
farm=1) will decrease by 0.11.

Therefore, the more the level of education the more

households will choose on-farm income
diversification than migration and local off farm
participation.
By: Teklebirhan A. 56
Cont…
The ratio of the probability of choosing one category
over the probability of choosing the reference category
is often referred to as relative risk ratio (Odds
Ratio).
So another way of interpreting the multinomial
regression results is in terms of relative risk ratio
(odds ratio) ==see the following slide.

By: Teklebirhan A. 57
Cont…
Multinomial logistic regression Number of obs = 500
LR chi2(8) = 241.43
Prob > chi2 = 0.0000
Log likelihood = -372.34821 Pseudo R2 = 0.2448

divers RRR Std. Err. z P>|z| [95% Conf. Interval]

on_farm (base outcome)

local_off_farm
edu .5907459 .0576212 -5.40 0.000 .4879493 .7151988
sex 3.81e-06 .0020092 -0.02 0.981 0 .
age 1.014513 .0149118 0.98 0.327 .9857031 1.044164
exp 2.233434 .5167425 3.47 0.001 1.419166 3.514899
_cons 1.20e+08 6.33e+10 0.04 0.972 0 .

migration
edu .8994171 .0833128 -1.14 0.252 .750092 1.078469
sex 1.46e-07 .0000768 -0.03 0.976 0 .
age .9984812 .0157093 -0.10 0.923 .9681614 1.029751
exp 4.063402 .9396483 6.06 0.000 2.58257 6.393333
_cons 2.20e+07 1.16e+10 0.03 0.974 0 .

Note: _cons estimates baseline relative risk for each outcome.

By: Teklebirhan A. 58
Cont…
For one year increase in level of education, the probability of
choosing local off farm is 0.591 times the probability of on-
farm income diversification.
So, as age increases by one year, the probability of choosing
migration is 0.998 times the probability of on-farm
diversification.
For a dichotomous dummy explanatory variable such as male,
the ratio of the relative risks of choosing migration (2) over on
farm diversification(1) for male as compare to female is
0.00000381.
The log of the ratio of the two probabilities,
(migration=2)/P(on-farm=1), for male will be lower by
15.74 than female.
59
Thus, male is less probable to migrate. By: Teklebirhan A.
Cont…
Marginal Effect Result

ME after MNL
ME after MNL for local off ME after MNL for
Categories for on-farm
farm diversification Migration
diversification

EDUC 0.0046787 -0.1012054*** 0.0965267

SEX 0 .0792842 *** 0.4155105 *** -0.4947947 ***
Age -0.0000766 0.0040408 -0.0039642

Exp -0.0198351 -0.1342864 0.1541215

By: Teklebirhan A. 60
Cont…
ME after MNL for on-farm income diversification
As the above table shows, male are more probable (8%) to
participate in on farm income diversification than female, and
statistically significant.
ME after MNL for local off-farm income diversification
More educated person is less probable (10%) than less
educated person to participate in local off farm income
diversification, and statistically significant.
Male is more probable (41%) to participate in local off farm
income diversification than female, & statistically significant.
 Thus, it is more probable for male and less educated person
to participate in local off farm income diversification.
61
By: Teklebirhan A.
Cont…
ME after MNL for migration

Male are 49.5% less probable than female to

participate in migration

By: Teklebirhan A. 62
Cont…
We can also determine the probability that an
individual chooses each alternatives using the
following command.
predict plogit1 plogit2 plogit3, pr
Finally, we can also determine the average probability
for each category using the following Stata command.
summarize plogit1 plogit2 plogit3

By: Teklebirhan A. 63
Cont…

. predict plogit1 plogit2 plogit3, pr

. summarize plogit1 plogit2 plogit3

Variable Obs Mean Std. Dev. Min Max

plogit1 500 .1342627 .1382494 1.55e-10 .6204082

plogit2 500 .3740613 .2271436 .0028397 .8532475
plogit3 500 .491676 .292386 .0880502 .9971603

By: Teklebirhan A. 64
Cont…
The above summary statistics of the probability of
choosing on-farm diversification, local off farm
diversification and migration of rural households
showed that about half of the rural households have at
least one migrant family member and therefore, rural
households used migration as one type of income
diversification.

By: Teklebirhan A. 65
3.3 Ordered Logit Model
The ordered logit model is also known as the proportional
odds model.
The terms parallel lines model and parallel regressions
model are also sometimes used.

In ordered logit model, there is observed ordinal variable, Y

which in turn, is a function of another variable, Y*, that is not
measured.

That means, in ordered logit model, there is a continuous,

unmeasured latent variable Y*, whose values determine the
value of ordinal variable Y.
By: Teklebirhan A. 66
Cont…
The continuous latent variable Y* has various
threshold points (cut points).

Your value on the observed variable Y depends on

whether or not you have crossed a particular
threshold.

These cut points (thresholds) are represented by k

which is the Greek small letter kappa.

By: Teklebirhan A. 67
Cont…
For example, when the number of categories are three
(M=3)

For example, it might be that if your score on the

unobserved latent variable Y* was 10 utils or less, your
score on Y would be low(1); if your Y* score was b/n
10 and 25 utils, Y would be medium (2); and if your
score on latent variable (Y*) was above 25, Y would be
high(3).
By: Teklebirhan A. 68
Cont…
Put another way, you can think of Y as being a
collapsed version of Y*.
Example Y* can take on an infinite range of values
which might then be collapsed into 3 categories of Y.
So, what does Y* equal? How do you estimate this
model?
In the population, the continuous latent variable Y* is
equal to

By: Teklebirhan A. 69
Cont…
Note that there is a random disturbance term, which,
in this case, has a standard logistic distribution (mean
of 0 and variance of 3.29).
This reflects the fact that relevant variables may
be left out of the equation, or variables may not
be perfectly measured.
The ordered logit model estimates part of the above:

By: Teklebirhan A. 70
Cont…

By: Teklebirhan A. 71
Cont…

By: Teklebirhan A. 72
Cont…

By: Teklebirhan A. 73
Cont…
Example on Ordered Logit
Suppose that we want to study the level and determinants
of the service satisfaction of the employees of AAU.

To this end, data on the level of satisfaction (low, medium

and high), years of schooling, gender, age and experience
were collected from 500 respondents.

In this example, the response variable, level of satisfaction

has ordered categorical variables (low, medium & high) and
therefore, we can use Ordered Logit/Probit Model.
By: Teklebirhan A. 74
Cont…
Three of the explanatory variables are quantitative
(experience, age and education) while the other one is
qualitative or categorical variable (gender).

By: Teklebirhan A. 75
Cont…
Basic Commands
 summarize
 tabulate LEVEL
 ologit LEVEL EDUC MALE AGE EXPR
 ologit LEVEL EDUC MALE AGE EXPR, or
 mfx, predict(outcome (1))
 mfx, predict(outcome (2))
 mfx, predict(outcome (3))
 predict plogit1 plogit2 plogit3, pr
 summarize plogit1 plogit2 plogit3
 linktest
By: Teklebirhan A. 76
Cont…
. describe

Contains data from D:\2010\Econometrics\Stata Training\Stata14\Ordered Logit.dta

obs: 500
vars: 5 29 Mar 2019 02:35
size: 2,500

storage display value

variable name type format label variable label

LEVEL byte %10.0g LEVEL LEVEL OF SATISFACTION

EDUC byte %10.0g EDUC
MALE byte %10.0g MALE
AGE byte %10.0g AGE
EXPR byte %10.0g EXPR

By: Teklebirhan A. 77
Cont…
. sum

Variable Obs Mean Std. Dev. Min Max

LEVEL 498 2.35743 .7069325 1 3

EDUC 500 11.652 1.644477 8 17
MALE 500 .89 .3132031 0 1
AGE 500 45.1 10.84275 22 88
EXPR 500 1.378 1.656685 0 7

By: Teklebirhan A. 78
Cont…
. tabulate LEVEL

LEVEL OF
SATISFACTIO
N Freq. Percent Cum.

Low 67 13.45 13.45

Moderate 186 37.35 50.80
High 245 49.20 100.00

Total 498 100.00

By: Teklebirhan A. 79
Cont…
ologit LEVEL EDUC MALE AGE EXPR
Ordered logistic regression Number of obs = 498
LR chi2(4) = 185.97
Prob > chi2 = 0.0000
Log likelihood = -398.38453 Pseudo R2 = 0.1892

LEVEL Coef. Std. Err. z P>|z| [95% Conf. Interval]

EDUC .0798993 .0600279 1.33 0.183 -.0377533 .1975519

MALE -3.409955 .6115636 -5.58 0.000 -4.608597 -2.211312
AGE -.0082687 .0090129 -0.92 0.359 -.0259336 .0093962
EXPR .699957 .0748069 9.36 0.000 .5533381 .8465758

/cut1 -4.024487 1.017069 -6.017906 -2.031069

/cut2 -1.644877 1.013087 -3.630491 .340738

By: Teklebirhan A. 80
Cont…
Ordered Logit Interpretation
The level of satisfaction is better (from low to medium
to high) with higher level of education and experience,
female and lower age.

Both gender and experience are statistically

significant; education and age are not.
For experience, a one year increases in experience, we
expect a 0.69957 increase in the log odds of being in a
higher level of satisfaction.
By: Teklebirhan A. 81
Cont…
The coefficient of male is the ordered log-odds
estimate of comparing male to female on expected
level of satisfaction.

The ordered logit for male being in a higher level of

satisfaction category is 3. 40 less than female.

By: Teklebirhan A. 82
Cont…
Odds Ratio Interpretation of the ordered logit model
ologit LEVEL EDUC MALE AGE EXPR, or

By: Teklebirhan A. 83
Cont…
For gender, the odds of being in the higher level of
satisfaction of male is 0.0330 times that of female
staff.

This means that, there is greater probability for

female to be in the higher level of satisfaction or
female are more satisfied than male.

By: Teklebirhan A. 84
Cont…
Probability Interpretation of ordered logit

By: Teklebirhan A. 85
Cont…
A year increase in education is associated with 0.6%
less likely to be in the low level of satisfaction, 1.4%
less likely to be in the medium level of satisfaction and
2% more likely to be in the high level of satisfaction.
As years of schooling (education) increases by one year,
the probability of being in the higher level of
satisfaction increases.
NB: Sum of the probability of each category is Zero for each
explanatory variable.

By: Teklebirhan A. 86
Cont…
Similarly, one year increases in experience is
associated with 4.8% less likely to be in the low level
of satisfaction, 12.6% less likely to be in the medium
level of satisfaction and 17.4% more likely to be in
the high level of satisfaction.

Male is 10.2% more likely, 41.8% more likely and

52% less likely to be in the low, medium and high
level of satisfaction, respectively.

By: Teklebirhan A. 87
Cont…
Predict plogit1 plogit2 plogit3

By: Teklebirhan A. 88
Cont…
As you can see, the predicted probability of being in
the lowest level of satisfaction is 13.3% and that of
the middle and highest level of satisfaction are 37.6%
and 49.2%, respectively.

We can infer that about half of the employees of the

university are highly satisfied with the services and the
main determinants of the level of satisfaction are
experience and gender.

By: Teklebirhan A. 89
Cont…
Link test of model specification
The test result showed that there is no problem.

By: Teklebirhan A. 90
Cont…
Measures of fit
The R2 don’t make sense for logit and probit model
rather we use pseudo-R2.
The pseudo-R2 measure the fit using the likelihood
function and measures the improvement in the value of
the log likelihood, relative to having no explanatory
variables (Xi).
Suppose that if pseudo-R2 = 0.189
This suggests that the log-likelihood value increases by
about 18.9% with the introduction of the set of regressors.
By: Teklebirhan A. 91
Cont…
Note:

In any of the discrete choice models, we first run a

linear regression to test for multicollinearity using
VIF command, and then test for multicollinearity via
the vif command.

By: Teklebirhan A. 92
End!

Thank you!
By: Teklebirhan A. 93

CH-4-Discrete Choice Models-Short
No ratings yet
CH-4-Discrete Choice Models-Short
58 pages
Logistic Regression
100% (3)
Logistic Regression
41 pages
Probit Model
No ratings yet
Probit Model
29 pages
Chapter 3 - Logit and Probit Models
No ratings yet
Chapter 3 - Logit and Probit Models
34 pages
Logit & Probit Model
No ratings yet
Logit & Probit Model
51 pages
Logit, Probit and Multinomial Logit Models in R: Oscar Torres-Reyna
No ratings yet
Logit, Probit and Multinomial Logit Models in R: Oscar Torres-Reyna
27 pages
Logistic Regression
0% (1)
Logistic Regression
71 pages
Qualitative Response Regression Model - Probabilistic Models
No ratings yet
Qualitative Response Regression Model - Probabilistic Models
34 pages
Econometrics II CH 1
No ratings yet
Econometrics II CH 1
48 pages
5.1) Binary Logistic Regression
No ratings yet
5.1) Binary Logistic Regression
32 pages
Logit, Probit and Multinomial Logit Models in R: Oscar Torres-Reyna
100% (1)
Logit, Probit and Multinomial Logit Models in R: Oscar Torres-Reyna
24 pages
Logit Probit
No ratings yet
Logit Probit
20 pages
LPM, Logit and Probit Models
No ratings yet
LPM, Logit and Probit Models
21 pages
Multinomial Logit or Probit Model 2
No ratings yet
Multinomial Logit or Probit Model 2
13 pages
Logistic Regression Analysis
No ratings yet
Logistic Regression Analysis
16 pages
09-Limited Dependent Variable Models
No ratings yet
09-Limited Dependent Variable Models
71 pages
Unitb - II - Linear Probability, Logit and Probit
No ratings yet
Unitb - II - Linear Probability, Logit and Probit
34 pages
Msfe Week9
No ratings yet
Msfe Week9
5 pages
Metrikaq
No ratings yet
Metrikaq
11 pages
Bgpev2 LDV
No ratings yet
Bgpev2 LDV
53 pages
Cap1 Slides
No ratings yet
Cap1 Slides
30 pages
Newsletter 23 - Logit, Probit, Tobit (2P)
No ratings yet
Newsletter 23 - Logit, Probit, Tobit (2P)
2 pages
Econometrics - Qualitative Response Models
No ratings yet
Econometrics - Qualitative Response Models
17 pages
Econometrics Eviews 6
No ratings yet
Econometrics Eviews 6
12 pages
Chapter 4
No ratings yet
Chapter 4
11 pages
Logit R101
No ratings yet
Logit R101
27 pages
Chapter 7. Software Application
No ratings yet
Chapter 7. Software Application
43 pages
Logit and Probit: Models With Discrete Dependent Variables
No ratings yet
Logit and Probit: Models With Discrete Dependent Variables
30 pages
Chapter 15 Qualitative Response Regression Models Part 2
No ratings yet
Chapter 15 Qualitative Response Regression Models Part 2
31 pages
26GeneralizedLinearModelBernoulliAnnotated PDF
No ratings yet
26GeneralizedLinearModelBernoulliAnnotated PDF
46 pages
Multiple Choice Models Part I - MNL, Nested Logit
No ratings yet
Multiple Choice Models Part I - MNL, Nested Logit
33 pages
Chapter 5-LDVM-2024
No ratings yet
Chapter 5-LDVM-2024
27 pages
Econometria Avanzada: Generalized Linear Models
No ratings yet
Econometria Avanzada: Generalized Linear Models
30 pages
Section 11 PDF
No ratings yet
Section 11 PDF
7 pages
Econometrics CH 4
No ratings yet
Econometrics CH 4
14 pages
Bio2 Module 5 - Logistic Regression
No ratings yet
Bio2 Module 5 - Logistic Regression
19 pages
L9 Logistical Regression Models Updated
No ratings yet
L9 Logistical Regression Models Updated
10 pages
Classification With Logistic Regression: DR Sandipan Karmakar Mnit Jaipur
No ratings yet
Classification With Logistic Regression: DR Sandipan Karmakar Mnit Jaipur
54 pages
Binaryresponsemf IMP
No ratings yet
Binaryresponsemf IMP
11 pages
CH 5. Discrete Choice Model
No ratings yet
CH 5. Discrete Choice Model
38 pages
SOC6078 SOC6078 Advanced Statistics: 4. Models For Categorical Dependent Variables II Extending The Logit and Probit Models
No ratings yet
SOC6078 SOC6078 Advanced Statistics: 4. Models For Categorical Dependent Variables II Extending The Logit and Probit Models
15 pages
Limited Dependent Variables - Binary Dependent Variables
No ratings yet
Limited Dependent Variables - Binary Dependent Variables
24 pages
Econometric Lec7
No ratings yet
Econometric Lec7
26 pages
Chapter 5 MGT
No ratings yet
Chapter 5 MGT
60 pages
DS535 Note 4 (With Marks)
No ratings yet
DS535 Note 4 (With Marks)
18 pages
Qualitative Response Models
No ratings yet
Qualitative Response Models
35 pages
Seminar Econometrie
No ratings yet
Seminar Econometrie
15 pages
Lecture 7 - Binary
No ratings yet
Lecture 7 - Binary
45 pages
Section 9 Limited Dependent Variables
No ratings yet
Section 9 Limited Dependent Variables
17 pages
3 Classification
No ratings yet
3 Classification
26 pages
Lecture15 Binary Dependent Variables
No ratings yet
Lecture15 Binary Dependent Variables
38 pages
Assignment 2
No ratings yet
Assignment 2
11 pages
Binary Logistic Regression - 6.2
No ratings yet
Binary Logistic Regression - 6.2
34 pages
Qualitative Methods
No ratings yet
Qualitative Methods
7 pages
7 Binaryresponsemf
No ratings yet
7 Binaryresponsemf
11 pages
Assignment On Probit Model
No ratings yet
Assignment On Probit Model
17 pages
BSC Intermediate Econometrics: Please Do Not Distribute
No ratings yet
BSC Intermediate Econometrics: Please Do Not Distribute
25 pages
TransCAD Demo Guide Compactado Páginas 39 69
No ratings yet
TransCAD Demo Guide Compactado Páginas 39 69
31 pages
NLOGIT Manual
No ratings yet
NLOGIT Manual
173 pages
Chapter 1 Econometrics
No ratings yet
Chapter 1 Econometrics
21 pages
Experimental Auctions Methods and Applications in Economic and Marketing Research
100% (1)
Experimental Auctions Methods and Applications in Economic and Marketing Research
315 pages
A Course in Environmental Economics Theory Policy and Practice 1
No ratings yet
A Course in Environmental Economics Theory Policy and Practice 1
370 pages
Cee561 L2
No ratings yet
Cee561 L2
25 pages
TR 631 LT 2.4 Mode Choice Modelling
No ratings yet
TR 631 LT 2.4 Mode Choice Modelling
67 pages
15 - Isaac Wachira Mwangi - Final Paper
No ratings yet
15 - Isaac Wachira Mwangi - Final Paper
21 pages
Nested Logit Transpo
No ratings yet
Nested Logit Transpo
9 pages
Introduction To Discrete Choice Models
No ratings yet
Introduction To Discrete Choice Models
6 pages
Subject: Logistics and Retail Information Credits: 4
No ratings yet
Subject: Logistics and Retail Information Credits: 4
135 pages
Discrete Choice Analysis I: Moshe Ben-Akiva
No ratings yet
Discrete Choice Analysis I: Moshe Ben-Akiva
38 pages
Pricing Analytics Models and Advanced Quantitative Techniques For Product Pricing First Edition Paczkowski
100% (2)
Pricing Analytics Models and Advanced Quantitative Techniques For Product Pricing First Edition Paczkowski
65 pages
Urban and Regional Transportation Modeling Essays in Honor of David Boyce New Dimensions in Networks Der-Horng Lee
100% (11)
Urban and Regional Transportation Modeling Essays in Honor of David Boyce New Dimensions in Networks Der-Horng Lee
82 pages
Toward Understanding Nurses' Decisions Whether To Miss Care - A Discrete Choice Experiment
No ratings yet
Toward Understanding Nurses' Decisions Whether To Miss Care - A Discrete Choice Experiment
11 pages
Chapter 3 An Illustrative Example of Case 1 Best-Worst Scaling - Non-Market Valuation With R
No ratings yet
Chapter 3 An Illustrative Example of Case 1 Best-Worst Scaling - Non-Market Valuation With R
41 pages
Syllabus EC995 Xia
No ratings yet
Syllabus EC995 Xia
3 pages
Airport Access System: Passenger Terminal Design and Access
No ratings yet
Airport Access System: Passenger Terminal Design and Access
25 pages
Ec1 18 PDF
No ratings yet
Ec1 18 PDF
32 pages
PHD Thesis - Charlotte Watteyn
No ratings yet
PHD Thesis - Charlotte Watteyn
284 pages
Rose & Bliemer 2009 - Constructing Efficient Stated Choice Experimental Designs
No ratings yet
Rose & Bliemer 2009 - Constructing Efficient Stated Choice Experimental Designs
32 pages
Public Benefits of Undeveloped Lands On Urban Outskirts PDF
No ratings yet
Public Benefits of Undeveloped Lands On Urban Outskirts PDF
51 pages
2009 - Christie - An Economic Assessment of The Amenity Benefits Associated With Alternative Coastal Defence Options
No ratings yet
2009 - Christie - An Economic Assessment of The Amenity Benefits Associated With Alternative Coastal Defence Options
20 pages
Stated Benefits of Teleworking in Mexico City A Discrete Choice Experiment On Office Workers
No ratings yet
Stated Benefits of Teleworking in Mexico City A Discrete Choice Experiment On Office Workers
65 pages
Dynamic Discrete Choice Models:: An Application To Vehicle Holding Decisions
No ratings yet
Dynamic Discrete Choice Models:: An Application To Vehicle Holding Decisions
56 pages
Goodwill and Dynamic Advertising Strateg
No ratings yet
Goodwill and Dynamic Advertising Strateg
38 pages
ASDM
No ratings yet
ASDM
87 pages
An Application of RP-SP Data For Joint Estimation of Mode Choice Models
No ratings yet
An Application of RP-SP Data For Joint Estimation of Mode Choice Models
22 pages
Seminar Synopsis
No ratings yet
Seminar Synopsis
17 pages

CH-4-Discrete Choice Models-PG (Compatibility Mode)

Uploaded by

CH-4-Discrete Choice Models-PG (Compatibility Mode)

Uploaded by

Applied econometrics for

ManageMent (MgMt MSC 5411)

Teklebirhan Alemnew (Assistant Professor)

Both Yi and Ui take only two values with

e) The true relationship between a binary outcome and

Violation of the assumption of Heteroscedasticity

Where, Li is called Logit which is linearly related

Cumulative Normal Distribution Function

Logistic Distribution Function

NB: the interpretation is similar with Logit estimation

Variable VIF 1/VIF

ase 1.19 0.840735

Mean VIF 1.13

Breusch-Pagan / Cook-Weisberg test for heteroskedasticity

grade Coef. Std. Err. z P>|z| [95% Conf. Interval]

_hat .9551764 .383456 2.49 0.013 .2036165 1.706736

 The insignificant hat square shows that the model

Then the consumer picks the alternative that maximizes his or

MNL model is a simple extension to the logit model

In Multinomial Logit Model (MLM), a response variable

That means, in multinomial logit model we have K-1

That is why Multinomial logit models are called multi-

The choice of reference /base category is arbitrary.

Example: if our dependent variable Y= 1, 2, 3 then

The multinomial logit is equivalent to running a series of

Multinomial logistic regression simultaneously estimates the

Hence, for each case, there will be M-1 predicted log

When there are more than 2 groups, computing

Once you have done that the calculation of the

Note that, when M=2, the mlogit and logistic

Variable Obs Mean Std. Dev. Min Max

divers 500 2.358 .7062816 1 3

on farm 67 13.40 13.40

Total 500 100.00

divers Coef. Std. Err. z P>|z| [95% Conf. Interval]

on_farm (base outcome)

Therefore, the more the level of education the more

divers RRR Std. Err. z P>|z| [95% Conf. Interval]

on_farm (base outcome)

Note: _cons estimates baseline relative risk for each outcome.

EDUC 0.0046787 -0.1012054*** 0.0965267

Exp -0.0198351 -0.1342864 0.1541215

Male are 49.5% less probable than female to

. predict plogit1 plogit2 plogit3, pr

. summarize plogit1 plogit2 plogit3

Variable Obs Mean Std. Dev. Min Max

plogit1 500 .1342627 .1382494 1.55e-10 .6204082

In ordered logit model, there is observed ordinal variable, Y

That means, in ordered logit model, there is a continuous,

Your value on the observed variable Y depends on

These cut points (thresholds) are represented by k

For example, it might be that if your score on the

To this end, data on the level of satisfaction (low, medium

In this example, the response variable, level of satisfaction

Contains data from D:\2010\Econometrics\Stata Training\Stata14\Ordered Logit.dta

storage display value

LEVEL byte %10.0g LEVEL LEVEL OF SATISFACTION

Variable Obs Mean Std. Dev. Min Max

LEVEL 498 2.35743 .7069325 1 3

Low 67 13.45 13.45

Total 498 100.00

LEVEL Coef. Std. Err. z P>|z| [95% Conf. Interval]

EDUC .0798993 .0600279 1.33 0.183 -.0377533 .1975519

/cut1 -4.024487 1.017069 -6.017906 -2.031069

Both gender and experience are statistically

The ordered logit for male being in a higher level of

This means that, there is greater probability for

Male is 10.2% more likely, 41.8% more likely and

We can infer that about half of the employees of the

In any of the discrete choice models, we first run a

You might also like