0% found this document useful (0 votes)

303 views5 pages

Multinomial Logistic Regression Models: Newsom Psy 525/625 Categorical Data Analysis, Spring 2021 1

Multinomial logistic regression models estimate the association between predictors and a multicategory nominal outcome variable. The model compares each outcome category to a reference category using logits. Predicted probabilities are estimated for each category based on maximum likelihood estimation. The model assumes independence of irrelevant alternatives, meaning adding a new category would equally impact existing categories. Multinomial probit models are an alternative that do not assume independence of irrelevant alternatives but are more difficult to estimate. Multinomial logistic regression can be estimated in SPSS, R, and SAS.

Uploaded by

Deo Tureme

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

303 views5 pages

Multinomial Logistic Regression Models: Newsom Psy 525/625 Categorical Data Analysis, Spring 2021 1

Uploaded by

Deo Tureme

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Newsom

Psy 525/625 Categorical Data Analysis, Spring 2021 1

Multinomial Logistic Regression Models

Multinomial logistic regression models estimate the association between a set of predictors and a
multicategory nominal (unordered) outcome. Examples of such an outcome might include “yes,” “no,”
and “don’t know”; “Apple iPhone,” “Android,” and “Samsung Galaxy”; or “walk,” “bike,” “car,” “public
transit.” The most common form of the model is a logistic model that is a generalization of the binary
outcome of standard logistic regression involving comparisons of each category of the outcome to a
referent category. There are J total categories of the outcome, indexed by the subscript j, and the
number of comparisons is then J – 1. The equation for the model is written in terms of the logit of the
outcome, which is a comparison of a particular category to the referent category, both denoted πj here.

π j 
ln  = αj + βjX
 π 
 j

The natural log of the ratio of the two proportions is the same as the logit in standard logistic regression,
where ln(πj/πj) replaces ln[π/(1-π)] , and is sometimes referred to as the generalized logit. The binary
logistic model is therefore a special case of the multinomial model. In generalized linear modeling terms,
the link function is the generalized logit and the random component is the multinomial distribution. The
model differs from the standard logistic model in that the comparisons are all estimated simultaneously
within the same model. The j subscript on both the intercept, αj, and slope, βj, indicate that there is an
intercept and a slope for the comparison of each category to the referent category. Note that in the
ordinal logistic model, there is only one slope coefficient for each predictor. Odds ratios for each
coefficient (for predicting the difference of one category response from the referent) are computed as
usual, with OR = e β , and represent the odds increase (or decrease) for category j compared with the
j

referent category for each unit increase in X.

The predicted probabilities can be computed from the model parameters for a specific value of X. For
the standard logistic regression, we used the logistic transformation to find the probability according to
the logistic cumulative distribution function (cdf; see the “Logistic Regression” handout). For a simple
logistic regression with one predictor, we used
1
π=
1 + eα + β X

Entering in a specific value of X and the model estimates of α and β and using the exponential function,
the estimate of the expected probability can be computed for the specific value of X. (For additional
predictors, the values for X and β for those variables are added to the exponent in the denominator.)
Predicted probabilities (y-axis) are then often plotted with a separate line for each comparison as a
function of the X variable values (x-axis).

The cdf transformation for the multinomial distribution must add the exponent functions of the intercepts
and the coefficients for each of the comparisons to the referent category. 1 For a single predictor, the
predicted probability can be computed by generalizing the above equation for standard logistic, using the
following equation with as many additional J – 1 terms in the denominator for every comparison to the
referent category:

1 1
πj =
1+ ∑ e j j
α j +β j X α j +β j X α J −1 + β J −1 X α +β X
1+ e +e ... e

1
The referent category in the logistic or the multinomial logistic have e0, which is 1. This is why 1 appears in the numerator and denominator.
Newsom
Psy 525/625 Categorical Data Analysis, Spring 2021 2

The result is the estimated proportion for the referent category relative to the total of the proportions of all
categories combined (1.0), given a specific value of X and the intercept and slope coefficient(s).
Maximum likelihood is the most common estimation used for multinomial logistic regression. And, as with
logistic regression, model fit tests, such as the likelihood ratio test with degrees of freedom equal to J –
1, 2 are used to determine whether together all of the comparisons to the referent are significant.

The multinomial logistic models assume that there is independence of irrelevant alternatives (IIA). The
assumption is that if an additional category was to be added to the outcome, the proportions for the
original categories would be equally affected by adding the new category (e.g., adding a third party
candidate would equally impact votes for the two major party candidates). 3 As this example suggests,
the IIA assumption is not particularly realistic in many situations, even though it is needed for truly
unbiased estimates of the observed and predicted proportions. Although tests have been suggested to
investigate violation of the assumption, they do not appear to perform well (e.g., Cheng & Long, 2005;
Fry & Harris, 1996).

Multinomial Probit Models

Multinomial probit models analogous to the binary probit model are also possible, and have been
considered as one potential solution that would be free of the IIA assumption. For the multinomial probit
model, the probit link is used with multivariate normal distribution random component. The use of the
multivariate normal distribution instead of the logistic distribution allows correlations among possible
alternatives. The multinomial probit model, however, has other difficulties that make it a less than optimal
alternative. For the multinomial probit model to have a mathematically identified solution, the predictors
must be included that are associated with each specific alternative. Restrictions on the correlations
among errors are also necessary to obtain estimates, and there may not be a theoretical basis for
making particular restrictions. Multinomial probit is not available in the current version of SPSS but can
be estimated in Limdep (maximum likelihood) or MNP (Bayesian) package in R, PROC MDC in SAS
(maximum likelihood).

Discrete Choice Models

The term discrete choice model is very generally applied to binary and multiple-category (ordinal or
nominal) outcomes. No particular estimation approach or statistical model is implied by the use of the
term discrete choice, but most often the reference to discrete choice is to multinomial models (most
typically multinomial logistic). The terminology is applied in consumer choice studies in marketing
(McFadden, 1974), transportation research, and economics among other fields. The goal is simply to
predict one decision over another using a set of explanatory factors. Model results are often described in
terms of the economic theory of utility (systematic function of the predictors and random error) of one
choice over another, representing the amount of gain to an individual by selecting one choice over
another.

Multinomial Logistic Example

SPSS
Multinomial logistic models can be estimated in SPSS using the nomreg procedure and in R using the
mlogit package or the nnet package and the multinom function.
GET FILE='c:\jason\spsswin\cdaclass\multinomial.sav'.

nomreg work (base = 1) with age srh married

/print = paramter summary cps mfi.

2
SAS prints the score and Wald test for the model as well.
3
The widely used example is the assumption that the original transportation choices of car and red bus would be equally affected if the choice
was between a car, red bus, or blue bus.
Newsom
Psy 525/625 Categorical Data Analysis, Spring 2021 3

R
> #use lessR routine for listwise deletion
> library(lessR)
> mydata <-Subset(work!='NA' & age!='NA' & srh!='NA' & married!='NA')
> #make sure dv is a factor
> d$work <- factor(d$work)
>
> library(nnet)
> d$work <- relevel(d$work, ref = 1)
> model <- multinom(work ~ age + srh + married, data = d)
# weights: 20 (12 variable)
initial value 709.782713
iter 10 value 325.755760
iter 20 value 297.417808
iter 30 value 297.133699
iter 40 value 297.125872
final value 297.125651
converged
Newsom
Psy 525/625 Categorical Data Analysis, Spring 2021 4

> summary(model)
Call:
multinom(formula = work ~ age + srh + married, data = d)
Coefficients:
(Intercept) age srh married
2 -3.311567 0.07788846 -0.6846427 0.02574789
3 -2.291723 0.05759751 -0.6545843 -0.56346491
4 -7.577511 0.17462753 -0.7710237 -0.18024777
Std. Errors:
(Intercept) age srh married
2 4.090534 0.05608322 0.2636701 0.5429907
3 4.752491 0.06532525 0.3052976 0.6251271
4 3.438072 0.04744140 0.2216583 0.4359652
Residual Deviance: 594.2513
AIC: 618.2513
>
> #obtain odds ratios
> exp(cbind(OR=coef(model), confint(model)))
(Intercept) age srh married
2 0.0364589851 1.081002 0.5042704 1.0260822 0.00001202087
3 0.1010921469 1.059289 0.5196580 0.5692333 0.96847498115
4 0.0005118338 1.190803 0.4625393 0.8350633 0.30076485647

SAS
proc logistic data=one ;
model work (ref=first) = age srh married / link=glogit;
run;

The LOGISTIC Procedure

Model Information

Data Set WORK.ONE

Response Variable work work status
Number of Response Levels 4
Model generalized logit
Optimization Technique Newton-Raphson

Number of Observations Read 566

Number of Observations Used 512

Response Profile

Ordered Total
Value work Frequency

1 full 29
2 part 38
3 retired 426
4 unemployed 19

Logits modeled use work='full' as the reference category.

NOTE: 54 observations were deleted due to missing values for the response or explanatory variables.

Model Fit Statistics

Intercept
Intercept and
Criterion Only Covariates
AIC 652.014 618.251
SC 664.729 669.111
-2 Log L 646.014 594.251
Newsom
Psy 525/625 Categorical Data Analysis, Spring 2021 5

Testing Global Null Hypothesis: BETA=0

Test Chi-Square DF Pr > ChiSq

Likelihood Ratio 51.7624 9 <.0001

Score 45.5293 9 <.0001
Wald 40.2975 9 <.0001

Type 3 Analysis of Effects

Wald
Effect DF Chi-Square Pr > ChiSq

age 3 25.1953 <.0001

srh 3 12.2197 0.0067
married 3 1.2012 0.7527

Analysis of Maximum Likelihood Estimates

Standard Wald
Parameter work DF Estimate Error Chi-Square Pr > ChiSq

Intercept part 1 -3.3093 4.0902 0.6546 0.4185

Intercept retired 1 -7.5753 3.4377 4.8558 0.0276
Intercept unemployed 1 -2.2894 4.7522 0.2321 0.6300
age part 1 0.0779 0.0561 1.9274 0.1650
age retired 1 0.1746 0.0474 13.5471 0.0002
age unemployed 1 0.0576 0.0653 0.7766 0.3782
srh part 1 -0.6846 0.2637 6.7422 0.0094
srh retired 1 -0.7710 0.2216 12.0998 0.0005
srh unemployed 1 -0.6546 0.3053 4.5970 0.0320
married part 1 0.0257 0.5430 0.0022 0.9622
married retired 1 -0.1803 0.4359 0.1710 0.6792
married unemployed 1 -0.5635 0.6251 0.8125 0.3674

Odds Ratio Estimates

Point 95% Wald
Effect work Estimate Confidence Limits

age part 1.081 0.968 1.207

age retired 1.191 1.085 1.307
age unemployed 1.059 0.932 1.204
srh part 0.504 0.301 0.845
srh retired 0.463 0.300 0.714
srh unemployed 0.520 0.286 0.945
married part 1.026 0.354 2.974
married retired 0.835 0.355 1.962
married unemployed 0.569 0.167 1.938

References and Further Reading

Ben-Akiva, M., & Lerman, S. (1985). Discrete choice models. London: MIT Press.
Cheng, S., & Long, J.S. (2007). Testing for IIA in the Multinomial Logit Model. Sociological
Methods & Research, 35, 583-600.
Fry, T. R. L. F., & Harris, M.N. (1996). A Monte Carlo study of tests for the independence of irrelevant alternatives property. Transportation
Research Part B: Methodological 30,: 19-30.
Long, J.S. (2012). Regression models for nominal and ordinal outcomes. In Best and Wolf (Eds), Regression Models. Thousand Oaks, CA:
Sage Publications
McFadden, D. (1974). Conditional Logit Analysis of Qualitative Choice Behavior. In P. Zarembka (Ed.), Frontiers in Econometrics. New York:
Academic Press.

CH-4-Discrete Choice Models-Short
No ratings yet
CH-4-Discrete Choice Models-Short
58 pages
CH-4-Discrete Choice Models-PG (Compatibility Mode)
No ratings yet
CH-4-Discrete Choice Models-PG (Compatibility Mode)
93 pages
Probit Model
No ratings yet
Probit Model
29 pages
Multinomial Logistic Regression Basic Relationships
No ratings yet
Multinomial Logistic Regression Basic Relationships
73 pages
Lecture 8
No ratings yet
Lecture 8
22 pages
Part III - Analysis With NonLinear Models
No ratings yet
Part III - Analysis With NonLinear Models
68 pages
Multinomial Logistic Regression Basic Relationships
No ratings yet
Multinomial Logistic Regression Basic Relationships
73 pages
Multinomial & Ordinal LR Possion1
No ratings yet
Multinomial & Ordinal LR Possion1
63 pages
Bivariate Logistic Regression
100% (1)
Bivariate Logistic Regression
12 pages
Int Statistical Rev - 2009 - Gertheiss - Penalized Regression With Ordinal Predictors
No ratings yet
Int Statistical Rev - 2009 - Gertheiss - Penalized Regression With Ordinal Predictors
21 pages
The Independence of Irrelevant Alternatives - 230919 - 191757
No ratings yet
The Independence of Irrelevant Alternatives - 230919 - 191757
26 pages
Multinomial Logistic Regression
No ratings yet
Multinomial Logistic Regression
17 pages
Multinomial Regression Models
No ratings yet
Multinomial Regression Models
35 pages
5.2) Multinomial Logistic Regression
No ratings yet
5.2) Multinomial Logistic Regression
34 pages
Multinomial Logistic Regression Basic Relationships
No ratings yet
Multinomial Logistic Regression Basic Relationships
73 pages
Multinomial Logistic Regression-3
No ratings yet
Multinomial Logistic Regression-3
19 pages
Multinomial LR
No ratings yet
Multinomial LR
24 pages
Chapter 5
No ratings yet
Chapter 5
25 pages
Multinomial and Ordinal Logistic Regress
No ratings yet
Multinomial and Ordinal Logistic Regress
12 pages
Sarang Ke Liye
No ratings yet
Sarang Ke Liye
14 pages
An Applicationon Multinomial Logistic Regression Model
No ratings yet
An Applicationon Multinomial Logistic Regression Model
22 pages
Dummy Variable CH 5
No ratings yet
Dummy Variable CH 5
22 pages
XSTK
No ratings yet
XSTK
8 pages
Chapter 5 MGT
No ratings yet
Chapter 5 MGT
60 pages
Q.1 Explain The Underlying Ideas Behind The Log It Model. Explain On What Grounds Log It Model Is An Improvement Over Linear Probability Model. Ans
No ratings yet
Q.1 Explain The Underlying Ideas Behind The Log It Model. Explain On What Grounds Log It Model Is An Improvement Over Linear Probability Model. Ans
17 pages
Listcoef
No ratings yet
Listcoef
76 pages
Econometrics II CH 1
No ratings yet
Econometrics II CH 1
48 pages
Exam - Time Series Analysis
No ratings yet
Exam - Time Series Analysis
8 pages
Logistic Regression
100% (1)
Logistic Regression
34 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
3 pages
Logistic - Poly Regression
No ratings yet
Logistic - Poly Regression
13 pages
An Application On Multinomial Logistic Regression Model
No ratings yet
An Application On Multinomial Logistic Regression Model
22 pages
Multiple Choice Models Part I - MNL, Nested Logit
No ratings yet
Multiple Choice Models Part I - MNL, Nested Logit
33 pages
Garch Model
100% (2)
Garch Model
92 pages
Handbook of Applied Econometrics & Statistical Inference
No ratings yet
Handbook of Applied Econometrics & Statistical Inference
718 pages
Probit Logit Indiana
No ratings yet
Probit Logit Indiana
62 pages
SPSS Data Analysis Examples - Multinomial Logistic Regression
No ratings yet
SPSS Data Analysis Examples - Multinomial Logistic Regression
5 pages
Chapter 5-LDVM-2024
No ratings yet
Chapter 5-LDVM-2024
27 pages
Chapter 2
No ratings yet
Chapter 2
11 pages
Chapter 18
No ratings yet
Chapter 18
9 pages
Polytomous Logistic Regression
No ratings yet
Polytomous Logistic Regression
2 pages
The Best Olr Method
No ratings yet
The Best Olr Method
12 pages
Regression With Categorical Variables
No ratings yet
Regression With Categorical Variables
28 pages
SOC6078 SOC6078 Advanced Statistics: 4. Models For Categorical Dependent Variables II Extending The Logit and Probit Models
No ratings yet
SOC6078 SOC6078 Advanced Statistics: 4. Models For Categorical Dependent Variables II Extending The Logit and Probit Models
15 pages
Assignment On Probit Model
No ratings yet
Assignment On Probit Model
17 pages
Chapter Three 3.0 Methodology 3.1 Source of Data
No ratings yet
Chapter Three 3.0 Methodology 3.1 Source of Data
10 pages
Sta 3010 Quizes
No ratings yet
Sta 3010 Quizes
10 pages
CH 5. Discrete Choice Model
No ratings yet
CH 5. Discrete Choice Model
38 pages
Multinomial Logit or Probit Model 2
No ratings yet
Multinomial Logit or Probit Model 2
13 pages
Multiple Regression: Curve Estimation
100% (2)
Multiple Regression: Curve Estimation
23 pages
Logistic Regression Tutorial
No ratings yet
Logistic Regression Tutorial
25 pages
Newsletter 23 - Logit, Probit, Tobit (2P)
No ratings yet
Newsletter 23 - Logit, Probit, Tobit (2P)
2 pages
Multinomial Logistic Regression Basic Relationships
No ratings yet
Multinomial Logistic Regression Basic Relationships
73 pages
Exploring Marginal Treatment Effects Flexible Estimation Using Stata
100% (1)
Exploring Marginal Treatment Effects Flexible Estimation Using Stata
37 pages
Logistic Ordinal Regression
No ratings yet
Logistic Ordinal Regression
10 pages
Lecture 24: Ordinal Logistic Regression
No ratings yet
Lecture 24: Ordinal Logistic Regression
4 pages
Group Assignment For Econometrics
75% (4)
Group Assignment For Econometrics
2 pages
Cointegration Copenhagen
100% (1)
Cointegration Copenhagen
51 pages
Chapter 0 - Multiple Regression Models
No ratings yet
Chapter 0 - Multiple Regression Models
34 pages
Multinomial Logistic Regression - Spss Data Analysis Examples
No ratings yet
Multinomial Logistic Regression - Spss Data Analysis Examples
1 page
Study Material - Econometrics
No ratings yet
Study Material - Econometrics
20 pages
Section 9 Limited Dependent Variables
No ratings yet
Section 9 Limited Dependent Variables
17 pages
Cheat Sheet Statistics
No ratings yet
Cheat Sheet Statistics
3 pages
Managerial Economics
No ratings yet
Managerial Economics
29 pages
Multiple Choice Test Bank Questions No Feedback - Chapter 3
100% (1)
Multiple Choice Test Bank Questions No Feedback - Chapter 3
5 pages
STATA Training Session 3
No ratings yet
STATA Training Session 3
53 pages
Multinomial Logistic Regression - R Data Analysis Examples - IDRE Stats
No ratings yet
Multinomial Logistic Regression - R Data Analysis Examples - IDRE Stats
8 pages
Article: An Introduction Tos Logistic Regression Analysis and Reporting
No ratings yet
Article: An Introduction Tos Logistic Regression Analysis and Reporting
5 pages
CH 12
No ratings yet
CH 12
30 pages
SRM Notes
No ratings yet
SRM Notes
96 pages
Chapter3 Solutions
No ratings yet
Chapter3 Solutions
5 pages
Problem Set 2 Answer PDF
No ratings yet
Problem Set 2 Answer PDF
5 pages
3.linear Regression
No ratings yet
3.linear Regression
33 pages
A Time-Space Dynamic Panel Data Model With Spatial Moving Average Errors PDF
No ratings yet
A Time-Space Dynamic Panel Data Model With Spatial Moving Average Errors PDF
53 pages
7 - 8. Hanke, John E. - Wichern, Dean W. - Business Forecasting
No ratings yet
7 - 8. Hanke, John E. - Wichern, Dean W. - Business Forecasting
60 pages
Syed Irfan Haider: ID # 2836 Course: QTIA Submitted To: Sir Imtiaz Arif
No ratings yet
Syed Irfan Haider: ID # 2836 Course: QTIA Submitted To: Sir Imtiaz Arif
17 pages
ECN224 Exe 1
No ratings yet
ECN224 Exe 1
2 pages
Lecture13 PDF
No ratings yet
Lecture13 PDF
48 pages
Cost Estimation Using Regression Analysis
No ratings yet
Cost Estimation Using Regression Analysis
9 pages
Mtcars: Choosing The Most Related Variable (S) To The Response
No ratings yet
Mtcars: Choosing The Most Related Variable (S) To The Response
13 pages
Ardl Modeling Using R Software
No ratings yet
Ardl Modeling Using R Software
5 pages
Andre Silvius Battu Ok
No ratings yet
Andre Silvius Battu Ok
17 pages
Suest - Seemingly Unrelated Estimation: 20 Estimation and Postestimation Commands
No ratings yet
Suest - Seemingly Unrelated Estimation: 20 Estimation and Postestimation Commands
19 pages
BMJ h2622 Full
No ratings yet
BMJ h2622 Full
2 pages
Corporate Social Responsibility
No ratings yet
Corporate Social Responsibility
31 pages
Instrumental Variables: 2-Stage and 3-Stage Least Squares Regression of A Linear Systems of Equations
No ratings yet
Instrumental Variables: 2-Stage and 3-Stage Least Squares Regression of A Linear Systems of Equations
22 pages
结构方程模型输出
No ratings yet
结构方程模型输出
2 pages
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
From Everand
Multi-dimensional Monte Carlo Integrations Utilizing Mathematica
SUJAUL CHOWDHURY
No ratings yet
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet