0% found this document useful (0 votes)
64 views

Multinomial Logistic Regression-3

Multinomial logistic regression is used when the dependent variable has more than two nominal categories. This document outlines a study examining factors that influence different types of diabetes. It discusses the assumptions, types, and interpretation of multinomial logistic regression. The example analyzes 145 patients and three independent variables to predict whether they have overt, chemical, or normal diabetes. It finds the variables are significant predictors and 66.7-76.7% of the variation is explained by the model.

Uploaded by

Ali Hassan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
64 views

Multinomial Logistic Regression-3

Multinomial logistic regression is used when the dependent variable has more than two nominal categories. This document outlines a study examining factors that influence different types of diabetes. It discusses the assumptions, types, and interpretation of multinomial logistic regression. The example analyzes 145 patients and three independent variables to predict whether they have overt, chemical, or normal diabetes. It finds the variables are significant predictors and 66.7-76.7% of the variation is explained by the model.

Uploaded by

Ali Hassan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 19

Multinomial Logistic Regression

Supervisor: Sir. Rizwan


Presented by: Yameen
University Roll No. 8406
Course: Seminar
M.Sc. Statistics
(Session 2015 to 2017)
Outline
o Definition (MLR)
o Introduction
o Assumption
o Types
o SPSS
o Interpretation
o Practical life example
Definition
Multinomial Logistic Regression is the linear
regression analysis to conduct when the dependent
variable is nominal with more than two levels. Thus
it is an extension of logistic regression, which
analyzes dichotomous (binary) dependents.
Introduction
Multiple logistic regression when the dependent variable is nominal
(categorical; meaning that it falls into any one of a set of categories
which can't be ordered in any meaning way) and for which there are
more than two categories.
Example:
Consider a study of choice of a health plan from among three plans
offered to the employees of a large corporation .the outcome variable
has three level of indicating which plan, A.B.C.is chosen possible
covariate might include Gender, Age ,Income Family size another
 
Types

 Binary logistic regression


 Multiple logistic regression
 Multinomial logistic regression
Detail Of Type
Binary variables are variables.
which only take two values. For
example, Male or Female, True or False and Yes or No. While many
variables and questions are naturally binary, it is often useful to
construct binary variables from other types of data.
multiple logistic regression.
when you have one nominal and two
or more measurement variables.
multinomial logistic regression.
when you have nominal variable
two or more measurement variables.
Model:
Log(=+++···+

Where

is the intercept j=1,2,…,k-1

,,…., are regression coefficients


, ,…., are explanatory variables
Assumption
 Dependent variable should be Nominal
 independent variables that are continuous, nominal
Example:
We want to study about the factors that have influence on
diabetes.
Our response variable is diabetes and its categories may be :
Chemical diabetes 2) Overt diabetes 3) normal diabetes
Many variables have influence on diabetes but we take only
three of them :
Insulin response (IR) 2) steady- state plasma glucose (SSPG)
3) relative weight (RW).
These measurements were taken on 145 volunteers who were
subjected to same regimen.
z
Solution:
N: shows the number of observation.
Marginal percentage - The marginal percentage lists
the proportion of valid observations found in each of
the outcome variable's groups. This can be calculated
by dividing the N.
Valid - This indicates the number of observations in the
dataset where the outcome variable and all predictor
variables are non-missing.
Missing - This indicates the number of observations in
the dataset where data are missing from the outcome
variable or any of the predictor variables.
Total -This indicates the total number of observations
in the dataset--the sum of the number of observations in
which data are missing and the number of observations
with valid data.
Results interpretation- Out of 145 people 33 have
overt diabetes, 36 have chemical diabetes and 76 have
normal diabetes. Thus marginal percentage
(33/145)*100 = 22.8%
Sig: The p = 0.000 which is less than 0.05 indicate that the
regression coefficients are not equal to zero. i.e. all the
variables are significant.
This table contain the chi-square statistic. This
statistic intended to test whether the observed data are
consistent with the fitted model.
Sig: Shows the p = 1.000 value is greater than 0.05
so we conclude that observed data are consistent with
the fitted model.
There three R2 that tells the variation in the data.
Cox and Snell: is based on the log likelihood for the model compare to
the log likelihood for intercept model. Its value is less than one.
Nagelkerke: it adjusts the Cox & Snell’s so that the range of possible
values extends to one.
McFadden: its value depends on the estimated likelihood. Its range is
0 to 1 but never reach 1.
Hence the value of R2 = 0.667 , 0.767 and 0.539 which indicate that
66.7% , 76.7% and 53.9% of the variation in response variable is due
to explanatory variables.
Interpretation:
= -1.903 and -7.611 are the average values of response variable
when the effect of explanatory variables are zero.
= 0.046 it means that the average increase in response variable is
0.046 due to unit increase in SSPG when all the other variables
held constant.
Sig: The “P” value of SSPG is 0.000 which is less than our
significance level set at 5% so we reject our null hypothesis. So
we conclude that SSPG has a significant effect on diabetes.
95% Confidence Interval: This is the Confidence Interval for
an individual regression coefficient given in the model. For a
given predictor with a level of 95% confidence, we would say
that we are 95% confident that the "true" population regression
coefficient lies in between the lower and upper limit of the
Practical example
you could use multinomial logistic regression to
understand which type of drink consumers prefer based
on location in the UK and age the dependent variable
would be type of drink and the four categories – Coffee,
Soft Drink, Tea and Water – and your 
Application
multinomial logistic regression every field apply when
the response variable are categorical form. Then we used
multinomial logistic regression .( Field,
medical,sports,stduents etc)
Reference:
 www.research.gate.net
Regression Analysis by Example 5th edition by
Samprit Chatterjee and Ali S. Hadi
statistics.laerd.com

You might also like