Multinomial logistic regression is used when the dependent variable has more than two nominal categories. This document outlines a study examining factors that influence different types of diabetes. It discusses the assumptions, types, and interpretation of multinomial logistic regression. The example analyzes 145 patients and three independent variables to predict whether they have overt, chemical, or normal diabetes. It finds the variables are significant predictors and 66.7-76.7% of the variation is explained by the model.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
64 views
Multinomial Logistic Regression-3
Multinomial logistic regression is used when the dependent variable has more than two nominal categories. This document outlines a study examining factors that influence different types of diabetes. It discusses the assumptions, types, and interpretation of multinomial logistic regression. The example analyzes 145 patients and three independent variables to predict whether they have overt, chemical, or normal diabetes. It finds the variables are significant predictors and 66.7-76.7% of the variation is explained by the model.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 19
Multinomial Logistic Regression
Supervisor: Sir. Rizwan
Presented by: Yameen University Roll No. 8406 Course: Seminar M.Sc. Statistics (Session 2015 to 2017) Outline o Definition (MLR) o Introduction o Assumption o Types o SPSS o Interpretation o Practical life example Definition Multinomial Logistic Regression is the linear regression analysis to conduct when the dependent variable is nominal with more than two levels. Thus it is an extension of logistic regression, which analyzes dichotomous (binary) dependents. Introduction Multiple logistic regression when the dependent variable is nominal (categorical; meaning that it falls into any one of a set of categories which can't be ordered in any meaning way) and for which there are more than two categories. Example: Consider a study of choice of a health plan from among three plans offered to the employees of a large corporation .the outcome variable has three level of indicating which plan, A.B.C.is chosen possible covariate might include Gender, Age ,Income Family size another
Types
Binary logistic regression
Multiple logistic regression Multinomial logistic regression Detail Of Type Binary variables are variables. which only take two values. For example, Male or Female, True or False and Yes or No. While many variables and questions are naturally binary, it is often useful to construct binary variables from other types of data. multiple logistic regression. when you have one nominal and two or more measurement variables. multinomial logistic regression. when you have nominal variable two or more measurement variables. Model: Log(=+++···+
Where
is the intercept j=1,2,…,k-1
,,…., are regression coefficients
, ,…., are explanatory variables Assumption Dependent variable should be Nominal independent variables that are continuous, nominal Example: We want to study about the factors that have influence on diabetes. Our response variable is diabetes and its categories may be : Chemical diabetes 2) Overt diabetes 3) normal diabetes Many variables have influence on diabetes but we take only three of them : Insulin response (IR) 2) steady- state plasma glucose (SSPG) 3) relative weight (RW). These measurements were taken on 145 volunteers who were subjected to same regimen. z Solution: N: shows the number of observation. Marginal percentage - The marginal percentage lists the proportion of valid observations found in each of the outcome variable's groups. This can be calculated by dividing the N. Valid - This indicates the number of observations in the dataset where the outcome variable and all predictor variables are non-missing. Missing - This indicates the number of observations in the dataset where data are missing from the outcome variable or any of the predictor variables. Total -This indicates the total number of observations in the dataset--the sum of the number of observations in which data are missing and the number of observations with valid data. Results interpretation- Out of 145 people 33 have overt diabetes, 36 have chemical diabetes and 76 have normal diabetes. Thus marginal percentage (33/145)*100 = 22.8% Sig: The p = 0.000 which is less than 0.05 indicate that the regression coefficients are not equal to zero. i.e. all the variables are significant. This table contain the chi-square statistic. This statistic intended to test whether the observed data are consistent with the fitted model. Sig: Shows the p = 1.000 value is greater than 0.05 so we conclude that observed data are consistent with the fitted model. There three R2 that tells the variation in the data. Cox and Snell: is based on the log likelihood for the model compare to the log likelihood for intercept model. Its value is less than one. Nagelkerke: it adjusts the Cox & Snell’s so that the range of possible values extends to one. McFadden: its value depends on the estimated likelihood. Its range is 0 to 1 but never reach 1. Hence the value of R2 = 0.667 , 0.767 and 0.539 which indicate that 66.7% , 76.7% and 53.9% of the variation in response variable is due to explanatory variables. Interpretation: = -1.903 and -7.611 are the average values of response variable when the effect of explanatory variables are zero. = 0.046 it means that the average increase in response variable is 0.046 due to unit increase in SSPG when all the other variables held constant. Sig: The “P” value of SSPG is 0.000 which is less than our significance level set at 5% so we reject our null hypothesis. So we conclude that SSPG has a significant effect on diabetes. 95% Confidence Interval: This is the Confidence Interval for an individual regression coefficient given in the model. For a given predictor with a level of 95% confidence, we would say that we are 95% confident that the "true" population regression coefficient lies in between the lower and upper limit of the Practical example you could use multinomial logistic regression to understand which type of drink consumers prefer based on location in the UK and age the dependent variable would be type of drink and the four categories – Coffee, Soft Drink, Tea and Water – and your Application multinomial logistic regression every field apply when the response variable are categorical form. Then we used multinomial logistic regression .( Field, medical,sports,stduents etc) Reference: www.research.gate.net Regression Analysis by Example 5th edition by Samprit Chatterjee and Ali S. Hadi statistics.laerd.com
(Ebook) Linear Regression Analysis, 2nd edition (Wiley Series in Probability and Statistics) by George A. F. Seber, Alan J. Lee ISBN 9780471415404, 0471415405 - Download the ebook now to never miss important content
(ICSA Book Series in Statistics) Jeffrey R. Wilson, Kent a. Lorenz, Lori P. Selby - Modeling Binary Correlated Responses_ Using SAS, SPSS, R and STATA-Springer (2024)