0% found this document useful (0 votes)

31 views27 pages

Logistic Regression-1

This document discusses logistic regression, which is a statistical model used for binary dependent variables. It covers the rationale behind the logistic model, odds ratios, and how to interpret logistic regression coefficients. Examples are provided to illustrate key concepts like binary and continuous independent variables, as well as multivariate logistic regression.

Uploaded by

Neha Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views27 pages

Logistic Regression-1

Uploaded by

Neha Singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 27

Logistic Regression

Dr. Gajendra K. Vishwakarma

Associate Scientist-Clinical Research
Lupin Research Park, Pune
LOGISTIC REGRESSION:

E.g.1 Y = Cure / no cure

X= Therapy, Other Pt. Variables [COHORT / RCT]
E.g. 2. Y = Case / Control (cancer / non-cancer)
X = Risk factors [Age, Sex, Smoking, Occupation]
[CASE / CONTROL]
E.g. 3. Y = MI / No MI
X = Risk factors [Age, Sex, family history ....]
[COHORT]
OUTCOME Y IS BINARY
Logistic Model

1 if “success”
Dependent Variable Y = (event)
0 if “failure”
(no event)

Examples: Dead / Alive

Case / Control
Exposed / Non exposed
LOGISTIC MODEL (contd.)

e a  bx
Pr Y  1 | X  
1  e a  bx

where ‘X’ is an independent

variable
RATIONALE
e a  bx
P  Pr Y  1 | X  
1  e a  bx

e a  bx
1
1  P  Pr Y  0 | X   1  
1 e a  bx
1 e a  bx

P
 e a  bx  ODDS
1 P

P
ln  a  bx  LOG ODDS  LOGIT
1 P
ln (P/1-P)
P

0 a X

b = “Slope” a = “Location”
ODDS RATIOS: BINARY X

1 = Exposed
E.g. X =
0 = Unexposed
FOR EXPOSED:
P1
ln  a  b x1  a  b
1 - P1

FOR UNEXPOSED:
P0
ln  a bx0 a
1 - P0
Therefore,
P1 P0
ln - ln  (a  b) - a  b
1 - P1 1 - P0
P1
1 - P1
ln  b
P0
1 - P0
Therefore,

b = ln(Odds Ratio)
or
Odds Ratio = eb
CONTINUOUS X:

E.g. X = Packs per day

b = ln(Odds Ratio) associated with unit increase in X

E.g. 4 Vs 3 packs per day

MULTIPLE LOGISTIC MODEL
For an individual with independent variable values X1,
X2, ....Xk

e a  b1 x1  b2 x 2  ......... bk x k
Pr Y  1 | X 1 , X 2 ,..... X k  
1  e a  b1 x1  b2 x 2  ......... bk x k
P
ln  a  b1 x1  b 2 x2  .......b k xk
1 P

b1 = ln(OR) for X1, adjusted for X2, X3, .....Xk

b2 = ln(OR) for X2, adjusted for X1, X3, .....Xk etc.

INTERPRETATION SIMILAR TO LINEAR REGRESSION,

BUT ON LOGIT SCALE
Age and Coronary Heart Disease Status
(CHD) of 100 subjects
ID AGRP AGE CHD
1 1 20 0
2 1 23 0
3 1 24 1
4 1 25 0
5 1 25 0
. . . .
. . . .
. . . .
97 8 64 0
98 8 64 1
99 8 65 1
100 8 65 1
1.2

1.0
Coronary Heart Disease (CHD)

0.0

-.2
10 20 30 40 50 60 70

AGE
EFFECT OF DATA GROUPING

Frequency table of Age group by CHD

Age group n CHD Mean
Absent Present (Proportion)
20 - 29 10 9 1 0.10
30 - 34 15 13 2 0.13
35 - 39 12 9 3 0.25
40 - 44 15 10 5 0.33
45 - 49 13 7 6 0.46
50 - 54 8 3 5 0.63
55 - 59 17 4 13 0.76
60 - 69 10 2 8 0.80
Total 100 57 43 0.43
MAXIMUM LIKELIHOOD ESTIMATION (MLE)
Maximum Likelihood Estimation (MLE)

A method of estimation (finding the values) for the unknown

parameters () in such a way that it maximizes the probability
of obtaining the observed data set.

e constant   age 
PCHD  yes | Age  
1  e constant   age 
Likelihood Function:
Probability of the observed data is expressed as a function
of unknown parameters. That is,
P(y=1 | X=Age) =  (x) =  (Age)
P(y=0 | X=Age) = 1 -  (x) = 1 -  (Age)

e constant   age  40 
 10 Age  40  
1  e constant   age  40 

Likelihood function for the 10th individual is

10(age = 40)chd-1(1- 10(age = 40)chd-0)

Likelihood Ratio:

Significance: If the predicted values are better or more

accurate than when the variable is not in the model.
That is, the likelihood estimate is directly proportional to the
difference between observed minus expected observation
(O-E).

As you add a variable in the model the likelihood estimate

will go up.
Therefore, to assess the significance of a variable we need to
have likelihood estimate with and without the variable.

G = -2 (loglikelihood for the model without the variable -

loglikelihood for the model with the variable)
G follows Chi-square distribution

The null hypothesis of 1 = 0, slope coefficient can be tested

using Chi-square statistic.A
Example:
The null hypothesis for the table is that the age is not associated
with CHD.
The model is e   1 age
PCHD  1 | age  
0

1  e 0   1 age

The Null hypothesis imply that 1 = 0.

Therefore, the likelihood without age is - 68.322
the likelihood with age is - 53.677
G = -2 (-53.677 - (68.322)) = 29.32
G = 29.31 is chi-square value with 1 d.f.
P (X2 (1) > 29.31) < 0.001
This imply that age is significantly (P<0.001) associated with CHD.
Multivariate Model:

Consider 4 independent variables, age, weight at last menstrual

period (LWT), race and number of first trimester physician visits
(FTV). The dependent variable is low birth weight (LBW).

The 4 independent variables can be represented as follows.

X’ = (age, LWT, Race, FTV)

Where,
Age - Continuous
LWT - Continuous
FTV - Discrete
Race - Polychotomous
White
Black
Other
Therefore, the design variables for RACE are

Design variable
D1 D2
White 0 0
Black 1 0
Other 0 1
Testing for Significance
Significance of the Model:
Assessing the significance of the model means that the test
for overall significance of the 4 variables in the model.
However, one or more variables individually may not
be significant.

The likelihood estimate for 4 variables + constant = -222.583

The likelihood estimate for constant = -234.673
G = {(-222.583) - (-234.673} = 12.09
The P value for the Chi-square test P[24 > 12.1] = 0.033
Significance of the Variables:

Variables in the Equation

B S.E. Wald df Sig. Exp(B)

Step
a
AGE -.023 .034 .483 1 .487 .977
1 LWT -.014 .007 4.751 1 .029 .986
RACE 4.425 2 .109
RACE(1) 1.005 .498 4.078 1 .043 2.732
RACE(2) .434 .362 1.434 1 .231 1.543
FTV -.049 .167 .087 1 .768 .952
Constant 1.287 1.070 1.447 1 .229 3.623
a. Variable(s) entered on step 1: AGE, LWT, RACE, FTV.

Log-likelihood = -222.583
Wald statistic = /SE()
We conclude that the variables LWT and possibly Race are
significant at P < 0.05.
USE OF LR COEFFICIENTS FOR
GENERAL COMPARISIONS OF RISK

General: Compare individuals with X = a to X = b

ln(OR) = 1 (a-b)

 1 a  b 
OR  e
Example:
Risk of birth defect to mothers, Ages 35 +
1 = 0.182 (Age in years)
 = e0.182 = 1.2 (change of 1 Year)
For change of 5 years (E.g. 40 vs 45)

5 1 = 0.91 e0.91 = 2.5

For change of 10 years (E.g. 45 vs 35)

10 1 = 1.82 e1.82 = 6.2

NOTES: Non - linear effect on OR

Linear effect on 1 (ln(OR))

White Topping Report
73% (11)
White Topping Report
21 pages
Multiple Logistic Regression
No ratings yet
Multiple Logistic Regression
71 pages
Sample Certificate of Non-Claim (Car Insurance Claim)
71% (7)
Sample Certificate of Non-Claim (Car Insurance Claim)
1 page
Logistic Regression
100% (1)
Logistic Regression
34 pages
Thesis Using Logistic Regression
100% (2)
Thesis Using Logistic Regression
7 pages
Logistic Regression
100% (2)
Logistic Regression
32 pages
Logistic Regression & Practice
100% (1)
Logistic Regression & Practice
51 pages
L5 Logistic Regression (2011)
100% (1)
L5 Logistic Regression (2011)
55 pages
Regression Logistic Regression
100% (1)
Regression Logistic Regression
37 pages
5.1) Binary Logistic Regression
No ratings yet
5.1) Binary Logistic Regression
32 pages
Logistic Regression
100% (1)
Logistic Regression
21 pages
Logistic Regression
100% (1)
Logistic Regression
37 pages
ODI Interview Questions and Answers
88% (8)
ODI Interview Questions and Answers
13 pages
Binary Logistic Regression Concept
No ratings yet
Binary Logistic Regression Concept
10 pages
Problem Set 1 - Simple Interest
50% (2)
Problem Set 1 - Simple Interest
2 pages
Lect7 Math231
No ratings yet
Lect7 Math231
29 pages
Logistics Regression
No ratings yet
Logistics Regression
30 pages
Lecture 5. Part 1 - Regression Analysis
No ratings yet
Lecture 5. Part 1 - Regression Analysis
28 pages
Standard Requirements For Tourist Land, Water &
100% (1)
Standard Requirements For Tourist Land, Water &
29 pages
Heart Disease App With Code
No ratings yet
Heart Disease App With Code
22 pages
Lecture13 PDF
No ratings yet
Lecture13 PDF
48 pages
IMA2023109 - Imagine Invoice 132432 - Thecaratshop
No ratings yet
IMA2023109 - Imagine Invoice 132432 - Thecaratshop
1 page
Logistic Regression (2022)
No ratings yet
Logistic Regression (2022)
44 pages
HFHFH
No ratings yet
HFHFH
37 pages
Lecture3-Logistic Regression 6-5-08
No ratings yet
Lecture3-Logistic Regression 6-5-08
72 pages
02 Simple-Logistic-Regression-An-Overview Simple Logistic Regression
No ratings yet
02 Simple-Logistic-Regression-An-Overview Simple Logistic Regression
86 pages
Module 4 - Logistic Regression - Afterclass1b
No ratings yet
Module 4 - Logistic Regression - Afterclass1b
54 pages
Factors Determining Weight-For-Age Status of Under-Five Children:-Multiple Logistic Regression Analysis
No ratings yet
Factors Determining Weight-For-Age Status of Under-Five Children:-Multiple Logistic Regression Analysis
14 pages
Logistic Regression: Interaction Terms
No ratings yet
Logistic Regression: Interaction Terms
23 pages
Home Lesson 15: Logistic, Poisson & Nonlinear Regression
No ratings yet
Home Lesson 15: Logistic, Poisson & Nonlinear Regression
32 pages
Logistic Regression Playbook
No ratings yet
Logistic Regression Playbook
19 pages
Introduction To Logistic Regression: Rachid Salmi, Jean-Claude Desenclos, Alain Moren, Thomas Grein
No ratings yet
Introduction To Logistic Regression: Rachid Salmi, Jean-Claude Desenclos, Alain Moren, Thomas Grein
36 pages
Introduction To Logistic Regression: Rachid Salmi, Jean-Claude Desenclos, Thomas Grein, Alain Moren
No ratings yet
Introduction To Logistic Regression: Rachid Salmi, Jean-Claude Desenclos, Thomas Grein, Alain Moren
38 pages
Logistic Regression Analysis
No ratings yet
Logistic Regression Analysis
48 pages
Psy 512 Logistic Regression
No ratings yet
Psy 512 Logistic Regression
12 pages
Logistic Regression
No ratings yet
Logistic Regression
9 pages
Logistic Regression
No ratings yet
Logistic Regression
15 pages
Logistic Regression-Advanced Biostat PDF
No ratings yet
Logistic Regression-Advanced Biostat PDF
86 pages
Regresion Logistica
No ratings yet
Regresion Logistica
71 pages
Final Cc01 Group05-1
No ratings yet
Final Cc01 Group05-1
26 pages
Classification With Logistic Regression: DR Sandipan Karmakar Mnit Jaipur
No ratings yet
Classification With Logistic Regression: DR Sandipan Karmakar Mnit Jaipur
54 pages
T3. Logistic Regressions
No ratings yet
T3. Logistic Regressions
3 pages
Graphing Motion
No ratings yet
Graphing Motion
30 pages
Modeling Ordinal Categorical Data (Agresti)
No ratings yet
Modeling Ordinal Categorical Data (Agresti)
71 pages
Logistic Regression Models: Series: Basic Statistics For Busy Clinicians (Vii)
No ratings yet
Logistic Regression Models: Series: Basic Statistics For Busy Clinicians (Vii)
11 pages
MEB632 Assignment February 2023 (Part-Time and Distance)
No ratings yet
MEB632 Assignment February 2023 (Part-Time and Distance)
7 pages
Bio2 Module 5 - Logistic Regression
No ratings yet
Bio2 Module 5 - Logistic Regression
19 pages
18logistic Regression Yilma
No ratings yet
18logistic Regression Yilma
88 pages
Final Prac So LN
No ratings yet
Final Prac So LN
26 pages
Logistic Regression: Logistic Regression and The New: Residual Logistic Regression
No ratings yet
Logistic Regression: Logistic Regression and The New: Residual Logistic Regression
31 pages
2 Dealing With Logistic Regression
No ratings yet
2 Dealing With Logistic Regression
4 pages
Binary Logistic Regression - 6.2
No ratings yet
Binary Logistic Regression - 6.2
34 pages
Chapter 8 Logistic Regression (Compatibility Mode)
No ratings yet
Chapter 8 Logistic Regression (Compatibility Mode)
22 pages
Regresi Logistik
No ratings yet
Regresi Logistik
34 pages
3 Classification
No ratings yet
3 Classification
26 pages
Logistic Regression
No ratings yet
Logistic Regression
23 pages
T12 Logistic Regression
No ratings yet
T12 Logistic Regression
5 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Predictive Modeling: Logistic Regression
No ratings yet
Predictive Modeling: Logistic Regression
13 pages
Logistic Regression
No ratings yet
Logistic Regression
5 pages
Logistic Regression - 2021 ch-8
No ratings yet
Logistic Regression - 2021 ch-8
52 pages
M8 Logreg
No ratings yet
M8 Logreg
10 pages
Minitab Tip Sheet 15
No ratings yet
Minitab Tip Sheet 15
5 pages
Laboratory 10
No ratings yet
Laboratory 10
8 pages
BHS Inggris Xi Sem-1 TP 2021-2022
No ratings yet
BHS Inggris Xi Sem-1 TP 2021-2022
8 pages
302 F 14 Logistic Regression
No ratings yet
302 F 14 Logistic Regression
23 pages
300 Ohm Twin-Lead J-Pole Portable Antenna
No ratings yet
300 Ohm Twin-Lead J-Pole Portable Antenna
3 pages
Madrid Protocol TMR
No ratings yet
Madrid Protocol TMR
21 pages
Fourier Analysis-A Signal Processing Approach
No ratings yet
Fourier Analysis-A Signal Processing Approach
14 pages
Signature Assignment Art Analysis-Final Paper
No ratings yet
Signature Assignment Art Analysis-Final Paper
5 pages
Matrikulasi - 2
No ratings yet
Matrikulasi - 2
37 pages
Control Clinical Trial
No ratings yet
Control Clinical Trial
14 pages
FULL Version Testbank Coordinate Geometry For JEE Advanced 3rd Edition G Tewani Multiple Formats
No ratings yet
FULL Version Testbank Coordinate Geometry For JEE Advanced 3rd Edition G Tewani Multiple Formats
409 pages
Phrasal Verbs
No ratings yet
Phrasal Verbs
20 pages
Mohammed Azhar Ali Anjum - Quote
No ratings yet
Mohammed Azhar Ali Anjum - Quote
4 pages
Random Details
No ratings yet
Random Details
2 pages
Better Homes & Gardens 8 Cube Organizer EN
No ratings yet
Better Homes & Gardens 8 Cube Organizer EN
26 pages
Kerry Anderson Resume 2017 Weebly
No ratings yet
Kerry Anderson Resume 2017 Weebly
3 pages
SC9b - Homework
No ratings yet
SC9b - Homework
6 pages
FV - Pitch Deck - Company Name
No ratings yet
FV - Pitch Deck - Company Name
12 pages
REVSTAT - v19 n2 06
No ratings yet
REVSTAT - v19 n2 06
16 pages
Bell, SOME EXPERIMENTS IN DIAGNOSTIC TEACHING
No ratings yet
Bell, SOME EXPERIMENTS IN DIAGNOSTIC TEACHING
23 pages
Machine Design, Vol.4 (2012) No.2, ISSN 1821-1259 Pp. 103-106
No ratings yet
Machine Design, Vol.4 (2012) No.2, ISSN 1821-1259 Pp. 103-106
4 pages
Computing The Effect of Measurement Errors On The Use of Auxiliary Information Under Systematic Sampling
No ratings yet
Computing The Effect of Measurement Errors On The Use of Auxiliary Information Under Systematic Sampling
19 pages
Performance Evaluation of Novel Logarithmic Estimators Under Correlated Measurement Errors
No ratings yet
Performance Evaluation of Novel Logarithmic Estimators Under Correlated Measurement Errors
12 pages
Importance of Statistics
No ratings yet
Importance of Statistics
16 pages
Measurement Error For Factor Class of Estimator
No ratings yet
Measurement Error For Factor Class of Estimator
14 pages
Ict2611 Octnov24
No ratings yet
Ict2611 Octnov24
15 pages
Literature Review Last Edit
No ratings yet
Literature Review Last Edit
11 pages
Re Vista Publish PPR
No ratings yet
Re Vista Publish PPR
11 pages
Punzalan, Joshua Mitchell L. Case-Scenarios-NICU
No ratings yet
Punzalan, Joshua Mitchell L. Case-Scenarios-NICU
2 pages
IoT Quantum Computing A Future Concept
No ratings yet
IoT Quantum Computing A Future Concept
8 pages
1
No ratings yet
1
5 pages
Action Reesearch Webinar CPD Certificate April 2025
No ratings yet
Action Reesearch Webinar CPD Certificate April 2025
5 pages
Project Brief 1
No ratings yet
Project Brief 1
2 pages
Plot Exponential Distribution
No ratings yet
Plot Exponential Distribution
2 pages

Logistic Regression-1

Uploaded by

Logistic Regression-1

Uploaded by

Logistic Regression

Dr. Gajendra K. Vishwakarma

E.g.1 Y = Cure / no cure

Examples: Dead / Alive

where ‘X’ is an independent

E.g. X = Packs per day

b = ln(Odds Ratio) associated with unit increase in X

E.g. 4 Vs 3 packs per day

b1 = ln(OR) for X1, adjusted for X2, X3, .....Xk

INTERPRETATION SIMILAR TO LINEAR REGRESSION,

Frequency table of Age group by CHD

A method of estimation (finding the values) for the unknown

Likelihood function for the 10th individual is

10(age = 40)chd-1(1- 10(age = 40)chd-0)

Significance: If the predicted values are better or more

As you add a variable in the model the likelihood estimate

G = -2 (loglikelihood for the model without the variable -

The null hypothesis of 1 = 0, slope coefficient can be tested

The Null hypothesis imply that 1 = 0.

Consider 4 independent variables, age, weight at last menstrual

The 4 independent variables can be represented as follows.

X’ = (age, LWT, Race, FTV)

The likelihood estimate for 4 variables + constant = -222.583

Variables in the Equation

B S.E. Wald df Sig. Exp(B)

General: Compare individuals with X = a to X = b

5 1 = 0.91 e0.91 = 2.5

For change of 10 years (E.g. 45 vs 35)

10 1 = 1.82 e1.82 = 6.2

NOTES: Non - linear effect on OR

You might also like