0% found this document useful (0 votes)

4 views39 pages

Logistic Regression

The document outlines a training session on Logistic Regression, covering topics such as classification problems, fitting lines, and interpreting coefficients. It includes examples of predicting machine failures, employee resignations, and patient re-admissions, along with exercises for understanding odds ratios and model coefficients. The session also emphasizes the logistic function and its application in predicting probabilities.

Uploaded by

avinash.pawar8429

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views39 pages

Logistic Regression

Uploaded by

avinash.pawar8429

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 39

Logistic Regression - Day 1

Agenda
Logistic Regression - Day 1
In this session, you will be learning:

01 02 03 04 05

Classification Solving Interpreting Finding out the Python demo

problems classification coefficients of optimal
problems using a logistic model coefficients
linear
regression
Classification Problems
Classification Problems

Predict if a machine will fail in the next 14 days.

VibrationX_14day VibrationY_14day VibrationZ_14day Failed

… … … Yes

… … … No
Classification Problems

The HR department of a company wants to understand which employees are at risk of resigning.

# promotions Current salary Market Salary Resigned

... ... ... Yes

... ... ... No

Classification Problems

Can we predict which patients are at risk of re-admission?

Patient ID Age Gender ….

001 23 M …

Patient ID Age Gender ….

Classification Problems: Class Exercise

Take five minutes and discuss two scenarios in which the

prediction problem is a classification problem. Also,
discuss what kind of data you will need to collect.

• Customer Churn Prediction: Demographics, Past

association with a product, and Number of times
complaints registered
• Predicting Fraud: Demographics, Financial History, and
Circumstances of transaction
Fitting Lines
Supervised Learning: Fitting lines

Feature 1 Feature 2 Response Feature 1

(Age_Normalized) (Income_Normalized) (Good/Bad) (Age_Normalized)
….. …… Good = 1 …..

….. …… Bad = 0 …..

Fit a model of the form, Response=b0+b1Feature1

Supervised Learning: Fitting lines

We fit a straight line to the data, where the response is

binary in nature 𝑦∈{0,1}.

Notice the predictions in red color

1. Predictions < 0 or Predictions > 1
2. 0 < Predictions < 1

We are trying to fit:

𝑦 ̂= β0+β1𝑋1+β2𝑋2+ε=𝑓(𝑋1,𝑋2)
The problem with the fitting linear model is:
𝑓(𝑋1,𝑋2)∈(−∞,+∞)
𝑦∈{0,1}
Supervised Learning: Fitting lines

Age Target
20 1 Age Proportion (1)
20 0 20 0.50

21 1 21 0.50

21 0
Supervised Learning: Fitting lines

Instead of estimating 𝑦∈{0,1} we can try to estimate

the Prob(y=1) = 𝑝 ̂
𝑝 ̂∈(0,1)

These estimates make sense now.

We are trying to fit:

𝑝 ̂ = β0+β1𝑋1+β2𝑋2+ε = 𝑓(𝑋1,𝑋2)

The problem still is:

𝑓(𝑋1,𝑋2) ∈ (−∞,+∞)
𝑝 ̂ ∈ (0,1)
Supervised Learning: Fitting lines

Age Target
20 1

20 1 Age Pr (1) Pr/1-Pr

20 1 20 0.75 3

20 0 21 0.50 0.5

21 1

21 0
Supervised Learning: Fitting lines

Age Target
20 1

20 1 Age Pr (1) Pr/1-Pr

20 1 20 0.75 3

20 0 21 0.50 0.5

21 1
Maximum and Minimum value Pr/(1-Pr)?
21 0
Minimum value of Pr(1)? Pr(1) = 0

Maximum value of Pr(1)? Pr(1) = 1

Min Pr(1) = 0, Pr/(1-Pr) = ? Pr/(1-Pr) = 0

Max Pr(1) = 1, Pr/(1-Pr) = ? Pr/(1-Pr) = Infinity

0<=Pr/(1-Pr)<=Infinity
Supervised Learning: Fitting lines

Instead of estimating 𝑦∈{0,1} we can try to estimate

the 𝑝 ̂/(1−𝑝 ̂ )
𝑝 ̂/(1−𝑝 ̂ ) ∈ (0,+∞)

These estimates make sense now.

We are trying to fit:

𝑝 ̂/(1−𝑝 ̂ )= β0+β1𝑋1+β2𝑋2+ε = 𝑓(𝑋1,𝑋2)
The problem still is:
𝑓(𝑋1,𝑋2)∈(−∞,+∞)
𝑝 ̂/(1−𝑝 ̂ )∈(0,+∞)
Supervised Learning: Fitting lines

Age Target
20 1

20 1 Age Pr (1) Pr/1-Pr log(Pr/(1-Pr))

20 1 20 0.75 3 1.09

20 0 21 0.50 0.5 -0.70

21 1

21 0
0 <= Pr/(1-Pr) <= Infinity

-Infinity <= log(Pr/(1-Pr)) <= Infinity

Supervised Learning: Fitting lines

Instead of estimating 𝑦∈{0,1} we can try to estimate

the log(𝑝 ̂/(1−𝑝 ̂ ))
log𝑝 ̂/(1−𝑝 ̂ )∈(−∞,+∞)

All these estimates make sense now.

We are trying to fit:

log𝑝 ̂/(1−𝑝 ̂ )= β0+β1𝑋1+β2𝑋2+ε=𝑓(𝑋1,𝑋2)
𝑓(𝑋1,𝑋2)∈(−∞,+∞)log𝑝 ̂/(1−𝑝 ̂ )∈(−∞,+∞)
Supervised Learning: Fitting lines

𝑝ො
log ො = β0 + β1𝑋1 + β2𝑋2 + ε
1−𝑝

𝑒 β0+β1𝑋1+β2𝑋2+ε
𝑝Ƹ =
1 + 𝑒 β0+β1𝑋1+β2𝑋2+ε

Logistic Function
Classification Problems: Class Exercise

Imagine that there are 125 customers of age 25 years. Of

them, 25 have subscribed to a premium subscription of an
OTT platform. Find out the odds ratio.

25 100 𝑝 25 125 1
𝑝= ,1−𝑝 = , 𝑠𝑜 𝑜𝑑𝑑𝑠 𝑟𝑎𝑡𝑖𝑜 = = ∗ =
125 125 1 − 𝑝 125 100 4
Interpreting Coefficients of a Logistic Model
Interpreting Logistic Regression Coefficients

Predicting churn based on a person’s age

𝑝
log = 𝛽0 + 𝛽1 𝑋1
1−𝑝

log(𝑝/(1−𝑝)) = 2.1+0.08𝐴𝑔𝑒

Age Churned

28 Yes

32 Yes

40 No
Interpreting Logistic Regression Coefficients

Predicting churn based on a person’s age

𝑝
log = 𝛽0 + 𝛽1 𝑋1
1−𝑝

log(𝑝/(1−𝑝)) = 2.1+0.08𝐴𝑔𝑒

Changing Age by 1 unit

will change the log odds
of someone churning by
0.08 units.
Interpreting Logistic Regression Coefficients

Predicting churn based on a person’s age

𝑝
log = 𝛽0 + 𝛽1 𝑋1
1−𝑝

log⁡(𝑝/(1−𝑝)) = 2.1+0.08𝐴𝑔𝑒

Changing Age by 1 unit

will change the log odds
of someone churning by
0.08 units.
Interpreting Logistic Regression Coefficients

Predicting churn based on a person’s age

𝑝
log = 𝛽0 + 𝛽1 𝑋1
1−𝑝

log(𝑝/(1−𝑝)) = 2.1+0.08 * 20
Interpreting Logistic Regression Coefficients

Predicting churn based on a person’s age

𝑝
log = 𝛽0 + 𝛽1 𝑋1
1−𝑝

log(𝑝/(1−𝑝)) = 2.1 + 1.6

Interpreting Logistic Regression Coefficients

Predicting churn based on a person’s age

𝑝
log = 𝛽0 + 𝛽1 𝑋1
1−𝑝

log(𝑝/(1−𝑝)) = 3.7
Interpreting Logistic Regression Coefficients

Predicting churn based on a person’s age

𝑝
log = 𝛽0 + 𝛽1 𝑋1
1−𝑝

log⁡(𝑝/(1−𝑝)) = e3.7
Interpreting Logistic Regression Coefficients

Predicting churn based on a person’s age

𝑝
log = 𝛽0 + 𝛽1 𝑋1
1−𝑝

log⁡(𝑝/(1−𝑝)) = 2.713.7
Interpreting Logistic Regression Coefficients

Predicting churn based on a person’s age

𝑝
log = 𝛽0 + 𝛽1 𝑋1
1−𝑝

log⁡(𝑝/(1−𝑝)) = 39.99
Interpreting Logistic Regression Coefficients

Predicting churn based on a person’s age

𝑝
log = 𝛽0 + 𝛽1 𝑋1
1−𝑝

𝑃 = 0.97
Class Exercise

• Imagine you were trying to model the propensity of customers

to churn, and you were able to build the following logistic
regression model:
𝑝
log = 1.95 − 2.5𝐴𝑔𝑒 + 1.68𝐼𝑛𝑐𝑜𝑚𝑒
1−𝑝
• Can we conclude that as age increases, the propensity for a
customer to churn decreases, keeping all else constant?
• What would be the churn probability for a person aged 25 with
an income of 15000?
Class Exercise

• Imagine you were trying to model the propensity of

customers to churn, and you were able to build the
following logistic regression model:
𝑝
log = 0.0001 − 0.005𝐴𝑔𝑒 + 0.008𝐼𝑛𝑐𝑜𝑚𝑒
1−𝑝
• Can we conclude that as age increases, the propensity
for a customer to churn decreases, keeping all else
constant? (Yes, negative sign on the coefficient of Age.)
• What would be the probability of churn for a person with
𝐸
age 25 and income 150?(𝑝 = , where E =
1+𝐸
e0.0001−0.005∗25+0.008∗150 , p = 0.75)
Finding Out the Optimal Coefficients
Estimating Coefficients of Logistic Model

Good_Bad 𝑝ො
Age Prediction log1−𝑝ො = β0 + β1𝐴𝑔𝑒
(Good =1)
20 1
𝑒 β0+β1𝐴𝑔𝑒
21 1 𝑝Ƹ =
1 + 𝑒 β0+β1𝐴𝑔𝑒
24 0
25 0
29 0
30 1
38 1

β0 = 0.7 𝑒 0.7+1.7𝐴𝑔𝑒
𝑝Ƹ =
𝛽1 = 1.7 1 + 𝑒 0.7+1.7𝐴𝑔𝑒
Estimating Coefficients of Logistic Model

Good_Bad 𝑝ො
Age Prediction log1−𝑝ො = β0 + β1𝐴𝑔𝑒
(Good =1)
20 1
𝑒 β0+β1𝐴𝑔𝑒
21 1 𝑝Ƹ =
1 + 𝑒 β0+β1𝐴𝑔𝑒
24 0
25 0 Good_Bad
Age Prediction
29 0 (Good =1)
30 1 20 1
38 1 21 1
24 0 β0 = 0.3
β0 = 0.7 25 0 𝛽1 = 2.2
𝛽1 = 1.7 29 0
30 1
38 1
Estimating Coefficients of Logistic Model

Good_Bad 𝑝ො
Age Prediction log1−𝑝ො = β0 + β1𝐴𝑔𝑒
(Good =1)
20 1
𝑒 β0+β1𝐴𝑔𝑒
21 1 𝑝Ƹ =
1 + 𝑒 β0+β1𝐴𝑔𝑒
24 0
25 0 Good_Bad
Age Prediction
29 0 (Good =1)
30 1 20 1 0.70
38 1 21 1 0.60
24 0 0.50 β0 = 0.3
β0 = 0.7 25 0 0.45 𝛽1 = 2.2
𝛽1 = 1.7 29 0 0.70
30 1 0.62
38 1 0.40
Estimating Coefficients of Logistic Model

Good_Bad 𝑝ො
Age Prediction log1−𝑝ො = β0 + β1𝐴𝑔𝑒
(Good =1)
20 1
𝑒 β0+β1𝐴𝑔𝑒
21 1 𝑝Ƹ =
1 + 𝑒 β0+β1𝐴𝑔𝑒
24 0
25 0 Good_Bad
Age Prediction
29 0 (Good =1)
30 1 20 1 0.70
38 1 21 1 0.60
24 0 0.50 β0 = 0.3
β0 = 0.7 25 0 0.45 𝛽1 = 2.2
𝛽1 = 1.7 29 0 0.70
30 1 0.62
Clearly β0 = 0.7 and β1 = 1.7 is a
better choice 38 1 0.40
Estimating Coefficients of Logistic Model

• One would like to choose the model coefficients so that the

model gives a high score to events and a low score to non-
events.
• But how will we measure a model’s ability to assign a high
score to events and a low to non-events? Cost Function.
See the excel sheet logistic_cost.xlsx
Thank You!

Machine Learning Module 3 Logistic Regression
No ratings yet
Machine Learning Module 3 Logistic Regression
22 pages
Logistic Regression
100% (3)
Logistic Regression
41 pages
DATA Warehouse MCQs
No ratings yet
DATA Warehouse MCQs
41 pages
Business Analytics & Machine Learning: Logistic and Poisson Regressions
No ratings yet
Business Analytics & Machine Learning: Logistic and Poisson Regressions
62 pages
Module-2 - Logistic Regression in Machine Learning
No ratings yet
Module-2 - Logistic Regression in Machine Learning
28 pages
Data Science - Machine Learning - Logistic Regression - Binary - Classification
No ratings yet
Data Science - Machine Learning - Logistic Regression - Binary - Classification
16 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
Logistic Regression
No ratings yet
Logistic Regression
16 pages
Chapter 10 Logistic Reg (Python)
No ratings yet
Chapter 10 Logistic Reg (Python)
29 pages
Machine Learning - Session4
No ratings yet
Machine Learning - Session4
56 pages
Classification
No ratings yet
Classification
56 pages
Chapter 10 - Logistic Regression: Data Mining For Business Intelligence
No ratings yet
Chapter 10 - Logistic Regression: Data Mining For Business Intelligence
20 pages
Topic 7 Regression (Cont2) Logistic Regression
No ratings yet
Topic 7 Regression (Cont2) Logistic Regression
33 pages
Logistic Regression
100% (1)
Logistic Regression
37 pages
CO 2 Session 3
No ratings yet
CO 2 Session 3
39 pages
Notes For Chapter 7
No ratings yet
Notes For Chapter 7
13 pages
Module1.4 Regression
No ratings yet
Module1.4 Regression
24 pages
3 - SupervisedIntro
No ratings yet
3 - SupervisedIntro
80 pages
Machine Learning Lab Manual 06
100% (1)
Machine Learning Lab Manual 06
8 pages
ANOVA
0% (1)
ANOVA
26 pages
DMML Unit4
No ratings yet
DMML Unit4
77 pages
BI Unit 3
No ratings yet
BI Unit 3
132 pages
Lecture 8 Logistic Regression
No ratings yet
Lecture 8 Logistic Regression
34 pages
Logistic Regression
No ratings yet
Logistic Regression
18 pages
BFCAI BigDataAnalytics Lecture#5 2
No ratings yet
BFCAI BigDataAnalytics Lecture#5 2
69 pages
Solution
No ratings yet
Solution
5 pages
419 Data Science
No ratings yet
419 Data Science
2 pages
S4 LogisticRegression 15jan2025
No ratings yet
S4 LogisticRegression 15jan2025
25 pages
Logistic Regression
No ratings yet
Logistic Regression
36 pages
Unit - 5
No ratings yet
Unit - 5
111 pages
Session 9-Logistic Regression
No ratings yet
Session 9-Logistic Regression
33 pages
Lecture 03 Logistic Regression
No ratings yet
Lecture 03 Logistic Regression
34 pages
Chapter 10 Logistic Reg
No ratings yet
Chapter 10 Logistic Reg
29 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
ML-Unit 4
No ratings yet
ML-Unit 4
29 pages
Top 65 SQL Data Analysis Q&A
No ratings yet
Top 65 SQL Data Analysis Q&A
53 pages
DM - Lecture 3
No ratings yet
DM - Lecture 3
41 pages
Binary Logistic
No ratings yet
Binary Logistic
29 pages
FEM 2063 - Data Analytics: CHAPTER 4: Classifications
100% (2)
FEM 2063 - Data Analytics: CHAPTER 4: Classifications
76 pages
INSY446 - 4 - Classification Part 1
No ratings yet
INSY446 - 4 - Classification Part 1
26 pages
Chapter 8
No ratings yet
Chapter 8
27 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Data Analysis For Management
No ratings yet
Data Analysis For Management
4 pages
ML Algo
No ratings yet
ML Algo
36 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
Mid Semester Project Review UditSoni
No ratings yet
Mid Semester Project Review UditSoni
25 pages
Logistic Regression
No ratings yet
Logistic Regression
20 pages
2+logistic Regression
No ratings yet
2+logistic Regression
10 pages
02 LogisticRegression
No ratings yet
02 LogisticRegression
29 pages
Logistic Regression
No ratings yet
Logistic Regression
4 pages
Classification Problems: - Outcome Is Categorical - E.G. Customer Responds or Not
No ratings yet
Classification Problems: - Outcome Is Categorical - E.G. Customer Responds or Not
10 pages
Logistic Regression
No ratings yet
Logistic Regression
13 pages
2-15 Corr&SimpReg
No ratings yet
2-15 Corr&SimpReg
72 pages
Report Logistic Regression
No ratings yet
Report Logistic Regression
17 pages
Q No. 1 1.1machine Learning:: Machine Learning Is The Study of Computer Algorithms That Improve Automatically
No ratings yet
Q No. 1 1.1machine Learning:: Machine Learning Is The Study of Computer Algorithms That Improve Automatically
10 pages
Experiment No 3
No ratings yet
Experiment No 3
7 pages
Logistic Regression
No ratings yet
Logistic Regression
30 pages
ABS Project Report Guidelines - PGDM 21-23
No ratings yet
ABS Project Report Guidelines - PGDM 21-23
22 pages
22011a0512 Madhu Da
No ratings yet
22011a0512 Madhu Da
5 pages
What Is Logistic Regression
No ratings yet
What Is Logistic Regression
20 pages
Lesson 7 Logistic Regression
No ratings yet
Lesson 7 Logistic Regression
17 pages
Chp2 Logistic Regression
No ratings yet
Chp2 Logistic Regression
6 pages
IOTA Data Driven Tax Administration 2016
No ratings yet
IOTA Data Driven Tax Administration 2016
64 pages
Cheat Sheet-Building Unsupervised Learning Models
No ratings yet
Cheat Sheet-Building Unsupervised Learning Models
3 pages
Baed-Rsch2122 Inquiries, Investigations and Immersion: Week 11-20 By: Marygenkie 100% Legit
50% (2)
Baed-Rsch2122 Inquiries, Investigations and Immersion: Week 11-20 By: Marygenkie 100% Legit
9 pages
Reference Material Logistic Regression
No ratings yet
Reference Material Logistic Regression
11 pages
Final Quail Industry
No ratings yet
Final Quail Industry
107 pages
Logistic Regression
No ratings yet
Logistic Regression
4 pages
Cost Sheet
No ratings yet
Cost Sheet
63 pages
Intro To Linear and Logistic Reg
No ratings yet
Intro To Linear and Logistic Reg
5 pages
13 - Summary and Conclusion PDF
No ratings yet
13 - Summary and Conclusion PDF
23 pages
PayingAttentionToESGMatters Evidenc Preview
No ratings yet
PayingAttentionToESGMatters Evidenc Preview
76 pages
ML0101EN Clas Logistic Reg Churn Py v1
No ratings yet
ML0101EN Clas Logistic Reg Churn Py v1
9 pages
Impact of Social Networking Media Usage PDF
No ratings yet
Impact of Social Networking Media Usage PDF
11 pages
Final Ep4
No ratings yet
Final Ep4
47 pages
Living With Poverty A Simulation (Powerpoint)
No ratings yet
Living With Poverty A Simulation (Powerpoint)
11 pages
Detecting and Quantifying Sources of Non-Stationar PDF
No ratings yet
Detecting and Quantifying Sources of Non-Stationar PDF
15 pages
Ourse Notes Ogistic Egression: Course Notes: Descriptive Statistics Course Notes: Descriptive Statistics
No ratings yet
Ourse Notes Ogistic Egression: Course Notes: Descriptive Statistics Course Notes: Descriptive Statistics
6 pages
Lesson 13 Activity 16 BUTALIDQUEENIE
No ratings yet
Lesson 13 Activity 16 BUTALIDQUEENIE
4 pages
Elias Igwebuike Agbo, PH.D and Ezuwore - Obodoekwe PDF
No ratings yet
Elias Igwebuike Agbo, PH.D and Ezuwore - Obodoekwe PDF
17 pages
Title Authors and Affiliation Methods Results Discussion Acknowledgments Reference
No ratings yet
Title Authors and Affiliation Methods Results Discussion Acknowledgments Reference
8 pages
Topic 1 - Analytics
No ratings yet
Topic 1 - Analytics
54 pages
Artikel Vetty Widyawati
No ratings yet
Artikel Vetty Widyawati
16 pages
Lab 6
No ratings yet
Lab 6
2 pages
Mldoc Intro
No ratings yet
Mldoc Intro
4 pages
01 Koch PDF
No ratings yet
01 Koch PDF
13 pages
Math Lecture
No ratings yet
Math Lecture
2 pages
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
From Everand
Student's Solutions Manual and Supplementary Materials for Econometric Analysis of Cross Section and Panel Data, second edition
Jeffrey M. Wooldridge
No ratings yet
SAT Math: Master the Skills in 40 Pages
From Everand
SAT Math: Master the Skills in 40 Pages
Jennifer L Johnson
No ratings yet

Logistic Regression

Uploaded by

Logistic Regression

Uploaded by

Logistic Regression - Day 1

Classification Solving Interpreting Finding out the Python demo

Predict if a machine will fail in the next 14 days.

VibrationX_14day VibrationY_14day VibrationZ_14day Failed

# promotions Current salary Market Salary Resigned

... ... ... No

Can we predict which patients are at risk of re-admission?

Patient ID Age Gender ….

Patient ID Age Gender ….

Take five minutes and discuss two scenarios in which the

• Customer Churn Prediction: Demographics, Past

Feature 1 Feature 2 Response Feature 1

….. …… Bad = 0 …..

Fit a model of the form, Response=b0+b1Feature1

We fit a straight line to the data, where the response is

Notice the predictions in red color

We are trying to fit:

Instead of estimating 𝑦∈{0,1} we can try to estimate

These estimates make sense now.

We are trying to fit:

The problem still is:

20 1 Age Pr (1) Pr/1-Pr

20 1 Age Pr (1) Pr/1-Pr

Maximum value of Pr(1)? Pr(1) = 1

Max Pr(1) = 1, Pr/(1-Pr) = ? Pr/(1-Pr) = Infinity

Instead of estimating 𝑦∈{0,1} we can try to estimate

These estimates make sense now.

We are trying to fit:

20 1 Age Pr (1) Pr/1-Pr log(Pr/(1-Pr))

20 0 21 0.50 0.5 -0.70

-Infinity <= log(Pr/(1-Pr)) <= Infinity

Instead of estimating 𝑦∈{0,1} we can try to estimate

All these estimates make sense now.

We are trying to fit:

Imagine that there are 125 customers of age 25 years. Of

Predicting churn based on a person’s age

Predicting churn based on a person’s age

Changing Age by 1 unit

Predicting churn based on a person’s age

Changing Age by 1 unit

Predicting churn based on a person’s age

Predicting churn based on a person’s age

log(𝑝/(1−𝑝)) = 2.1 + 1.6

Predicting churn based on a person’s age

Predicting churn based on a person’s age

Predicting churn based on a person’s age

Predicting churn based on a person’s age

Predicting churn based on a person’s age

• Imagine you were trying to model the propensity of customers

• Imagine you were trying to model the propensity of

• One would like to choose the model coefficients so that the

Copyright © HeroX Private Limited, 2023. All rights reserved.

You might also like