100% found this document useful (1 vote)

60 views

Logistic Regression

Logistic regression is used for classification problems where the response variable is categorical. It models the probability of an event occurring versus not occurring. Some examples include predicting loan defaults, fraud detection, customer churn, and propensity to buy models. Unlike linear regression, which predicts absolute values, logistic regression predicts probabilities. It uses a sigmoid function to map predictor variable values to a probability between 0 and 1. Model parameters are estimated using maximum likelihood estimation to minimize the error between predicted and actual probabilities. Thresholds can be selected using methods like ROC curves to optimize sensitivity and specificity for classification.

Uploaded by

Saket Anand

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

60 views

Logistic Regression

Uploaded by

Saket Anand

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 14

Logistic regression

model

Case studies for choice

models
Choice model cater to cases where the response variable are
categorical variables
Home loan/credit card/ Consumer loan defaults { default vs. no
default}
Fraud detection {fraud case vs. no fraud}
Customer Churn Analysis {churn vs. no churn}
Propensity to buy models { buy vs. no buy|

Linear regression bad choice when

response variables are categorical

- Clearly simplest
model could be y =1
when tumor size is
greater than 5
- In the first model one
could do that by
saying y_predicted
>0.5
- Adding a few more
grey points should not
result in new model or
a new line because in
reality the cut has not
changed

General structure for choice

models
X

Home loan default

Income
Debt to Income
Default on other
loans
Salaried vs.
Business
Expense to
Income

Credit Score

Probability of
default

Logistic regression model

Instead of predicting absolute value we predict probability
of an event
1.2
Probability
of Cancer
1
0.8
0.6
0.4
0.2
0
0

P(z) = 1/(1+exp(-z))
6

Tumor Size

Sigmoid function

Error function(analogy)

Y=0

(p-0)
Roughly
MLE

1
Error

Y=1

(1-p)

Error

p1 y (1 p ) y

Minimiz
e

p y (1 p )1 y

Maximiz
e

MLE
(Maximum
Likelihood)

Estimate parameter using

Maximum Likelihood

Max yi ln( p ( zi )) (1 yi ) ln(1 p ( zi ))

where
zi xi

Churn Model Example

Setting Threshold for

classification
Positive

Threshold

Negative

High Threshold -> High Accuracy low

capture
Low Threshold -> Low Accuracy high
capture

Picking a threshold:
KS Chart
- Divide the
population into
deciles
-

Take upper limit of

all deciles and plot
the cumulative
percentage of good
and bad examples

- Pick the
score/threshold of
the decile where the
separation between
good and bad is the
maximum

Truth Table to measure

accuracy
False Negative Rate = False Negative/Total Actual False
(specificity)
True Positive Rate = True Positive/Total Actual True
(sensitivity)
actual
True

False

True

True Positive

False
Positive

False

True
Negative

False
Negative

Predicted

Max sensitivity and

Specificity
Choose the threshold where both sensitivity and specificity are
maximized

Goodness of fit ROC Curve

- The dotted line

represents the case
where model has not
learnt anything i.e. picks
the same percentage of
of false positives and
True Positives
- The area under the blue
curve therefore
represents the goodness
of fit (0.5<Area<1)

QuantEconlectures Python3 PDF
100% (1)
QuantEconlectures Python3 PDF
1,125 pages
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
100% (1)
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
42 pages
Logistic Regression
100% (1)
Logistic Regression
56 pages
Logistic Regression
100% (1)
Logistic Regression
17 pages
Logistic Regression Example
100% (1)
Logistic Regression Example
22 pages
1.1 Simple Linear Regression Model
100% (1)
1.1 Simple Linear Regression Model
15 pages
CS229 Lecture 3 PDF
100% (1)
CS229 Lecture 3 PDF
35 pages
Stats For Managers - Intro
100% (1)
Stats For Managers - Intro
101 pages
Blank: CFC Cumulative Forecast Error or Bias Error
100% (1)
Blank: CFC Cumulative Forecast Error or Bias Error
2 pages
Quiz Feedback1 - Coursera
100% (1)
Quiz Feedback1 - Coursera
7 pages
Lecture 4 Linear Regression
100% (1)
Lecture 4 Linear Regression
44 pages
Project 5 PDF
100% (1)
Project 5 PDF
48 pages
Sas Notes Module 4-Categorical Data Analysis Testing Association Between Categorical Variables
100% (1)
Sas Notes Module 4-Categorical Data Analysis Testing Association Between Categorical Variables
16 pages
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
100% (1)
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
15 pages
Classification With Decision Trees: Instructor: Qiang Yang
100% (1)
Classification With Decision Trees: Instructor: Qiang Yang
62 pages
Taller Practica Churn
50% (2)
Taller Practica Churn
6 pages
Introduction To STATISTICS-new
100% (1)
Introduction To STATISTICS-new
46 pages
Risk Return Summery
100% (1)
Risk Return Summery
85 pages
Decision Tree Classification
100% (1)
Decision Tree Classification
11 pages
Poly
100% (1)
Poly
108 pages
Name: Reg. No.: Lab Exercise:: Shivam Batra 19BPS1131
100% (1)
Name: Reg. No.: Lab Exercise:: Shivam Batra 19BPS1131
10 pages
EFFIE 2002 Case Studies
100% (1)
EFFIE 2002 Case Studies
16 pages
Community Medicine Trans - Epidemic Investigation 2
100% (1)
Community Medicine Trans - Epidemic Investigation 2
10 pages
Course Title: Data Pre-Processing and Visualization
100% (2)
Course Title: Data Pre-Processing and Visualization
11 pages
7. Heteroscedasticity: y = β + β x + · · · + β x + u
100% (1)
7. Heteroscedasticity: y = β + β x + · · · + β x + u
21 pages
EDA Lecture Module 2
100% (1)
EDA Lecture Module 2
42 pages
Stat1012 Cheatsheet Double-Sided
100% (1)
Stat1012 Cheatsheet Double-Sided
2 pages
Homework 2
100% (1)
Homework 2
12 pages
Tutor
100% (1)
Tutor
309 pages
Unit 4 Basics of Feature Engineering
100% (1)
Unit 4 Basics of Feature Engineering
33 pages
Python Numpy (1) : Intro To Multi-Dimensional Array & Numerical Linear Algebra
100% (1)
Python Numpy (1) : Intro To Multi-Dimensional Array & Numerical Linear Algebra
27 pages
KPMG - Data Set
100% (1)
KPMG - Data Set
1,685 pages
8multiple Linear Regression
100% (1)
8multiple Linear Regression
21 pages
1
100% (1)
1
385 pages
Import As
100% (1)
Import As
27 pages
Homework 2
100% (1)
Homework 2
14 pages
Correlation & Regression
100% (1)
Correlation & Regression
53 pages
M&A Deal of ABC Inc. and XYZ Inc.: Insert Your Title Here
100% (1)
M&A Deal of ABC Inc. and XYZ Inc.: Insert Your Title Here
25 pages
KPMG Data
50% (2)
KPMG Data
3,723 pages
LPTHW
100% (1)
LPTHW
220 pages
Case Study 2
100% (1)
Case Study 2
12 pages
Leer Los Datos: Import As Import As Import As From Import From Import
100% (1)
Leer Los Datos: Import As Import As Import As From Import From Import
14 pages
Photon Prog Guide
100% (1)
Photon Prog Guide
919 pages
CPE412 Pattern Recognition (Week 8)
100% (1)
CPE412 Pattern Recognition (Week 8)
25 pages
Logistic Regression Model Study Assignment
100% (1)
Logistic Regression Model Study Assignment
5 pages
Airbnbs in Seattle, Wa: Questions
100% (1)
Airbnbs in Seattle, Wa: Questions
5 pages
Scip y Lectures
100% (1)
Scip y Lectures
329 pages
KPMG
100% (1)
KPMG
2 pages
January 1, 1983 1990 5 July 1994 1930 1960
100% (1)
January 1, 1983 1990 5 July 1994 1930 1960
13 pages
Python For You and Me: Release 0.3.alpha1
100% (1)
Python For You and Me: Release 0.3.alpha1
143 pages
LLSPS - INT - 2831 - Predicting Life Expectancy Using Machine Learning
100% (1)
LLSPS - INT - 2831 - Predicting Life Expectancy Using Machine Learning
36 pages
Forecasting of Stock Prices Using Multi Layer Perceptron
100% (1)
Forecasting of Stock Prices Using Multi Layer Perceptron
6 pages
ML MU Unit 2
100% (2)
ML MU Unit 2
42 pages
Linear Regression (Check List)
100% (1)
Linear Regression (Check List)
2 pages
Employee Attrition Miniblogs
100% (1)
Employee Attrition Miniblogs
15 pages
Preparation and Evaluation of Polyherbal Hair Oil
100% (1)
Preparation and Evaluation of Polyherbal Hair Oil
13 pages
Mastering Parallel Programming with R
From Everand
Mastering Parallel Programming with R
Terence Sloan
No ratings yet
Effective Amazon Machine Learning
From Everand
Effective Amazon Machine Learning
Alexis Perrier
No ratings yet
Text Mining: Fundamentals and Applications
From Everand
Text Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet
Python Natural Language Processing Cookbook: Over 60 recipes for building powerful NLP solutions using Python and LLM libraries
From Everand
Python Natural Language Processing Cookbook: Over 60 recipes for building powerful NLP solutions using Python and LLM libraries
Zhenya Antić
No ratings yet

Logistic Regression

Uploaded by

Logistic Regression

Uploaded by

Logistic regression

Case studies for choice

Linear regression bad choice when

General structure for choice

Home loan default

Logistic regression model

Estimate parameter using

Max yi ln( p ( zi )) (1 yi ) ln(1 p ( zi ))

Churn Model Example

Setting Threshold for

High Threshold -> High Accuracy low

Take upper limit of

Truth Table to measure

Max sensitivity and

Goodness of fit ROC Curve

- The dotted line

You might also like