0% found this document useful (0 votes)

158 views12 pages

Logistic Regression

Logistic regression is used for binary classification problems. It models the probability of an outcome as a logistic function of the independent variables. Unlike linear regression, logistic regression limits the probability to between 0 and 1. It transforms the probability using the logit function to make it suitable for linear modeling. Logistic regression is a type of generalized linear model (GLM) that uses the binomial distribution and logit link function. It outputs predicted probabilities that can be converted to class predictions using a cutoff threshold.

Uploaded by

omar mohsen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

158 views12 pages

Logistic Regression

Uploaded by

omar mohsen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Logistic Regression

Logistic regression is analogous to multiple linear regression (see Chapter 4), except
the outcome is binary. Various transformations are employed to convert the problem
to one in which a linear model can be fit. Like discriminant analysis, and unlike K-
Nearest Neighbor and naive Bayes, logistic regression is a structured model approach
rather than a data-centric approach. Due to its fast computational speed and its out‐
put of a model that lends itself to rapid scoring of new data, it is a popular method.

Key Terms for Logistic Regression

Logit
The function that maps class membership probability to a range from ± ∞
(instead of 0 to 1).
Synonym
Log odds (see below)
Odds
The ratio of “success” (1) to “not success” (0).
Log odds
The response in the transformed model (now linear), which gets mapped back to
a probability.

Logistic Response Function and Logit

The key ingredients for logistic regression are the logistic response function and the
logit, in which we map a probability (which is on a 0–1 scale) to a more expansive
scale suitable for linear modeling.
The first step is to think of the outcome variable not as a binary label but as the prob‐
ability p that the label is a “1.” Naively, we might be tempted to model p as a linear
function of the predictor variables:

p = β0 + β1x1 + β2x2 + ⋯ + βqxq

However, fitting this model does not ensure that p will end up between 0 and 1, as a
probability must.
Instead, we model p by applying a logistic response or inverse logit function to the
predictors:

208 | Chapter 5: Classification

1
p=
− β0 + β1x1 + β2x2 + ⋯ + βqxq
1+e

This transform ensures that the p stays between 0 and 1.

To get the exponential expression out of the denominator, we consider odds instead
of probabilities. Odds, familiar to bettors everywhere, are the ratio of “successes” (1)
to “nonsuccesses” (0). In terms of probabilities, odds are the probability of an event
divided by the probability that the event will not occur. For example, if the probability
that a horse will win is 0.5, the probability of “won’t win” is (1 – 0.5) = 0.5, and the
odds are 1.0:

p
Odds Y = 1 =
1− p

We can obtain the probability from the odds using the inverse odds function:

Odds
p=
1 + Odds

We combine this with the logistic response function, shown earlier, to get:

β0 + β1x1 + β2x2 + ⋯ + βqxq

Odds Y = 1 = e

Finally, taking the logarithm of both sides, we get an expression that involves a linear
function of the predictors:

log Odds Y = 1 = β0 + β1x1 + β2x2 + ⋯ + βqxq

The log-odds function, also known as the logit function, maps the probability p from
0, 1 to any value − ∞, + ∞ —see Figure 5-2. The transformation circle is complete;
we have used a linear model to predict a probability, which we can in turn map to a
class label by applying a cutoff rule—any record with a probability greater than the
cutoff is classified as a 1.

Logistic Regression | 209

Figure 5-2. Graph of the logit function that maps a probability to a scale suitable for a
linear model

Logistic Regression and the GLM

The response in the logistic regression formula is the log odds of a binary outcome of
1. We observe only the binary outcome, not the log odds, so special statistical meth‐
ods are needed to fit the equation. Logistic regression is a special instance of a gener‐
alized linear model (GLM) developed to extend linear regression to other settings.
In R, to fit a logistic regression, the glm function is used with the family parameter
set to binomial. The following code fits a logistic regression to the personal loan data
introduced in “K-Nearest Neighbors” on page 238:
logistic_model <- glm(outcome ~ payment_inc_ratio + purpose_ +
home_ + emp_len_ + borrower_score,
data=loan_data, family='binomial')
logistic_model

Call: glm(formula = outcome ~ payment_inc_ratio + purpose_ + home_ +

emp_len_ + borrower_score, family = "binomial", data = loan_data)

210 | Chapter 5: Classification

Coefficients:
(Intercept) payment_inc_ratio
1.63809 0.07974
purpose_debt_consolidation purpose_home_improvement
0.24937 0.40774
purpose_major_purchase purpose_medical
0.22963 0.51048
purpose_other purpose_small_business
0.62066 1.21526
home_OWN home_RENT
0.04833 0.15732
emp_len_ > 1 Year borrower_score
-0.35673 -4.61264

Degrees of Freedom: 45341 Total (i.e. Null); 45330 Residual

Null Deviance: 62860
Residual Deviance: 57510 AIC: 57540

The response is outcome, which takes a 0 if the loan is paid off and a 1 if the loan
defaults. purpose_ and home_ are factor variables representing the purpose of the loan
and the home ownership status. As in linear regression, a factor variable with P levels
is represented with P – 1 columns. By default in R, the reference coding is used, and
the levels are all compared to the reference level (see “Factor Variables in Regression”
on page 163). The reference levels for these factors are credit_card and MORTGAGE,
respectively. The variable borrower_score is a score from 0 to 1 representing the
creditworthiness of the borrower (from poor to excellent). This variable was created
from several other variables using K-Nearest Neighbor—see “KNN as a Feature
Engine” on page 247.
In Python, we use the scikit-learn class LogisticRegression from sklearn.lin
ear_model. The arguments penalty and C are used to prevent overfitting by L1 or L2
regularization. Regularization is switched on by default. In order to fit without regu‐
larization, we set C to a very large value. The solver argument selects the used mini‐
mizer; the method liblinear is the default:
predictors = ['payment_inc_ratio', 'purpose_', 'home_', 'emp_len_',
'borrower_score']
outcome = 'outcome'
X = pd.get_dummies(loan_data[predictors], prefix='', prefix_sep='',
drop_first=True)
y = loan_data[outcome]

logit_reg = LogisticRegression(penalty='l2', C=1e42, solver='liblinear')

logit_reg.fit(X, y)

In contrast to R, scikit-learn derives the classes from the unique values in y (paid
off and default). Internally, the classes are ordered alphabetically. As this is the reverse
order from the factors used in R, you will see that the coefficients are reversed. The

Logistic Regression | 211

predict method returns the class label and predict_proba returns the probabilities
in the order available from the attribute logit_reg.classes_.

Generalized Linear Models

Generalized linear models (GLMs) are characterized by two main components:

• A probability distribution or family (binomial in the case of logistic regression)

• A link function—i.e., a transformation function that maps the response to the
predictors (logit in the case of logistic regression)

Logistic regression is by far the most common form of GLM. A data scientist will
encounter other types of GLMs. Sometimes a log link function is used instead of the
logit; in practice, use of a log link is unlikely to lead to very different results for most
applications. The Poisson distribution is commonly used to model count data (e.g.,
the number of times a user visits a web page in a certain amount of time). Other fam‐
ilies include negative binomial and gamma, often used to model elapsed time (e.g.,
time to failure). In contrast to logistic regression, application of GLMs with these
models is more nuanced and involves greater care. These are best avoided unless you
are familiar with and understand the utility and pitfalls of these methods.

Predicted Values from Logistic Regression

The predicted value from logistic regression is in terms of the log odds:
Y = log Odds Y = 1 . The predicted probability is given by the logistic response
function:

1
p=
1 + e−Y

For example, look at the predictions from the model logistic_model in R:

pred <- predict(logistic_model)
summary(pred)
Min. 1st Qu. Median Mean 3rd Qu. Max.
-2.704774 -0.518825 -0.008539 0.002564 0.505061 3.509606

In Python, we can convert the probabilities into a data frame and use the describe
method to get these characteristics of the distribution:
pred = pd.DataFrame(logit_reg.predict_log_proba(X),
columns=loan_data[outcome].cat.categories)
pred.describe()

212 | Chapter 5: Classification

Converting these values to probabilities is a simple transform:
prob <- 1/(1 + exp(-pred))
> summary(prob)
Min. 1st Qu. Median Mean 3rd Qu. Max.
0.06269 0.37313 0.49787 0.50000 0.62365 0.97096

The probabilities are directly available using the predict_proba methods in scikit-
learn:
pred = pd.DataFrame(logit_reg.predict_proba(X),
columns=loan_data[outcome].cat.categories)
pred.describe()
These are on a scale from 0 to 1 and don’t yet declare whether the predicted value is
default or paid off. We could declare any value greater than 0.5 as default. In practice,
a lower cutoff is often appropriate if the goal is to identify members of a rare class
(see “The Rare Class Problem” on page 223).

Interpreting the Coefficients and Odds Ratios

One advantage of logistic regression is that it produces a model that can be scored to
new data rapidly, without recomputation. Another is the relative ease of interpreta‐
tion of the model, as compared with other classification methods. The key conceptual
idea is understanding an odds ratio. The odds ratio is easiest to understand for a
binary factor variable X:

Odds Y = 1 X = 1
odds ratio =
Odds Y = 1 X = 0

This is interpreted as the odds that Y = 1 when X = 1 versus the odds that Y = 1 when
X = 0. If the odds ratio is 2, then the odds that Y = 1 are two times higher when X = 1
versus when X = 0.
Why bother with an odds ratio rather than probabilities? We work with odds because
the coefficient β j in the logistic regression is the log of the odds ratio for X j.
An example will make this more explicit. For the model fit in “Logistic Regression
and the GLM” on page 210, the regression coefficient for purpose_small_business is
1.21526. This means that a loan to a small business compared to a loan to pay off
credit card debt reduces the odds of defaulting versus being paid off by
exp 1.21526 ≈ 3.4. Clearly, loans for the purpose of creating or expanding a small
business are considerably riskier than other types of loans.
Figure 5-3 shows the relationship between the odds ratio and the log-odds ratio for
odds ratios greater than 1. Because the coefficients are on the log scale, an increase of
1 in the coefficient results in an increase of exp 1 ≈ 2.72 in the odds ratio.

Logistic Regression | 213

Figure 5-3. The relationship between the odds ratio and the log-odds ratio

Odds ratios for numeric variables X can be interpreted similarly: they measure the
change in the odds ratio for a unit change in X. For example, the effect of increasing
the payment-to-income ratio from, say, 5 to 6 increases the odds of the loan default‐
ing by a factor of exp 0.08244 ≈ 1.09. The variable borrower_score is a score on the
borrowers’ creditworthiness and ranges from 0 (low) to 1 (high). The odds of the best
borrowers relative to the worst borrowers defaulting on their loans is smaller by a
factor of exp − 4.61264 ≈ 0.01. In other words, the default risk from the borrowers
with the poorest creditworthiness is 100 times greater than that of the best borrowers!

Linear and Logistic Regression: Similarities and Differences

Linear regression and logistic regression share many commonalities. Both assume a
parametric linear form relating the predictors with the response. Exploring and find‐
ing the best model are done in very similar ways. Extensions to the linear model, like
the use of a spline transform of a predictor (see “Splines” on page 189), are equally
applicable in the logistic regression setting. Logistic regression differs in two funda‐
mental ways:

214 | Chapter 5: Classification

• The way the model is fit (least squares is not applicable)
• The nature and analysis of the residuals from the model

Fitting the model

Linear regression is fit using least squares, and the quality of the fit is evaluated using
RMSE and R-squared statistics. In logistic regression (unlike in linear regression),
there is no closed-form solution, and the model must be fit using maximum likelihood
estimation (MLE). Maximum likelihood estimation is a process that tries to find the
model that is most likely to have produced the data we see. In the logistic regression
equation, the response is not 0 or 1 but rather an estimate of the log odds that the
response is 1. The MLE finds the solution such that the estimated log odds best
describes the observed outcome. The mechanics of the algorithm involve a quasi-
Newton optimization that iterates between a scoring step (Fisher’s scoring), based on
the current parameters, and an update to the parameters to improve the fit.

Maximum Likelihood Estimation

Here is a bit more detail, if you like statistical symbols: start with a set of data
X1, X2, ⋯, Xn and a probability model Pθ X1, X2, ⋯, Xn that depends on a set of
parameters θ. The goal of MLE is to find the set of parameters θ that maximizes the
value of Pθ X1, X2, ⋯, Xn ; that is, it maximizes the probability of observing
X1, X2, ⋯, Xn given the model P. In the fitting process, the model is evaluated using
a metric called deviance:

deviance = − 2 log P X1, X2, ⋯, Xn

Lower deviance corresponds to a better fit.

Fortunately, most practitioners don’t need to concern themselves with the details of
the fitting algorithm since this is handled by the software. Most data scientists will
not need to worry about the fitting method, other than understanding that it is a way
to find a good model under certain assumptions.

Logistic Regression | 215

Handling Factor Variables
In logistic regression, factor variables should be coded as in linear
regression; see “Factor Variables in Regression” on page 163. In R
and other software, this is normally handled automatically, and
generally reference encoding is used. All of the other classification
methods covered in this chapter typically use the one hot encoder
representation (see “One Hot Encoder” on page 242). In Python’s
scikit-learn, it is easiest to use one hot encoding, which means
that only n – 1 of the resulting dummies can be used in the
regression.

Assessing the Model

Like other classification methods, logistic regression is assessed by how accurately the
model classifies new data (see “Evaluating Classification Models” on page 219). As
with linear regression, some additional standard statistical tools are available to
examine and improve the model. Along with the estimated coefficients, R reports the
standard error of the coefficients (SE), a z-value, and a p-value:
summary(logistic_model)

Call:
glm(formula = outcome ~ payment_inc_ratio + purpose_ + home_ +
emp_len_ + borrower_score, family = "binomial", data = loan_data)

Deviance Residuals:
Min 1Q Median 3Q Max
-2.51951 -1.06908 -0.05853 1.07421 2.15528

Coefficients:
Estimate Std. Error z value Pr(>|z|)
(Intercept) 1.638092 0.073708 22.224 < 2e-16 ***
payment_inc_ratio 0.079737 0.002487 32.058 < 2e-16 ***
purpose_debt_consolidation 0.249373 0.027615 9.030 < 2e-16 ***
purpose_home_improvement 0.407743 0.046615 8.747 < 2e-16 ***
purpose_major_purchase 0.229628 0.053683 4.277 1.89e-05 ***
purpose_medical 0.510479 0.086780 5.882 4.04e-09 ***
purpose_other 0.620663 0.039436 15.738 < 2e-16 ***
purpose_small_business 1.215261 0.063320 19.192 < 2e-16 ***
home_OWN 0.048330 0.038036 1.271 0.204
home_RENT 0.157320 0.021203 7.420 1.17e-13 ***
emp_len_ > 1 Year -0.356731 0.052622 -6.779 1.21e-11 ***
borrower_score -4.612638 0.083558 -55.203 < 2e-16 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

(Dispersion parameter for binomial family taken to be 1)

Null deviance: 62857 on 45341 degrees of freedom

216 | Chapter 5: Classification

Residual deviance: 57515 on 45330 degrees of freedom
AIC: 57539

Number of Fisher Scoring iterations: 4

The package statsmodels has an implementation for generalized linear model (GLM)
that provides similarly detailed information:
y_numbers = [1 if yi == 'default' else 0 for yi in y]
logit_reg_sm = sm.GLM(y_numbers, X.assign(const=1),
family=sm.families.Binomial())
logit_result = logit_reg_sm.fit()
logit_result.summary()
Interpretation of the p-value comes with the same caveat as in regression and should
be viewed more as a relative indicator of variable importance (see “Assessing the
Model” on page 153) than as a formal measure of statistical significance. A logistic
regression model, which has a binary response, does not have an associated RMSE or
R-squared. Instead, a logistic regression model is typically evaluated using more gen‐
eral metrics for classification; see “Evaluating Classification Models” on page 219.
Many other concepts for linear regression carry over to the logistic regression setting
(and other GLMs). For example, you can use stepwise regression, fit interaction
terms, or include spline terms. The same concerns regarding confounding and corre‐
lated variables apply to logistic regression (see “Interpreting the Regression Equation”
on page 169). You can fit generalized additive models (see “Generalized Additive
Models” on page 192) using the mgcv package in R:
logistic_gam <- gam(outcome ~ s(payment_inc_ratio) + purpose_ +
home_ + emp_len_ + s(borrower_score),
data=loan_data, family='binomial')

The formula interface of statsmodels also supports these extensions in Python:

import statsmodels.formula.api as smf
formula = ('outcome ~ bs(payment_inc_ratio, df=4) + purpose_ + ' +
'home_ + emp_len_ + bs(borrower_score, df=4)')
model = smf.glm(formula=formula, data=loan_data, family=sm.families.Binomial())
results = model.fit()

Analysis of residuals
One area where logistic regression differs from linear regression is in the analysis of
the residuals. As in linear regression (see Figure 4-9), it is straightforward to compute
partial residuals in R:
terms <- predict(logistic_gam, type='terms')
partial_resid <- resid(logistic_model) + terms
df <- data.frame(payment_inc_ratio = loan_data[, 'payment_inc_ratio'],
terms = terms[, 's(payment_inc_ratio)'],
partial_resid = partial_resid[, 's(payment_inc_ratio)'])

Logistic Regression | 217

ggplot(df, aes(x=payment_inc_ratio, y=partial_resid, solid = FALSE)) +
geom_point(shape=46, alpha=0.4) +
geom_line(aes(x=payment_inc_ratio, y=terms),
color='red', alpha=0.5, size=1.5) +
labs(y='Partial Residual')
The resulting plot is displayed in Figure 5-4. The estimated fit, shown by the line,
goes between two sets of point clouds. The top cloud corresponds to a response of 1
(defaulted loans), and the bottom cloud corresponds to a response of 0 (loans paid
off). This is very typical of residuals from a logistic regression since the output is
binary. The prediction is measured as the logit (log of the odds ratio), which will
always be some finite value. The actual value, an absolute 0 or 1, corresponds to an
infinite logit, either positive or negative, so the residuals (which get added to the fit‐
ted value) will never equal 0. Hence the plotted points lie in clouds either above or
below the fitted line in the partial residual plot. Partial residuals in logistic regression,
while less valuable than in regression, are still useful to confirm nonlinear behavior
and identify highly influential records.
There is currently no implementation of partial residuals in any of the major Python
packages. We provide Python code to create the partial residual plot in the accompa‐
nying source code repository.

Figure 5-4. Partial residuals from logistic regression

218 | Chapter 5: Classification

Some of the output from the summary function can effectively be
ignored. The dispersion parameter does not apply to logistic
regression and is there for other types of GLMs. The residual devi‐
ance and the number of scoring iterations are related to the maxi‐
mum likelihood fitting method; see “Maximum Likelihood
Estimation” on page 215.

Key Ideas
• Logistic regression is like linear regression, except that the outcome is a binary
variable.
• Several transformations are needed to get the model into a form that can be fit as
a linear model, with the log of the odds ratio as the response variable.
• After the linear model is fit (by an iterative process), the log odds is mapped back
to a probability.
• Logistic regression is popular because it is computationally fast and produces a
model that can be scored to new data with only a few arithmetic operations.

Further Reading
• The standard reference on logistic regression is Applied Logistic Regression, 3rd
ed., by David Hosmer, Stanley Lemeshow, and Rodney Sturdivant (Wiley, 2013).
• Also popular are two books by Joseph Hilbe: Logistic Regression Models (very
comprehensive, 2017) and Practical Guide to Logistic Regression (compact, 2015),
both from Chapman & Hall/CRC Press.
• Both The Elements of Statistical Learning, 2nd ed., by Trevor Hastie, Robert Tib‐
shirani, and Jerome Friedman (Springer, 2009), and its shorter cousin, An Intro‐
duction to Statistical Learning by Gareth James, Daniela Witten, Trevor Hastie,
and Robert Tibshirani (Springer, 2013), have a section on logistic regression.
• Data Mining for Business Analytics by Galit Shmueli, Peter Bruce, Nitin Patel,
Peter Gedeck, Inbal Yahav, and Kenneth Lichtendahl (Wiley, 2007–2020, with
editions for R, Python, Excel, and JMP) has a full chapter on logistic regression.

Evaluating Classification Models

It is common in predictive modeling to train a number of different models, apply
each to a holdout sample, and assess their performance. Sometimes, after a number of
models have been evaluated and tuned, and if there are enough data, a third holdout

Evaluating Classification Models | 219

Group 5 - Assignment No.3
No ratings yet
Group 5 - Assignment No.3
4 pages
Advanced Regression
No ratings yet
Advanced Regression
13 pages
chapter4-estimation (1)
No ratings yet
chapter4-estimation (1)
28 pages
IBM SPSS Modeler-Neural Networks
100% (1)
IBM SPSS Modeler-Neural Networks
18 pages
DS535 Note 4 (With Marks)
No ratings yet
DS535 Note 4 (With Marks)
18 pages
Instant ebooks textbook Handbook of Regression Analysis With Applications in R, Second Edition Samprit Chatterjee download all chapters
100% (2)
Instant ebooks textbook Handbook of Regression Analysis With Applications in R, Second Edition Samprit Chatterjee download all chapters
41 pages
2
0% (1)
2
36 pages
Lessons in Estimation Theory For PDF
100% (1)
Lessons in Estimation Theory For PDF
570 pages
Data Science Task PDF
No ratings yet
Data Science Task PDF
8 pages
Applied Statistics II-SLR
100% (1)
Applied Statistics II-SLR
23 pages
Correlation and Regression - The Simple Case
100% (2)
Correlation and Regression - The Simple Case
106 pages
Machine Learning (Analytics Vidhya) : What Is Logistic Regression?
100% (1)
Machine Learning (Analytics Vidhya) : What Is Logistic Regression?
5 pages
regression_predict_PART_1of2 (1)
No ratings yet
regression_predict_PART_1of2 (1)
26 pages
Mathematical Logic
No ratings yet
Mathematical Logic
34 pages
P 2.1 Logistic Regression
No ratings yet
P 2.1 Logistic Regression
18 pages
Multicollinearity Exercise
100% (1)
Multicollinearity Exercise
6 pages
Anderson F. Survival Analysis by Example. Hands On Approach Using R 2016
No ratings yet
Anderson F. Survival Analysis by Example. Hands On Approach Using R 2016
42 pages
14.predictive Modeling Using Logistic Regression.2007
No ratings yet
14.predictive Modeling Using Logistic Regression.2007
266 pages
Regression Analysis
100% (2)
Regression Analysis
9 pages
Time Series Analysis
100% (1)
Time Series Analysis
15 pages
Propensity Score Matching
No ratings yet
Propensity Score Matching
40 pages
Example How To Perform Multiple Regression Analysis Using SPSS Statistics
100% (1)
Example How To Perform Multiple Regression Analysis Using SPSS Statistics
14 pages
Module 1 Quiz
No ratings yet
Module 1 Quiz
7 pages
Algorithms 17 00524
No ratings yet
Algorithms 17 00524
17 pages
Lecture Week 2 KNN and Model Evaluation PDF
100% (1)
Lecture Week 2 KNN and Model Evaluation PDF
53 pages
CPE412 Pattern Recognition (Week 8)
100% (1)
CPE412 Pattern Recognition (Week 8)
25 pages
UNIT 4 Predicate Logic
No ratings yet
UNIT 4 Predicate Logic
20 pages
Lecture 4 Linear Regression
100% (1)
Lecture 4 Linear Regression
44 pages
Classification Metrics in Machine Learning
No ratings yet
Classification Metrics in Machine Learning
6 pages
01-Introduction Machine Learning
100% (1)
01-Introduction Machine Learning
48 pages
DS II Mid Term 2017 Solution
No ratings yet
DS II Mid Term 2017 Solution
19 pages
1
100% (1)
1
385 pages
Menu Bokharest 2024 Compressed
No ratings yet
Menu Bokharest 2024 Compressed
28 pages
CS 601 Machine Learning Unit 5
No ratings yet
CS 601 Machine Learning Unit 5
18 pages
Block Size For Local Estimation
No ratings yet
Block Size For Local Estimation
3 pages
Assignment 2: 1. Problem1
No ratings yet
Assignment 2: 1. Problem1
27 pages
Jntuk R20 ML Unit-Ii
No ratings yet
Jntuk R20 ML Unit-Ii
37 pages
Logistic Regression
100% (1)
Logistic Regression
14 pages
Ecotrics (PR) Panel Data 1
No ratings yet
Ecotrics (PR) Panel Data 1
14 pages
ML Project Report: (Text Learning Case Study)
No ratings yet
ML Project Report: (Text Learning Case Study)
9 pages
Pengaruh Human Relation (Hubungan Antar Manusia), Lingkungan Kerja Terhadap Etos Kerja Karyawan (Studi Kasus Pada PT - Pelindo Teluk Bayur Padang)
No ratings yet
Pengaruh Human Relation (Hubungan Antar Manusia), Lingkungan Kerja Terhadap Etos Kerja Karyawan (Studi Kasus Pada PT - Pelindo Teluk Bayur Padang)
14 pages
Course Outline: Econometrics 2: Diponegoro University - Dept. of Economics Econometrics 2
No ratings yet
Course Outline: Econometrics 2: Diponegoro University - Dept. of Economics Econometrics 2
7 pages
MATH 1281 - Unit 7 Assignment
No ratings yet
MATH 1281 - Unit 7 Assignment
3 pages
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
100% (1)
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
15 pages
Question 7
No ratings yet
Question 7
3 pages
Math 301 CH 9 Estimation (One and Two Samples
No ratings yet
Math 301 CH 9 Estimation (One and Two Samples
42 pages
Tutorial Week11
No ratings yet
Tutorial Week11
3 pages
Linear Regression in R
No ratings yet
Linear Regression in R
7 pages
Coefficient Stability
No ratings yet
Coefficient Stability
41 pages
Practice For Test 3 2 PDF
No ratings yet
Practice For Test 3 2 PDF
12 pages
Calculation of First Sales Forecasts
No ratings yet
Calculation of First Sales Forecasts
29 pages
Introduction To STATISTICS-new
100% (1)
Introduction To STATISTICS-new
46 pages
Regression Notes
100% (1)
Regression Notes
20 pages
Quiz Week 7 - Support Vector Machines
100% (1)
Quiz Week 7 - Support Vector Machines
3 pages
Session 15 Regression and Correlation
No ratings yet
Session 15 Regression and Correlation
66 pages
Eda PDF
100% (1)
Eda PDF
45 pages
Topic 6 Simple Linear Regression
No ratings yet
Topic 6 Simple Linear Regression
57 pages
Trend, Variation, and Universal Kriging
No ratings yet
Trend, Variation, and Universal Kriging
12 pages
Contoh Lampiran
No ratings yet
Contoh Lampiran
17 pages
Lessons in Digital Estimation Theory
100% (1)
Lessons in Digital Estimation Theory
161 pages
Outliers Influence
No ratings yet
Outliers Influence
6 pages
U02Lecture07 Classification
100% (1)
U02Lecture07 Classification
56 pages
Lab 4: Logistic Regression: PSTAT 131/231, Winter 2019
No ratings yet
Lab 4: Logistic Regression: PSTAT 131/231, Winter 2019
10 pages
Regression Mean
No ratings yet
Regression Mean
3 pages
Session 18 Time Series Forecasting
No ratings yet
Session 18 Time Series Forecasting
30 pages
Dummy Regression
No ratings yet
Dummy Regression
23 pages
Multivariate Statistical Analysis: Professor Dr. Muhammad Mohsin Butt Department of Marketing School of Business Studies
No ratings yet
Multivariate Statistical Analysis: Professor Dr. Muhammad Mohsin Butt Department of Marketing School of Business Studies
8 pages
Syl5213 08
No ratings yet
Syl5213 08
11 pages
Linear Regression Analysis. Statistics 2 Notes
No ratings yet
Linear Regression Analysis. Statistics 2 Notes
20 pages
Linear Statistical Models The Less Than Full Rank Model: Yao-Ban Chan
100% (1)
Linear Statistical Models The Less Than Full Rank Model: Yao-Ban Chan
140 pages
ML Week 3 Logistic Regression
60% (10)
ML Week 3 Logistic Regression
6 pages
Engagement Dan Kepuasan Kerja Terhadap Kinerja
No ratings yet
Engagement Dan Kepuasan Kerja Terhadap Kinerja
11 pages
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
100% (2)
Machine Learning: Lecture 13: Model Validation Techniques, Overfitting, Underfitting
26 pages
Practice Midterm2 Fall2011
No ratings yet
Practice Midterm2 Fall2011
9 pages
Advanced Statistical Inference
No ratings yet
Advanced Statistical Inference
7 pages
TF Idf Algorithm
No ratings yet
TF Idf Algorithm
4 pages
Sajjad DS
100% (2)
Sajjad DS
97 pages
An R Tutorial Starting Out
No ratings yet
An R Tutorial Starting Out
9 pages
LDA 01 Linear Discriminant Analysis
No ratings yet
LDA 01 Linear Discriminant Analysis
65 pages
Simple Regression Quiz
No ratings yet
Simple Regression Quiz
6 pages
Model Variables Entered Variables Removed Method 1 Wat, VLT - Enter A. All Requested Variables Entered. B. Dependent Variable: RC
No ratings yet
Model Variables Entered Variables Removed Method 1 Wat, VLT - Enter A. All Requested Variables Entered. B. Dependent Variable: RC
2 pages
Time Series Lecture Notes
No ratings yet
Time Series Lecture Notes
97 pages
Support Vector Machine - Explanation
No ratings yet
Support Vector Machine - Explanation
12 pages
Char Lie
100% (1)
Char Lie
64 pages
Supervised Learning (Classification and Regression)
No ratings yet
Supervised Learning (Classification and Regression)
14 pages
Discriminant Analysis
No ratings yet
Discriminant Analysis
33 pages
Parallelism of Statistics and Machine Learning & Logistic Regression Versus Random Forest
100% (1)
Parallelism of Statistics and Machine Learning & Logistic Regression Versus Random Forest
72 pages
Loss Functions
No ratings yet
Loss Functions
37 pages
Ensemble Classifiers
100% (1)
Ensemble Classifiers
37 pages
Estimation and Hypothesis
100% (1)
Estimation and Hypothesis
32 pages
Predict 422 - Module 8
100% (1)
Predict 422 - Module 8
138 pages
Data Preprocessing
No ratings yet
Data Preprocessing
77 pages
Seminar Report Machine Learning
No ratings yet
Seminar Report Machine Learning
20 pages
App.A - Detection and Estimation in Additive Gaussian Noise PDF
No ratings yet
App.A - Detection and Estimation in Additive Gaussian Noise PDF
55 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages

Logistic Regression

Uploaded by

Logistic Regression

Uploaded by

Logistic Regression

Key Terms for Logistic Regression

Logistic Response Function and Logit

p = β0 + β1x1 + β2x2 + ⋯ + βqxq

208 | Chapter 5: Classification

This transform ensures that the p stays between 0 and 1.

β0 + β1x1 + β2x2 + ⋯ + βqxq

log Odds Y = 1 = β0 + β1x1 + β2x2 + ⋯ + βqxq

Logistic Regression | 209

Logistic Regression and the GLM

Call: glm(formula = outcome ~ payment_inc_ratio + purpose_ + home_ +

210 | Chapter 5: Classification

Degrees of Freedom: 45341 Total (i.e. Null); 45330 Residual

logit_reg = LogisticRegression(penalty='l2', C=1e42, solver='liblinear')

Logistic Regression | 211

Generalized Linear Models

• A probability distribution or family (binomial in the case of logistic regression)

Predicted Values from Logistic Regression

For example, look at the predictions from the model logistic_model in R:

212 | Chapter 5: Classification

Interpreting the Coefficients and Odds Ratios

Logistic Regression | 213

Linear and Logistic Regression: Similarities and Differences

214 | Chapter 5: Classification

Fitting the model

Maximum Likelihood Estimation

deviance = − 2 log P X1, X2, ⋯, Xn

Lower deviance corresponds to a better fit.

Logistic Regression | 215

Assessing the Model

(Dispersion parameter for binomial family taken to be 1)

Null deviance: 62857 on 45341 degrees of freedom

216 | Chapter 5: Classification

Number of Fisher Scoring iterations: 4

The formula interface of statsmodels also supports these extensions in Python:

Logistic Regression | 217

Figure 5-4. Partial residuals from logistic regression

218 | Chapter 5: Classification

Evaluating Classification Models

Evaluating Classification Models | 219

You might also like