0% found this document useful (0 votes)

88 views16 pages

Logistic Regression

Logistic regression is a machine learning classification algorithm that predicts the probability of categorical dependent variables based on independent variables. It uses a logistic function to map predictions between 0 and 1, like an S-curve. Logistic regression is similar to linear regression but is used for classification instead of regression. It estimates probabilities instead of providing exact values.

Uploaded by

Bhakti Betageri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

88 views16 pages

Logistic Regression

Uploaded by

Bhakti Betageri

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Logistic Regression

Anita Shirol
Introduction

 Logistic regression is a supervised machine learning algorithm mainly used for

classification tasks where the goal is to predict the probability that an
instance of belonging to a given class.
 It is used for classification algorithms its name is logistic regression. it’s
referred to as regression because it takes the output of the linear regression
function as input.
 Uses a sigmoid function to estimate the probability for the given class.
 The difference between linear regression and logistic regression is that linear
regression output is the continuous value that can be anything while logistic
regression predicts the probability that an instance belongs to a given class or
not.
Cont.
 Logistic regression predicts the output of a categorical dependent variable. Therefore
the outcome must be a categorical or discrete value.
 It can be either Yes or No, 0 or 1, true or False, etc. but instead of giving the exact
value as 0 and 1, it gives the probabilistic values which lie between 0 and 1.
 Logistic Regression is much similar to the Linear Regression except that how they are
used. Linear Regression is used for solving Regression problems, whereas Logistic
regression is used for solving the classification problems.
 In Logistic regression, instead of fitting a regression line, we fit an “S” shaped logistic
function, which predicts two maximum values (0 or 1).
 The curve from the logistic function indicates the likelihood of something such as
whether the cells are cancerous or not, a mouse is obese or not based on its weight,
etc.
 Logistic Regression is a significant machine learning algorithm because it has the ability
to provide probabilities and classify new data using continuous and discrete datasets.
 Logistic Regression can be used to classify the observations using different types of
data and can easily determine the most effective variables used for the classification.
Logistic Function (Sigmoid Function)
 The sigmoid function is a mathematical function used to map the predicted
values to probabilities.

 It maps any real value into another value within a range of 0 and 1. o The
value of the logistic regression must be between 0 and 1, which cannot go
beyond this limit, so it forms a curve like the “S” form.

 The S-form curve is called the Sigmoid function or the logistic function.

 In logistic regression, we use the concept of the threshold value, which

defines the probability of either 0 or 1. Such as values above the threshold
value tends to 1, and a value below the threshold values tends to 0.
Types
 Binomial: In binomial Logistic regression, there can be only two possible
types of the dependent variables, such as 0 or 1, Pass or Fail, etc.

 Multinomial: In multinomial Logistic regression, there can be 3 or more

possible unordered types of the dependent variable, such as “cat”, “dogs”, or
“sheep”

 Ordinal: In ordinal Logistic regression, there can be 3 or more possible

ordered types of dependent variables, such as “low”, “Medium”, or “High”.
Difference between Linear Regression and
Logistic Regression
Terminologies
 Independent variables: The input characteristics or predictor factors applied to the dependent
variable’s predictions.
 Dependent variable: The target variable in a logistic regression model, which we are trying to
predict.
 Logistic function: The formula used to represent how the independent and dependent variables
relate to one another. The logistic function transforms the input variables into a probability value
between 0 and 1, which represents the likelihood of the dependent variable being 1 or 0.
 Odds: It is the ratio of something occurring to something not occurring. it is different from
probability as the probability is the ratio of something occurring to everything that could possibly
occur.
 Log-odds: The log-odds, also known as the logit function, is the natural logarithm of the odds. In
logistic regression, the log odds of the dependent variable are modeled as a linear combination of
the independent variables and the intercept.
 Coefficient: The logistic regression model’s estimated parameters, show how the independent and
dependent variables relate to one another.
 Intercept: A constant term in the logistic regression model, which represents the log odds when all
independent variables are equal to zero.
 Maximum likelihood estimation: The method used to estimate the coefficients of the logistic
regression model, which maximizes the likelihood of observing the data given the model.
How does Logistic Regression work?

 The logistic regression model transforms the linear regression function continuous
value output into categorical value output using a sigmoid function, which maps any
real-valued set of independent variables input into a value between 0 and 1. This
function is known as the logistic function.
 Let the independent input features be:
Sigmoid
Equation
 Odds of occurrence
Likelihood function for Logistic Regression
Gradient of the log-likelihood function
Assumptions for Logistic Regression

 The assumptions for Logistic regression are as follows:

 Independent observations: Each observation is independent of the other.
meaning there is no correlation between any input variables.
 Binary dependent variables: It takes the assumption that the dependent
variable must be binary or dichotomous, meaning it can take only two values.
For more than two categories softmax functions are used.
 Linearity relationship between independent variables and log odds: The
relationship between the independent variables and the log odds of the
dependent variable should be linear.
 No outliers: There should be no outliers in the dataset.
 Large sample size: The sample size is sufficiently large
Binomial

Code
# import the necessary libraries
 from sklearn.datasets import load_breast_cancer
 from sklearn.linear_model import LogisticRegression
 from sklearn.model_selection import train_test_split
 from sklearn.metrics import accuracy_score
 # load the breast cancer dataset
 X, y = load_breast_cancer(return_X_y=True)
 # split the train and test dataset
 X_train, X_test,\
 y_train, y_test = train_test_split(X, y,
 test_size=0.20,
 random_state=23)
 # LogisticRegression
 clf = LogisticRegression(random_state=0)
 clf.fit(X_train, y_train)
 # Prediction
 y_pred = clf.predict(X_test)

 acc = accuracy_score(y_test, y_pred)

 print("Logistic Regression model accuracy (in %):", acc*100)
Multi class
 from sklearn.model_selection import train_test_split
 from sklearn import datasets, linear_model, metrics

 # load the digit dataset

 digits = datasets.load_digits()

 # defining feature matrix(X) and response vector(y)

 X = digits.data
 y = digits.target

 # splitting X and y into training and testing sets

 X_train, X_test,\
 y_train, y_test = train_test_split(X, y, test_size=0.4, random_state=1)

 # create logistic regression object

 reg = linear_model.LogisticRegression()

 # train the model using the training sets

 reg.fit(X_train, y_train)

 # making predictions on the testing set

 y_pred = reg.predict(X_test)
Cost function

‎⁨كتاب التشطيبات م عبدالغني الجند⁩
100% (1)
‎⁨كتاب التشطيبات م عبدالغني الجند⁩
604 pages
ML Lecture 15 Ensemble
No ratings yet
ML Lecture 15 Ensemble
27 pages
Mitsubishi - FD30N
100% (1)
Mitsubishi - FD30N
7 pages
WRM Year8 Spring Block 1 Brackets Equations Inequalities Exemplar Questions and Answers
No ratings yet
WRM Year8 Spring Block 1 Brackets Equations Inequalities Exemplar Questions and Answers
87 pages
Gradient Descent
No ratings yet
Gradient Descent
18 pages
QGB Series: Instruction Manual
No ratings yet
QGB Series: Instruction Manual
100 pages
Headspace 2nd Year
100% (1)
Headspace 2nd Year
401 pages
Bayesian Network
No ratings yet
Bayesian Network
15 pages
Ue22cs342aa2 20241114095341
No ratings yet
Ue22cs342aa2 20241114095341
23 pages
CISSP Mindmap
No ratings yet
CISSP Mindmap
24 pages
Displacement Measurement
No ratings yet
Displacement Measurement
90 pages
CSC2102 Data Structures and Algorithm Program BSSE-3 Sec. A Week 1
No ratings yet
CSC2102 Data Structures and Algorithm Program BSSE-3 Sec. A Week 1
30 pages
Unit - 4 - Modified
No ratings yet
Unit - 4 - Modified
152 pages
9 Distance Measures in Data Science
No ratings yet
9 Distance Measures in Data Science
9 pages
CS229 Lecture 3 PDF
100% (1)
CS229 Lecture 3 PDF
35 pages
Logistic Regression
No ratings yet
Logistic Regression
47 pages
How To Apply Initial Stress Using INISTATE
No ratings yet
How To Apply Initial Stress Using INISTATE
4 pages
SPSS Multiple Linear Regression
No ratings yet
SPSS Multiple Linear Regression
55 pages
Logistic Regression
100% (1)
Logistic Regression
12 pages
Module 1 Notes
100% (1)
Module 1 Notes
73 pages
DBSCAN
No ratings yet
DBSCAN
42 pages
TGTDCL 2018 Question Solution by Design Integrity
No ratings yet
TGTDCL 2018 Question Solution by Design Integrity
6 pages
Chapter 2 IA
No ratings yet
Chapter 2 IA
49 pages
Tabla de Ampacidades 310.16 NEC (NFPA 70) - Como Utilizar La Tabla de Ampacidades
No ratings yet
Tabla de Ampacidades 310.16 NEC (NFPA 70) - Como Utilizar La Tabla de Ampacidades
3 pages
Regression Logistic 4
No ratings yet
Regression Logistic 4
51 pages
ZX2-GC×GC Im 01
No ratings yet
ZX2-GC×GC Im 01
32 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
95 pages
Logistic Regression
100% (1)
Logistic Regression
56 pages
Doing Bayesian Data Analysis With JASP: Darrell A. Worthy
No ratings yet
Doing Bayesian Data Analysis With JASP: Darrell A. Worthy
76 pages
DBSCAN
No ratings yet
DBSCAN
18 pages
Multiple Regression
No ratings yet
Multiple Regression
138 pages
Hotkeys Meshmixer
No ratings yet
Hotkeys Meshmixer
5 pages
Database Management System - Practical File
No ratings yet
Database Management System - Practical File
11 pages
SOP For E-Mail Security Policy - v. 1.0
No ratings yet
SOP For E-Mail Security Policy - v. 1.0
8 pages
Manual
No ratings yet
Manual
64 pages
UNIT-2.1 - Angular Measurement
No ratings yet
UNIT-2.1 - Angular Measurement
74 pages
Euclidean Distance
No ratings yet
Euclidean Distance
10 pages
Conjoint Tutorial
No ratings yet
Conjoint Tutorial
20 pages
Bayes Network
100% (1)
Bayes Network
80 pages
Sankosha 2017 Laundry the-Higher-Standard
No ratings yet
Sankosha 2017 Laundry the-Higher-Standard
10 pages
Logistic Regression
100% (1)
Logistic Regression
21 pages
MMM Lecture - Unit 1 - Displacement Measurement
No ratings yet
MMM Lecture - Unit 1 - Displacement Measurement
22 pages
Logistic Regression
100% (1)
Logistic Regression
21 pages
CV - (Hadziq Mufid Mahmud) (Middleware Developer)
No ratings yet
CV - (Hadziq Mufid Mahmud) (Middleware Developer)
6 pages
ML - Unit 2
No ratings yet
ML - Unit 2
15 pages
Microsoft Malware Prediction
100% (1)
Microsoft Malware Prediction
16 pages
Bayesian Network - Problem
100% (1)
Bayesian Network - Problem
4 pages
AI - 02 (Intelligent Agents)
No ratings yet
AI - 02 (Intelligent Agents)
36 pages
Nibha Dubey
No ratings yet
Nibha Dubey
5 pages
SensaGuard Switches With EStop To MSR138.1DP Relay
No ratings yet
SensaGuard Switches With EStop To MSR138.1DP Relay
4 pages
Augmented Reality in Quality Control
No ratings yet
Augmented Reality in Quality Control
6 pages
Logistic+Regression - Done
100% (1)
Logistic+Regression - Done
41 pages
Classification Metrics in Machine Learning
No ratings yet
Classification Metrics in Machine Learning
6 pages
A Tutorial On Bayesian Belief Networks: Mark L. Krieg DSTO-TN-0403
No ratings yet
A Tutorial On Bayesian Belief Networks: Mark L. Krieg DSTO-TN-0403
66 pages
Catalogo
No ratings yet
Catalogo
3 pages
DE Ch21
No ratings yet
DE Ch21
20 pages
D-4856 Vensim Conversion Guide (Aaron Diamond)
No ratings yet
D-4856 Vensim Conversion Guide (Aaron Diamond)
6 pages
Minitab Statguide Multivariate
No ratings yet
Minitab Statguide Multivariate
25 pages
Measurements of Stress and Strain
100% (1)
Measurements of Stress and Strain
26 pages
Aa270625068397p - SCN25062025 GST
No ratings yet
Aa270625068397p - SCN25062025 GST
1 page
Cluster
100% (1)
Cluster
72 pages
Chapter 14 - Cluster Analysis: Data Mining For Business Intelligence
No ratings yet
Chapter 14 - Cluster Analysis: Data Mining For Business Intelligence
31 pages
SR Vibratory Ripper
No ratings yet
SR Vibratory Ripper
4 pages
Big Data Presentation
No ratings yet
Big Data Presentation
45 pages
Naive Bayes Classifier: Coin Toss and Fair Dice Example
No ratings yet
Naive Bayes Classifier: Coin Toss and Fair Dice Example
16 pages
Logistic Regression
No ratings yet
Logistic Regression
9 pages
Bess White-Paper Explosion-Protection Final
No ratings yet
Bess White-Paper Explosion-Protection Final
2 pages
Whitepaper Top Benefits of Video Conferencing Polycom
No ratings yet
Whitepaper Top Benefits of Video Conferencing Polycom
2 pages
Naive Bayes and Sentiment
No ratings yet
Naive Bayes and Sentiment
19 pages
Bar Graph-Wps Office
No ratings yet
Bar Graph-Wps Office
16 pages
Lecture-5 (Ch. 4, Uma) - Types of Variables
No ratings yet
Lecture-5 (Ch. 4, Uma) - Types of Variables
19 pages
Agglomerative Hierarchical Clustering
No ratings yet
Agglomerative Hierarchical Clustering
21 pages
Touch Screen Technology: Let'S Touch The Future
No ratings yet
Touch Screen Technology: Let'S Touch The Future
45 pages
Non Linear Regression
No ratings yet
Non Linear Regression
20 pages
Linear & Angular Measuring Instruments-Prianka
No ratings yet
Linear & Angular Measuring Instruments-Prianka
22 pages
Evaluation Mcqs
No ratings yet
Evaluation Mcqs
2 pages
OUTLIERS
100% (1)
OUTLIERS
5 pages
Preliminary Water Utility Report
No ratings yet
Preliminary Water Utility Report
24 pages
Customer Choice Tutorial
No ratings yet
Customer Choice Tutorial
15 pages
Blank: CFC Cumulative Forecast Error or Bias Error
100% (1)
Blank: CFC Cumulative Forecast Error or Bias Error
2 pages
Development of Smart Multi-Level Inverter With Remote Monitoring System
No ratings yet
Development of Smart Multi-Level Inverter With Remote Monitoring System
5 pages
Logistic Regression
No ratings yet
Logistic Regression
10 pages
Gujarat Technological University: External Examiner's Feedback Form
0% (1)
Gujarat Technological University: External Examiner's Feedback Form
1 page
1.5 S Shift in The Six Sigma Process
No ratings yet
1.5 S Shift in The Six Sigma Process
3 pages
Design of A Latent Heat Storage System For The Replacement of Cooling Tower For DG Set
No ratings yet
Design of A Latent Heat Storage System For The Replacement of Cooling Tower For DG Set
6 pages
Greek Architecture:: Golden Ratio in Use
No ratings yet
Greek Architecture:: Golden Ratio in Use
4 pages
Logistic Regression
No ratings yet
Logistic Regression
11 pages
Cheatsheet Midterms 2 - 3
No ratings yet
Cheatsheet Midterms 2 - 3
2 pages
VMware KB - Required VMware Vcenter Converter Ports
No ratings yet
VMware KB - Required VMware Vcenter Converter Ports
4 pages
1.5 Sigma Process Shift Explanation
No ratings yet
1.5 Sigma Process Shift Explanation
1 page

Logistic Regression

Uploaded by

Logistic Regression

Uploaded by

Logistic Regression

 Logistic regression is a supervised machine learning algorithm mainly used for

 In logistic regression, we use the concept of the threshold value, which

 Multinomial: In multinomial Logistic regression, there can be 3 or more

 Ordinal: In ordinal Logistic regression, there can be 3 or more possible

 The assumptions for Logistic regression are as follows:

 acc = accuracy_score(y_test, y_pred)

 # load the digit dataset

 # defining feature matrix(X) and response vector(y)

 # splitting X and y into training and testing sets

 # create logistic regression object

 # train the model using the training sets

 # making predictions on the testing set

You might also like