100% found this document useful (1 vote)

85 views

Logistic Regression Example

Logistic regression is an algorithm for binary classification that learns coefficients to predict the probability of an output being 1 or 0 based on the input features. It works by iteratively updating the coefficients to minimize error using stochastic gradient descent. On a sample dataset, a logistic regression model achieved 100% accuracy after 10 epochs of training.

Uploaded by

LUV ARORA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

85 views

Logistic Regression Example

Uploaded by

LUV ARORA

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 22

Logistic Regression

• Logistic regression is one of the most popular

machine learning algorithms for binary classification.
This is because it is a simple algorithm that performs
very well on a wide range of problems.
• How to calculate the logistic function.
• How to learn the coefficients for a logistic regression
model using stochastic gradient descent.
• How to make predictions using a logistic regression
model.
Raw Dataset
This dataset has two input variables (X1 and X2) and one output variable (Y).
In input variables are real-valued random numbers drawn from a Gaussian
distribution. The output variable has two values, making the problem a
binary classification problem

1 X1 X2 Y
2 2.7810836 2.550537003 0
3 1.4654893 2.362125076 0
4 3.3965616 4.400293529 0
5 1.3880701 1.850220317 0
6 3.0640723 3.005305973 0
7 7.6275312 2.759262235 1
8 5.3324412 2.088626775 1
9 6.9225967 1.77106367 1
10 8.6754186 -0.242068654 1
11 7.6737564 3.508563011 1
• Below is a plot of the dataset. You can see that
it is completely contrived and that we can
easily draw a line to separate the classes.
• This is exactly what we are going to do with
the logistic regression model.
plot
Logistic Function

• The logistic function is defined as:

• transformed = 1 / (1 + e^-x)
• Where e is the numerical constant Euler’s
number and x is a input we plug into the
function.
• a series of numbers from -5 to +5 and see how
the logistic function transforms them:
a series of numbers from -5 to +5 and see how the logistic function
transforms them:

1 X Transformed • all of the inputs have been

2 -5 0.006692850924
3 -4 0.01798620996 transformed into the range [0, 1]
4 -3 0.04742587318 • the smallest negative numbers
5 -2 0.119202922
6 -1 0.2689414214 resulted in values close to zero
7 0 0.5 • the larger positive numbers
8 1 0.7310585786
9 2 0.880797078 resulted in values close to one.
10 3 0.9525741268 • 0 transformed to 0.5 or the
11 4 0.98201379
12 5 0.9933071491
midpoint of the new range.
• as long as our mean value is zero, we can plug
in positive and negative values into the
function and always get out a consistent
transform into the new range.
Logistic Regression Model

• The logistic regression model takes real-valued

inputs and makes a prediction as to the
probability of the input belonging to the
default class (class 0).
• If the probability is > 0.5 we can take the
output as a prediction for the default class
(class 0), otherwise the prediction is for the
other class (class 1).
• For this dataset, the logistic regression has three
coefficients just like linear regression, for example:
• output = b0 + b1*x1 + b2*x2
• The job of the learning algorithm will be to
discover the best values for the coefficients (b0, b1
and b2) based on the training data.
• Unlike linear regression, the output is transformed
into a probability using the logistic function:
• p(class=0) = 1 / (1 + e^(-output))
Logistic Regression by Stochastic Gradient Descent

• estimate the values of the coefficients using stochastic gradient

descent.
• It works by using the model to calculate a prediction for each
instance in the training set and calculating the error for each
prediction.

• Given each training instance:

1. Calculate a prediction using the current values of the

coefficients.
2. Calculate new coefficient values based on the error in the
prediction.
• The process is repeated until the model is accurate
enough (e.g. error drops to some desirable level) or
for a fixed number iterations
• continue to update the model for training instances
and correcting errors until the model is accurate
enough or cannot be made any more accurate
• May randomize the order of the training instances
shown to the model to mix up the corrections made
Calculate Prediction

• Let’s start off by assigning 0.0 to each

coefficient and calculating the probability of the
first training instance that belongs to class 0.
• B0 = 0.0
• B1 = 0.0
• B2 = 0.0
• The first training instance is: x1=2.7810836,
x2=2.550537003, Y=0
• Using the above equation we can plug in all of
these numbers and calculate a prediction:
• prediction = 1 / (1 + e^(-(b0 + b1*x1 + b2*x2)))
• prediction = 1 / (1 + e^(-(0.0 + 0.0*2.7810836
+ 0.0*2.550537003)))
• prediction = 0.5
Calculate New Coefficients

• calculate the new coefficient values using a

simple update equation.
• b = b + alpha * (y – prediction) * prediction *
(1 – prediction) * x
• alpha is learning rate and controls how much
the coefficients (and therefore the model)
changes or learns each time it is updated
• Good values might be in the range 0.1 to 0.3
• Alpha = 0.3
• Let’s update the coefficients using the prediction
(0.5) and coefficient values (0.0) from the
previous section.
• b0 = b0 + 0.3 * (0 – 0.5) * 0.5 * (1 – 0.5) * 1.0
• b1 = b1 + 0.3 * (0 – 0.5) * 0.5 * (1 – 0.5) * 2.7810836
• b2 = b2 + 0.3 * (0 – 0.5) * 0.5 * (1 – 0.5) * 2.550537003

b0 = -0.0375
b1 = -0.104290635
b2 = -0.09564513761
Repeat the Process

• repeat this process and update the model for each

training instance in the dataset.
• A single iteration through the training dataset is
called an epoch. It is common to repeat the
stochastic gradient descent procedure for a fixed
number of epochs.
• At the end of epoch you can calculate error values
for the model. Because this is a classification
problem, it would be nice to get an idea of how
accurate the model is at each iteration.
The graph below show a plot of accuracy of
the model over 10 epochs
• the model very quickly achieves 100%
accuracy on the training dataset.
• The coefficients calculated after 10 epochs of
stochastic gradient descent are:
• b0 = -0.4066054641
• b1 = 0.8525733164
• b2 = -1.104746259
Make Predictions

• Using the coefficients above learned after 10

epochs, we can calculate output values for
each training instance
1 0.2987569857
2 0.145951056
3 0.08533326531
4 0.2197373144
5 0.2470590002
6 0.9547021348
7 0.8620341908
8 0.9717729051
9 0.9992954521
10 0.905489323
• convert these into crisp class values using:
prediction = IF (output < 0.5) Then 0
Else 1
1 0
2 0
3 0
4 0
5 0
6 1
7 1
8 1
9 1
10 1
• calculate the accuracy for the model on the
training dataset:
• accuracy = (correct predictions / num
predictions made) * 100
• accuracy = (10 /10) * 100
• accuracy = 100%

QuantEconlectures Python3 PDF
100% (1)
QuantEconlectures Python3 PDF
1,125 pages
EDA Lecture Module 2
100% (1)
EDA Lecture Module 2
42 pages
CS229 Lecture 3 PDF
100% (1)
CS229 Lecture 3 PDF
35 pages
Lecture 4 Linear Regression
100% (1)
Lecture 4 Linear Regression
44 pages
Quiz Feedback1 - Coursera
100% (1)
Quiz Feedback1 - Coursera
7 pages
Homework 2
100% (1)
Homework 2
12 pages
CPE412 Pattern Recognition (Week 8)
100% (1)
CPE412 Pattern Recognition (Week 8)
25 pages
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
100% (1)
Logistic Regression: Gunjan Bharadwaj Assistant Professor Dept of CEA
42 pages
Homework 2
100% (1)
Homework 2
14 pages
Project 5 PDF
100% (1)
Project 5 PDF
48 pages
Sas Notes Module 4-Categorical Data Analysis Testing Association Between Categorical Variables
100% (1)
Sas Notes Module 4-Categorical Data Analysis Testing Association Between Categorical Variables
16 pages
Classification With Decision Trees: Instructor: Qiang Yang
100% (1)
Classification With Decision Trees: Instructor: Qiang Yang
62 pages
1.1 Simple Linear Regression Model
100% (1)
1.1 Simple Linear Regression Model
15 pages
Logistic Regression
100% (1)
Logistic Regression
17 pages
Logistic Regression Model Study Assignment
100% (1)
Logistic Regression Model Study Assignment
5 pages
Name: Reg. No.: Lab Exercise:: Shivam Batra 19BPS1131
100% (1)
Name: Reg. No.: Lab Exercise:: Shivam Batra 19BPS1131
10 pages
7. Heteroscedasticity: y = β + β x + · · · + β x + u
100% (1)
7. Heteroscedasticity: y = β + β x + · · · + β x + u
21 pages
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
100% (1)
Linear Regression With LM Function, Diagnostic Plots, Interaction Term, Non-Linear Transformation of The Predictors, Qualitative Predictors
15 pages
Scip y Lectures
100% (1)
Scip y Lectures
329 pages
Logistic Regression
100% (1)
Logistic Regression
14 pages
8multiple Linear Regression
100% (1)
8multiple Linear Regression
21 pages
Unit 4 Basics of Feature Engineering
100% (1)
Unit 4 Basics of Feature Engineering
33 pages
Python Numpy (1) : Intro To Multi-Dimensional Array & Numerical Linear Algebra
100% (1)
Python Numpy (1) : Intro To Multi-Dimensional Array & Numerical Linear Algebra
27 pages
Tutor
100% (1)
Tutor
309 pages
Dokumen - Pub Approaching Almost Any Machine Learning Problem 9788269211528 L 5276104
100% (1)
Dokumen - Pub Approaching Almost Any Machine Learning Problem 9788269211528 L 5276104
151 pages
Blank: CFC Cumulative Forecast Error or Bias Error
100% (1)
Blank: CFC Cumulative Forecast Error or Bias Error
2 pages
Decision Tree Classification
100% (1)
Decision Tree Classification
11 pages
Forecasting of Stock Prices Using Multi Layer Perceptron
100% (1)
Forecasting of Stock Prices Using Multi Layer Perceptron
6 pages
Taller Practica Churn
50% (2)
Taller Practica Churn
6 pages
Risk Return Summery
100% (1)
Risk Return Summery
85 pages
Stat1012 Cheatsheet Double-Sided
100% (1)
Stat1012 Cheatsheet Double-Sided
2 pages
Python For You and Me: Release 0.3.alpha1
100% (1)
Python For You and Me: Release 0.3.alpha1
143 pages
Introduction To STATISTICS-new
100% (1)
Introduction To STATISTICS-new
46 pages
Logistic Regression
100% (1)
Logistic Regression
56 pages
Linear Regression (Check List)
100% (1)
Linear Regression (Check List)
2 pages
Stats For Managers - Intro
100% (1)
Stats For Managers - Intro
101 pages
KPMG - Data Set
100% (1)
KPMG - Data Set
1,685 pages
Python Vs R in Data and Machine Learning PDF
100% (1)
Python Vs R in Data and Machine Learning PDF
6 pages
LPTHW
100% (1)
LPTHW
220 pages
Import As
100% (1)
Import As
27 pages
1694600777-Unit2.2 Logistic Regression CU 2.0
100% (1)
1694600777-Unit2.2 Logistic Regression CU 2.0
37 pages
Case Study 2
100% (1)
Case Study 2
12 pages
EFFIE 2002 Case Studies
100% (1)
EFFIE 2002 Case Studies
16 pages
Photon Prog Guide
100% (1)
Photon Prog Guide
919 pages
In All The Regression Models That We Have Considered So
100% (1)
In All The Regression Models That We Have Considered So
52 pages
Human Life Span Prediction Using Machine Learning
100% (1)
Human Life Span Prediction Using Machine Learning
9 pages
Community Medicine Trans - Epidemic Investigation 2
100% (1)
Community Medicine Trans - Epidemic Investigation 2
10 pages
Preparation and Evaluation of Polyherbal Hair Oil
100% (1)
Preparation and Evaluation of Polyherbal Hair Oil
13 pages
Correlation & Regression
100% (1)
Correlation & Regression
53 pages
ML MU Unit 2
100% (2)
ML MU Unit 2
42 pages
Correlation & Regression Analysis
100% (1)
Correlation & Regression Analysis
39 pages
KPMG
100% (1)
KPMG
2 pages
Airbnbs in Seattle, Wa: Questions
100% (1)
Airbnbs in Seattle, Wa: Questions
5 pages
M&A Deal of ABC Inc. and XYZ Inc.: Insert Your Title Here
100% (1)
M&A Deal of ABC Inc. and XYZ Inc.: Insert Your Title Here
25 pages
Quest Stat
100% (1)
Quest Stat
2 pages
Lead Scoring Group Case Study Presentation
100% (2)
Lead Scoring Group Case Study Presentation
19 pages
Machine Learning (Analytics Vidhya) : What Is Logistic Regression?
100% (1)
Machine Learning (Analytics Vidhya) : What Is Logistic Regression?
5 pages
3. LR, decision tree
No ratings yet
3. LR, decision tree
48 pages
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
A Brief Introduction to MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
2.5/5 (2)
Magic wIth Math
From Everand
Magic wIth Math
Rajinder Goswami
5/5 (2)
Quality Control and Realiability - All in One
No ratings yet
Quality Control and Realiability - All in One
430 pages
Spss Tutorial Compare Means
No ratings yet
Spss Tutorial Compare Means
3 pages
Eastern Mediterranean University Department of Industrial Engineering
No ratings yet
Eastern Mediterranean University Department of Industrial Engineering
3 pages
Lecture Slides - BKM 08
No ratings yet
Lecture Slides - BKM 08
32 pages
Helens List Amended CS5900
No ratings yet
Helens List Amended CS5900
10 pages
Question #1 of 139
No ratings yet
Question #1 of 139
59 pages
Measure of Dispersion
No ratings yet
Measure of Dispersion
66 pages
Stat Prob 11 q4 Mod3 Regression Analysis v4
No ratings yet
Stat Prob 11 q4 Mod3 Regression Analysis v4
22 pages
Aktu maths 4 odd sem 2024 question paper
No ratings yet
Aktu maths 4 odd sem 2024 question paper
2 pages
Toleranslar Ingilizce
No ratings yet
Toleranslar Ingilizce
1 page
Get Statistics For Business & Economics 13th Revised Edition Edition David Ray Anderson - eBook PDF PDF ebook with Full Chapters Now
100% (5)
Get Statistics For Business & Economics 13th Revised Edition Edition David Ray Anderson - eBook PDF PDF ebook with Full Chapters Now
51 pages
Kidney Length Normative in Children
No ratings yet
Kidney Length Normative in Children
11 pages
A13-Two Sample T (Key)
No ratings yet
A13-Two Sample T (Key)
3 pages
Updated Table
No ratings yet
Updated Table
2 pages
Business Statistics: Quiz On Correlation
No ratings yet
Business Statistics: Quiz On Correlation
50 pages
Steps in The Process of Quantitative Data Analysis
No ratings yet
Steps in The Process of Quantitative Data Analysis
9 pages
SCM Lesson 2 PDF
No ratings yet
SCM Lesson 2 PDF
4 pages
Int Stats For Eco - Assignment Question Paper
No ratings yet
Int Stats For Eco - Assignment Question Paper
2 pages
lecture3-shallow-nn
No ratings yet
lecture3-shallow-nn
62 pages
What Is EDA in Data Science - Everything About Exploratory Data - by Aman Kharwal - Medium
No ratings yet
What Is EDA in Data Science - Everything About Exploratory Data - by Aman Kharwal - Medium
11 pages
Assignment N0 3
No ratings yet
Assignment N0 3
9 pages
SEC - Statistics For Buss
No ratings yet
SEC - Statistics For Buss
1 page
MATH 101-Week 7-8- Lesson 4.1 Correlation & Regression Analysis
No ratings yet
MATH 101-Week 7-8- Lesson 4.1 Correlation & Regression Analysis
53 pages
Spot Speed Analysis
No ratings yet
Spot Speed Analysis
75 pages
Get (Ebook PDF) Statistics With STATA: Version 12 8th Edition by Lawrence C. Hamilton Free All Chapters
100% (5)
Get (Ebook PDF) Statistics With STATA: Version 12 8th Edition by Lawrence C. Hamilton Free All Chapters
49 pages
Where can buy Statistics for Business and Economics Global Edition Newbold P. ebook with cheap price
100% (3)
Where can buy Statistics for Business and Economics Global Edition Newbold P. ebook with cheap price
45 pages
Correlations: Correlations /variables X Y /print Twotail Nosig /missing Pairwise
No ratings yet
Correlations: Correlations /variables X Y /print Twotail Nosig /missing Pairwise
8 pages
One-Sample Kolmogorov-Smirnov Test: Npar Tests
No ratings yet
One-Sample Kolmogorov-Smirnov Test: Npar Tests
25 pages
Nama: Arsenius Kennard Budiman NIM: 41160078: Case Processing Summary
No ratings yet
Nama: Arsenius Kennard Budiman NIM: 41160078: Case Processing Summary
8 pages
Mod I-II - III - Study Material BL 4 - 5 - 6
No ratings yet
Mod I-II - III - Study Material BL 4 - 5 - 6
7 pages

Logistic Regression Example

Uploaded by

Logistic Regression Example

Uploaded by

Logistic Regression

• Logistic regression is one of the most popular

• The logistic function is defined as:

1 X Transformed • all of the inputs have been

• The logistic regression model takes real-valued

• estimate the values of the coefficients using stochastic gradient

• Given each training instance:

1. Calculate a prediction using the current values of the

• Let’s start off by assigning 0.0 to each

• calculate the new coefficient values using a

• repeat this process and update the model for each

• Using the coefficients above learned after 10

You might also like