0% found this document useful (0 votes)

135 views13 pages

How To Do A Logistic Regression in Excel

Uploaded by

Farook Shaikh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

135 views13 pages

How To Do A Logistic Regression in Excel

Uploaded by

Farook Shaikh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

How to Do a Logistic Regression in Excel

Logistic regression is a statistical analysis technique for transforming a linear function’s output
into a probability value. Unlike linear regression, which predicts continuous outcomes, logistic
regression predicts the probability of an event occurring by using a logistic function to predict
the probability of a binary outcome. These types of predictions that categorize based on two
outcomes are called binary classification tasks.

For our example, you’ll perform a logistic regression in Excel to determine whether a college
basketball player is likely to get drafted into the NBA. Your dataset includes basic performance
metrics from the previous season:

 Average points

 Rebounds

 Assists
Because logistic regression is a binary classification problem, the target prediction is a simple
binary classification value of the likelihood of being drafted:

 0 = no

 1 = yes

Step 1: Insert Historical Data and Regression Coefficients

The first step is to create the tabular structure in Excel for holding your dataset and performing
calculations and transformations.

1. In a new Excel sheet, create four columns labeled ““Points,” “Rebounds,” “Assists,” and
“Drafted?”

2. Insert the dataset from the table below.

Step 2: Create Corresponding Cells for Variables

Create a corresponding cell for each of your columnar variables—Points, Rebounds, Assists—to
hold your regression coefficients.

1. Skipping a row after the dataset, create three subsequent cells labeled B1, B2, and B3.

2. On the next row, create a cell for the logistic regression’s intercept.

3. Set all four of these values to 0.001 for now; we’ll optimize them in a later step.
Step 3: Create Columns for Coefficient Optimizations

Next we’ll create columns for optimizing the regression coefficients. We’ll need these to
calculate predictions in later steps, but for now we’ll focus on populating four new columns:

 Logit: The logarithm of the odds of the probability p of a player getting drafted.

 Elogit: The inverse transformation of logit.

 Probability: The probability of being drafted, expressed as a real number.

 Log Likelihood: Goodness of fit, expressed as a negative number—the closer to zero, the
better.

1. Beginning in the first empty column to the right of the dataset, label the four subsequent
columns as follows: “logit,” “elogit,” “probability,” and “log likelihood.”
2. Calculate logit values by taking the logarithm of the odds of the probability (p) of a certain
event occurring:

In Excel, you can use the formula $B$15+$B$16B2+$B$17C2+$B$18*D2 to easily derive

the logit value. Place this formula into the first logit cell and drag the bottom right corner of the
highlighted cell to the last logit cell to populate the column.
3. Create elogit values by returning the result of the constant (e), which is the base of the natural
logarithm raised to the power of the value in the logit column. In this example, the base of the
natural logarithm comes out to about 2.718.
You can use Excel’s EXP function to get this value. Place the formula =EXP(E2) into the
first elogit cell and drag the bottom right corner of the highlighted cell to the last elogit cell to
populate the column.
4. Calculate probability values using the following formula for calculating probability (p):

In our example:

 p is the probability of a 1 value (the proportion of 1s, the mean of Y)

 e is the constant with the value ~2.718

 a and b are the parameters of the algorithm

In Excel, you can use the formula =IF(A2=1, F2/(1+F2), 1-(F2/(1+F2))) to derive the
probability values by placing this formula into the first probability cell and dragging the bottom
right corner of the highlighted cell to the last probability cell to populate the column.

Your spreadsheet should now look like below

Step 4: Create And Sum Log Likelihood Values

Because adding logarithms is computationally more efficient than multiplying probabilities

directly, you’ll need to calculate the log likelihood values to simplify your calculations and make
them more practical.

Log likelihood values are calculated by using the following formula:

Log likelihood = LN(probability)

1. Use the formula =LN(G2) to easily derive the log likelihood values in Excel by placing this
formula into the first log likelihood cell and dragging the bottom right corner of the
highlighted cell to the last log likelihood cell to populate the column.

2. Sum up all the log likelihood values in order to derive the number to maximize to solve for
the regression coefficients. You do this easily by placing the formula =SUM(H2:H13) in the
cell below the last log likelihood cell.
Your spreadsheet should now look like this
Step 5: Solve For Regression Coefficients

The last step involves using Excel’s Solver add-in to automatically calculate the regression
coefficient estimates.

1. Install Excel’s Solver add-in by clicking first on the Home menu and then the Add-Ins menu.

2. Search for and install Solver by following the prompts.

3. Select the Data menu from the top-level navigation and click Solver on the right-hand side to
run the add-in.

4. In the Solver Parameters pane, insert the following values:

 Set Objective: select cell H14 with the sum of the log likelihoods

 To: Max

 By Changing Variable Cells: Select cells B15:B18 containing your regression coefficients

 Make Unconstrained Variables Non-Negative: Uncheck

 Select a Solving Method: GRG Nonlinear

1. Click the “Solve” button.
After Solver finishes automatically calculating your regression coefficient estimates, your
spreadsheet should look like below
The current regression coefficients default to determining the probability of a non-draft:

Draft? = 0

To get the probability of being drafted (Draft? = 1), simply reverse the regression coefficients
signs—for example, reverse the -4.643753 in the p(x=0) column for a positive 4.643753 value
in the p(x=1) column.

Step 6: Add New Data for New Prediction

Now that you have your regression coefficient estimates, you can plug them into the probability
equation to find out whether a new player will get drafted. For this example, let’s say the new
player averages 15 points per game, 4 rebounds per game, and 6 assists per game. Again, the
formula for calculating the probability of being drafted is:
In this example, the formula would look like the following:

Evaluating this equation yields 0.66, or a 66 percent probability this new player will get drafted.

1. To calculate this in Excel, add the new player’s data to your Excel spreadsheet in a new row
to calculate their probability of getting drafted.

Evaluating this equation yields 0.66, or a 66 percent probability this new player will get drafted.

1. To calculate this in Excel, add the new player’s data to your Excel spreadsheet in a new row
to calculate their probability of getting drafted.
As you can see, the probability of the new player being drafted is also 66 percent, which lines up
with the previous manual calculation.

How Does Logistic Regression Work?

Logistic regression involves predicting the probability of a binary event occurring—for example,
success/failure, yes/no, churn/no churn). By definition, probability is a measure of the likelihood
of an event occurring, ranging from 0 (impossible) to 1 (certain).

Odds, on the other hand, express the likelihood of success compared to the likelihood of failure.
For example, if the probability of success is 0.8, the odds of success are 0.8 / (1 – 0.8) = 4. This
means there are four times as many favorable outcomes as unfavorable ones.

Log Odds and The Sigmoid Function

Log odds ratio is a calculation method for transforming these odds into a more workable range of
values. Specifically, the logistic regression model uses the sigmoid function—denoted as σ(z)—
to calculate the log odds ratio, or the logarithm of the odds of success. Mathematically, log odds
ratio is represented as:

Four Factors Celtics Start Jupyter Notebook
No ratings yet
Four Factors Celtics Start Jupyter Notebook
13 pages
Lecture 7 Logistic Regression
No ratings yet
Lecture 7 Logistic Regression
33 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
208 pages
Lecture 2.3.1
No ratings yet
Lecture 2.3.1
50 pages
Logistic Regression
No ratings yet
Logistic Regression
72 pages
Detailed Logistic Regression
No ratings yet
Detailed Logistic Regression
30 pages
Byron Gottfried-Spreadsheet Tools For Engineers Using Excel ® 2007-McGraw-Hill Education (2009)
100% (3)
Byron Gottfried-Spreadsheet Tools For Engineers Using Excel ® 2007-McGraw-Hill Education (2009)
529 pages
Logistic Regression
100% (3)
Logistic Regression
41 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
C4.1 - Logistic Regression
No ratings yet
C4.1 - Logistic Regression
6 pages
Automation Anywhere Certified Advanced RPA Professional (Automation 360) Assessment Was Completed by On
86% (7)
Automation Anywhere Certified Advanced RPA Professional (Automation 360) Assessment Was Completed by On
19 pages
Introduction To Logistic Regression
No ratings yet
Introduction To Logistic Regression
12 pages
Logistic
No ratings yet
Logistic
5 pages
Full Using & Understanding Mathematics: A Quantitative Reasoning Approach, 7th Edition (Ebook PDF) PDF All Chapters
100% (1)
Full Using & Understanding Mathematics: A Quantitative Reasoning Approach, 7th Edition (Ebook PDF) PDF All Chapters
55 pages
Lecture 22. GLM
No ratings yet
Lecture 22. GLM
41 pages
DPSWarrior Classic
No ratings yet
DPSWarrior Classic
72 pages
First Steps With Jedox For Excel
No ratings yet
First Steps With Jedox For Excel
104 pages
Bcom 1st Sem Fit Lab Record
No ratings yet
Bcom 1st Sem Fit Lab Record
25 pages
Logistic Regression
No ratings yet
Logistic Regression
49 pages
Lec-4 Logistic Regression
No ratings yet
Lec-4 Logistic Regression
54 pages
Logistic Regression
No ratings yet
Logistic Regression
4 pages
Ms Excel2007 Part 1
No ratings yet
Ms Excel2007 Part 1
34 pages
Chapter 15 Percentage Formula
No ratings yet
Chapter 15 Percentage Formula
7 pages
Q 10 A Q 6B Logistic Regression Class
No ratings yet
Q 10 A Q 6B Logistic Regression Class
18 pages
Logistic Regression in R and Python
No ratings yet
Logistic Regression in R and Python
9 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logit PDF
No ratings yet
Logit PDF
44 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
Logistic Regression
No ratings yet
Logistic Regression
49 pages
Logistic Reg
No ratings yet
Logistic Reg
87 pages
1 LogisticRegressionNotes1
No ratings yet
1 LogisticRegressionNotes1
11 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logistic Regression: 30 March 2016
No ratings yet
Logistic Regression: 30 March 2016
49 pages
Generalized Ordered Logit: Notes
No ratings yet
Generalized Ordered Logit: Notes
48 pages
Experiment No 3
No ratings yet
Experiment No 3
7 pages
Data Analytics Using R
No ratings yet
Data Analytics Using R
23 pages
Decision Science - June - 2023
No ratings yet
Decision Science - June - 2023
8 pages
Logistic Regression Example Illustrated
No ratings yet
Logistic Regression Example Illustrated
20 pages
Introduction To Logistic Regression
No ratings yet
Introduction To Logistic Regression
20 pages
Logisticregression PDF
No ratings yet
Logisticregression PDF
48 pages
Excel Data Validation Detailed Questions
No ratings yet
Excel Data Validation Detailed Questions
7 pages
Logistic Regression
0% (1)
Logistic Regression
49 pages
Log Reg
No ratings yet
Log Reg
32 pages
Introduction To Generalized Linear Models: Logit Model With Categorical Predictors. Before
No ratings yet
Introduction To Generalized Linear Models: Logit Model With Categorical Predictors. Before
24 pages
Resume Adam Zumbado V
No ratings yet
Resume Adam Zumbado V
1 page
Fleetrun: The Solution For Fleet Maintenance Control
No ratings yet
Fleetrun: The Solution For Fleet Maintenance Control
13 pages
Regression3 Slides
No ratings yet
Regression3 Slides
47 pages
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
No ratings yet
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
10 pages
Logistic Regression
No ratings yet
Logistic Regression
17 pages
Binary Logistic Regression
No ratings yet
Binary Logistic Regression
8 pages
48 - (2018) CCT & EH (Slopes MC Ground-Sevilla-conference)
No ratings yet
48 - (2018) CCT & EH (Slopes MC Ground-Sevilla-conference)
19 pages
Ordered Logit: Notes
No ratings yet
Ordered Logit: Notes
34 pages
Excel Functions for the Daily User - Vol 2
From Everand
Excel Functions for the Daily User - Vol 2
Palani Murugappan
No ratings yet
Logit Regression - R Data Analysis Examples
No ratings yet
Logit Regression - R Data Analysis Examples
12 pages
Tosca Course Syllabus Content From Inventateq
No ratings yet
Tosca Course Syllabus Content From Inventateq
3 pages
What Is Logistic Regression
No ratings yet
What Is Logistic Regression
20 pages
MGMT 469 Maximum Likelihood Estimation
No ratings yet
MGMT 469 Maximum Likelihood Estimation
6 pages
Excel Skills For Business: Intermediate II: Week 3: Automating Lookups
0% (1)
Excel Skills For Business: Intermediate II: Week 3: Automating Lookups
10 pages
Binary Logit: Notes
No ratings yet
Binary Logit: Notes
22 pages
Empowerment Technologies: Quarter 1 - Module 4: Advanced Techniques Using Microsoft Word
No ratings yet
Empowerment Technologies: Quarter 1 - Module 4: Advanced Techniques Using Microsoft Word
28 pages
MODULE 3 Introduction To Construction Estimates
No ratings yet
MODULE 3 Introduction To Construction Estimates
7 pages
Lab-4: Regression Analysis: Logistic & Multinomial Logistic Regression
No ratings yet
Lab-4: Regression Analysis: Logistic & Multinomial Logistic Regression
10 pages
Betting Tracker v2 21 Basic GBP
No ratings yet
Betting Tracker v2 21 Basic GBP
92 pages
E Book Computer MCQ Final
No ratings yet
E Book Computer MCQ Final
84 pages
Regresi Logistik
No ratings yet
Regresi Logistik
34 pages
VB Script
No ratings yet
VB Script
52 pages
L9 Logistical Regression Models Updated
No ratings yet
L9 Logistical Regression Models Updated
10 pages
Logistic Regression: Jia Li
No ratings yet
Logistic Regression: Jia Li
44 pages
Basic Concepts of Logistic Regression
No ratings yet
Basic Concepts of Logistic Regression
5 pages
Using SAS To Extend Logistic Regression
No ratings yet
Using SAS To Extend Logistic Regression
8 pages
Microsoft Excel Formulas: Master Microsoft Excel 2016 Formulas in 30 days
From Everand
Microsoft Excel Formulas: Master Microsoft Excel 2016 Formulas in 30 days
Tina E. Bernard
4/5 (7)
What Is Logistic Regression?
No ratings yet
What Is Logistic Regression?
1 page
Binary Logistic Regression - 6.2
No ratings yet
Binary Logistic Regression - 6.2
34 pages
A Simple Method For Determining Fault Location On Distribution Lines
No ratings yet
A Simple Method For Determining Fault Location On Distribution Lines
13 pages
A Simple But Effective Logistic Regression Derivation
No ratings yet
A Simple But Effective Logistic Regression Derivation
6 pages
Appendix: How To Use The Excel Solver To Fit Data To Any Equation
No ratings yet
Appendix: How To Use The Excel Solver To Fit Data To Any Equation
2 pages
Assignment 2
No ratings yet
Assignment 2
11 pages
Division Order Analyst Resume
100% (1)
Division Order Analyst Resume
8 pages
Puter Literacy 40 Q & A SR
No ratings yet
Puter Literacy 40 Q & A SR
6 pages
Demo Instruction Manual
No ratings yet
Demo Instruction Manual
26 pages
Empowerment Technology
No ratings yet
Empowerment Technology
4 pages
Sensitivity Analysis
No ratings yet
Sensitivity Analysis
6 pages
S.No Topic Link: Ilogic - Content Centre
No ratings yet
S.No Topic Link: Ilogic - Content Centre
21 pages
Ourse Notes Ogistic Egression: Course Notes: Descriptive Statistics Course Notes: Descriptive Statistics
No ratings yet
Ourse Notes Ogistic Egression: Course Notes: Descriptive Statistics Course Notes: Descriptive Statistics
6 pages
Lab 4: Logistic Regression: PSTAT 131/231, Winter 2019
No ratings yet
Lab 4: Logistic Regression: PSTAT 131/231, Winter 2019
10 pages
Machine Learning (Analytics Vidhya) : What Is Logistic Regression?
100% (1)
Machine Learning (Analytics Vidhya) : What Is Logistic Regression?
5 pages
Access Form Design
No ratings yet
Access Form Design
18 pages
Raci Matrix Template
No ratings yet
Raci Matrix Template
10 pages
Learning Open Office: Calc & Base
From Everand
Learning Open Office: Calc & Base
Durgesh
No ratings yet
Ict Specialization Grade 9 3rd Quarterly Exam
100% (1)
Ict Specialization Grade 9 3rd Quarterly Exam
2 pages

How To Do A Logistic Regression in Excel

Uploaded by

How To Do A Logistic Regression in Excel

Uploaded by

How to Do a Logistic Regression in Excel

Step 1: Insert Historical Data and Regression Coefficients

2. Insert the dataset from the table below.

 Elogit: The inverse transformation of logit.

 Probability: The probability of being drafted, expressed as a real number.

In Excel, you can use the formula $B$15+$B$16*B2+$B$17*C2+$B$18*D2 to easily derive

 p is the probability of a 1 value (the proportion of 1s, the mean of Y)

 e is the constant with the value ~2.718

 a and b are the parameters of the algorithm

Your spreadsheet should now look like below

Because adding logarithms is computationally more efficient than multiplying probabilities

Log likelihood values are calculated by using the following formula:

Log likelihood = LN(probability)

2. Search for and install Solver by following the prompts.

4. In the Solver Parameters pane, insert the following values:

 Make Unconstrained Variables Non-Negative: Uncheck

 Select a Solving Method: GRG Nonlinear

Step 6: Add New Data for New Prediction

How Does Logistic Regression Work?

Log Odds and The Sigmoid Function

You might also like

In Excel, you can use the formula $B$15+$B$16B2+$B$17C2+$B$18*D2 to easily derive