0% found this document useful (0 votes)

23 views6 pages

logistic regression

Logistic Regression is a key algorithm for binary classification, focusing on predicting probabilities of outcomes based on categorical data. It employs the sigmoid function to map inputs into a probability range of 0 to 1, transforming linear equations into a logistic model. The process involves optimizing a cost function using gradient descent and evaluating model performance through metrics like confusion matrices and ROC curves.

Uploaded by

shubham.kunjapur

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views6 pages

logistic regression

Uploaded by

shubham.kunjapur

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Breaking down Logistic Regression to its

basics

#1. The Path from Data to Decision

In the vast expanse of ML algorithms, Logistic Regression stands as an optimal model for binary
classification problems.
It is the trusted path we take when the terrain is categorical, and the destination is decision-making.
Logistic Regression is not merely a statistical tool but a storytelling device that translates numerical
tales into binary outcomes.

#2. Introduction to Logistic Regression

Imagine you are at a crossroads where each path leads to a distinct outcome, and your choice is
binary: yes or no, true or false, A or B.
Logistic regression is the queen in this field of dichotomies.
At its core, Logistic Regression is about probabilities. It measures the likelihood of an event
occurring.
Its main goal? 🎯
Logistic regression aims to find the probability that a given input belongs to a certain class.

#3. The Sigmoid Function

Logistic regression is based on the sigmoid function, a mathematical curve that maps any real-
valued input into a value between 0 and 1, suitable for probability interpretation.
This is the probability space where Logistic Regression composes its symphony.

The elegance of this function lies in its simplicity: it takes the linear equation, akin to a straight
road, and bends it into an S-shaped path that gracefully transitions from one state to another.
#3.1 From Linear To Logistic Regression
Step 1. Linear Regression Foundation:We begin with a representation of a linear regression
equation:
Y = Ax + B,
where:
• Y is the dependent variable (the outcome we're trying to predict).
• X is the independent variable (the predictor).
• A and B are the coefficients that represent the slope and the y-intercept of the regression
line, respectively.
The main problem?
The plot shows how linear regression would typically fit a straight line through data points.
Step 2. Probability Adjustment: Since linear regression outputs can extend beyond the range of
[0,1], which is not suitable for probability, the equation is adjusted to P = Ax + B to reflect that
P (probability) is being modeled instead of a direct measurement Y.

Step 3. Odds Transformation. However, P can still take on values less than 0 or greater than 1,
which is not practical for probabilities. So to constraint P between [0,1] the odds are computed.

Step 4. Log Transformation: A log transformation is applied, leading to the equation

log(P / (1 - P)) = Ax + B. (LOGIT function)

This is a pivotal step in moving from linear to logistic regression. This transformation allows us to
model P as a linear combination of x but in the log-odds space, not the probability space.

Step 5. Sigmoid Function Derivation: By rearranging the log-odds equation, we obtain

P = e^(Ax + B) / (1 + e^(Ax + B)).

This equation represents the sigmoid function, which bounds P between 0 and 1. It translates the
linear combination of x into a probability.

#4. How to obtain it mathematically?

For a binary classification problem, the model output corresponds to the
probability of prediction y being:
• y = 1 when the output is a class, let's say A.
• y = 0 when the output is the other class or B.
Of course, this codification could be vice versa.
If we define this as our hypothesis, we can mathematically obtain the cost function to minimize it,
known as binary cross-entropy or log loss.

By looking at the Loss function we see:

• The loss function approaches infinity if we predict it incorrectly.
• The loss approaches 0 when we predict correctly, and thus, a minimum.
And now following step is…

#5. Find the optimal solution with Gradient Descent

Gradient descent is a pivotal optimization algorithm used to minimize the cost function, aiding us in
our aim to find the most accurate weight values for our predictive model.
Envision standing atop a hill, your objective is the valley below — this represents our cost
function's minimum point.
To reach it, we begin with initial guesses for our weights, A and B, and iteratively refine these
guesses.
The process is akin to descending a hill: with each step, we assess our surroundings and adjust our
trajectory to ensure each subsequent step brings us closer to the valley floor.
These steps are guided by the learning rate — a vital hyperparameter symbolized as lr in the
equations. This learning rate controls the size of our steps or adjustments to the parameters A and B,
ensuring that we do not overshoot the minimum.
As we take each step, we calculate the partial derivatives of the cost function with respect to A and
B, denoted as dA and dB respectively. These derivatives point us in the direction where the cost
function decreases the fastest, akin to finding the steepest descent on our metaphorical hill.
The updated equations for A and B in each iteration, factoring in the learning rate, are as follows:
This process is repeated until we reach a point where the cost function's decrease is negligible,
suggesting we've arrived at or near the global minimum — our destination where the predictive
error is minimized, and our model's accuracy is maximized.

#6. Model Evaluation

To evaluate the performance of our model, there are different approaches:
1. Confusion Matrix: A table used to describe the performance of a classification model. It
categorizes predictions into true positives, true negatives, false positives, and false
negatives. With these metrics, we have clear picture of a model's predictive accuracy and the
nature of errors.
2. ROC Curve: A graph that illustrates the model's
3. ability to correctly predict the positive class at
4. various threshold levels, providing insights into the
5. balance between sensitivity and specificity.
6. AUC: Standing for "Area Under the ROC Curve," this metric quantifies the overall ability
of the model to distinguish between classes, with higher values indicating better
performance.

#7. The Assumptions Behind the Algorithm

Every model rests on assumptions, and Logistic Regression is no different. It presumes:
• A binary outcome
• A linear relationship between the log odds and the
• independent variables.
• The independent variables are not in a
• multicollinearity ensemble
• A sample size that can give reliable insights
• into the patterns of the data.

Module-2_Logistic Regression in Machine Learning
No ratings yet
Module-2_Logistic Regression in Machine Learning
28 pages
Lecture 09 ML
No ratings yet
Lecture 09 ML
26 pages
ML_MU_Unit_2 - Supervised Learning-Classification Techniques
No ratings yet
ML_MU_Unit_2 - Supervised Learning-Classification Techniques
153 pages
Yousef ML Washin Classification
100% (1)
Yousef ML Washin Classification
333 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
DMML Unit4
No ratings yet
DMML Unit4
77 pages
Basic ML Algorithm
No ratings yet
Basic ML Algorithm
74 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
COMP-377Week6_v1.1
No ratings yet
COMP-377Week6_v1.1
38 pages
SUPERVISED MACHINE LEARNING
No ratings yet
SUPERVISED MACHINE LEARNING
56 pages
ML Tutorial
No ratings yet
ML Tutorial
45 pages
Notes 05
No ratings yet
Notes 05
51 pages
3-LG_Eval
No ratings yet
3-LG_Eval
52 pages
Session9-LogisticRegression_a6c5bc556df30fa3eb779e22e464a08a - Copy
No ratings yet
Session9-LogisticRegression_a6c5bc556df30fa3eb779e22e464a08a - Copy
33 pages
Practical - Logistic Regression
No ratings yet
Practical - Logistic Regression
84 pages
Chapter 10 Logistic Reg
No ratings yet
Chapter 10 Logistic Reg
29 pages
Generalized Linear Model
No ratings yet
Generalized Linear Model
67 pages
09_23ECE216_LogisticRegression
No ratings yet
09_23ECE216_LogisticRegression
40 pages
DS203 2024 01 02 LogisticRegression
No ratings yet
DS203 2024 01 02 LogisticRegression
38 pages
Logistic Regression
100% (1)
Logistic Regression
10 pages
SMDS-Unit-5
No ratings yet
SMDS-Unit-5
21 pages
Ch2Regression and Regularization1
No ratings yet
Ch2Regression and Regularization1
45 pages
3. LR, decision tree
No ratings yet
3. LR, decision tree
48 pages
ML-Unit 4
No ratings yet
ML-Unit 4
29 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
Lec12 Logreg
No ratings yet
Lec12 Logreg
41 pages
Logistic Regression
No ratings yet
Logistic Regression
34 pages
Ml Assignment Kv2
No ratings yet
Ml Assignment Kv2
10 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
ML CLASS 5 Logistic Regression Algorithm
No ratings yet
ML CLASS 5 Logistic Regression Algorithm
16 pages
1694600777-Unit2.2 Logistic Regression CU 2.0
100% (1)
1694600777-Unit2.2 Logistic Regression CU 2.0
37 pages
MLS - Logistic Regression
No ratings yet
MLS - Logistic Regression
13 pages
15) Machine Learning Algorithms - Google Docs
No ratings yet
15) Machine Learning Algorithms - Google Docs
5 pages
Logistic Regression
No ratings yet
Logistic Regression
42 pages
Logistic regression
No ratings yet
Logistic regression
12 pages
ICSE MCQ Type Questions Electrolysis
No ratings yet
ICSE MCQ Type Questions Electrolysis
9 pages
4.Logistic Regression
No ratings yet
4.Logistic Regression
16 pages
2+Logistic_regression
No ratings yet
2+Logistic_regression
10 pages
AIML_Lab7_Manual (Model Eval-Cross Validation)
No ratings yet
AIML_Lab7_Manual (Model Eval-Cross Validation)
6 pages
Logistic_Regression_Class_Notes
No ratings yet
Logistic_Regression_Class_Notes
3 pages
04 Probability and Learning PDF
No ratings yet
04 Probability and Learning PDF
34 pages
Logistic Regression Algorithm
No ratings yet
Logistic Regression Algorithm
8 pages
cs188 Fa23 Note22
No ratings yet
cs188 Fa23 Note22
3 pages
Broadly, There Are 3 Types of Machine Learning Algorithms.
No ratings yet
Broadly, There Are 3 Types of Machine Learning Algorithms.
33 pages
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
No ratings yet
Logistic Regression For Machine Learning Complete TutorialUnderstand This Popular Supervised Classifi
10 pages
Lecture Notes 6 Logistic Regression
No ratings yet
Lecture Notes 6 Logistic Regression
8 pages
Intro to Linear and Logistic Reg
No ratings yet
Intro to Linear and Logistic Reg
5 pages
week 8
No ratings yet
week 8
38 pages
Chap10 Logistic Regression
No ratings yet
Chap10 Logistic Regression
36 pages
ML Linear Model
No ratings yet
ML Linear Model
10 pages
Deep Learning Week 204-4
No ratings yet
Deep Learning Week 204-4
1 page
Logistic Regression
No ratings yet
Logistic Regression
8 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logistic Regression Example
100% (1)
Logistic Regression Example
22 pages
Lecture 6
No ratings yet
Lecture 6
19 pages
B-56 Sanket Jambhulkar MLA-3
No ratings yet
B-56 Sanket Jambhulkar MLA-3
7 pages
ML DSBA Lab2
No ratings yet
ML DSBA Lab2
4 pages
Logistic Regression
No ratings yet
Logistic Regression
10 pages
Download all chapters of Statistics A Gentle Introduction 3rd Edition Coolidge Test Bank as a single PDF instantly.
100% (5)
Download all chapters of Statistics A Gentle Introduction 3rd Edition Coolidge Test Bank as a single PDF instantly.
45 pages
A Tutorial of Machine Learning
No ratings yet
A Tutorial of Machine Learning
16 pages
4215D (Tier2) GCIC
No ratings yet
4215D (Tier2) GCIC
2 pages
BS81xC-xv130
No ratings yet
BS81xC-xv130
35 pages
Aptitude Exam
No ratings yet
Aptitude Exam
146 pages
A Tolerated Margin of Mess
100% (1)
A Tolerated Margin of Mess
41 pages
An Adventurer's Guide To Number Theory (The History of Science) by Richard Friedberg
No ratings yet
An Adventurer's Guide To Number Theory (The History of Science) by Richard Friedberg
6 pages
The Aldol Condensation Reaction - Preparation of Benzalacetophenones (Chalcones)
93% (30)
The Aldol Condensation Reaction - Preparation of Benzalacetophenones (Chalcones)
6 pages
754787 Physics Form 2 Term-II
No ratings yet
754787 Physics Form 2 Term-II
10 pages
ASIST Automated Water Billing System
No ratings yet
ASIST Automated Water Billing System
6 pages
Full Custom IC Design Flow Tutorial
No ratings yet
Full Custom IC Design Flow Tutorial
101 pages
Homeostasis: DR Arpana Hazarika Associate Prof Dept of Physiology
No ratings yet
Homeostasis: DR Arpana Hazarika Associate Prof Dept of Physiology
29 pages
Three Way Valve - Mixing & Diverting Type
No ratings yet
Three Way Valve - Mixing & Diverting Type
7 pages
The 7 Chakra Symbols Explained - Their Meaning & Shapes (+PICTURES) - Brett Larkin Yoga
100% (1)
The 7 Chakra Symbols Explained - Their Meaning & Shapes (+PICTURES) - Brett Larkin Yoga
27 pages
TE Electrical Engineering Power Electronics, Electromagnetic Engineering
100% (1)
TE Electrical Engineering Power Electronics, Electromagnetic Engineering
53 pages
4 Classification 1
100% (1)
4 Classification 1
45 pages
WBS Assignment
No ratings yet
WBS Assignment
3 pages
Maths CH-2
No ratings yet
Maths CH-2
5 pages
HW 1.5.3 Piecewise Functions
No ratings yet
HW 1.5.3 Piecewise Functions
5 pages
CCTV 6 Pul6004bu Fe
No ratings yet
CCTV 6 Pul6004bu Fe
2 pages
Area Abierta en Pozos-Roscoe Moss
No ratings yet
Area Abierta en Pozos-Roscoe Moss
15 pages
Mysql Architecture
No ratings yet
Mysql Architecture
10 pages
Conditional Formatting
No ratings yet
Conditional Formatting
46 pages
Keywords and Classes
No ratings yet
Keywords and Classes
20 pages
Thermodynamic Engineering
No ratings yet
Thermodynamic Engineering
29 pages
Colombo-Katunayake Expressway Project
No ratings yet
Colombo-Katunayake Expressway Project
19 pages
Paw Friwa PDF
No ratings yet
Paw Friwa PDF
2 pages
Advanced Quantum Mechanics, Notes Based On Online Course Given by Leonard Susskind - Lecture 1
No ratings yet
Advanced Quantum Mechanics, Notes Based On Online Course Given by Leonard Susskind - Lecture 1
7 pages
VHDL Code:: Structural Modeling
No ratings yet
VHDL Code:: Structural Modeling
6 pages
Econometrics Syllabus PDF
No ratings yet
Econometrics Syllabus PDF
3 pages
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
From Everand
Bilinear Interpolation: Enhancing Image Resolution and Clarity through Bilinear Interpolation
Fouad Sabry
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet

logistic regression

Uploaded by

logistic regression

Uploaded by

Breaking down Logistic Regression to its

#1. The Path from Data to Decision

#2. Introduction to Logistic Regression

#3. The Sigmoid Function

Step 4. Log Transformation: A log transformation is applied, leading to the equation

Step 5. Sigmoid Function Derivation: By rearranging the log-odds equation, we obtain

#4. How to obtain it mathematically?

By looking at the Loss function we see:

#5. Find the optimal solution with Gradient Descent

#6. Model Evaluation

#7. The Assumptions Behind the Algorithm

You might also like