0% found this document useful (0 votes)

13 views21 pages

SMDS Unit 5

Uploaded by

charancharan73202

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views21 pages

SMDS Unit 5

Uploaded by

charancharan73202

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

UNIT-5

Logistic Regression

Syllabus:
The classification problem
Logistic Regression Setup
Interpreting the results
Comparing models
Classification using logistic Regression
The Classification Problem:
 Classification is a supervised machine learning technique
used to predict categorical outcomes.
 Classification is a way to sort things into different
groups.
 It involves identifying which category (class) an
observation belongs to.
 Example: Sorting emails as "Spam" or "Not Spam."
 Helps in decision-making, like predicting diseases or
fraud detection.
 Used in AI, machine learning, and daily applications.

Types of Classification:

 Binary Classification:
 Only two groups (e.g., Pass/Fail, Yes/No).

 Multi-Class Classification:
 More than two groups (e.g., Cat/Dog/Rabbit).

 Multi-Label Classification:
 One item can belong to multiple groups
(e.g., A movie can be both Action &
Comedy).
Working:
A computer looks at past data.
It learns patterns and applies them to new data.
Example: A bank can predict if a loan will be repaid or not.

Real-Life Examples:
 Face Recognition
 (e.g., Unlocking your phone).
 Medical Diagnosis
 (e.g., Checking if a person has a disease).
 Online Shopping
o (e.g., Recommending products based
on your interest).

Logistic Regression Setup:

A method used to predict whether something belongs to one
group or another.
Example: Predicting if a student will pass or fail based on
study hours.
When the output is either "Yes or No," "Spam or Not Spam,"
etc.
It helps in decision-making based on data.
Mathematical Formula (Simple Version):
Instead of predicting a direct number, it predicts a probability
(0 to 1).
Uses the Sigmoid Function to convert values into
probabilities.

Steps to Set Up Logistic Regression:

Step 1:
Collect Data (e.g., Student’s study hours & exam results).

Step 2:
Clean and prepare the data (remove errors, missing
values).

Step 3:
Split data into two parts
– Training Set & Testing Set.

Step 4:
Train the model (let the computer learn from training
data).
Step 5:
Test the model (check how well it predicts on new data).

Step 6:
Evaluate performance using accuracy, precision, recall,
etc.

Real-Life Uses of Logistic Regression:

 Predicting if a customer will buy a product or not.
 Diagnosing diseases based on symptoms.
 Detecting fraud in credit card transactions.

Interpreting the Results of Logistic Regression:

Logistic regression gives a probability (a value between 0 and
1).
Example:
If the probability is 0.85, there is an 85% chance of belonging
to Class 1 (e.g., "Yes," "Pass," "Spam").

Decision Making Using Probability:

 If probability > 0.5, predict Class 1 (e.g., "Yes," "Pass").
 If probability ≤ 0.5, predict Class 0 (e.g., "No," "Fail").
 The threshold (0.5) can be adjusted based on
requirements.
Key Performance Metrics:
Accuracy:
Measures how many predictions were correct.
Precision:
Out of all predicted "Yes" cases, how many were
actually "Yes"?
Recall:
Out of all actual "Yes" cases, how many were correctly
predicted?
F1 Score:
A balance between precision and recall.
Confusion Matrix:
A table showing correct and incorrect predictions.

Example Interpretation:
Suppose a model predicts if a student will pass an exam:
Probability = 0.92 → Predict "Pass"

Probability = 0.30 → Predict "Fail"

If the model makes many incorrect predictions, adjustments

are needed.
Interpretation Importance:
 Helps in understanding how well the model is
performing.
 Identifies areas where the model can improve.
 Ensures correct decision-making based on
reliable predictions.
Comparing Models:

Model Used for Mathematical Advantages Disadvantag Real life

concept es examples
Logistic Binary Uses sigmoid function Simple, fast, Struggles Spam
Regression Classificatio to o/p good for with detection
n(Yes/NO) probability(0&1) linear complex, (Classify
seperable non-linear emails as
data data spam or not
spam)
Linear Predicting Fits a straight line Simple, good Cannot House price
Regression continuous y = 𝛽0 +𝛽1 X data for linear handle prediction
values relationships classification (Predict price
on problems based on
area,
bedrooms
etc)
Decision Classificatio Spilt data into Works with Sensitive to Loan
trees n& branches based on non-linear small approval
Regression features data, easy to changes (decide if a
visualize customer
quantifies for
a loan)
K-NN Classificatio Compares new data No training Slow for Movie
n& with nearest existing time, good large datasets recommendat
Regression data points datasets ion (finds
movie similar
to ones you
like)
Support Classificatio Finds a hyperplane Works well Expensive Face
vector n that best separates with detection
machine classes complex, (Classify if
(SVM) high an image
dimensional contains a
data face or not)
Classification using logistic Regression:
Classification using logistic regression is a statistical
method used for binary (and sometimes multiclass)
classification tasks.
Despite the name "regression," it's actually used for
predicting categorical outcomes
Logistic Regression is used to classify data into two or more
categories.
Purpose:
Logistic regression estimates the probability that a data point
belongs to a particular class. Based on this probability, it then
classifies the data point.
Example Use Cases:
Spam (1) or not spam (0)
Customer will churn (1) or not (0)
Disease present (1) or not (0)

Extensions:
Multinomial logistic regression for more than two classes
Regularized logistic regression (L1, L2) for feature selection or
to avoid overfitting
It predicts the probability of an event occurring (e.g., "Spam"
or "Not Spam").
If probability > 0.5, classify as Class 1, else classify as Class 0.
Steps in Classification Using Logistic Regression:

Step 1:
Collect Data
Example: A bank wants to classify if a customer will repay a
loan (Yes/No).
Data includes income, credit score, loan amount, etc.

Step 2:
Preprocess Data
Handle missing values and remove unnecessary features.
Convert categorical data (e.g., "Male/Female") into numerical
format.

Step 3:
Split Data
Divide data into Training Set (80%) and Testing Set (20%).
The model learns patterns from the training data.
Step 4:
Train the Model
Use the Sigmoid function to predict probabilities.
Adjust model parameters to improve accuracy.

Step 5:
Make Predictions
Apply the trained model to new data.
If probability > 0.5, classify as "Yes";
otherwise, classify as "No."

Step 6:
Evaluate Performance
Check accuracy, precision, recall, and F1-score.
Use a Confusion Matrix to see correct vs. incorrect
predictions.
Applications of Logistic Regression in Classification:
Logistic regression is widely used for classification tasks,
particularly when the target variable is binary (e.g., yes/no,
spam/non, disease/no disease).

Here are some common and important applications of logistic

regression in classification:

1. Medical Diagnosis
Application: Predicting whether a patient has a disease (e.g.,
cancer, diabetes) based on symptoms, lab results, or other
medical parameters.
Example: Predicting the presence of heart disease using
features like age, cholesterol, blood pressure, etc

2. Email Spam Detection

Application: Classifying emails as “spam” or “not spam”
based on the email’s content and metadata.
Example: Logistic regression can use word frequencies,
presence of links, sender information, etc., as input features.
3. Credit Scoring and Risk Assessment
Application: Assessing the likelihood of a customer defaulting
on a loan or credit card.
Example: Input features can include income, credit history,
loan amount, and past repayment behavior.

4. Marketing and Customer Segmentation

Application: Predicting whether a customer will respond to a
marketing campaign (e.g., click an ad, buy a product).
Example: Logistic regression can use demographic data and
browsing behavior to predict conversion.

5. Fraud Detection
Application: Classifying financial transactions as fraudulent or
legitimate.
Example: Features could include transaction amount,
location, time, and user behavior patterns.

6. Churn Prediction
Application: Predicting whether a customer will stop using a
service or product.
Example: Telecom companies use logistic regression to
identify customers likely to cancel their plans.
7. Image Recognition (Binary Classification)
Application: Classifying simple images into two categories
(e.g., cat vs. not-cat).
Example: Flattened pixel values serve as features in a logistic
regression model.

8. Text Classification
Application: Classifying short texts, such as tweets, into
categories like positive/negative sentiment.
Example: Logistic regression can handle bag-of-words or TF-
IDF features for this purpose.

Medical Diagnosis (Detecting diseases like cancer or

diabetes).
Spam Detection (Classifying emails as Spam or Not Spam).
Fraud Detection (Identifying fraudulent credit card
transactions).
Customer Churn Prediction (Predicting if a customer will
leave a service).
Summary:

The Classification Problem:

 Classification is a supervised machine learning technique
used to predict categorical outcomes.
 It involves identifying which category (class) an
observation belongs to.
Examples:

 Spam or Not Spam emails

 Disease detection (Positive or Negative)

 Loan approval (Approved or Not Approved)

Types of Classification:

 Binary Classification (Two Classes)

 Multi-Class Classification (More than Two Classes)

 Multi-Label Classification (Multiple Labels at Once)

 Logistic Regression Setup:

Logistic Regression is used for binary classification problems.

It predicts the probability of an outcome belonging to a

particular class.

Steps to Set up:

Data Preprocessing
Splitting Data into Training and Testing Sets
Model Training
Model Evaluation

Interpreting the Results:

Logistic Regression outputs probabilities between 0 and 1.

Decision Threshold:
If probability > 0.5 → Class 1
If probability ≤ 0.5 → Class 0
Important Metrics:
Accuracy
Precision
Recall
F1 Score
Confusion Matrix

Comparing Models:

Logistic Regression vs Linear Regression:

Linear Regression predicts continuous values, while Logistic
Regression predicts probabilities.
Logistic Regression uses the Sigmoid function, but Linear
Regression does not.

Logistic Regression vs Decision Trees:

Logistic Regression is simpler and more interpretable.
Decision Trees handle non-linear relationships better.

Logistic Regression vs KNN (K-Nearest Neighbours):

Logistic Regression is faster with large datasets.
KNN performs better with small datasets.
Classification Using Logistic Regression:
Applications:
Medical Diagnosis
Email Spam Detection
Customer Churn Prediction
Credit Card Fraud Detection

Steps in Classification:
Import Libraries
Load Dataset
Data Cleaning
Feature Scaling
Model Training
Predictions
Model Evaluation

Classification
100% (2)
Classification
105 pages
Photoshop MCQ Questions and Answers
73% (15)
Photoshop MCQ Questions and Answers
9 pages
Module-2 - Logistic Regression in Machine Learning
No ratings yet
Module-2 - Logistic Regression in Machine Learning
28 pages
Logistic Regression
100% (1)
Logistic Regression
10 pages
Week-14 Lecture 28
No ratings yet
Week-14 Lecture 28
34 pages
Pma 5
No ratings yet
Pma 5
39 pages
Introduction To Machine Learning and Logistic Regression
No ratings yet
Introduction To Machine Learning and Logistic Regression
28 pages
Chapter 4 Statistical Classification Methods
No ratings yet
Chapter 4 Statistical Classification Methods
63 pages
Session 9-Logistic Regression
No ratings yet
Session 9-Logistic Regression
33 pages
ML 4
No ratings yet
ML 4
80 pages
Interview Questions
No ratings yet
Interview Questions
26 pages
Practical - Logistic Regression
No ratings yet
Practical - Logistic Regression
84 pages
Logistic Regression in Python - Real Python
No ratings yet
Logistic Regression in Python - Real Python
27 pages
ML Unit-IV Notes
No ratings yet
ML Unit-IV Notes
49 pages
DMML Unit4
No ratings yet
DMML Unit4
77 pages
Chapter 4 Statistical Classification Methods
No ratings yet
Chapter 4 Statistical Classification Methods
73 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
13 Logistic Regression Main
No ratings yet
13 Logistic Regression Main
14 pages
ML CLASS 5 Logistic Regression Algorithm
No ratings yet
ML CLASS 5 Logistic Regression Algorithm
16 pages
CO 2 Session 3
No ratings yet
CO 2 Session 3
39 pages
3 Intro To Logistic Regression LT
No ratings yet
3 Intro To Logistic Regression LT
18 pages
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
No ratings yet
ML - MU - Unit - 2 - Supervised Learning-Classification Techniques
153 pages
1694600777-Unit2.2 Logistic Regression CU 2.0
100% (1)
1694600777-Unit2.2 Logistic Regression CU 2.0
37 pages
ML-Unit 4
No ratings yet
ML-Unit 4
29 pages
Session-11 Machine Learning - Jupyter Notebook
No ratings yet
Session-11 Machine Learning - Jupyter Notebook
11 pages
Logisticregression
No ratings yet
Logisticregression
22 pages
ML Notes by Pushpa
No ratings yet
ML Notes by Pushpa
26 pages
Linear and Logistic Regression
No ratings yet
Linear and Logistic Regression
21 pages
DA Mid 2
No ratings yet
DA Mid 2
17 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
Sonia Jessica - 2022 - How Does Logistic Regression Work
No ratings yet
Sonia Jessica - 2022 - How Does Logistic Regression Work
4 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
15) Machine Learning Algorithms
No ratings yet
15) Machine Learning Algorithms
5 pages
B-56 Sanket Jambhulkar MLA-3
No ratings yet
B-56 Sanket Jambhulkar MLA-3
7 pages
Logistic Regression
No ratings yet
Logistic Regression
12 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
Logistic Regression
No ratings yet
Logistic Regression
18 pages
09 23ECE216 LogisticRegression
No ratings yet
09 23ECE216 LogisticRegression
40 pages
Unit 3-2
No ratings yet
Unit 3-2
20 pages
Regression Vs Classification in Machine Learning Explained!
No ratings yet
Regression Vs Classification in Machine Learning Explained!
10 pages
DS Unit 4
No ratings yet
DS Unit 4
13 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Classification Models
No ratings yet
Classification Models
3 pages
Logistic Regression
No ratings yet
Logistic Regression
36 pages
Logistic Regressions
No ratings yet
Logistic Regressions
11 pages
B55 MLExp 1
No ratings yet
B55 MLExp 1
4 pages
Module1.4 Regression
No ratings yet
Module1.4 Regression
24 pages
Intro To Linear and Logistic Reg
No ratings yet
Intro To Linear and Logistic Reg
5 pages
ML Assignment Kv2
No ratings yet
ML Assignment Kv2
10 pages
Chp2 Logistic Regression
No ratings yet
Chp2 Logistic Regression
6 pages
DSL5
No ratings yet
DSL5
6 pages
Machine Learning (Chapter1)
No ratings yet
Machine Learning (Chapter1)
8 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
Day.12 Logistic Regression
No ratings yet
Day.12 Logistic Regression
8 pages
Lecture 7 Classification
No ratings yet
Lecture 7 Classification
33 pages
Smath Studio
No ratings yet
Smath Studio
47 pages
HITEC PowerPRO2700 - 2016 PDF
100% (4)
HITEC PowerPRO2700 - 2016 PDF
55 pages
Machine Learning Lab Manual 06
100% (1)
Machine Learning Lab Manual 06
8 pages
B24 ML Exp-1
No ratings yet
B24 ML Exp-1
10 pages
VO MCA S4 Data Mining Unit 8
No ratings yet
VO MCA S4 Data Mining Unit 8
18 pages
P 2.1 Logistic Regression
No ratings yet
P 2.1 Logistic Regression
18 pages
Slingshot Elastics Test
100% (1)
Slingshot Elastics Test
12 pages
VP AIR Part Test 04 Class 12th NEET 2025 08-06-2025 Questions Paper
No ratings yet
VP AIR Part Test 04 Class 12th NEET 2025 08-06-2025 Questions Paper
23 pages
How To Build A Stacking Harness
100% (2)
How To Build A Stacking Harness
5 pages
Ec34 Question Bank
No ratings yet
Ec34 Question Bank
6 pages
Grammar Jeopardy: Modal Auxiliaries, Relative Adverbs, & Relative Pronouns
No ratings yet
Grammar Jeopardy: Modal Auxiliaries, Relative Adverbs, & Relative Pronouns
18 pages
MLE1101 - Tutorial 2 - Suggested Solutions
No ratings yet
MLE1101 - Tutorial 2 - Suggested Solutions
8 pages
Ramsey S Legacy 1st Edition Lillehammer Download PDF
100% (6)
Ramsey S Legacy 1st Edition Lillehammer Download PDF
84 pages
Draftspecificationformantransformer 7775 Kvawithincr
No ratings yet
Draftspecificationformantransformer 7775 Kvawithincr
13 pages
Kebutuhan Panas Cement Mill (1) 1
No ratings yet
Kebutuhan Panas Cement Mill (1) 1
3 pages
SAL Event Documentation
No ratings yet
SAL Event Documentation
13 pages
Prelims Test Series Csat 1722243977612
No ratings yet
Prelims Test Series Csat 1722243977612
3 pages
Ideal Gas
No ratings yet
Ideal Gas
20 pages
Assignment 01
No ratings yet
Assignment 01
2 pages
EnCom LG ABS 40 - EnCom
No ratings yet
EnCom LG ABS 40 - EnCom
2 pages
Icd Tutorial
No ratings yet
Icd Tutorial
42 pages
Research Final
No ratings yet
Research Final
39 pages
PMA 133 Book - Verbal Intelligence Test Questions (Solved) - 1
No ratings yet
PMA 133 Book - Verbal Intelligence Test Questions (Solved) - 1
4 pages
Preboards Science ST - John's Academy
No ratings yet
Preboards Science ST - John's Academy
7 pages
Fractional Fourier Transform
No ratings yet
Fractional Fourier Transform
28 pages
Image Registration Methods A Survey
No ratings yet
Image Registration Methods A Survey
25 pages
Muravyl Installation ENG
No ratings yet
Muravyl Installation ENG
10 pages
Level-Off Cement Plugging Method To Cure Lost Ci
No ratings yet
Level-Off Cement Plugging Method To Cure Lost Ci
13 pages
Chemical Shift
No ratings yet
Chemical Shift
10 pages
Automatic High Beam Controller For Vehicles
No ratings yet
Automatic High Beam Controller For Vehicles
6 pages
Q1 (25pt.) Q2 (25pt.) Q3 (25pt.) Q4 (25pt.) Total (100pt.) : Instructor: Dr. Moayed Almobaied, Ph.D. Control & Automation
No ratings yet
Q1 (25pt.) Q2 (25pt.) Q3 (25pt.) Q4 (25pt.) Total (100pt.) : Instructor: Dr. Moayed Almobaied, Ph.D. Control & Automation
4 pages
Production of PHA
No ratings yet
Production of PHA
8 pages
Nested List Home Work
No ratings yet
Nested List Home Work
2 pages
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet

SMDS Unit 5

Uploaded by

SMDS Unit 5

Uploaded by

UNIT-5

Logistic Regression Setup:

Steps to Set Up Logistic Regression:

Real-Life Uses of Logistic Regression:

Interpreting the Results of Logistic Regression:

Decision Making Using Probability:

Probability = 0.30 → Predict "Fail"

If the model makes many incorrect predictions, adjustments

Model Used for Mathematical Advantages Disadvantag Real life

Here are some common and important applications of logistic

2. Email Spam Detection

4. Marketing and Customer Segmentation

Medical Diagnosis (Detecting diseases like cancer or

The Classification Problem:

 Spam or Not Spam emails

 Disease detection (Positive or Negative)

 Loan approval (Approved or Not Approved)

 Binary Classification (Two Classes)

 Multi-Class Classification (More than Two Classes)

 Multi-Label Classification (Multiple Labels at Once)

Logistic Regression is used for binary classification problems.

It predicts the probability of an outcome belonging to a

Steps to Set up:

Interpreting the Results:

Logistic Regression vs Linear Regression:

Logistic Regression vs Decision Trees:

Logistic Regression vs KNN (K-Nearest Neighbours):

You might also like