0% found this document useful (0 votes)

14 views7 pages

Experiment No 3

Uploaded by

aniruddha.kambleiot

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views7 pages

Experiment No 3

Uploaded by

aniruddha.kambleiot

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

0ICPC455

AY 2024-25 Machine Learning Lab

Quality Laboratory Manual

Experiment No. 3

Study and Implementation of Logistic

Regression

Course Instructor -
DR. TAHSEEN A. MULLA
ASSOCIATE PROFESSOR
QUALITY LABORATORY MANUAL
Prepared by – Dr. Tahseen A. Mulla
Machine Learning Laboratory [0ICPC455]
Final Year – AY 2024-25 [Odd Semester]

Experiment No. 3

Title of Experiment: To study and implement Logistic Regression

Aim of Experiment: To implement and understand the working of Logistic Regression, a

statistical method used for binary classification problems in Machine Learning

System Requirements – Win 8 and above OS, 4GB RAM, 2.33 GHz Processor

Software/s Needed for Experiment – Jupyter Notebook/ Anaconda Navigator/ Google

Colaboratory/ Spyder, Python 3.x [With libraries such as Numpy, Pandas, Matplotlib and Scikit-
Learn]

Experiment Outcomes –
1. Understand the principles and fundamentals of logistic regression as binary classifier
2. Gain insights into how logistic regression fits into the broader landscape of Machine Learning
models
3. Able to evaluate the performance of logistic regression model using appropriate metrics
4. Extend the possibilities to handle multi-class classification problems with logistic regression

Theory –
Logistic Regression is a type of regression analysis used for predicting the outcome of a
categorical dependent variable based on one or more predictor variables (independent variables).
Unlike linear regression, which predicts continuous outcomes, logistic regression predicts a
probability that the dependent variable belongs to a particular category.
Logistic regression uses a logistic function called a sigmoid function to map predictions
and their probabilities. The sigmoid function refers to an S-shaped curve that converts any real
value to a range between 0 and 1.
If the output of the sigmoid function (estimated probability) is greater than a predefined
threshold on the graph, the model predicts that the instance belongs to that class. If the estimated
probability is less than the predefined threshold, the model predicts that the instance does not
belong to the class.
For binary classification, the outcome is often coded as 0 or 1, where 1 typically represents
the presence of an event (e.g., success, yes) and 0 represents its absence (e.g., failure, no).

The sigmoid function is referred to as an activation function for logistic regression and is defined
as:
1
𝑓(𝑥) =
1 + 𝑒 −𝑥
Where,

Study and Implementation of Logistic Regression Page 1 of 6

QUALITY LABORATORY MANUAL
Prepared by – Dr. Tahseen A. Mulla
Machine Learning Laboratory [0ICPC455]
Final Year – AY 2024-25 [Odd Semester]

e = base of natural logarithms

The logistic regression model uses the following logistic function (also known as the
sigmoid function) to map predicted values to probabilities:
1
𝑃(𝑦 = 1) =
1 + 𝑒 −(𝛽0 + 𝛽1 𝑥1+ 𝛽2𝑥2 +⋯+ 𝛽𝑛𝑥𝑛)

where:

 P(y=1) is the probability that the dependent variable y equals 1

 β0 is the intercept
 β1, β2, …, βn are the coefficients for the independent variables x1, x2, … , xn

The model aims to find the best-fit coefficients that maximize the likelihood of observing the
given data.

Type of Logistic Regression:

On the basis of the categories, Logistic Regression can be classified into three types:
1. Binomial: In binomial Logistic regression, there can be only two possible types of the
dependent variables, such as 0 or 1, Pass or Fail, etc.

2. Multinomial: In multinomial Logistic regression, there can be 3 or more possible

unordered types of the dependent variable, such as "cat", "dogs", or "sheep"

3. Ordinal: In ordinal Logistic regression, there can be 3 or more possible ordered types of
dependent variables, such as "low", "Medium", or "High"

Study and Implementation of Logistic Regression Page 2 of 6

QUALITY LABORATORY MANUAL
Prepared by – Dr. Tahseen A. Mulla
Machine Learning Laboratory [0ICPC455]
Final Year – AY 2024-25 [Odd Semester]

Procedure to implement Logistic Regression:

1. Data Collection: Obtain a dataset suitable for binary classification. A dataset such as
predicting whether a student passes or fails based on study hours and attendance

2. Data Preprocessing:
a. Load the dataset into a Pandas DataFrame.
b. Handle missing values, outliers, and encode categorical variables if necessary.
c. Normalize or standardize the data if needed.

3. Exploratory Data Analysis (EDA): Visualize relationships between the dependent and
independent variables using scatter plots and correlation matrices.

4. Splitting the Data: Split the dataset into training and test sets to evaluate the model's
performance.

5. Implementing Multiple Linear Regression:

a. Use the Scikit-learn library to fit a logistic regression model.
b. Train the model using the training data.

6. Model Prediction:
a. Use the trained model to make predictions on the test data.
b. Generate a confusion matrix and classification report to assess accuracy

7. Model Evaluation: Evaluate the model using metrics such as accuracy, precision, recall,
F1-score and ROC-AUC curve

Sample Code:
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import confusion_matrix,
classification_report, roc_auc_score, roc_curve

# Step 1: Data Collection (Example dataset)

# Dataset: Predicting whether a student passes or fails based on study hours and attendance

data = {
'Study_Hours': [10, 9, 8, 7, 6, 5, 4, 3, 2, 1],
'Attendance': [90, 85, 80, 75, 70, 65, 60, 55, 50, 45],
'Pass': [1, 1, 1, 1, 1, 0, 0, 0, 0, 0] # 1: Pass, 0: Fail
}

df = pd.DataFrame(data)

Study and Implementation of Logistic Regression Page 3 of 6

QUALITY LABORATORY MANUAL
Prepared by – Dr. Tahseen A. Mulla
Machine Learning Laboratory [0ICPC455]
Final Year – AY 2024-25 [Odd Semester]

# Step 2: Data Preprocessing

# No missing values or categorical variables in this simple dataset.

# Step 3: Exploratory Data Analysis (EDA)

# Plotting scatter plots for independent variables against the dependent variable (Pass/Fail)
plt.figure(figsize=(10, 5))

plt.subplot(1, 2, 1)
plt.scatter(df['Study_Hours'], df['Pass'], color='blue')
plt.xlabel('Study Hours')
plt.ylabel('Pass (1) / Fail (0)')
plt.title('Study Hours vs Pass/Fail')

plt.subplot(1, 2, 2)
plt.scatter(df['Attendance'], df['Pass'], color='green')
plt.xlabel('Attendance (%)')
plt.ylabel('Pass (1) / Fail (0)')
plt.title('Attendance vs Pass/Fail')

plt.show()

# Step 4: Splitting the Data

X = df[['Study_Hours', 'Attendance']] # Independent variables
y = df['Pass'] # Dependent variable

X_train, X_test, y_train, y_test = train_test_split(X, y,

test_size=0.2, random_state=0)

# Step 5: Implementing Logistic Regression

logreg = LogisticRegression()
logreg.fit(X_train, y_train)

# Step 6: Model Prediction

y_pred = logreg.predict(X_test)

# Comparing Actual vs Predicted

comparison_df = pd.DataFrame({'Actual': y_test, 'Predicted':
y_pred})
print(comparison_df)

# Step 7: Model Evaluation

# Confusion Matrix
conf_matrix = confusion_matrix(y_test, y_pred)
print(f"Confusion Matrix:\n{conf_matrix}")

Study and Implementation of Logistic Regression Page 4 of 6

QUALITY LABORATORY MANUAL
Prepared by – Dr. Tahseen A. Mulla
Machine Learning Laboratory [0ICPC455]
Final Year – AY 2024-25 [Odd Semester]

# Classification Report
class_report = classification_report(y_test, y_pred)
print(f"Classification Report:\n{class_report}")

# ROC-AUC Score
roc_auc = roc_auc_score(y_test, y_pred)
print(f"ROC-AUC Score: {roc_auc}")

# ROC Curve
fpr, tpr, thresholds = roc_curve(y_test,
logreg.predict_proba(X_test)[:,1])
plt.plot(fpr, tpr, color='blue')
plt.plot([0, 1], [0, 1], color='red', linestyle='--')
plt.xlabel('False Positive Rate')
plt.ylabel('True Positive Rate')
plt.title('ROC Curve')
plt.show()

# Coefficients of the model

print(f"Intercept (β0): {logreg.intercept_[0]}")
print(f"Coefficients (β1, β2): {logreg.coef_[0]}")

Observations -
 Record the predicted outcomes (pass/fail) for the test data
 Observe the performance of the model using the confusion matrix and classification report

Conclusion –
Hence, the model summarize the findings from the experiment, such as the relationship
between study hours, attendance and the likelihood of passing

References –
a. Textbook –
i. Machine Learning with Python – An approach to Applied ML – Abhishek
Vijayvargiya, BPB Publications, 1st Edition 2018
ii. Machine Learning, Tom Mitchell, McGraw Hill Education, 1st Edition 1997
b. Online references –
i. https://www.analyticsvidhya.com/blog/2021/08/conceptual-understanding-of-logistic-
regression-for-data-science-beginners/
ii. https://www.simplilearn.com/tutorials/machine-learning-tutorial/logistic-regression-
in-python
iii. https://www.kaggle.com/code/nargisbegum82/logistic-regression-in-machine-
learning

Study and Implementation of Logistic Regression Page 5 of 6

QUALITY LABORATORY MANUAL
Prepared by – Dr. Tahseen A. Mulla
Machine Learning Laboratory [0ICPC455]
Final Year – AY 2024-25 [Odd Semester]

Expected Oral Questions –

1. What is Logistic Regression and how does it differ from Linear Regression?
2. When would you use Logistic Regression instead of Linear Regression?
3. Explain the logistic function and how it is used in Logistic Regression?
4. What is the range of the output of a Logistic Regression model and what does it represent?
5. How do you handle the categorical predictors in Logistic Regression?
6. What are some methods to assess the performance of a Logistic Regression model?
7. What is the purpose of using a confusion matrix in the context of Logistic Regression?
8. How can you handle imbalanced datasets in Logistic Regression?
9. What is the difference between binary logistic regression and multinomial logistic
regression?
10. State a basic example for Logisitc Regression?

FAQ’s in Interview –
1. What is Logistic Regression and how does it differ from Linear Regression?
2. Explain the concept of the logit function in Logistic Regression?
3. How is Logistic Regression used for classification tasks?
4. What is sigmoid function and why is it important in Logistic Regression?
5. How do you interpret the coefficients of Logistic Regression model?

Study and Implementation of Logistic Regression Page 6 of 6

Module-2 - Logistic Regression in Machine Learning
No ratings yet
Module-2 - Logistic Regression in Machine Learning
28 pages
ML Lab Manual
100% (1)
ML Lab Manual
37 pages
EPPP Statistics and Research Design
No ratings yet
EPPP Statistics and Research Design
17 pages
Exp2 Milf
No ratings yet
Exp2 Milf
7 pages
Logistic Regression
100% (1)
Logistic Regression
10 pages
Logistic Regression
100% (2)
Logistic Regression
30 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
Machine Learning Lab Manual 06
100% (1)
Machine Learning Lab Manual 06
8 pages
Business Statistics,: 9e, GE (Groebner/Shannon/Fry) Chapter 3 Describing Data Using Numerical Measures
No ratings yet
Business Statistics,: 9e, GE (Groebner/Shannon/Fry) Chapter 3 Describing Data Using Numerical Measures
43 pages
Broadly, There Are 3 Types of Machine Learning Algorithms.
No ratings yet
Broadly, There Are 3 Types of Machine Learning Algorithms.
33 pages
A Levels Stats 2 Chapter 6
100% (1)
A Levels Stats 2 Chapter 6
19 pages
Logistic Regression Lecture Notes
No ratings yet
Logistic Regression Lecture Notes
11 pages
Logistic Regression
No ratings yet
Logistic Regression
13 pages
Introduction To Logistics Regression.
No ratings yet
Introduction To Logistics Regression.
4 pages
Logistic Regression in R and Python
No ratings yet
Logistic Regression in R and Python
9 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Chapter 4 Statistical Classification Methods
No ratings yet
Chapter 4 Statistical Classification Methods
63 pages
Dav Exp4 66
No ratings yet
Dav Exp4 66
5 pages
Practical - Logistic Regression
No ratings yet
Practical - Logistic Regression
84 pages
Rain in Australia Logistic Regression Classifier
No ratings yet
Rain in Australia Logistic Regression Classifier
10 pages
Supervised Machine Learning
No ratings yet
Supervised Machine Learning
56 pages
Logistic Regression in Machine Learning
No ratings yet
Logistic Regression in Machine Learning
11 pages
Wa0004.
No ratings yet
Wa0004.
9 pages
Logistic Regression
No ratings yet
Logistic Regression
18 pages
B24 ML Exp-1
No ratings yet
B24 ML Exp-1
10 pages
ML DSBA Lab2
No ratings yet
ML DSBA Lab2
4 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Logistic Regression
No ratings yet
Logistic Regression
14 pages
Tolerances and Resultant Fits - SKF PDF
No ratings yet
Tolerances and Resultant Fits - SKF PDF
4 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Regression Analysis
No ratings yet
Regression Analysis
16 pages
ML Exp 8
No ratings yet
ML Exp 8
22 pages
AI Lab8
No ratings yet
AI Lab8
8 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
ML Record
No ratings yet
ML Record
6 pages
Logistic Regression Algorithm
No ratings yet
Logistic Regression Algorithm
8 pages
29 - ML Exp - 03
No ratings yet
29 - ML Exp - 03
4 pages
Logistic Regression
No ratings yet
Logistic Regression
12 pages
ML-Unit 4
No ratings yet
ML-Unit 4
29 pages
Task 1
No ratings yet
Task 1
7 pages
Write A Lab Report On Linear Regression and Logistic Regression. Include The Cost Function Differentiation and The Code in The Report.
No ratings yet
Write A Lab Report On Linear Regression and Logistic Regression. Include The Cost Function Differentiation and The Code in The Report.
7 pages
Session 9-Logistic Regression
No ratings yet
Session 9-Logistic Regression
33 pages
Mla 4
No ratings yet
Mla 4
2 pages
Linear and Logistic Regression
No ratings yet
Linear and Logistic Regression
21 pages
B-56 Sanket Jambhulkar MLA-3
No ratings yet
B-56 Sanket Jambhulkar MLA-3
7 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logistic Regression
No ratings yet
Logistic Regression
6 pages
Lab 4: Logistic Regression: PSTAT 131/231, Winter 2019
No ratings yet
Lab 4: Logistic Regression: PSTAT 131/231, Winter 2019
10 pages
Misc 5
No ratings yet
Misc 5
1 page
AIML - Lab7 - Manual (Model Eval-Cross Validation)
No ratings yet
AIML - Lab7 - Manual (Model Eval-Cross Validation)
6 pages
Logistic Regression
No ratings yet
Logistic Regression
21 pages
Shivansh Exp7
No ratings yet
Shivansh Exp7
5 pages
B55 MLExp 1
No ratings yet
B55 MLExp 1
4 pages
Experiment 5B - Minor
No ratings yet
Experiment 5B - Minor
1 page
Practical - 4 Aim:: Experiment
No ratings yet
Practical - 4 Aim:: Experiment
5 pages
ML Lab Programs
No ratings yet
ML Lab Programs
9 pages
Chp2 Logistic Regression
No ratings yet
Chp2 Logistic Regression
6 pages
Intro To Linear and Logistic Reg
No ratings yet
Intro To Linear and Logistic Reg
5 pages
09 23ECE216 LogisticRegression
No ratings yet
09 23ECE216 LogisticRegression
40 pages
S6 Skewness2
No ratings yet
S6 Skewness2
42 pages
Logistic Regression
No ratings yet
Logistic Regression
36 pages
DTS 101 Lecture 3
No ratings yet
DTS 101 Lecture 3
21 pages
Exercise 4
No ratings yet
Exercise 4
3 pages
Short Term Water Demand Forecast Modelling Using Artificial Intelligence For Smart Water Management
No ratings yet
Short Term Water Demand Forecast Modelling Using Artificial Intelligence For Smart Water Management
22 pages
ML TW-PW 01
No ratings yet
ML TW-PW 01
5 pages
Past 5 Manual
No ratings yet
Past 5 Manual
314 pages
NAMJ 15 (3) Sept 2020-11
No ratings yet
NAMJ 15 (3) Sept 2020-11
17 pages
Classification and Prediction
No ratings yet
Classification and Prediction
81 pages
6 Correlation Analysis - Solving For R
No ratings yet
6 Correlation Analysis - Solving For R
32 pages
Exam
No ratings yet
Exam
90 pages
9709 s20 QP 31-Solved (Handwritten)
No ratings yet
9709 s20 QP 31-Solved (Handwritten)
12 pages
Week 2 Homework - Summer 2020: Attempt History
No ratings yet
Week 2 Homework - Summer 2020: Attempt History
27 pages
Warranty Data Analysis: A Review: Shaomin Wu
No ratings yet
Warranty Data Analysis: A Review: Shaomin Wu
21 pages
Stationary and Nonstationary Series: T y y E T y S S T y T y S T y T y T y
No ratings yet
Stationary and Nonstationary Series: T y y E T y S S T y T y S T y T y T y
17 pages
MAE 108 - Probability and Statistical Methods For Engineers - Spring 2014 Final Exam, June 10 Instructions
No ratings yet
MAE 108 - Probability and Statistical Methods For Engineers - Spring 2014 Final Exam, June 10 Instructions
8 pages
Session 09 - BS - 2020-Z Score
No ratings yet
Session 09 - BS - 2020-Z Score
32 pages
Group 1 Ba165 Pilot Testing Result
No ratings yet
Group 1 Ba165 Pilot Testing Result
24 pages
Activity # 1 - Statistics and Data Handling in Analytical Chemistry Treatment of Data
No ratings yet
Activity # 1 - Statistics and Data Handling in Analytical Chemistry Treatment of Data
4 pages
METHODS OF DETE-WPS Office
No ratings yet
METHODS OF DETE-WPS Office
8 pages
47-Article Text-103-1-10-20220403
No ratings yet
47-Article Text-103-1-10-20220403
16 pages
Ceng317 Gc32 Final Exam: Two-Way Anova
No ratings yet
Ceng317 Gc32 Final Exam: Two-Way Anova
6 pages
Annex 19 - Agenda 14.1.2 (II) Doc 23 - 14.1.1 Report Joint Progeny Trial Progress 2018 Malaysia ACC-21 VietNam (21st Meeting)
No ratings yet
Annex 19 - Agenda 14.1.2 (II) Doc 23 - 14.1.1 Report Joint Progeny Trial Progress 2018 Malaysia ACC-21 VietNam (21st Meeting)
9 pages
21 Goodness of Fit
No ratings yet
21 Goodness of Fit
16 pages
(ST-APP) Summary of probability distributions: n x π, x = 0, 1, - . -, n
No ratings yet
(ST-APP) Summary of probability distributions: n x π, x = 0, 1, - . -, n
2 pages
STAT5002 Midterm Review Solutions N
No ratings yet
STAT5002 Midterm Review Solutions N
8 pages
Output - Group - Work - Project - 4652 - GWP1.ipynb - Colaboratory
No ratings yet
Output - Group - Work - Project - 4652 - GWP1.ipynb - Colaboratory
6 pages
Forecasting
No ratings yet
Forecasting
6 pages
The Purpose of This Feasibility Study Is To Forecast The Sales of Renewable Stationary Generators Over The Next Three Years
No ratings yet
The Purpose of This Feasibility Study Is To Forecast The Sales of Renewable Stationary Generators Over The Next Three Years
2 pages
Mindmap QUANT - M6
No ratings yet
Mindmap QUANT - M6
1 page
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
From Everand
Acceptance-Rejection Sampling and Multi-dimensional Monte Carlo Integrations Utilizing Mathematica®
SUJAUL CHOWDHURY
No ratings yet

Experiment No 3

Uploaded by

Experiment No 3

Uploaded by

0ICPC455

AY 2024-25 Machine Learning Lab

Study and Implementation of Logistic

Title of Experiment: To study and implement Logistic Regression

Aim of Experiment: To implement and understand the working of Logistic Regression, a

Software/s Needed for Experiment – Jupyter Notebook/ Anaconda Navigator/ Google

Study and Implementation of Logistic Regression Page 1 of 6

e = base of natural logarithms

 P(y=1) is the probability that the dependent variable y equals 1

Type of Logistic Regression:

2. Multinomial: In multinomial Logistic regression, there can be 3 or more possible

Study and Implementation of Logistic Regression Page 2 of 6

Procedure to implement Logistic Regression:

5. Implementing Multiple Linear Regression:

# Step 1: Data Collection (Example dataset)

Study and Implementation of Logistic Regression Page 3 of 6

# Step 2: Data Preprocessing

# Step 3: Exploratory Data Analysis (EDA)

# Step 4: Splitting the Data

X_train, X_test, y_train, y_test = train_test_split(X, y,

# Step 5: Implementing Logistic Regression

# Step 6: Model Prediction

# Comparing Actual vs Predicted

# Step 7: Model Evaluation

Study and Implementation of Logistic Regression Page 4 of 6

# Coefficients of the model

Study and Implementation of Logistic Regression Page 5 of 6

Expected Oral Questions –

Study and Implementation of Logistic Regression Page 6 of 6

You might also like