0% found this document useful (0 votes)

21 views5 pages

Apply Logistic Regression Model Techniques To Predict Data On Any Dataset

Uploaded by

sonawaneabhishek69

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views5 pages

Apply Logistic Regression Model Techniques To Predict Data On Any Dataset

Uploaded by

sonawaneabhishek69

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

5 - Jupyter Notebook http://localhost:8888/notebooks/Practicals_AI/5.

ipynb

5. Apply Logistic Regression Model techniques to predict data on

any dataset.

In [20]: import pandas as pd

import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LogisticRegression
from sklearn.preprocessing import StandardScaler

In [21]: df = pd.read_csv("blood_pressure.csv")

In [22]: df.head()

Out[22]: Patient_Number Blood_Pressure_Abnormality Level_of_Hemoglobin Genetic_Pedigree_Coefficient

0 1 1 11.28 0.90

1 2 0 9.75 0.23

2 3 1 10.79 0.91

3 4 0 11.00 0.43

4 5 1 14.17 0.83

In [23]: df.tail()

Out[23]: Patient_Number Blood_Pressure_Abnormality Level_of_Hemoglobin Genetic_Pedigree_Coefficient

1995 1996 1 10.14

1996 1997 1 11.77

1997 1998 1 16.91

1998 1999 0 11.15

1999 2000 1 11.36

In [24]: df.shape

Out[24]: (2000, 15)

1 of 5 30-10-2024, 22:07
5 - Jupyter Notebook http://localhost:8888/notebooks/Practicals_AI/5.ipynb

In [25]: df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 2000 entries, 0 to 1999
Data columns (total 15 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 Patient_Number 2000 non-null int64
1 Blood_Pressure_Abnormality 2000 non-null int64
2 Level_of_Hemoglobin 2000 non-null float64
3 Genetic_Pedigree_Coefficient 1908 non-null float64
4 Age 2000 non-null int64
5 BMI 2000 non-null int64
6 Sex 2000 non-null int64
7 Pregnancy 442 non-null float64
8 Smoking 2000 non-null int64
9 Physical_activity 2000 non-null int64
10 salt_content_in_the_diet 2000 non-null int64
11 alcohol_consumption_per_day 1758 non-null float64
12 Level_of_Stress 2000 non-null int64
13 Chronic_kidney_disease 2000 non-null int64
14 Adrenal_and_thyroid_disorders 2000 non-null int64
dtypes: float64(4), int64(11)
memory usage: 234.5 KB

In [26]: df.isnull().sum()

Out[26]: Patient_Number 0
Blood_Pressure_Abnormality 0
Level_of_Hemoglobin 0
Genetic_Pedigree_Coefficient 92
Age 0
BMI 0
Sex 0
Pregnancy 1558
Smoking 0
Physical_activity 0
salt_content_in_the_diet 0
alcohol_consumption_per_day 242
Level_of_Stress 0
Chronic_kidney_disease 0
Adrenal_and_thyroid_disorders 0
dtype: int64

2 of 5 30-10-2024, 22:07
5 - Jupyter Notebook http://localhost:8888/notebooks/Practicals_AI/5.ipynb

In [53]: sns.countplot(x = df.Blood_Pressure_Abnormality)

Out[53]: <Axes: xlabel='Blood_Pressure_Abnormality', ylabel='count'>

In [28]: df['Genetic_Pedigree_Coefficient'] = df.Genetic_Pedigree_Coefficient.fillna(df

In [29]: df['alcohol_consumption_per_day'] = df['alcohol_consumption_per_day'].fillna(df

In [30]: df = df.drop(['Pregnancy'],axis=1)

In [31]: df = df.drop(['Patient_Number'],axis=1)

In [32]: df.isnull().sum()

Out[32]: Blood_Pressure_Abnormality 0
Level_of_Hemoglobin 0
Genetic_Pedigree_Coefficient 0
Age 0
BMI 0
Sex 0
Smoking 0
Physical_activity 0
salt_content_in_the_diet 0
alcohol_consumption_per_day 0
Level_of_Stress 0
Chronic_kidney_disease 0
Adrenal_and_thyroid_disorders 0
dtype: int64

In [33]: X = df.drop(['Blood_Pressure_Abnormality'],axis=1)
y = df['Blood_Pressure_Abnormality']

In [34]: X_train,X_test,y_train,y_test = train_test_split(X,y,test_size=0.2,random_state

3 of 5 30-10-2024, 22:07
5 - Jupyter Notebook http://localhost:8888/notebooks/Practicals_AI/5.ipynb

In [35]: lg = LogisticRegression(C= 1438.44988828766,

max_iter= 100,
penalty= 'l2',
solver= 'liblinear')

In [36]: lg.fit(X_train,y_train)

Out[36]: LogisticRegression(C=1438.44988828766, solver='liblinear')

In a Jupyter environment, please rerun this cell to show the HTML representation or trust
the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page
with nbviewer.org.

In [37]: y_pred = lg.predict(X_test)

In [38]: from sklearn.metrics import confusion_matrix,classification_report,accuracy_score

In [39]: confusion_matrix(y_test,y_pred)

Out[39]: array([[159, 64],

[ 49, 128]], dtype=int64)

In [40]: print(classification_report(y_test,y_pred))

precision recall f1-score support

0 0.76 0.71 0.74 223

1 0.67 0.72 0.69 177

accuracy 0.72 400

macro avg 0.72 0.72 0.72 400
weighted avg 0.72 0.72 0.72 400

In [41]: recall = 111/111+66

In [42]: FPR = 64/64+159

In [43]: FPR

Out[43]: 160.0

In [44]: accuracy_score(y_test,y_pred)

Out[44]: 0.7175

In [45]: precision_score(y_test,y_pred)

Out[45]: 0.6666666666666666

4 of 5 30-10-2024, 22:07
5 - Jupyter Notebook http://localhost:8888/notebooks/Practicals_AI/5.ipynb

In [46]: recall_score(y_test,y_pred)

Out[46]: 0.7231638418079096

In [47]: from sklearn.metrics import roc_curve

In [48]: pred_proba = lg.predict_proba(X_test)

In [49]: roc_auc_score(y_test,y_pred)*100

Out[49]: 71.80841630564213

In [50]: fpr,tpr,ther = roc_curve(y_test,pred_proba[:,-1])

In [51]: plt.plot(fpr,tpr,c = 'g')

plt.xlabel("FPR")
plt.ylabel("TPR")
plt.grid()

In [ ]:

5 of 5 30-10-2024, 22:07

The JavaScript Workbook - Download Edition
100% (6)
The JavaScript Workbook - Download Edition
221 pages
Cohesity ServiceNow Integration-User Guide
No ratings yet
Cohesity ServiceNow Integration-User Guide
29 pages
Fanuc 10 Alarm List
50% (2)
Fanuc 10 Alarm List
8 pages
Step-By-Step-Diabetes-Classification-Knn-Detailed-Copy1 - Jupyter Notebook
No ratings yet
Step-By-Step-Diabetes-Classification-Knn-Detailed-Copy1 - Jupyter Notebook
12 pages
Manual Testing Interview Questions and Answers
No ratings yet
Manual Testing Interview Questions and Answers
22 pages
Kubernetes On OpenStack Ebook Final
No ratings yet
Kubernetes On OpenStack Ebook Final
27 pages
Express Checkout API Specification 1.4b
No ratings yet
Express Checkout API Specification 1.4b
34 pages
Samsung Pg17n, Pg19n Service Manual
No ratings yet
Samsung Pg17n, Pg19n Service Manual
85 pages
Ingame Commands VCMP 0.3
No ratings yet
Ingame Commands VCMP 0.3
3 pages
Git Flow Tutorial
No ratings yet
Git Flow Tutorial
29 pages
9893 PDF
No ratings yet
9893 PDF
20 pages
Data Compression UNIT-5 MCQ Questions With Solutions AKTU - GOEL DIGITAL SOLUTION
No ratings yet
Data Compression UNIT-5 MCQ Questions With Solutions AKTU - GOEL DIGITAL SOLUTION
8 pages
Boyles Law PhET
No ratings yet
Boyles Law PhET
7 pages
I O Extended 2023 GDO Discord
No ratings yet
I O Extended 2023 GDO Discord
20 pages
Phases of Project Management
100% (1)
Phases of Project Management
20 pages
Hearth Failure Prediction
No ratings yet
Hearth Failure Prediction
38 pages
Top 10 Developer Articles For 2020
No ratings yet
Top 10 Developer Articles For 2020
3 pages
Hcin620 m6 Lab6 Hanifahmutesi-Finalproject
No ratings yet
Hcin620 m6 Lab6 Hanifahmutesi-Finalproject
5 pages
Markov Random Field Models in Computer Vision: A Posteriori
No ratings yet
Markov Random Field Models in Computer Vision: A Posteriori
2 pages
Stroke Prediction Dataset
No ratings yet
Stroke Prediction Dataset
48 pages
M. Fahmi Fachrozi: Career Summary Personal Profile
No ratings yet
M. Fahmi Fachrozi: Career Summary Personal Profile
1 page
Logistic Regression
No ratings yet
Logistic Regression
28 pages
Capstone Project 2
No ratings yet
Capstone Project 2
15 pages
7-Integration Tools For Design and Process Control of Filament Winding!!!!!!!!!!!!!!
No ratings yet
7-Integration Tools For Design and Process Control of Filament Winding!!!!!!!!!!!!!!
144 pages
Cpuguide
No ratings yet
Cpuguide
13 pages
SVPTube Readme
No ratings yet
SVPTube Readme
2 pages
UT Dallas Syllabus For cs6367.001.10s Taught by Joao Cangussu (jwc021000)
No ratings yet
UT Dallas Syllabus For cs6367.001.10s Taught by Joao Cangussu (jwc021000)
4 pages
Project 10 Movie Recommendation - Ipynb - Colaboratory
No ratings yet
Project 10 Movie Recommendation - Ipynb - Colaboratory
6 pages
Project 190
No ratings yet
Project 190
6 pages
Logistic Regression 205
No ratings yet
Logistic Regression 205
8 pages
Stroke Prediction
No ratings yet
Stroke Prediction
10 pages
Ide To 6 Classification Algorithms
No ratings yet
Ide To 6 Classification Algorithms
34 pages
ML Practical 04
No ratings yet
ML Practical 04
20 pages
Logistic Regression
No ratings yet
Logistic Regression
12 pages
Diabetes
No ratings yet
Diabetes
10 pages
DAL Experiment Outputs 6to10
No ratings yet
DAL Experiment Outputs 6to10
16 pages
SVM - RF - Diabetes - CSV - 26 - 6 - 2023.ipynb - Colaboratory
No ratings yet
SVM - RF - Diabetes - CSV - 26 - 6 - 2023.ipynb - Colaboratory
8 pages
Diabetes Prediction System
No ratings yet
Diabetes Prediction System
4 pages
AkarshKA MScASA-1
No ratings yet
AkarshKA MScASA-1
1 page
Logistic - Ipynb - Colaboratory
No ratings yet
Logistic - Ipynb - Colaboratory
6 pages
Heart Disease Indicator Prediction Model
No ratings yet
Heart Disease Indicator Prediction Model
17 pages
Commented Tomike Famoroti Dissertation Draft. (Ecommerce Security)
No ratings yet
Commented Tomike Famoroti Dissertation Draft. (Ecommerce Security)
178 pages
Razi AML Assignment2
No ratings yet
Razi AML Assignment2
18 pages
Project
No ratings yet
Project
8 pages
Exp 5
No ratings yet
Exp 5
7 pages
David - Support Engineer
No ratings yet
David - Support Engineer
6 pages
Diabetes
No ratings yet
Diabetes
97 pages
ML Data Preprocessing in Python
No ratings yet
ML Data Preprocessing in Python
9 pages
508 Test Report NIST Mobile UFED4PC v4.2.6.5 January 2016
No ratings yet
508 Test Report NIST Mobile UFED4PC v4.2.6.5 January 2016
20 pages
Mpesa Web User Application Form
No ratings yet
Mpesa Web User Application Form
2 pages
My Code
No ratings yet
My Code
7 pages
Diabetes
No ratings yet
Diabetes
7 pages
Documentation Code
No ratings yet
Documentation Code
20 pages
Import As From Import From Import From Import From Import From Import From Import From Import From Import From Import From Import Import As
No ratings yet
Import As From Import From Import From Import From Import From Import From Import From Import From Import From Import From Import Import As
8 pages
Python 2025
No ratings yet
Python 2025
25 pages
Diabetes EDA and Kears Modeling
No ratings yet
Diabetes EDA and Kears Modeling
26 pages
Unit5 - Logistic Regression
No ratings yet
Unit5 - Logistic Regression
4 pages
KNN For Classification
No ratings yet
KNN For Classification
5 pages
# Load Packages: Pandas Pandas PD PD Numpy Numpy NP NP
No ratings yet
# Load Packages: Pandas Pandas PD PD Numpy Numpy NP NP
17 pages
Diabetis Project
No ratings yet
Diabetis Project
7 pages
Diabetes and Glucose Correlation - IBM Machine Learning Training Project
No ratings yet
Diabetes and Glucose Correlation - IBM Machine Learning Training Project
10 pages
Ml4.ipynb - Colab
No ratings yet
Ml4.ipynb - Colab
3 pages
OAFQuestions
No ratings yet
OAFQuestions
17 pages
GFW0018 W6 Poster (S2116309)
No ratings yet
GFW0018 W6 Poster (S2116309)
3 pages
HP LJ p1005 Datasheet
No ratings yet
HP LJ p1005 Datasheet
4 pages
Lab Manual - MachineLearningLaboratory-DR - Vaishnavi
No ratings yet
Lab Manual - MachineLearningLaboratory-DR - Vaishnavi
71 pages
24MCB0021 VL2024250505870 Ast03
No ratings yet
24MCB0021 VL2024250505870 Ast03
4 pages
Diabetes Prediction
No ratings yet
Diabetes Prediction
1 page
Logidtic Regression ASSIGNMENT
No ratings yet
Logidtic Regression ASSIGNMENT
13 pages
Diabetes Prediction 1704256341
No ratings yet
Diabetes Prediction 1704256341
17 pages
AML Sessional 1 Students
No ratings yet
AML Sessional 1 Students
16 pages
Healthcare-Project-Simplilearn - Week1
No ratings yet
Healthcare-Project-Simplilearn - Week1
6 pages
lab - 8 - - (6) عفان عبدالله احمد - التكليف -
No ratings yet
lab - 8 - - (6) عفان عبدالله احمد - التكليف -
18 pages
ExNo 08ml
No ratings yet
ExNo 08ml
4 pages
Heart - Cleveland - Ipynb - Colab
No ratings yet
Heart - Cleveland - Ipynb - Colab
5 pages
Diabetes Prediction Using Machine Learning
No ratings yet
Diabetes Prediction Using Machine Learning
20 pages
ADS Exp-1
No ratings yet
ADS Exp-1
3 pages
Linear Merged Pagenumber
No ratings yet
Linear Merged Pagenumber
48 pages
Data Pre-Processing
No ratings yet
Data Pre-Processing
22 pages
Eda-Ml-Decision-Tree - Ipynb - Colab
No ratings yet
Eda-Ml-Decision-Tree - Ipynb - Colab
20 pages
Baseline - Ipynb - Colab
No ratings yet
Baseline - Ipynb - Colab
5 pages
Major Project - Colab
No ratings yet
Major Project - Colab
15 pages
EGlu User Manual
No ratings yet
EGlu User Manual
58 pages
Untitled2.Ipynb - Colab
No ratings yet
Untitled2.Ipynb - Colab
8 pages
Pythone Code For Predicting Diabetes Using ML
No ratings yet
Pythone Code For Predicting Diabetes Using ML
18 pages
LAB8 LogisticReg HeartDisease
No ratings yet
LAB8 LogisticReg HeartDisease
31 pages
Fds 1
No ratings yet
Fds 1
44 pages
Stroke Prediction
No ratings yet
Stroke Prediction
14 pages
Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Six Sigma Yellow Belt: Introduction to Lean six Sigma Methodology for Beginners
From Everand
Six Sigma Yellow Belt: Introduction to Lean six Sigma Methodology for Beginners
Elias Soussi
No ratings yet

Apply Logistic Regression Model Techniques To Predict Data On Any Dataset

Uploaded by

Apply Logistic Regression Model Techniques To Predict Data On Any Dataset

Uploaded by

5 - Jupyter Notebook http://localhost:8888/notebooks/Practicals_AI/5.

5. Apply Logistic Regression Model techniques to predict data on

In [20]: import pandas as pd

Out[22]: Patient_Number Blood_Pressure_Abnormality Level_of_Hemoglobin Genetic_Pedigree_Coefficient

Out[23]: Patient_Number Blood_Pressure_Abnormality Level_of_Hemoglobin Genetic_Pedigree_Coefficient

1995 1996 1 10.14

1996 1997 1 11.77

1997 1998 1 16.91

1998 1999 0 11.15

1999 2000 1 11.36

Out[24]: (2000, 15)

In [53]: sns.countplot(x = df.Blood_Pressure_Abnormality)

Out[53]: <Axes: xlabel='Blood_Pressure_Abnormality', ylabel='count'>

In [28]: df['Genetic_Pedigree_Coefficient'] = df.Genetic_Pedigree_Coefficient.fillna(df

In [29]: df['alcohol_consumption_per_day'] = df['alcohol_consumption_per_day'].fillna(df

In [34]: X_train,X_test,y_train,y_test = train_test_split(X,y,test_size=0.2,random_state

In [35]: lg = LogisticRegression(C= 1438.44988828766,

Out[36]: LogisticRegression(C=1438.44988828766, solver='liblinear')

In [37]: y_pred = lg.predict(X_test)

In [38]: from sklearn.metrics import confusion_matrix,classification_report,accuracy_score

Out[39]: array([[159, 64],

precision recall f1-score support

0 0.76 0.71 0.74 223

accuracy 0.72 400

In [41]: recall = 111/111+66

In [42]: FPR = 64/64+159

In [47]: from sklearn.metrics import roc_curve

In [48]: pred_proba = lg.predict_proba(X_test)

In [50]: fpr,tpr,ther = roc_curve(y_test,pred_proba[:,-1])

In [51]: plt.plot(fpr,tpr,c = 'g')

You might also like