0% found this document useful (0 votes)

20 views6 pages

LP Practical ! Jupyter Notebook

Uploaded by

xifavo8319

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views6 pages

LP Practical ! Jupyter Notebook

Uploaded by

xifavo8319

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

In [1]: import os

os.getcwd()

Out[1]: 'C:\\Users\\kunal'

In [2]: import pandas as pd # used to access data which in table format

In [5]: #import the database

df = pd.read_csv('Heart.csv')

In [6]: df.head() #df stands for data frame (it shows the first 5 entry of dataset)

Out[6]:
Unnamed: 0 Age Sex ChestPain RestBP Chol Fbs RestECG MaxHR ExAng Oldpeak Slope Ca Thal AHD

0 1 63 1 typical 145 233 1 2 150 0 2.3 3 0.0 fixed No

1 2 67 1 asymptomatic 160 286 0 2 108 1 1.5 2 3.0 normal Yes

2 3 67 1 asymptomatic 120 229 0 2 129 1 2.6 2 2.0 reversable Yes

3 4 37 1 nonanginal 130 250 0 0 187 0 3.5 3 0.0 normal No

4 5 41 0 nontypical 130 204 0 2 172 0 1.4 1 0.0 normal No

In [7]: #shape find no. of rows and columns

df.shape

Out[7]: (303, 15)

In [8]: # Finding missing values

df.isnull()

Out[8]:
Unnamed: 0 Age Sex ChestPain RestBP Chol Fbs RestECG MaxHR ExAng Oldpeak Slope Ca Thal AHD

0 False False False False False False False False False False False False False False False

1 False False False False False False False False False False False False False False False

2 False False False False False False False False False False False False False False False

3 False False False False False False False False False False False False False False False

4 False False False False False False False False False False False False False False False

... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...

298 False False False False False False False False False False False False False False False

299 False False False False False False False False False False False False False False False

300 False False False False False False False False False False False False False False False

301 False False False False False False False False False False False False False False False

302 False False False False False False False False False False False False True False False

303 rows × 15 columns

In [9]: # view in summary format false 0 true 1 we add every column
df.isnull().sum()

Out[9]: Unnamed: 0 0
Age 0
Sex 0
ChestPain 0
RestBP 0
Chol 0
Fbs 0
RestECG 0
MaxHR 0
ExAng 0
Oldpeak 0
Slope 0
Ca 4
Thal 2
AHD 0
dtype: int64

In [10]: # we can use other method this gives the not null values
df.count()

Out[10]: Unnamed: 0 303

Age 303
Sex 303
ChestPain 303
RestBP 303
Chol 303
Fbs 303
RestECG 303
MaxHR 303
ExAng 303
Oldpeak 303
Slope 303
Ca 299
Thal 301
AHD 303
dtype: int64

In [11]: # find data type of each column by using attribute not method
df.dtypes

Out[11]: Unnamed: 0 int64

Age int64
Sex int64
ChestPain object
RestBP int64
Chol int64
Fbs int64
RestECG int64
MaxHR int64
ExAng int64
Oldpeak float64
Slope int64
Ca float64
Thal object
AHD object
dtype: object
In [12]: # find where out zeros in column mark 0 as true
df==0

Out[12]:
Unnamed: 0 Age Sex ChestPain RestBP Chol Fbs RestECG MaxHR ExAng Oldpeak Slope Ca Thal AHD

0 False False False False False False False False False True False False True False False

1 False False False False False False True False False False False False False False False

2 False False False False False False True False False False False False False False False

3 False False False False False False True True False True False False True False False

4 False False True False False False True False False True False False True False False

... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...

298 False False False False False False True True False True False False True False False

299 False False False False False False False True False True False False False False False

300 False False False False False False True True False False False False False False False

301 False False True False False False True False False True True False False False False

302 False False False False False False True True False True True False False False False

303 rows × 15 columns

In [13]: # to highlight zeros

df[df ==0]

Out[13]:
Unnamed: 0 Age Sex ChestPain RestBP Chol Fbs RestECG MaxHR ExAng Oldpeak Slope Ca Thal AHD

0 NaN NaN NaN NaN NaN NaN NaN NaN NaN 0.0 NaN NaN 0.0 NaN NaN

1 NaN NaN NaN NaN NaN NaN 0.0 NaN NaN NaN NaN NaN NaN NaN NaN

2 NaN NaN NaN NaN NaN NaN 0.0 NaN NaN NaN NaN NaN NaN NaN NaN

3 NaN NaN NaN NaN NaN NaN 0.0 0.0 NaN 0.0 NaN NaN 0.0 NaN NaN

4 NaN NaN 0.0 NaN NaN NaN 0.0 NaN NaN 0.0 NaN NaN 0.0 NaN NaN

... ... ... ... ... ... ... ... ... ... ... ... ... ... ... ...

298 NaN NaN NaN NaN NaN NaN 0.0 0.0 NaN 0.0 NaN NaN 0.0 NaN NaN

299 NaN NaN NaN NaN NaN NaN NaN 0.0 NaN 0.0 NaN NaN NaN NaN NaN

300 NaN NaN NaN NaN NaN NaN 0.0 0.0 NaN NaN NaN NaN NaN NaN NaN

301 NaN NaN 0.0 NaN NaN NaN 0.0 NaN NaN 0.0 0.0 NaN NaN NaN NaN

302 NaN NaN NaN NaN NaN NaN 0.0 0.0 NaN 0.0 0.0 NaN NaN NaN NaN

303 rows × 15 columns

In [14]: # count number of zeros in each column

df[df==0].count()

Out[14]: Unnamed: 0 0
Age 0
Sex 97
ChestPain 0
RestBP 0
Chol 0
Fbs 258
RestECG 151
MaxHR 0
ExAng 204
Oldpeak 99
Slope 0
Ca 176
Thal 0
AHD 0
dtype: int64
In [15]: # find mean age from age column so we first list the all columns name
df.columns

Out[15]: Index(['Unnamed: 0', 'Age', 'Sex', 'ChestPain', 'RestBP', 'Chol', 'Fbs',

'RestECG', 'MaxHR', 'ExAng', 'Oldpeak', 'Slope', 'Ca', 'Thal', 'AHD'],
dtype='object')

In [16]: # accessing age column called as label based listing and also want to find mean hence .mean()
df['Age'].mean()

Out[16]: 54.43894389438944

In [21]: # extracting given columns only for more than one column use double brackets
newdf =df[['Age' , 'Sex' , 'ChestPain' , 'Chol']]

In [22]: #store above data in one variable and show it

newdf

Out[22]:
Age Sex ChestPain Chol

0 63 1 typical 233

1 67 1 asymptomatic 286

2 67 1 asymptomatic 229

3 37 1 nonanginal 250

4 41 0 nontypical 204

... ... ... ... ...

298 45 1 typical 264

299 68 1 asymptomatic 193

300 57 1 asymptomatic 131

301 57 0 nontypical 236

302 38 1 nonanginal 175

303 rows × 4 columns

In [24]: # for cross validation we pass 75% data for training sklearn is library in which train_test method is present
#cross validation
from sklearn.model_selection import train_test_split

In [26]: train, test= train_test_split(df, random_state=0 ,test_size=0.25) # we can give any random state to shuffle da
# by default also size is given as 75% and 25%

In [27]: train.shape

Out[27]: (227, 15)

In [28]: test.shape

Out[28]: (76, 15)

In [29]: import numpy as np # import if you want to create array we take some randdom data for testing

In [30]: actual=list(np.ones(45)) + list(np.zeros(55)) # create array as actual ones mesans aray of 1,1,1....
#zeros for remaining 55 values
In [31]: np.array(actual)

Out[31]: array([1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1.,
1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1.,
1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 0., 0., 0., 0., 0., 0.,
0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.,
0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.,
0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.])

In [32]: predicted=list(np.ones(40)) + list(np.zeros(52)) + list(np.ones(8))

np.array(predicted)

Out[32]: array([1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1.,
1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1., 1.,
1., 1., 1., 1., 1., 1., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.,
0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.,
0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0., 0.,
0., 0., 0., 0., 0., 0., 0., 1., 1., 1., 1., 1., 1., 1., 1.])

In [33]: from sklearn.metrics import ConfusionMatrixDisplay

In [36]: ConfusionMatrixDisplay.from_predictions(actual,predicted)

Out[36]: <sklearn.metrics._plot.confusion_matrix.ConfusionMatrixDisplay at 0x1f7fe3394f0>

In [38]: # in above matrix 47 and 40 values are matching

from sklearn.metrics import classification_report

In [39]: print(classification_report(actual, predicted))

precision recall f1-score support

0.0 0.90 0.85 0.88 55

1.0 0.83 0.89 0.86 45

accuracy 0.87 100

macro avg 0.87 0.87 0.87 100
weighted avg 0.87 0.87 0.87 100
In [40]: # recall indicate accuracy for individual class out of 55 i.e 44+8 , 47 are matching hence 47/55=0.85
#40/45=0.89

#for precison 47+5= 52 , 47/52=0.90

#f1-score is mean of 0.90 and 0.85 (harmonic mean)

# direct formula for accuracy
from sklearn.metrics import accuracy_score
accuracy_score(actual , predicted)

Out[40]: 0.87

In [ ]:

SPF Admin Course - Modeling and Mapping PDF
100% (1)
SPF Admin Course - Modeling and Mapping PDF
778 pages
Basic Python Book PDF
No ratings yet
Basic Python Book PDF
41 pages
Fitzpatrick Dermatology
100% (4)
Fitzpatrick Dermatology
2,576 pages
Beginners Guide To Making Money Online
100% (8)
Beginners Guide To Making Money Online
129 pages
Computer Graphics Programs Using C
No ratings yet
Computer Graphics Programs Using C
73 pages
Understanding VAPT - Non Technical Guide To Cybersecurity
No ratings yet
Understanding VAPT - Non Technical Guide To Cybersecurity
32 pages
Secure Webmail: Sending Mail Using Stunnel, Mail Submission Port and
No ratings yet
Secure Webmail: Sending Mail Using Stunnel, Mail Submission Port and
103 pages
Software Mining Repository Practical
No ratings yet
Software Mining Repository Practical
28 pages
Heart Failure Prediction
100% (1)
Heart Failure Prediction
41 pages
Measurement & Control Question Paper
No ratings yet
Measurement & Control Question Paper
4 pages
BDSL456B Lab Manual
No ratings yet
BDSL456B Lab Manual
36 pages
Heart Disease Prediction - Jupyter Notebook
100% (1)
Heart Disease Prediction - Jupyter Notebook
9 pages
Manual-Kojair - Microbiological Safety Cabinet
No ratings yet
Manual-Kojair - Microbiological Safety Cabinet
18 pages
Aids
No ratings yet
Aids
88 pages
Stroke Prediction Dataset
No ratings yet
Stroke Prediction Dataset
48 pages
Lab Manual DAR
No ratings yet
Lab Manual DAR
81 pages
Setting Up and Importing UTAS Email Into Exchange Mailboxes
No ratings yet
Setting Up and Importing UTAS Email Into Exchange Mailboxes
80 pages
Kodiaq Accessories 2018 en
No ratings yet
Kodiaq Accessories 2018 en
33 pages
Fire Protection Engineer CV
No ratings yet
Fire Protection Engineer CV
5 pages
Heart Disease Prediction! ?
No ratings yet
Heart Disease Prediction! ?
52 pages
Python Solution
No ratings yet
Python Solution
30 pages
Project Report
No ratings yet
Project Report
18 pages
Ide To 6 Classification Algorithms
No ratings yet
Ide To 6 Classification Algorithms
34 pages
QUIZ Week 2 CART Practice PDF
No ratings yet
QUIZ Week 2 CART Practice PDF
10 pages
Bacdeaf 23032025 115708 Split 1
No ratings yet
Bacdeaf 23032025 115708 Split 1
37 pages
LAB8 LogisticReg HeartDisease
No ratings yet
LAB8 LogisticReg HeartDisease
31 pages
Heart Dataset Analysis
No ratings yet
Heart Dataset Analysis
24 pages
Kubernetes Security Best Practices
No ratings yet
Kubernetes Security Best Practices
20 pages
Heart Disease Diagnosis Using Machine Learning
No ratings yet
Heart Disease Diagnosis Using Machine Learning
26 pages
Binary Prediction of Smoker Status Using Bio-Signals
No ratings yet
Binary Prediction of Smoker Status Using Bio-Signals
20 pages
# Load Packages: Pandas Pandas PD PD Numpy Numpy NP NP
No ratings yet
# Load Packages: Pandas Pandas PD PD Numpy Numpy NP NP
17 pages
Sensors: Extrinsic Calibration of Camera and 2D Laser Sensors Without Overlap
No ratings yet
Sensors: Extrinsic Calibration of Camera and 2D Laser Sensors Without Overlap
24 pages
10-Maintenance of GeneXpert
No ratings yet
10-Maintenance of GeneXpert
18 pages
Heart Disease Indicator Prediction Model
No ratings yet
Heart Disease Indicator Prediction Model
17 pages
Assignment 1 - LP1
No ratings yet
Assignment 1 - LP1
14 pages
Openlab 1
No ratings yet
Openlab 1
17 pages
C2 W4 Lab 02 Tree Ensemble
No ratings yet
C2 W4 Lab 02 Tree Ensemble
16 pages
Major Project - Colab
No ratings yet
Major Project - Colab
15 pages
Heart Attack Prediction
No ratings yet
Heart Attack Prediction
17 pages
ANSYS HFSS L05 1 HFSS 3D Optimetrics
No ratings yet
ANSYS HFSS L05 1 HFSS 3D Optimetrics
20 pages
Comfar III - Brochure 2022
No ratings yet
Comfar III - Brochure 2022
12 pages
Week - 6 - SWI - MLP - LogisticRegression - Ipynb - Colaboratory
No ratings yet
Week - 6 - SWI - MLP - LogisticRegression - Ipynb - Colaboratory
15 pages
Machine Learning Laboratory (21AIL66)
No ratings yet
Machine Learning Laboratory (21AIL66)
7 pages
Logistic Regression
No ratings yet
Logistic Regression
12 pages
2 Niessen Cursor On Target
No ratings yet
2 Niessen Cursor On Target
13 pages
DWDM Lab 3
No ratings yet
DWDM Lab 3
10 pages
The Integration of A Unified Permit License System For Engineering Office
No ratings yet
The Integration of A Unified Permit License System For Engineering Office
8 pages
Preprocessing1.ipynb - Colab
No ratings yet
Preprocessing1.ipynb - Colab
13 pages
Heart Failure Prediction With Detailed Headings
No ratings yet
Heart Failure Prediction With Detailed Headings
12 pages
Assignment 1
No ratings yet
Assignment 1
11 pages
Dsbda 5
No ratings yet
Dsbda 5
12 pages
Stroke Prediction
No ratings yet
Stroke Prediction
10 pages
C ML1
No ratings yet
C ML1
10 pages
Assignment 1
No ratings yet
Assignment 1
10 pages
Model2.ipynb - Colab
No ratings yet
Model2.ipynb - Colab
11 pages
Cheat Python
No ratings yet
Cheat Python
8 pages
Prediction of Heart Disease Final-1-20 April - Jupyter Notebook
No ratings yet
Prediction of Heart Disease Final-1-20 April - Jupyter Notebook
10 pages
Adaboost 2
No ratings yet
Adaboost 2
9 pages
Chapter 7
No ratings yet
Chapter 7
14 pages
Step-By-Step-Diabetes-Classification-Knn-Detailed-Copy1 - Jupyter Notebook
No ratings yet
Step-By-Step-Diabetes-Classification-Knn-Detailed-Copy1 - Jupyter Notebook
12 pages
Ai in HC - 2
No ratings yet
Ai in HC - 2
9 pages
Exp 5
No ratings yet
Exp 5
7 pages
Heart - Disease - 1.ipynb - Colaboratory
No ratings yet
Heart - Disease - 1.ipynb - Colaboratory
9 pages
Untitled2.Ipynb - Colab
No ratings yet
Untitled2.Ipynb - Colab
8 pages
Heart Disease Classification Using Ann Hands-On
No ratings yet
Heart Disease Classification Using Ann Hands-On
7 pages
Heart - Cleveland - Ipynb - Colab
No ratings yet
Heart - Cleveland - Ipynb - Colab
5 pages
KNN For Classification
No ratings yet
KNN For Classification
5 pages
Heart Disease Classification ML Assignment - Jupyter Notebook
No ratings yet
Heart Disease Classification ML Assignment - Jupyter Notebook
7 pages
AI Mini Project
No ratings yet
AI Mini Project
6 pages
Data Science Practical 9
No ratings yet
Data Science Practical 9
6 pages
Dataset 912
No ratings yet
Dataset 912
6 pages
Practical 1
No ratings yet
Practical 1
7 pages
Project Proposal Sample Reference56
No ratings yet
Project Proposal Sample Reference56
7 pages
Baseline - Ipynb - Colab
No ratings yet
Baseline - Ipynb - Colab
5 pages
Evosys
No ratings yet
Evosys
5 pages
Loading The Dataset: 'Diabetes - CSV'
No ratings yet
Loading The Dataset: 'Diabetes - CSV'
4 pages
KNN - Jupyter Notebook
No ratings yet
KNN - Jupyter Notebook
5 pages
Heart Disease Prediction System
No ratings yet
Heart Disease Prediction System
5 pages
B58 - Handling Missing Values, Feature - Selection
No ratings yet
B58 - Handling Missing Values, Feature - Selection
4 pages
List of E-Learning - B
No ratings yet
List of E-Learning - B
3 pages
Dovdush KN-305 Lab2
No ratings yet
Dovdush KN-305 Lab2
2 pages
Dovdush KN-305 Lab3
No ratings yet
Dovdush KN-305 Lab3
2 pages
DocScanner Oct 22, 2024 17-38
No ratings yet
DocScanner Oct 22, 2024 17-38
2 pages
Bio-Signal Analysis For Smoking
No ratings yet
Bio-Signal Analysis For Smoking
1 page
Sincnet
No ratings yet
Sincnet
2 pages
Hare Krishna
No ratings yet
Hare Krishna
1 page
Shubham Prashar (SDE1)
No ratings yet
Shubham Prashar (SDE1)
1 page
Classical Feedback Control With MATLAB: B. J Lurie Paul J Enright
No ratings yet
Classical Feedback Control With MATLAB: B. J Lurie Paul J Enright
1 page
The Beast and His Number (666) On the Calculator: Volume I
From Everand
The Beast and His Number (666) On the Calculator: Volume I
John R. Garay
No ratings yet
The Laws That Govern the Roulette Wheel
From Everand
The Laws That Govern the Roulette Wheel
Beng M. Jabier
No ratings yet

LP Practical ! Jupyter Notebook

Uploaded by

LP Practical ! Jupyter Notebook

Uploaded by

In [1]: import os

In [2]: import pandas as pd # used to access data which in table format

In [5]: #import the database

0 1 63 1 typical 145 233 1 2 150 0 2.3 3 0.0 fixed No

1 2 67 1 asymptomatic 160 286 0 2 108 1 1.5 2 3.0 normal Yes

2 3 67 1 asymptomatic 120 229 0 2 129 1 2.6 2 2.0 reversable Yes

3 4 37 1 nonanginal 130 250 0 0 187 0 3.5 3 0.0 normal No

4 5 41 0 nontypical 130 204 0 2 172 0 1.4 1 0.0 normal No

In [7]: #shape find no. of rows and columns

Out[7]: (303, 15)

In [8]: # Finding missing values

303 rows × 15 columns

Out[10]: Unnamed: 0 303

Out[11]: Unnamed: 0 int64

303 rows × 15 columns

In [13]: # to highlight zeros

303 rows × 15 columns

In [14]: # count number of zeros in each column

Out[15]: Index(['Unnamed: 0', 'Age', 'Sex', 'ChestPain', 'RestBP', 'Chol', 'Fbs',

In [22]: #store above data in one variable and show it

... ... ... ... ...

298 45 1 typical 264

299 68 1 asymptomatic 193

300 57 1 asymptomatic 131

301 57 0 nontypical 236

302 38 1 nonanginal 175

303 rows × 4 columns

Out[27]: (227, 15)

Out[28]: (76, 15)

In [32]: predicted=list(np.ones(40)) + list(np.zeros(52)) + list(np.ones(8))

In [33]: from sklearn.metrics import ConfusionMatrixDisplay

Out[36]: <sklearn.metrics._plot.confusion_matrix.ConfusionMatrixDisplay at 0x1f7fe3394f0>

In [38]: # in above matrix 47 and 40 values are matching

In [39]: print(classification_report(actual, predicted))

precision recall f1-score support

0.0 0.90 0.85 0.88 55

accuracy 0.87 100

You might also like