ML Practical 3D
ML Practical 3D
Compute confusion matrix, accuracy, error rate, precision and recall on the given dataset. Dataset link :
https://www.kaggle.com/datasets/abdallamahgoub/diabetes
In [3]: df.head()
Out[3]: Pregnancies Glucose BloodPressure SkinThickness Insulin BMI Pedigree Age Outcome
1 1 85 66 29 0 26.6 0.351 31 0
3 1 89 66 23 94 28.1 0.167 21 0
<class 'pandas.core.frame.DataFrame'>
RangeIndex: 768 entries, 0 to 767 Data
columns (total 9 columns):
# Column Non-Null Count Dtype
(768, 9)
Out[5]:
df.columns
In [6]:
Index(['Pregnancies', 'Glucose', 'BloodPressure', 'SkinThickness', 'Insulin','BMI', 'Pedigree',
Out[6]: 'Age', 'Outcome'],
dtype='object')
In [7]: df.describe()
count 768.000000 768.000000 768.000000 768.000000 768.000000 768.000000 768.000000 768.000000 768
In [8]: df.isna().sum()
Out[8]: Pregnancies 0
Glucose 0
BloodPressure 0
SkinThickness 0
Insulin 0
BMI 0
Pedigree 0
Age 0
Outcome 0
dtype: int64
Out[10]: Pregnancies Glucose BloodPressure SkinThickness Insulin BMI Pedigree Age Outcome
1 1 85 66 29 0 26.6 0.351 31 0
3 1 89 66 23 94 28.1 0.167 21 0
Out[13]:
▾ KNeighborsClassifier
KNeighborsClassifier()
Confusion Matrix:
print(conf_matrix)
In [17]:
[[119 32]
[ 37 43]]
Accuracy: 0.7012987012987013
Precision: 0.5733333333333334
Recall: 0.5375
In [ ]: