0% found this document useful (0 votes)

27 views4 pages

ML Practical 3D

Uploaded by

Samir Bhosale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views4 pages

ML Practical 3D

Uploaded by

Samir Bhosale

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Assignment No: 03

Name: Bhosale Samir Shamkant Roll no: CO407 Class: BE COMP

Title: Implement K-Nearest Neighbors algorithm on diabetes.csv dataset.

Compute confusion matrix, accuracy, error rate, precision and recall on the given dataset. Dataset link :
https://www.kaggle.com/datasets/abdallamahgoub/diabetes

Importing the libraries

In [1]: import pandas as pd
import numpy as np
from sklearn.preprocessing import StandardScaler
from sklearn.neighbors import KNeighborsClassifier
from sklearn.metrics import confusion_matrix, accuracy_score, precision_score, recall_sc

Import the dataset

In [2]: df = pd.read_csv("diabetes.csv")

In [3]: df.head()

Out[3]: Pregnancies Glucose BloodPressure SkinThickness Insulin BMI Pedigree Age Outcome

0 6 148 72 35 0 33.6 0.627 50 1

1 1 85 66 29 0 26.6 0.351 31 0

2 8 183 64 0 0 23.3 0.672 32 1

3 1 89 66 23 94 28.1 0.167 21 0

4 0 137 40 35 168 43.1 2.288 33 1

Preprocess the dataset
In [4]: df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 768 entries, 0 to 767 Data
columns (total 9 columns):
# Column Non-Null Count Dtype

0 Pregnancies 768 non-null int64

1 Glucose 768 non-null int64
2 BloodPressure 768 non-null int64
3 SkinThickness 768 non-null int64
4 Insulin 768 non-null int64
5 BMI 768 non-null float64
6 Pedigree 768 non-null float64
7 Age 768 non-null int64
8 Outcome 768 non-null int64
dtypes: float64(2), int64(7)memory
usage: 54.1 KB
In [5]: df.shape

(768, 9)
Out[5]:

df.columns
In [6]:
Index(['Pregnancies', 'Glucose', 'BloodPressure', 'SkinThickness', 'Insulin','BMI', 'Pedigree',
Out[6]: 'Age', 'Outcome'],
dtype='object')

In [7]: df.describe()

Out[7]: Pregnancies Glucose BloodPressure SkinThickness Insulin BMI Pedigree Age O

count 768.000000 768.000000 768.000000 768.000000 768.000000 768.000000 768.000000 768.000000 768

mean 3.845052 120.894531 69.105469 20.536458 79.799479 31.992578 0.471876 33.240885

std 3.369578 31.972618 19.355807 15.952218 115.244002 7.884160 0.331329 11.760232 0

min 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.078000 21.000000

25% 1.000000 99.000000 62.000000 0.000000 0.000000 27.300000 0.243750 24.000000 0

50% 3.000000 117.000000 72.000000 23.000000 30.500000 32.000000 0.372500 29.000000

75% 6.000000 140.250000 80.000000 32.000000 127.250000 36.600000 0.626250 41.000000 1

max 17.000000 199.000000 122.000000 99.000000 846.000000 67.100000 2.420000 81.000000

In [8]: df.isna().sum()

Out[8]: Pregnancies 0
Glucose 0
BloodPressure 0
SkinThickness 0
Insulin 0
BMI 0
Pedigree 0
Age 0
Outcome 0
dtype: int64

In [9]: # Features and target variable

X = df.drop('Outcome', axis=1)
y = df['Outcome']
In [10]: df.head()

Out[10]: Pregnancies Glucose BloodPressure SkinThickness Insulin BMI Pedigree Age Outcome

0 6 148 72 35 0 33.6 0.627 50 1

1 1 85 66 29 0 26.6 0.351 31 0

2 8 183 64 0 0 23.3 0.672 32 1

3 1 89 66 23 94 28.1 0.167 21 0

4 0 137 40 35 168 43.1 2.288 33 1

In [11]: # Split the dataset into training and testing sets

from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42
In [12]: # Normalize the features
scaler = StandardScaler()
X_train_scaled = scaler.fit_transform(X_train)
X_test_scaled = scaler.transform(X_test)

In [13]: # Initialize and train KNN classifier

k = 5
knn = KNeighborsClassifier(n_neighbors=k)
knn.fit(X_train_scaled, y_train)

Out[13]:
▾ KNeighborsClassifier
KNeighborsClassifier()

In [14]: # Predict on the test set

y_pred = knn.predict(X_test_scaled)

In [15]: # Compute evaluation metrics

conf_matrix = confusion_matrix(y_test, y_pred)
accuracy = accuracy_score(y_test, y_pred)
error_rate = 1 - accuracy
precision = precision_score(y_test, y_pred)
recall = recall_score(y_test, y_pred)

In [16]: # Print metrics

print("Confusion Matrix:")

Confusion Matrix:
print(conf_matrix)
In [17]:
[[119 32]
[ 37 43]]

In [18]: print("Accuracy:", accuracy)

Accuracy: 0.7012987012987013

In [19]: print("Error Rate:", error_rate)

Error Rate: 0.2987012987012987

In [20]: print("Precision:", precision)

Precision: 0.5733333333333334

In [21]: print("Recall:", recall)

Recall: 0.5375

In [ ]:

Grade 3 Project Anall Numerates
50% (2)
Grade 3 Project Anall Numerates
8 pages
Topic 3 Characteristics and Principles of Assessment
100% (1)
Topic 3 Characteristics and Principles of Assessment
45 pages
Q1 Module 2 MIL
No ratings yet
Q1 Module 2 MIL
10 pages
Cardio Screen RF
100% (1)
Cardio Screen RF
27 pages
Health-Optimizing P.E. (H.O.P.E.) 2: Sports: Organizing A Sports Events
No ratings yet
Health-Optimizing P.E. (H.O.P.E.) 2: Sports: Organizing A Sports Events
6 pages
COMP5318
No ratings yet
COMP5318
42 pages
Side by Side Extra L1 U3 - Teacher's Guide
No ratings yet
Side by Side Extra L1 U3 - Teacher's Guide
22 pages
Experiment No-4 Code
No ratings yet
Experiment No-4 Code
16 pages
AIML Report (1) 11
No ratings yet
AIML Report (1) 11
13 pages
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
ML Manual Final
No ratings yet
ML Manual Final
35 pages
Sense of Belonging Lit Review
No ratings yet
Sense of Belonging Lit Review
16 pages
AIML Report.
No ratings yet
AIML Report.
12 pages
Stroke Prediction
No ratings yet
Stroke Prediction
10 pages
KNN - Jupyter Notebook
No ratings yet
KNN - Jupyter Notebook
5 pages
Ethics and Ai Lab Final
No ratings yet
Ethics and Ai Lab Final
31 pages
Ccs333 Final QB
No ratings yet
Ccs333 Final QB
7 pages
Assignment 5 - SourceCode - Ipynb - Colab
No ratings yet
Assignment 5 - SourceCode - Ipynb - Colab
4 pages
lab - 8 - - (6) عفان عبدالله احمد - التكليف -
No ratings yet
lab - 8 - - (6) عفان عبدالله احمد - التكليف -
18 pages
Data Science Practical 9
No ratings yet
Data Science Practical 9
6 pages
Experiment 4
No ratings yet
Experiment 4
8 pages
KNN For Classification
No ratings yet
KNN For Classification
5 pages
KNN Rainfall
No ratings yet
KNN Rainfall
9 pages
AML Sessional 1 Students
No ratings yet
AML Sessional 1 Students
16 pages
Major Project - Colab
No ratings yet
Major Project - Colab
15 pages
SPPUML5
No ratings yet
SPPUML5
4 pages
Openlab 1
No ratings yet
Openlab 1
17 pages
Data Pre-Processing
No ratings yet
Data Pre-Processing
22 pages
17.11.24 - Jupyter Notebook - Doc
No ratings yet
17.11.24 - Jupyter Notebook - Doc
6 pages
MLLABDA2
No ratings yet
MLLABDA2
5 pages
Experiment 4
No ratings yet
Experiment 4
5 pages
KNN Model
No ratings yet
KNN Model
5 pages
Model2.ipynb - Colab
No ratings yet
Model2.ipynb - Colab
11 pages
Macaingalan Elementary School / National High School: School Memorandum No. 24, S., 2021
No ratings yet
Macaingalan Elementary School / National High School: School Memorandum No. 24, S., 2021
5 pages
Project 10 Movie Recommendation - Ipynb - Colaboratory
No ratings yet
Project 10 Movie Recommendation - Ipynb - Colaboratory
6 pages
ML Practical 04
No ratings yet
ML Practical 04
20 pages
Practical 4
No ratings yet
Practical 4
2 pages
Brand Management
No ratings yet
Brand Management
2 pages
ML 7
No ratings yet
ML 7
6 pages
EX - NO:3: Algorithm
No ratings yet
EX - NO:3: Algorithm
11 pages
Discussion Text (s1)
No ratings yet
Discussion Text (s1)
5 pages
Diabetes
No ratings yet
Diabetes
7 pages
Loading The Dataset: 'Diabetes - CSV'
No ratings yet
Loading The Dataset: 'Diabetes - CSV'
4 pages
Documentation Code
No ratings yet
Documentation Code
20 pages
Exp 5
No ratings yet
Exp 5
7 pages
ML 4
No ratings yet
ML 4
2 pages
Ml4.ipynb - Colab
No ratings yet
Ml4.ipynb - Colab
3 pages
ML 5
No ratings yet
ML 5
3 pages
KNN For Classification
No ratings yet
KNN For Classification
4 pages
Healthcare-Project-Simplilearn - Week1
No ratings yet
Healthcare-Project-Simplilearn - Week1
6 pages
SVM - RF - Diabetes - CSV - 26 - 6 - 2023.ipynb - Colaboratory
No ratings yet
SVM - RF - Diabetes - CSV - 26 - 6 - 2023.ipynb - Colaboratory
8 pages
Assignment 4
No ratings yet
Assignment 4
2 pages
Diabetes
No ratings yet
Diabetes
10 pages
B58 - Handling Missing Values, Feature - Selection
No ratings yet
B58 - Handling Missing Values, Feature - Selection
4 pages
KNN
No ratings yet
KNN
2 pages
Doc
No ratings yet
Doc
550 pages
ML Data Preprocessing in Python
No ratings yet
ML Data Preprocessing in Python
9 pages
Import As From Import From Import From Import From Import From Import From Import From Import From Import From Import From Import Import As
No ratings yet
Import As From Import From Import From Import From Import From Import From Import From Import From Import From Import From Import Import As
8 pages
Project 3 - Diabetes Prediction - Ipynb - Colab
No ratings yet
Project 3 - Diabetes Prediction - Ipynb - Colab
4 pages
ML Exp 7
No ratings yet
ML Exp 7
3 pages
Lab Manual - MachineLearningLaboratory-DR - Vaishnavi
No ratings yet
Lab Manual - MachineLearningLaboratory-DR - Vaishnavi
71 pages
Logistic - Ipynb - Colaboratory
No ratings yet
Logistic - Ipynb - Colaboratory
6 pages
Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Omml
No ratings yet
Omml
1 page
Iligan Medical Center College
No ratings yet
Iligan Medical Center College
14 pages
Diabetic Prediction Using LogicalRegression
No ratings yet
Diabetic Prediction Using LogicalRegression
9 pages
Question Paper With Solutions
No ratings yet
Question Paper With Solutions
6 pages
Unit 2 Univariate Data Unit Plan
No ratings yet
Unit 2 Univariate Data Unit Plan
5 pages
Lab 5
No ratings yet
Lab 5
2 pages
20MIS7043 (LAB 7) .Ipynb Colaboratory
No ratings yet
20MIS7043 (LAB 7) .Ipynb Colaboratory
4 pages
20MIS7095 (LAB 7) .Ipynb Colaboratory
No ratings yet
20MIS7095 (LAB 7) .Ipynb Colaboratory
4 pages
09 - Lecture 4 Language Maintenance and Shift
No ratings yet
09 - Lecture 4 Language Maintenance and Shift
9 pages
Running Head: Fusing Creativity in Multicultural Teams: Dubai School of Government
No ratings yet
Running Head: Fusing Creativity in Multicultural Teams: Dubai School of Government
44 pages
ExNo 08ml
No ratings yet
ExNo 08ml
4 pages
Diabetis Project
No ratings yet
Diabetis Project
7 pages
Intent Letter Food Packs
No ratings yet
Intent Letter Food Packs
4 pages
Step-By-Step-Diabetes-Classification-Knn-Detailed-Copy1 - Jupyter Notebook
No ratings yet
Step-By-Step-Diabetes-Classification-Knn-Detailed-Copy1 - Jupyter Notebook
12 pages
DLP - Perdev - 10-01-24 - Personal Relationships
No ratings yet
DLP - Perdev - 10-01-24 - Personal Relationships
6 pages
2 Класс 2 Четверть
No ratings yet
2 Класс 2 Четверть
27 pages
Sony Walkman Digital Music Player NWZ
No ratings yet
Sony Walkman Digital Music Player NWZ
10 pages
Sample LAS Template For Q2 LAS English - Tagalog 1
No ratings yet
Sample LAS Template For Q2 LAS English - Tagalog 1
10 pages
Transkrip Nilai Sementara
No ratings yet
Transkrip Nilai Sementara
4 pages
TPCN Monthly List of Subcontractors 06-2017
No ratings yet
TPCN Monthly List of Subcontractors 06-2017
3 pages
Venkatesh Resume
No ratings yet
Venkatesh Resume
2 pages
FSD BIS601 Syllabus
No ratings yet
FSD BIS601 Syllabus
4 pages
Draft 2023 EWF Side Meeting
No ratings yet
Draft 2023 EWF Side Meeting
2 pages
Call For Papers - IJAIKE Inaugural Issues - Rev3
No ratings yet
Call For Papers - IJAIKE Inaugural Issues - Rev3
2 pages
Summer Winning Camp Calendar 2026 PO BE-Batch, ME & MCA, BCA & B.SC Batch 2026 PO
No ratings yet
Summer Winning Camp Calendar 2026 PO BE-Batch, ME & MCA, BCA & B.SC Batch 2026 PO
1 page
Call For Abstracts - Innovation and Research Summit 2025
No ratings yet
Call For Abstracts - Innovation and Research Summit 2025
1 page
Sample Business Plan For Starting A Nursery School
100% (1)
Sample Business Plan For Starting A Nursery School
13 pages

ML Practical 3D

Uploaded by

ML Practical 3D

Uploaded by

Assignment No: 03

Name: Bhosale Samir Shamkant Roll no: CO407 Class: BE COMP

Title: Implement K-Nearest Neighbors algorithm on diabetes.csv dataset.

Importing the libraries

Import the dataset

0 6 148 72 35 0 33.6 0.627 50 1

2 8 183 64 0 0 23.3 0.672 32 1

4 0 137 40 35 168 43.1 2.288 33 1

0 Pregnancies 768 non-null int64

Out[7]: Pregnancies Glucose BloodPressure SkinThickness Insulin BMI Pedigree Age O

mean 3.845052 120.894531 69.105469 20.536458 79.799479 31.992578 0.471876 33.240885

std 3.369578 31.972618 19.355807 15.952218 115.244002 7.884160 0.331329 11.760232 0

min 0.000000 0.000000 0.000000 0.000000 0.000000 0.000000 0.078000 21.000000

25% 1.000000 99.000000 62.000000 0.000000 0.000000 27.300000 0.243750 24.000000 0

50% 3.000000 117.000000 72.000000 23.000000 30.500000 32.000000 0.372500 29.000000

75% 6.000000 140.250000 80.000000 32.000000 127.250000 36.600000 0.626250 41.000000 1

max 17.000000 199.000000 122.000000 99.000000 846.000000 67.100000 2.420000 81.000000

In [9]: # Features and target variable

0 6 148 72 35 0 33.6 0.627 50 1

2 8 183 64 0 0 23.3 0.672 32 1

4 0 137 40 35 168 43.1 2.288 33 1

In [11]: # Split the dataset into training and testing sets

In [13]: # Initialize and train KNN classifier

In [14]: # Predict on the test set

In [15]: # Compute evaluation metrics

In [16]: # Print metrics

In [18]: print("Accuracy:", accuracy)

In [19]: print("Error Rate:", error_rate)

Error Rate: 0.2987012987012987

In [20]: print("Precision:", precision)

In [21]: print("Recall:", recall)

You might also like