0% found this document useful (0 votes)

19 views5 pages

KNN For Classification

Uploaded by

snehalkotar1153

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views5 pages

KNN For Classification

Uploaded by

snehalkotar1153

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Name : Snehal Kotkar Div : A Roll No.

: 46

Practical No. : 2 Problem Statement : Build a machine learning model using k-Nearest
Neighbors algorithm to predict whether the patients in the "Pima Indians Diabetes Dataset"
have diabetes or not.

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
plt.style.use('ggplot')

from google.colab import drive

drive.mount('/content/drive')

Drive already mounted at /content/drive; to attempt to forcibly

remount, call drive.mount("/content/drive", force_remount=True).

df = pd.read_csv('/content/drive/MyDrive/ML /diabetes.csv')
df.head()

{"summary":"{\n \"name\": \"df\",\n \"rows\": 768,\n \"fields\": [\

n {\n \"column\": \"Pregnancies\",\n \"properties\": {\n
\"dtype\": \"number\",\n \"std\": 3,\n \"min\": 0,\n
\"max\": 17,\n \"num_unique_values\": 17,\n \"samples\":
[\n 6,\n 1,\n 3\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"Glucose\",\n \"properties\":
{\n \"dtype\": \"number\",\n \"std\": 31,\n
\"min\": 0,\n \"max\": 199,\n \"num_unique_values\":
136,\n \"samples\": [\n 151,\n 101,\n
112\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"BloodPressure\",\n \"properties\": {\n \"dtype\":
\"number\",\n \"std\": 19,\n \"min\": 0,\n
\"max\": 122,\n \"num_unique_values\": 47,\n
\"samples\": [\n 86,\n 46,\n 85\
n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"SkinThickness\",\n \"properties\": {\n \"dtype\":
\"number\",\n \"std\": 15,\n \"min\": 0,\n
\"max\": 99,\n \"num_unique_values\": 51,\n \"samples\":
[\n 7,\n 12,\n 48\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"Insulin\",\n \"properties\":
{\n \"dtype\": \"number\",\n \"std\": 115,\n
\"min\": 0,\n \"max\": 846,\n \"num_unique_values\":
186,\n \"samples\": [\n 52,\n 41,\n
183\n ],\n \"semantic_type\": \"\",\n
\"description\": \"\"\n }\n },\n {\n \"column\":
\"BMI\",\n \"properties\": {\n \"dtype\": \"number\",\n
\"std\": 7.884160320375446,\n \"min\": 0.0,\n \"max\":
67.1,\n \"num_unique_values\": 248,\n \"samples\": [\n
19.9,\n 31.0,\n 38.1\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"DiabetesPedigreeFunction\",\n
\"properties\": {\n \"dtype\": \"number\",\n \"std\":
0.3313285950127749,\n \"min\": 0.078,\n \"max\": 2.42,\n
\"num_unique_values\": 517,\n \"samples\": [\n 1.731,\
n 0.426,\n 0.138\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"Age\",\n \"properties\": {\n
\"dtype\": \"number\",\n \"std\": 11,\n \"min\": 21,\n
\"max\": 81,\n \"num_unique_values\": 52,\n \"samples\":
[\n 60,\n 47,\n 72\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n },\n {\n \"column\": \"Outcome\",\n \"properties\":
{\n \"dtype\": \"number\",\n \"std\": 0,\n
\"min\": 0,\n \"max\": 1,\n \"num_unique_values\": 2,\n
\"samples\": [\n 0,\n 1\n ],\n
\"semantic_type\": \"\",\n \"description\": \"\"\n }\
n }\n ]\n}","type":"dataframe","variable_name":"df"}

df.shape

(768, 9)

df.isnull().sum()

Pregnancies 0
Glucose 0
BloodPressure 0
SkinThickness 0
Insulin 0
BMI 0
DiabetesPedigreeFunction 0
Age 0
Outcome 0
dtype: int64

X = df.drop('Outcome',axis=1).values
y = df['Outcome'].values

from sklearn.model_selection import train_test_split

X_train,X_test,y_train,y_test =
train_test_split(X,y,test_size=0.25,random_state=42, stratify=y)

#import KNeighborsClassifier
from sklearn.neighbors import KNeighborsClassifier

#Setup arrays to store training and test accuracies

neighbors = np.arange(1,15)
train_accuracy =np.empty(len(neighbors))
test_accuracy = np.empty(len(neighbors))

for i,k in enumerate(neighbors):

#Setup a knn classifier with k neighbors
knn = KNeighborsClassifier(n_neighbors=k)

#Fit the model

knn.fit(X_train, y_train)

#Compute accuracy on the training set

train_accuracy[i] = knn.score(X_train, y_train)

#Compute accuracy on the test set

test_accuracy[i] = knn.score(X_test, y_test)

#Generate plot
plt.title('k-NN Varying number of neighbors')
plt.plot(neighbors, test_accuracy, label='Testing Accuracy')
plt.plot(neighbors, train_accuracy, label='Training accuracy')
plt.legend()
plt.xlabel('Number of neighbors')
plt.ylabel('Accuracy')
plt.show()
#Setup a knn classifier with k neighbors
knn = KNeighborsClassifier(n_neighbors=4)

#Fit the model

knn.fit(X_train,y_train)

KNeighborsClassifier(n_neighbors=4)

#Get accuracy. Note: In case of classification algorithms score method

represents accuracy.
knn.score(X_test,y_test)

0.7291666666666666

#let us get the predictions using the classifier we had fit above
y_pred = knn.predict(X_test)

y_pred

array([0, 0, 0, 0, 1, 0, 0, 0, 1, 0, 0, 1, 0, 0, 0, 1, 1, 0, 0, 0, 0,
0,
0, 1, 1, 0, 0, 0, 0, 1, 1, 0, 0, 0, 1, 0, 0, 0, 0, 0, 1, 0, 0,
0,
1, 0, 0, 0, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 1, 1, 0, 0, 0, 0, 1, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 1, 0, 0, 0, 0, 0, 1, 0, 0, 1, 1,
0,
1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 1, 0, 0, 1, 0, 0, 0,
1,
0, 0, 0, 1, 0, 0, 0, 1, 0, 1, 1, 0, 0, 0, 0, 0])

Machine Learning For Algorithmic Trading 2nd Edition Stefan Jansen instant download
No ratings yet
Machine Learning For Algorithmic Trading 2nd Edition Stefan Jansen instant download
30 pages
Data Analytics Using Python
100% (1)
Data Analytics Using Python
982 pages
Essential n8n Playbook
From Everand
Essential n8n Playbook
Leandro Calado
No ratings yet
Predicting The Direction of Stock Market Prices Using Tree Based
No ratings yet
Predicting The Direction of Stock Market Prices Using Tree Based
45 pages
Step-By-Step-Diabetes-Classification-Knn-Detailed-Copy1 - Jupyter Notebook
No ratings yet
Step-By-Step-Diabetes-Classification-Knn-Detailed-Copy1 - Jupyter Notebook
12 pages
Scikit Learn Docs PDF
No ratings yet
Scikit Learn Docs PDF
2,387 pages
B58_ Handling Missing Values,Feature_Selection (1)
No ratings yet
B58_ Handling Missing Values,Feature_Selection (1)
4 pages
vertopal.com_Heart_Disease_Classification_Full-1
No ratings yet
vertopal.com_Heart_Disease_Classification_Full-1
3 pages
KNN - Jupyter Notebook
No ratings yet
KNN - Jupyter Notebook
5 pages
Experiment 4
No ratings yet
Experiment 4
5 pages
Exp 5
No ratings yet
Exp 5
7 pages
ML Practical 3D
No ratings yet
ML Practical 3D
4 pages
Documentation Code
No ratings yet
Documentation Code
20 pages
Diabetes_Prediction_1704256341
No ratings yet
Diabetes_Prediction_1704256341
17 pages
Stroke Prediction
No ratings yet
Stroke Prediction
10 pages
KNN For Classification
No ratings yet
KNN For Classification
4 pages
healthcare-project-simplilearn- Week1
No ratings yet
healthcare-project-simplilearn- Week1
6 pages
ML Practical 04
No ratings yet
ML Practical 04
20 pages
Model2.ipynb - Colab
No ratings yet
Model2.ipynb - Colab
11 pages
Import As From Import From Import From Import From Import From Import From Import From Import From Import From Import From Import Import As
No ratings yet
Import As From Import From Import From Import From Import From Import From Import From Import From Import From Import From Import Import As
8 pages
Loading The Dataset: 'Diabetes - CSV'
No ratings yet
Loading The Dataset: 'Diabetes - CSV'
4 pages
ML Proj Diabetes.pptx
No ratings yet
ML Proj Diabetes.pptx
51 pages
Lab Manual - MachineLearningLaboratory-DR.vaishnavi (1)
No ratings yet
Lab Manual - MachineLearningLaboratory-DR.vaishnavi (1)
71 pages
Project 10 Movie Recommendation - Ipynb - Colaboratory
No ratings yet
Project 10 Movie Recommendation - Ipynb - Colaboratory
6 pages
Diabetes Prediction Using Machine Learning
No ratings yet
Diabetes Prediction Using Machine Learning
20 pages
Data Pre-Processing
No ratings yet
Data Pre-Processing
22 pages
Preprocessing1.ipynb - Colab
No ratings yet
Preprocessing1.ipynb - Colab
13 pages
Diabetes EDA and Kears Modeling
No ratings yet
Diabetes EDA and Kears Modeling
26 pages
Cardio Screen RF
100% (1)
Cardio Screen RF
27 pages
Diabetes
No ratings yet
Diabetes
7 pages
LAB8_LogisticReg_HeartDisease[1]
No ratings yet
LAB8_LogisticReg_HeartDisease[1]
31 pages
Diabetes
No ratings yet
Diabetes
97 pages
ExNo 08ml
No ratings yet
ExNo 08ml
4 pages
lab_8__(6)عفان عبدالله احمد_التكليف_
No ratings yet
lab_8__(6)عفان عبدالله احمد_التكليف_
18 pages
SVM - RF - Diabetes - CSV - 26 - 6 - 2023.ipynb - Colaboratory
No ratings yet
SVM - RF - Diabetes - CSV - 26 - 6 - 2023.ipynb - Colaboratory
8 pages
Diabetes
No ratings yet
Diabetes
10 pages
Data Science Practical 9
No ratings yet
Data Science Practical 9
6 pages
Stroke Prediction Dataset
No ratings yet
Stroke Prediction Dataset
48 pages
ML 7
No ratings yet
ML 7
6 pages
AML Sessional 1 Students
No ratings yet
AML Sessional 1 Students
16 pages
ML Data Preprocessing in Python
No ratings yet
ML Data Preprocessing in Python
9 pages
Diabetis Project
No ratings yet
Diabetis Project
7 pages
ADS Exp-1
No ratings yet
ADS Exp-1
3 pages
Practical 4
No ratings yet
Practical 4
2 pages
Project
No ratings yet
Project
8 pages
baseline.ipynb - Colab
No ratings yet
baseline.ipynb - Colab
5 pages
AIML Report (1) 11
No ratings yet
AIML Report (1) 11
13 pages
Capstone Project 2
No ratings yet
Capstone Project 2
15 pages
مختار النعيري - The Course Work Submission (1)
No ratings yet
مختار النعيري - The Course Work Submission (1)
31 pages
AIML Report.
No ratings yet
AIML Report.
12 pages
Bio-Signal Analysis For Smoking
No ratings yet
Bio-Signal Analysis For Smoking
1 page
1728086737277
No ratings yet
1728086737277
26 pages
Diabetes Prediction
No ratings yet
Diabetes Prediction
1 page
Major project - Colab
No ratings yet
Major project - Colab
15 pages
Diabetes Prediction System
No ratings yet
Diabetes Prediction System
4 pages
Covid_19_Analysis_and_Visualization_using_Plotly_Express
No ratings yet
Covid_19_Analysis_and_Visualization_using_Plotly_Express
11 pages
My Code
No ratings yet
My Code
7 pages
Diabetes and Glucose Correlation - IBM Machine Learning Training Project
No ratings yet
Diabetes and Glucose Correlation - IBM Machine Learning Training Project
10 pages
diabetes-prediction-using-machine-learning
No ratings yet
diabetes-prediction-using-machine-learning
16 pages
vertopal.com_python2025
No ratings yet
vertopal.com_python2025
25 pages
20MIS7043 (LAB 7) .Ipynb Colaboratory
No ratings yet
20MIS7043 (LAB 7) .Ipynb Colaboratory
4 pages
20MIS7095 (LAB 7) .Ipynb Colaboratory
No ratings yet
20MIS7095 (LAB 7) .Ipynb Colaboratory
4 pages
KNN - Jupyter Notebook (1)
No ratings yet
KNN - Jupyter Notebook (1)
7 pages
MCS-011: Problem Solving and Programming
From Everand
MCS-011: Problem Solving and Programming
Dr. DK Sukhani
No ratings yet
Fake News Detection Using Deep Learning
No ratings yet
Fake News Detection Using Deep Learning
5 pages
Generative Adversarial Networks - A Literature Review
No ratings yet
Generative Adversarial Networks - A Literature Review
23 pages
Paradigm Shift Essay
No ratings yet
Paradigm Shift Essay
15 pages
405 ArticleText 703 1 10 20191102
No ratings yet
405 ArticleText 703 1 10 20191102
14 pages
Mcq's On Unit V
100% (1)
Mcq's On Unit V
6 pages
Clustering Algorithms: I I M M M N S
No ratings yet
Clustering Algorithms: I I M M M N S
16 pages
Decision Trees
No ratings yet
Decision Trees
32 pages
Data Engineer - Gen AI - Associate 2
No ratings yet
Data Engineer - Gen AI - Associate 2
4 pages
Equations Work Resume
No ratings yet
Equations Work Resume
2 pages
CSE1015 - Machine Learning Essentials: J Component Report
No ratings yet
CSE1015 - Machine Learning Essentials: J Component Report
18 pages
INT354 Question Bank
No ratings yet
INT354 Question Bank
11 pages
Markov Chain Analysis
No ratings yet
Markov Chain Analysis
19 pages
Anomaly Detection in Public Procurements
No ratings yet
Anomaly Detection in Public Procurements
8 pages
HRDC JNTUH RC AI and ML
No ratings yet
HRDC JNTUH RC AI and ML
2 pages
XX CV
No ratings yet
XX CV
3 pages
Research Paper of Computer Science PDF
100% (1)
Research Paper of Computer Science PDF
9 pages
Machine Learning Guide for Oil and Gas Using Python Hoss Belyadi All Chapters Instant Download
100% (2)
Machine Learning Guide for Oil and Gas Using Python Hoss Belyadi All Chapters Instant Download
41 pages
CJW Res
No ratings yet
CJW Res
1 page
ML-UNIT-1 - Introduction PART-1
No ratings yet
ML-UNIT-1 - Introduction PART-1
60 pages
Google Cloud Certified Professional Machine Learning Engineer Study Guide 1st Edition Mona instant download
100% (1)
Google Cloud Certified Professional Machine Learning Engineer Study Guide 1st Edition Mona instant download
80 pages
6_month_data_science_roadmap
No ratings yet
6_month_data_science_roadmap
4 pages
AI Cheat
No ratings yet
AI Cheat
13 pages
Aicte Edukills Google Ai-Ml Virtual Internship: Bachelor of Technology IN Computer Science and Engineering
No ratings yet
Aicte Edukills Google Ai-Ml Virtual Internship: Bachelor of Technology IN Computer Science and Engineering
27 pages
Future Trends of Deep Learning Neural Networks
No ratings yet
Future Trends of Deep Learning Neural Networks
10 pages
Python's Applications in The Real World
No ratings yet
Python's Applications in The Real World
12 pages
Data Science and Analytics For Smes: Consulting, Tools, Practical Use Cases Afolabi Ibukun Tolulope
No ratings yet
Data Science and Analytics For Smes: Consulting, Tools, Practical Use Cases Afolabi Ibukun Tolulope
341 pages

KNN For Classification

Uploaded by

KNN For Classification

Uploaded by

Name : Snehal Kotkar Div : A Roll No.

from google.colab import drive

Drive already mounted at /content/drive; to attempt to forcibly

{"summary":"{\n \"name\": \"df\",\n \"rows\": 768,\n \"fields\": [\

from sklearn.model_selection import train_test_split

#Setup arrays to store training and test accuracies

for i,k in enumerate(neighbors):

#Fit the model

#Compute accuracy on the training set

#Compute accuracy on the test set

#Fit the model

#Get accuracy. Note: In case of classification algorithms score method

You might also like