Practical 7

Uploaded by

Vinut P Maradur

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views6 pages

Practical 7

Uploaded by

Vinut P Maradur

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Practical: Week 7

Aim:

Write a program to analysis a cancer patient . We are given a data of cancer patient and we have to
find whether the cancer is Primary or Secondary with the help of KNN Machine Learning models.

Theory:

KNN is one of the most basic yet essential classification algorithms in machine learning. It belongs
to the supervised learning domain and finds intense application in pattern recognition, data
mining, and intrusion detection.
It is widely disposable in real-life scenarios since it is non-parametric, meaning it does not make
any underlying assumptions about the distribution of data (as opposed to other algorithms such as
GMM, which assume a Gaussian distribution of the given data). We are given some prior data (also
called training data), which classifies coordinates into groups identified by an attribute.
If we plot these points on a graph, we may be able to locate some clusters or groups. Now, given
an unclassified point, we can assign it to a group by observing what group its nearest neighbors
belong to. This means a point close to a cluster of points classified as ‘Red’ has a higher probability
of getting classified as ‘Red’.
Intuitively, we can see that the first point (2.5, 7) should be classified as ‘Green’, and the second
point (5.5, 4.5) should be classified as ‘Red’.

(K-NN) algorithm is a versatile and widely used machine learning algorithm that is primarily used
for its simplicity and ease of implementation. It does not require any assumptions about the
underlying data distribution. It can also handle both numerical and categorical data, making it a
flexible choice for various types of datasets in classification and regression tasks. It is a non-
parametric method that makes predictions based on the similarity of data points in a given
dataset. K-NN is less sensitive to outliers compared to other algorithms.
The K-NN algorithm works by finding the K nearest neighbors to a given data point based on a
distance metric, such as Euclidean distance. The class or value of the data point is then
determined by the majority vote or average of the K neighbors. This approach allows the
algorithm to adapt to different patterns and make predictions based on the local structure of the
data.
Step-by-Step explanation of how KNN works is discussed below:
Step 1: Selecting the optimal value of K
 K represents the number of nearest neighbors that needs to be considered while making
prediction.
Step 2: Calculating distance
 To measure the similarity between target and training data points, Euclidean distance is used.
Distance is calculated between each of the data points in the dataset and target point.
Step 3: Finding Nearest Neighbors
 The k data points with the smallest distances to the target point are the nearest neighbors.
Step 4: Voting for Classification or Taking Average for Regression
 In the classification problem, the class labels of K-nearest neighbors are determined by
performing majority voting. The class with the most occurrences among the neighbors
becomes the predicted class for the target data point.
Program:

# Prediction for breast cancer using KNN Classifier

import pandas as pd

# Load the data from data file into the data frame frame
df=pd.read_csv('breast-cancer-wisconsin.csv')
df.head()

# display the column names

df.columns

# since the column names have spaces, remove them

# Convert columns into strings and then replace
# the space with empty string
df.columns = df.columns.str. replace (' ','')
df.columns

# we can find ? mark in the bare_nulei column

# find 0ut such rows there are 16 such rows
df[df['barenuclei'] =='?']
#copy those rows into df where ‘?’ is not found
df = df[df['barenuclei']!='?']

df.drop(['id'], axis=1, errors='ignore', inplace=True)

# take 0 to 8th cols in x.

x = df.iloc[:, :9]
x
# take 9th column, i.e. class column in y
y = df.iloc [:, 9] # y can be 2 or 4
y

#split the data

from sklearn.model_selection import train_test_split
x_train, x_test, y_train, y_test = train_test_split(x, y, test_size=0.2, random_state = 0)

#k value is square root of number of rows in test data

import math

k=math.sqrt(len(y_test))
k

# see k value is even, convert it to odd

if k%2==0: k+=1
k = int(k)
k
11

# import KNeighborsClassifier class

from sklearn.neighbors import KNeighborsClassifier
# create the model with k value obtained above
model = KNeighborsClassifier(n_neighbors=k)
model.fit(x_train, y_train)

# find accuracy
accuracy = model.score (x_test, y_test)
accuracy

# let us find k values and accuracy levels for each k

k_range = range (1, 16)
scores= []
for k in k_range:
model= KNeighborsClassifier(n_neighbors=k)
model.fit(x_train, y_train)
accuracy = model.score(x_test, y_test)
scores. append (accuracy)
print('k= %d Accuracy= %.2f%%' % (k, accuracy*100) )

#show the k values and scores in 1ine plot

#we can see highest accuracy when k=1,3,4,5,7
import matplotlib.pyplot as plt
plt.plot (k_range, scores)
plt.xlabel ("Value of k")
plt.ylabel ("Accuracy")
#Take k=3 for achieving highest accuracy.
model = KNeighborsClassifier(n_neighbors=3)
model.fit(x_train, y_train)

# find accuracy
accuracy = model.score (x_test, y_test)
accuracy

# predict for the given data

model.predict([[4, 2, 1, 1, 1, 2, 3, 2, 1]])

model.predict([[4,2,1,1,1,2,3,2,1], [8,10,10,8,7,10,9,7,1]])

K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
18 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
17 pages
Rahul Raj - Ipynb - Colab
No ratings yet
Rahul Raj - Ipynb - Colab
50 pages
Unit 2 Notes
No ratings yet
Unit 2 Notes
105 pages
4K-Nearest Neighbor
No ratings yet
4K-Nearest Neighbor
38 pages
Slide 2 ML Basics
No ratings yet
Slide 2 ML Basics
42 pages
Classification and K Nearest Neighbour Algorithm
No ratings yet
Classification and K Nearest Neighbour Algorithm
53 pages
K-Nearest Neighbors
No ratings yet
K-Nearest Neighbors
35 pages
Lecture 12 K-Nearest Neighbors
No ratings yet
Lecture 12 K-Nearest Neighbors
24 pages
UNIT 3 - Final
No ratings yet
UNIT 3 - Final
37 pages
INSY446 - 5 - Classification Part 2
No ratings yet
INSY446 - 5 - Classification Part 2
37 pages
Machine Learning With Python - Machine Learning Algorithms - KNN
No ratings yet
Machine Learning With Python - Machine Learning Algorithms - KNN
15 pages
Python For Data Science IA 1 Programs
No ratings yet
Python For Data Science IA 1 Programs
14 pages
DSASSign 4
No ratings yet
DSASSign 4
11 pages
KNN Model Implementation
No ratings yet
KNN Model Implementation
12 pages
06 KNN
No ratings yet
06 KNN
41 pages
Experiment 4
No ratings yet
Experiment 4
8 pages
Updated K-Nearest Neighbors in Machine Learning
No ratings yet
Updated K-Nearest Neighbors in Machine Learning
11 pages
Python For Data Science IA 1 Programs
No ratings yet
Python For Data Science IA 1 Programs
14 pages
Artificial Intelligence Lab 7
No ratings yet
Artificial Intelligence Lab 7
10 pages
A Complete Guide To KNN
No ratings yet
A Complete Guide To KNN
16 pages
MLLABDA2
No ratings yet
MLLABDA2
5 pages
K-Nearest Neighbor On Python Ken Ocuma
100% (2)
K-Nearest Neighbor On Python Ken Ocuma
9 pages
Sample KNN
No ratings yet
Sample KNN
7 pages
Assignment No 2 AI
No ratings yet
Assignment No 2 AI
4 pages
Unit 5 Learning With Algorithm
No ratings yet
Unit 5 Learning With Algorithm
7 pages
KNN Lab
No ratings yet
KNN Lab
4 pages
B-56 Sanket Jambhulkar MLA-7
No ratings yet
B-56 Sanket Jambhulkar MLA-7
9 pages
Python Code For KNN Classifier 1. Initial Message
No ratings yet
Python Code For KNN Classifier 1. Initial Message
7 pages
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
No ratings yet
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
13 pages
K-Nearest Neighbors: Marcel Van Velzen Junior Marte Garcia
No ratings yet
K-Nearest Neighbors: Marcel Van Velzen Junior Marte Garcia
8 pages
KNN - Predictive Analysis
No ratings yet
KNN - Predictive Analysis
6 pages
Untitled 9
No ratings yet
Untitled 9
17 pages
ML Lab Week 7
No ratings yet
ML Lab Week 7
4 pages
ML Lab2 PGM
No ratings yet
ML Lab2 PGM
3 pages
KnnClassifier - Jupyter Notebook
No ratings yet
KnnClassifier - Jupyter Notebook
2 pages
KMEANS
No ratings yet
KMEANS
9 pages
Dhanashree ML Report
No ratings yet
Dhanashree ML Report
3 pages
Lab 8
No ratings yet
Lab 8
7 pages
Lab Session 9
No ratings yet
Lab Session 9
2 pages
Jntuk R20 ML Unit-Ii
No ratings yet
Jntuk R20 ML Unit-Ii
37 pages
STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
No ratings yet
STAT 479: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
23 pages
K Nearest Neighbors
No ratings yet
K Nearest Neighbors
5 pages
Part A 3. KNN Classification
No ratings yet
Part A 3. KNN Classification
35 pages
Experiment 4: Aim/Overview of The Practical: Task To Be Done
No ratings yet
Experiment 4: Aim/Overview of The Practical: Task To Be Done
7 pages
K-Means Clustering From Scratch
No ratings yet
K-Means Clustering From Scratch
3 pages
Experiment 2.2 KNN Classifier
No ratings yet
Experiment 2.2 KNN Classifier
7 pages
Experiment No 7 ML
No ratings yet
Experiment No 7 ML
4 pages
Machine Learning and Data Analytics Using Python Lab
No ratings yet
Machine Learning and Data Analytics Using Python Lab
36 pages
Week10 KNN Practical
No ratings yet
Week10 KNN Practical
4 pages
K-Nearest Neighbors
100% (1)
K-Nearest Neighbors
32 pages
K Nearest Neighbor Algorithm in Python - Towards Data Science
No ratings yet
K Nearest Neighbor Algorithm in Python - Towards Data Science
7 pages
ChatGPT and Higher Education
No ratings yet
ChatGPT and Higher Education
51 pages
ML Notes
100% (2)
ML Notes
125 pages
Lecture Week 2 KNN and Model Evaluation PDF
100% (1)
Lecture Week 2 KNN and Model Evaluation PDF
53 pages
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
100% (1)
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
125 pages
Machine Learning
100% (5)
Machine Learning
56 pages
KNN Solution
No ratings yet
KNN Solution
2 pages
Here's An Visualization of The K-Nearest Neighbors Algorithm
No ratings yet
Here's An Visualization of The K-Nearest Neighbors Algorithm
5 pages
Decision Tree KNN
No ratings yet
Decision Tree KNN
9 pages
Machine Learning Tutorial
100% (1)
Machine Learning Tutorial
44 pages
Essential Guide To Python For All Levels (2024 Collection
No ratings yet
Essential Guide To Python For All Levels (2024 Collection
184 pages
AI Impact Assessment
No ratings yet
AI Impact Assessment
29 pages
ML Fresher JD
No ratings yet
ML Fresher JD
2 pages
21551A05C8 3-2 Internship Report
No ratings yet
21551A05C8 3-2 Internship Report
49 pages
1406 and 1550 - Law of Corporate Finance Project
100% (1)
1406 and 1550 - Law of Corporate Finance Project
25 pages
A Detailed Analysis of Use of AI in Inventory Management For Technically Better Management
100% (1)
A Detailed Analysis of Use of AI in Inventory Management For Technically Better Management
5 pages
Complete ML Concepts
No ratings yet
Complete ML Concepts
30 pages
Complete Basic Stats
No ratings yet
Complete Basic Stats
18 pages
Resume For Data Analyst
No ratings yet
Resume For Data Analyst
1 page
Modern ABC Chemistry For Class 12 Part I - Dr. S.P. Jauhar
No ratings yet
Modern ABC Chemistry For Class 12 Part I - Dr. S.P. Jauhar
6 pages
PowerPoint Presentation
No ratings yet
PowerPoint Presentation
41 pages
17BEC096
No ratings yet
17BEC096
61 pages
The Impact of Artificial Intelligence On The Future of Work
No ratings yet
The Impact of Artificial Intelligence On The Future of Work
3 pages
15 Mlops Interview Questions For 2025
No ratings yet
15 Mlops Interview Questions For 2025
13 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
78 pages
Advance House Price Prediction Completed
No ratings yet
Advance House Price Prediction Completed
38 pages
Analyzing Activation Functions With Transfer Learning-Based Layer Customization For Improved Brain Tumor Classification
No ratings yet
Analyzing Activation Functions With Transfer Learning-Based Layer Customization For Improved Brain Tumor Classification
21 pages
Perfect Crowd Counting Presentation
No ratings yet
Perfect Crowd Counting Presentation
13 pages
Complete Linear Regression Algorithm
No ratings yet
Complete Linear Regression Algorithm
4 pages
Online Analysis of Ingredient Safety Leveraging OCR and Machine Learning For Enhanced Consumer Product Safety
No ratings yet
Online Analysis of Ingredient Safety Leveraging OCR and Machine Learning For Enhanced Consumer Product Safety
6 pages
(Huawei, KD) One-for-All - Bridge The Gap Between Heterogeneous Architectures in Knowledge Distillation
No ratings yet
(Huawei, KD) One-for-All - Bridge The Gap Between Heterogeneous Architectures in Knowledge Distillation
13 pages
ML Guess Paper - 2
No ratings yet
ML Guess Paper - 2
4 pages
Large Scale GAN Training For High Fidelity Natural Image Synthesis
No ratings yet
Large Scale GAN Training For High Fidelity Natural Image Synthesis
28 pages
Data Mining Unitwise Imp Questions
No ratings yet
Data Mining Unitwise Imp Questions
3 pages
Research Cloud
No ratings yet
Research Cloud
8 pages
Cindicator WhitePaper en
No ratings yet
Cindicator WhitePaper en
37 pages
Here
No ratings yet
Here
3 pages
Purple and White Clean and Professional Resume
No ratings yet
Purple and White Clean and Professional Resume
2 pages
Core Technical Hard Skills For Data Scientists
No ratings yet
Core Technical Hard Skills For Data Scientists
3 pages
Early Detection of Heart Disease Using Machine Learning
No ratings yet
Early Detection of Heart Disease Using Machine Learning
8 pages
Education Statement: Iit Roorkee
No ratings yet
Education Statement: Iit Roorkee
1 page