0% found this document useful (0 votes)

10 views3 pages

ML Lab2 PGM

The document outlines a Python program that utilizes the k-Nearest Neighbor (KNN) algorithm to classify users from a social network dataset based on their likelihood of purchasing an item. It details the steps for data preparation, including encoding categorical variables, splitting the dataset into training and testing sets, and scaling features. The program builds and evaluates the KNN model, achieving an accuracy score of 95% using a confusion matrix.

Uploaded by

vishnun2811

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views3 pages

ML Lab2 PGM

Uploaded by

vishnun2811

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

2. Write a Python program to extract social_network_ads.csv file.

Apply k-Nearest
Neighbor technique to identify the users who purchased the item or not.

KNN (K Nearest Neighbors) algorithm is a supervised Machine Learning classification

algorithm. It is one of the simplest and widely used classification algorithms in which a new
data point is classified based on similarity in the specific group of neighboring data points.
This gives a competitive result.

Working:

For a given data point in the set, the algorithms find the distances between this and all
other K numbers of data point in the dataset close to the initial point and votes for that category
that has the most frequency. Usually, Euclidean distance is taking as a measure of distance.
Thus the end resultant model is just the labeled data placed in a space. This algorithm is
popularly known for various applications like genetics, forecasting, etc. The algorithm is best
when more features are present.

KNN reducing over fitting is a fact. On the other hand, there is a need to choose the best value
for K. So now how do we choose K? Generally we use the Square root of the number of
samples in the dataset as value for K. An optimal value has to be found out since lower value
may lead to overfitting and higher value may require high computational complication in
distance. So using an error plot may help. Another method is the elbow method. You can prefer
to take root else can also follow the elbow method.

Different steps of K-NN for classifying a new data point:

Step 1: Select the value of K neighbors (say k=5)

Step 2: Find the K (5) nearest data point for our new data point based on Euclidean distance.
Step 3: Among these K data points count the data points in each category.
Step 4: Assign the new data point to the category that has the most neighbors of the new data
point.

Example:

Consider an example problem for getting a clear intuition on the K -Nearest Neighbor
classification. We are using the Social network ad dataset. The dataset contains the details of
users in a social networking site to find whether a user buys a product by clicking the ad on
the site based on their salary, age, and gender.
Importing essential libraries:

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd
import sklearn

Importing of the dataset and slicing it into independent and dependent variables:

dataset = pd.read_csv('Social_Network_Ads.csv')
X = dataset.iloc[:, [1, 2, 3]].values
y = dataset.iloc[:, -1].values

Since the dataset containing character variables, need to encode it using LabelEncoder.

from sklearn.preprocessing import LabelEncoder

le = LabelEncoder()
X[:,0] = le.fit_transform(X[:,0])

Split the dataset into train and test set. Providing the test size as 0.20, that means training
sample contains 320 training set and test sample contains 80 tests set.

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y,
test_size = 0.20, random_state = 0)

Next, feature scaling is done to the training and test set of independent variables for reducing
the size to smaller values.

from sklearn.preprocessing import StandardScaler

sc = StandardScaler()
X_train = sc.fit_transform(X_train)
X_test = sc.transform(X_test)

Build and train the K Nearest Neighbor model with the training set.

from sklearn.neighbors import KNeighborsClassifier

classifier = KNeighborsClassifier(n_neighbors = 5, metric =
'minkowski', p = 2)
classifier.fit(X_train, y_train)

Three different parameters are used in the model creation. n_neighbors is setting as 5, which
means 5 neighborhood points are required for classifying a given point. The distance metric
used is Minkowski. Equation for the same is given below.

As per the equation, we need to select the p-value.

p = 1 , Manhattan Distance
p = 2 , Euclidean Distance
p = infinity , Cheybchev Distance

In this example, we are choosing the p value as 2. Machine Learning model is created, now
we have to predict the output for the test set.

y_pred = classifier.predict(X_test)

Comparing true and predicted value:

y_test
>>
array([0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1,
0, 0, 1,
0, 1, 0, 1, 0, 0, 0, 0, 0, 1, 1, 0, 0, 0, 0, 0, 0, 1, 0,
0, 0, 0,
1, 0, 0, 1, 0, 1, 1, 0, 0, 0, 1, 1, 0, 0, 1, 0, 0, 1, 0,
1, 0, 1,
0, 0, 0, 0, 1, 0, 0, 1, 0, 0, 0, 0, 1, 1, 1, 0, 0, 0, 1,
1, 0, 1,
1, 0, 0, 1, 0, 0, 0, 1, 0, 1, 1, 1], dtype=int64)

y_pred
>>
array([0, 0, 0, 0, 0, 0, 0, 1, 0, 1, 0, 0, 0, 0, 0, 1, 0, 0, 1,
0, 0, 1,
0, 1, 0, 1, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 1, 0,
0, 0, 0,
1, 0, 0, 1, 0, 1, 1, 0, 0, 1, 1, 1, 0, 0, 1, 0, 0, 1, 0,
1, 0, 1,
0, 0, 0, 0, 1, 0, 0, 1, 0, 0, 0, 0, 1, 1, 1, 1, 0, 0, 1,
0, 0, 1,
1, 0, 0, 1, 0, 0, 0, 0, 0, 1, 1, 1], dtype=int64)

Evaluating the model using the confusion matrix and accuracy score by comparing the
predicted and actual test values.

from sklearn.metrics import confusion_matrix,accuracy_score

cm = confusion_matrix(y_test, y_pred)
ac = accuracy_score(y_test,y_pred)

Confusion matrix :

cm
>>
[[64 4]
[ 3 29]]

ac
>>
0.95

Designing A Roller Coaster
100% (6)
Designing A Roller Coaster
18 pages
San Miguel Corporation Business Model Canvas
71% (7)
San Miguel Corporation Business Model Canvas
2 pages
Komatsu Avance Loader WA470 3 Wheel Loader Operating Maintenance Manual
0% (1)
Komatsu Avance Loader WA470 3 Wheel Loader Operating Maintenance Manual
235 pages
Updated K-Nearest Neighbors in Machine Learning
No ratings yet
Updated K-Nearest Neighbors in Machine Learning
11 pages
K-Nearest Neighbor On Python Ken Ocuma
100% (2)
K-Nearest Neighbor On Python Ken Ocuma
9 pages
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
No ratings yet
A Complete Guide To K Nearest Neighbors Algorithm 1598272616
13 pages
K Nearest Neighbors
No ratings yet
K Nearest Neighbors
5 pages
Part A 3. KNN Classification
No ratings yet
Part A 3. KNN Classification
35 pages
K-Nearest Neighbors Algorithm
No ratings yet
K-Nearest Neighbors Algorithm
7 pages
B-56 Sanket Jambhulkar MLA-7
No ratings yet
B-56 Sanket Jambhulkar MLA-7
9 pages
Dhanashree ML Report
No ratings yet
Dhanashree ML Report
3 pages
Experiment No 7 ML
No ratings yet
Experiment No 7 ML
4 pages
04 KNN Implementation
No ratings yet
04 KNN Implementation
7 pages
KNN Classifier
No ratings yet
KNN Classifier
5 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning
17 pages
CSL0777 L22
No ratings yet
CSL0777 L22
35 pages
K-Nearest Neighbors
No ratings yet
K-Nearest Neighbors
35 pages
ML Lec07 KNN
100% (2)
ML Lec07 KNN
37 pages
Activity 01: Python Set/s of Source Code Use in The Activity (Paste Below)
No ratings yet
Activity 01: Python Set/s of Source Code Use in The Activity (Paste Below)
2 pages
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
No ratings yet
K-Nearest Neighbor (KNN) Algorithm For Machine Learning - Javatpoint
18 pages
Rahul Raj - Ipynb - Colab
No ratings yet
Rahul Raj - Ipynb - Colab
50 pages
K-Nearest Neighbor (KNN) 6
No ratings yet
K-Nearest Neighbor (KNN) 6
46 pages
Untitled 9
No ratings yet
Untitled 9
17 pages
KNN Dan KMeans
No ratings yet
KNN Dan KMeans
37 pages
Week 07
No ratings yet
Week 07
24 pages
Experiment 2.2 KNN Classifier
No ratings yet
Experiment 2.2 KNN Classifier
7 pages
Bài nhóm tìm hiểu về KNN
No ratings yet
Bài nhóm tìm hiểu về KNN
5 pages
KNN Class 2
No ratings yet
KNN Class 2
40 pages
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
100% (1)
Introduction To K-Nearest Neighbors: Simplified (With Implementation in Python)
125 pages
ML Notes
100% (2)
ML Notes
125 pages
Lab 8
No ratings yet
Lab 8
7 pages
4K-Nearest Neighbor
No ratings yet
4K-Nearest Neighbor
38 pages
Classification (K-Nearest Neighbor)
No ratings yet
Classification (K-Nearest Neighbor)
22 pages
KNN Colab Illustration
No ratings yet
KNN Colab Illustration
5 pages
Lecture 3
No ratings yet
Lecture 3
17 pages
KNN Algorithm - PPT (Autosaved)
0% (1)
KNN Algorithm - PPT (Autosaved)
8 pages
Practical 7
No ratings yet
Practical 7
6 pages
Week10 KNN Practical
No ratings yet
Week10 KNN Practical
4 pages
K-Nearest Neighbor (KNN) Algorithm: Last Updated: 14 May, 2025
No ratings yet
K-Nearest Neighbor (KNN) Algorithm: Last Updated: 14 May, 2025
14 pages
Lab Session 9
No ratings yet
Lab Session 9
2 pages
ML 7th Sem Aiml Ite Notes Complete Long (1) - 63-155
No ratings yet
ML 7th Sem Aiml Ite Notes Complete Long (1) - 63-155
93 pages
KNN Clearly Explained 1696688332
No ratings yet
KNN Clearly Explained 1696688332
7 pages
Machine Learning With Python - Machine Learning Algorithms - KNN
No ratings yet
Machine Learning With Python - Machine Learning Algorithms - KNN
15 pages
Risss ML Record 6
No ratings yet
Risss ML Record 6
6 pages
K-Nearest Neighbor
No ratings yet
K-Nearest Neighbor
22 pages
K-Nearest Neighbor: General Gist
No ratings yet
K-Nearest Neighbor: General Gist
14 pages
K-Nearest Neighbor Classification-Algorithm and Characteristics
No ratings yet
K-Nearest Neighbor Classification-Algorithm and Characteristics
6 pages
ML 4
No ratings yet
ML 4
33 pages
ML-Unit 5
No ratings yet
ML-Unit 5
40 pages
Assignment No 2 AI
No ratings yet
Assignment No 2 AI
4 pages
K Nearest Neighbor: Presented by
No ratings yet
K Nearest Neighbor: Presented by
29 pages
KNN - Predictive Analysis
No ratings yet
KNN - Predictive Analysis
6 pages
ML Assignment No. 3: 3.1 Title
No ratings yet
ML Assignment No. 3: 3.1 Title
6 pages
KNN Class 1
No ratings yet
KNN Class 1
32 pages
PML Lab Exp 11
No ratings yet
PML Lab Exp 11
3 pages
MachineLearning-Spring24 - KNN Implementation For Classification
No ratings yet
MachineLearning-Spring24 - KNN Implementation For Classification
3 pages
Lecture 14 and 15
No ratings yet
Lecture 14 and 15
42 pages
Machine Learning Lecture 02
No ratings yet
Machine Learning Lecture 02
25 pages
Machine Learning KNN - Supervised
No ratings yet
Machine Learning KNN - Supervised
9 pages
Lab 10 - Manual and Assignment On KNN
No ratings yet
Lab 10 - Manual and Assignment On KNN
3 pages
K Nearest Neighbour in Machine Learning 1723063509
No ratings yet
K Nearest Neighbour in Machine Learning 1723063509
16 pages
Unit 5 Learning With Algorithm
No ratings yet
Unit 5 Learning With Algorithm
7 pages
K Nearest Neighbor Algorithm: Fundamentals and Applications
From Everand
K Nearest Neighbor Algorithm: Fundamentals and Applications
Fouad Sabry
No ratings yet
Versa CSeries Aluminum Solenoid Valves
No ratings yet
Versa CSeries Aluminum Solenoid Valves
24 pages
SpeedHeat EzeeStat II Instructions Rev 04
No ratings yet
SpeedHeat EzeeStat II Instructions Rev 04
4 pages
Home Building Manual 2014
100% (2)
Home Building Manual 2014
39 pages
Clinical Medical Assistant Lesson 14 Assignment
No ratings yet
Clinical Medical Assistant Lesson 14 Assignment
2 pages
Mad Catz Street Fighter V Arcade FightStick TE2 PS4 PS3 Product Guide
No ratings yet
Mad Catz Street Fighter V Arcade FightStick TE2 PS4 PS3 Product Guide
14 pages
Active Driveline
No ratings yet
Active Driveline
17 pages
Grounded Theory Thesis Structure
100% (3)
Grounded Theory Thesis Structure
5 pages
Maori Presentation
No ratings yet
Maori Presentation
13 pages
Sara
No ratings yet
Sara
160 pages
Gul Nawaz CV
No ratings yet
Gul Nawaz CV
2 pages
Applied Chemistry Feb 2023
No ratings yet
Applied Chemistry Feb 2023
4 pages
97-680 Multiprime
No ratings yet
97-680 Multiprime
2 pages
Dbms Codds Rules
No ratings yet
Dbms Codds Rules
2 pages
Topo Sheet Report
No ratings yet
Topo Sheet Report
15 pages
Unit 5 Pointers
No ratings yet
Unit 5 Pointers
9 pages
Essay
No ratings yet
Essay
7 pages
CRI Test Method 114
No ratings yet
CRI Test Method 114
11 pages
Japan - ICFG International Cold Forging Group
No ratings yet
Japan - ICFG International Cold Forging Group
12 pages
Healing Benefits of Himalayan Pink Salt
No ratings yet
Healing Benefits of Himalayan Pink Salt
4 pages
Course Contents of List of Courses Approved by Federal University of Technology, Akure in Metallurgical and Materials Engineering Department
100% (2)
Course Contents of List of Courses Approved by Federal University of Technology, Akure in Metallurgical and Materials Engineering Department
14 pages
Summer 2022
No ratings yet
Summer 2022
2 pages
Podcast Lesson Plan
No ratings yet
Podcast Lesson Plan
3 pages
Educ 102
No ratings yet
Educ 102
3 pages
Semi - NCM 101
100% (1)
Semi - NCM 101
13 pages
TEDxYouth Programme
No ratings yet
TEDxYouth Programme
2 pages
Scientific Writing of Research: Dr. Aman Ullah, PH.D
No ratings yet
Scientific Writing of Research: Dr. Aman Ullah, PH.D
38 pages
Rate and Perception of Parents Towards The Implementation of Fatima National High School Drive
No ratings yet
Rate and Perception of Parents Towards The Implementation of Fatima National High School Drive
34 pages