0% found this document useful (0 votes)

15 views8 pages

ML Lab Experiments (1) - Pages-5

The document describes implementing a locally weighted regression algorithm to fit data points. It discusses locally weighted regression, provides the lowess algorithm steps, includes sample Python code to generate a dataset, calculate weights, make predictions and plot the results.

Uploaded by

Tarasha Maheshwari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views8 pages

ML Lab Experiments (1) - Pages-5

Uploaded by

Tarasha Maheshwari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Machine Learning Lab (IT804) Jan-Jun 2021

Experiment No: 9
Objective: Write a program to implement k-Nearest Neighbor algorithm to classify the iris
data set. Print both correct and wrong predictions. Java/Python ML library classes can be used
for this problem.

Description:
K-Nearest Neighbour is based on Supervised Learning technique. This algorithm assumes the
similarity between the new case/data and available cases and put the new case into the category
that is most similar to the available categories. K-NN algorithm stores all the available data and
classifies a new data point based on the similarity. This means when new data appears then it can
be easily classified into a well suite category by using K- NN algorithm.
K-NN algorithm can be used for Regression as well as for Classification but mostly it is used for
the Classification problems. It is a non-parametric algorithm, which means it does not make any
assumption on underlying data. It is also called a lazy learner algorithm because it does not learn
from the training set immediately instead it stores the dataset and at the time of classification, it
performs an action on the dataset.
KNN algorithm at the training phase just stores the dataset and when it gets new data, and then it
classifies that data into a category that is much similar to the new data.
The K-NN working can be explained on the basis of the below algorithm:
1. Select the number K of the neighbors
2. Calculate the Euclidean distance of K number of neighbors
3. Take the K nearest neighbors as per the calculated Euclidean distance.
4. Among these k neighbors, count the number of the data points in each category.
5. Assign the new data points to that category for which the number of the neighbor is
maximum.
6. Our model is ready.

Training Algorithm
 For each training example (x,f(x)), add the example to the list training examples
classification algorithm.
 Given a query instance xq to be classified.
 Let x1,x2,….xk denotes the k instances from training examples that are nearest to
xq.
23
Laboratory File
Machine Learning Lab (IT804) Jan-Jun 2021

 return

∑ k
i =1
f ( xi )
f^ ( x q )←
k
 Where f(xi), function to calculate the mean value of the k-nearest training examples.

Data Set:
Iris plants data set: data set contains 150 instances (50 in each of three classes)
The Number of attributes: 4 numeric, predictive attributes and the class.
S. No sepal_length sepal_width petal_length petal_width class
0 5.1 3.5 1.4 0.2 Iris-setosa
1 4.9 3.0 1.4 0.2 Iris-setosa
2 4.7 3.2 1.3 0.2 Iris-setosa
3 4.6 3.1 1.5 0.2 Iris-setosa
4 5.0 3.6 1.4 0.2 Iris-setosa

Program:
from sklearn.model_selection import train_test_split
from sklearn.neighbors import KNeighborsClassifier
from sklearn.metrics import classification_report, confusion_matrix
from sklearn import datasets

iris=datasets.load_iris()

x = iris.data
y = iris.target

print ('sepal-length', 'sepal-width', 'petal-length', 'petal-width')

print(x)
print('class: 0-Iris-Setosa, 1- Iris-Versicolour, 2- Iris-Virginica')
print(y)

x_train, x_test, y_train, y_test = train_test_split(x,y,test_size=0.3)

24
Laboratory File
Machine Learning Lab (IT804) Jan-Jun 2021

#To Training the model and Nearest nighbors K=5

classifier = KNeighborsClassifier(n_neighbors=5)
classifier.fit(x_train, y_train)
#To make predictions on our test data
y_pred=classifier.predict(x_test)

print('Confusion Matrix')
print(confusion_matrix(y_test,y_pred))
print('Accuracy Metrics')
print(classification_report(y_test,y_pred))

Output:
Confusion matrix is as follows
[[11 0 0]
[0 9 1]
[0 1 8]]
Accuracy metrics
0 1.00 1.00 1.00 11
1 0.90 0.90 0.90 10
2 0.89 0.89 0,89 9
Avg/Total 0.93 0.93 0.93 30

25
Laboratory File
Machine Learning Lab (IT804) Jan-Jun 2021

Experiment No: 10
Objective: Implement the non-parametric Locally Weighted Regression algorithm in order to
fit data points. Select appropriate data set for your experiment and draw graphs.
Description:
Locally Weighted Regression Algorithm
Regression:
 Regression is a technique from statistics that is used to predict values of a desired target
quantity when the target quantity is continuous.
 In regression, we seek to identify (or estimate) a continuous variable y associated with a
given input vector x.
 y is called the dependent variable.
 x is called the independent variable.
Loess/Lowess Regression:
Loess regression is a nonparametric technique that uses local weighted regression to fit a smooth
curve through points in a scatter plot.
Lowess Algorithm:
 Locally weighted regression is a very powerful nonparametric model used in statistical
learning.
 Given a dataset X, y, we attempt to find a model parameter β(x) that minimizes residual
sum of weighted squared errors.
 The weights are given by a kernel function (k or w) which can be chosen arbitrarily
Algorithm
1. Read the Given data Sample to X and the curve (linear or non linear) to Y
2. Set the value for Smoothening parameter or Free parameter say τ
3. Set the bias /Point of interest set x0 which is a subset of X
4. Determine the weight matrix using :
2
( x−x 0 )
− 2
2τ
w ( x , x 0 )=e
5. Determine the value of model term parameter β using :

β^( x 0 )=( XT WX )−1 XT Wy

6. Prediction = x0*β:

26
Laboratory File
Machine Learning Lab (IT804) Jan-Jun 2021

Program:
import numpy as np
from bokeh.plotting import figure, show, output_notebook
from bokeh.layouts import gridplot
from bokeh.io import push_notebook

def local_regression(x0, X, Y, tau):# add bias term

x0 = np.r_[1, x0] # Add one to avoid the loss in information
X = np.c_[np.ones(len(X)), X]

# fit model: normal equations with kernel

xw = X.T * radial_kernel(x0, X, tau) # XTranspose * W

beta = np.linalg.pinv(xw @ X) @ xw @ Y #@ Matrix Multiplication or Dot Product

# predict value
return x0 @ beta # @ Matrix Multiplication or Dot Product for prediction
def radial_kernel(x0, X, tau):
return np.exp(np.sum((X - x0) ** 2, axis=1) / (-2 * tau * tau))

# Weight or Radial Kernal Bias Function

n = 1000

# generate dataset
X = np.linspace(-3, 3, num=n)
print("The Data Set ( 10 Samples) X :\n",X[1:10])
Y = np.log(np.abs(X ** 2 - 1) + .5)
print("The Fitting Curve Data Set (10 Samples) Y :\n",Y[1:10])

# jitter X
X += np.random.normal(scale=.1, size=n)
print("Normalised (10 Samples) X :\n",X[1:10])
domain = np.linspace(-3, 3, num=300)
print(" Xo Domain Space(10 Samples) :\n",domain[1:10])
27
Laboratory File
Machine Learning Lab (IT804) Jan-Jun 2021

def plot_lwr(tau):

# prediction through regression

prediction = [local_regression(x0, X, Y, tau) for x0 in domain]
plot = figure(plot_width=400, plot_height=400)
plot.title.text='tau=%g' % tau
plot.scatter(X, Y, alpha=.3)
plot.line(domain, prediction, line_width=2, color='red')
return plot

show(gridplot([
[plot_lwr(10.), plot_lwr(1.)],
[plot_lwr(0.1), plot_lwr(0.01)]]))

Output:

28
Laboratory File
Machine Learning Lab (IT804) Jan-Jun 2021

# -- coding: utf-8 --

"""
Spyder Editor
This is a temporary script file.
"""
from numpy import *
from os import listdir
import matplotlib
import matplotlib.pyplot as plt
import pandas as pd
import numpy as np1
import numpy.linalg as np
from scipy.stats.stats import pearsonr

def kernel(point,xmat, k):

m,n = np1.shape(xmat)
weights = np1.mat(np1.eye((m)))
for j in range(m):
diff = point - X[j]
weights[j,j] = np1.exp(diff*diff.T/(-2.0*k**2))
return weights

def localWeight(point,xmat,ymat,k):

29
Laboratory File
Machine Learning Lab (IT804) Jan-Jun 2021

wei = kernel(point,xmat,k)
W = (X.T*(wei*X)).I*(X.T*(wei*ymat.T))
return W

def localWeightRegression(xmat,ymat,k):
m,n = np1.shape(xmat)
ypred = np1.zeros(m)
for i in range(m):
ypred[i] = xmat[i]*localWeight(xmat[i],xmat,ymat,k)
return ypred

# load data points

data = pd.read_csv('tips.csv')
bill = np1.array(data.total_bill)
tip = np1.array(data.tip)
#preparing and add 1 in bill
mbill = np1.mat(bill)
mtip = np1.mat(tip) # mat is used to convert to n dimesiona to 2 dimensional array form
m= np1.shape(mbill)[1]
# print(m) 244 data is stored in m
one = np1.mat(np1.ones(m))
X= np1.hstack((one.T,mbill.T)) # create a stack of bill from ONE
#print(X)
#set k here
ypred = localWeightRegression(X,mtip,0.3)
SortIndex = X[:,1].argsort(0)
xsort = X[SortIndex][:,0]
fig = plt.figure()
ax = fig.add_subplot(1,1,1)
ax.scatter(bill,tip, color='green')
ax.plot(xsort[:,1],ypred[SortIndex], color = 'red', linewidth=5)
plt.xlabel('Total bill')

plt.ylabel('Tip')
plt.show();
30
Laboratory File

Organization and Management Module 1: Quarter 1 - Week 1
100% (1)
Organization and Management Module 1: Quarter 1 - Week 1
16 pages
ML Lab Record8to15
No ratings yet
ML Lab Record8to15
23 pages
Locally Weighted Regression Algorithm
No ratings yet
Locally Weighted Regression Algorithm
6 pages
cp4252 Machine Learning Lab Manual
No ratings yet
cp4252 Machine Learning Lab Manual
21 pages
Machine Learning
100% (5)
Machine Learning
56 pages
Machine Learning LAB
No ratings yet
Machine Learning LAB
20 pages
Advance AI and ML LAB
No ratings yet
Advance AI and ML LAB
16 pages
Machine Learning Final Manual
No ratings yet
Machine Learning Final Manual
45 pages
Python For Data Science IA 1 Programs
No ratings yet
Python For Data Science IA 1 Programs
14 pages
Python For Data Science IA 1 Programs
No ratings yet
Python For Data Science IA 1 Programs
14 pages
MLLab Manual
No ratings yet
MLLab Manual
24 pages
AIML Lab Prog
No ratings yet
AIML Lab Prog
15 pages
ML Manual
No ratings yet
ML Manual
30 pages
Matlab Homework Experts 2
No ratings yet
Matlab Homework Experts 2
10 pages
16BCB0126 VL2018195002535 Pe003
No ratings yet
16BCB0126 VL2018195002535 Pe003
40 pages
Final ML File
No ratings yet
Final ML File
34 pages
1
No ratings yet
1
13 pages
ML Lab
No ratings yet
ML Lab
23 pages
IOT DA 21bee0309
No ratings yet
IOT DA 21bee0309
3 pages
ML Lab 146
No ratings yet
ML Lab 146
50 pages
ML Full For Print New 1
No ratings yet
ML Full For Print New 1
38 pages
ML File - Merged
No ratings yet
ML File - Merged
24 pages
Machine Learning LAB: Practical-1
100% (2)
Machine Learning LAB: Practical-1
24 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
9 pages
Machine Learning: Practice 2
No ratings yet
Machine Learning: Practice 2
74 pages
Machine Learnin
100% (2)
Machine Learnin
23 pages
Mlalllabprgs
No ratings yet
Mlalllabprgs
17 pages
ML Minimized Programs
No ratings yet
ML Minimized Programs
9 pages
Aiml Practicals
No ratings yet
Aiml Practicals
22 pages
DA Programs
No ratings yet
DA Programs
44 pages
Unit2 ML Programs
No ratings yet
Unit2 ML Programs
7 pages
Machine Learning With SQL
100% (1)
Machine Learning With SQL
12 pages
Wa0003
No ratings yet
Wa0003
16 pages
ML Lab
No ratings yet
ML Lab
7 pages
ML Journal External
No ratings yet
ML Journal External
14 pages
LAB-4 Report
No ratings yet
LAB-4 Report
21 pages
Exercise - 3: DS203-2024-S1 Roll Number: 23B2215
No ratings yet
Exercise - 3: DS203-2024-S1 Roll Number: 23B2215
25 pages
Slide 2 ML Basics
No ratings yet
Slide 2 ML Basics
42 pages
Building, Tuning, and Deploying Models
No ratings yet
Building, Tuning, and Deploying Models
11 pages
Shubham Pract 6 - Merged
No ratings yet
Shubham Pract 6 - Merged
12 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
22 pages
Machine Learning Algorithms From Scratch
No ratings yet
Machine Learning Algorithms From Scratch
9 pages
ML Yogesh
No ratings yet
ML Yogesh
23 pages
Today Lab Programs
No ratings yet
Today Lab Programs
4 pages
Lab2 Linear Regression
100% (1)
Lab2 Linear Regression
18 pages
Agniva
No ratings yet
Agniva
16 pages
Aiml Codes
No ratings yet
Aiml Codes
11 pages
ML RECORD - Merged
No ratings yet
ML RECORD - Merged
33 pages
ML Record
No ratings yet
ML Record
19 pages
Shobit Sharma (2124399) ML Lab File PDF
No ratings yet
Shobit Sharma (2124399) ML Lab File PDF
19 pages
ML Practical File
No ratings yet
ML Practical File
30 pages
Easy Pract ML
No ratings yet
Easy Pract ML
7 pages
Udacity Machine Learning Analysis Supervised Learning
100% (1)
Udacity Machine Learning Analysis Supervised Learning
504 pages
Ai Last 5
No ratings yet
Ai Last 5
4 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
Implement The KNN
No ratings yet
Implement The KNN
5 pages
Programs Lab Bca
No ratings yet
Programs Lab Bca
16 pages
ML Cyber Lab
No ratings yet
ML Cyber Lab
16 pages
ML Short Code - Under Updating
No ratings yet
ML Short Code - Under Updating
4 pages
Machine
100% (1)
Machine
45 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Based On May 2011 Occupational Standards: Ethiopian TVET-System
No ratings yet
Based On May 2011 Occupational Standards: Ethiopian TVET-System
92 pages
3hac042305 041
No ratings yet
3hac042305 041
1 page
The Lifestyle Flow
No ratings yet
The Lifestyle Flow
14 pages
Chapter 13 - Aggregate Supply and The Short-Run Tradeoff Between Inflation and Unemployment
No ratings yet
Chapter 13 - Aggregate Supply and The Short-Run Tradeoff Between Inflation and Unemployment
26 pages
395 SrivastavaS
No ratings yet
395 SrivastavaS
10 pages
Insit of Medicine Members 2008
No ratings yet
Insit of Medicine Members 2008
33 pages
8 1 AlphaAndBetaDecayLab 2
No ratings yet
8 1 AlphaAndBetaDecayLab 2
3 pages
Central University of Haryana: Temporary Camp Office: Govt. B.Ed. College Building, Narnaul (Distt. Mahendergarh) Haryana
No ratings yet
Central University of Haryana: Temporary Camp Office: Govt. B.Ed. College Building, Narnaul (Distt. Mahendergarh) Haryana
7 pages
Ilecpracticetest: Test of Reading
No ratings yet
Ilecpracticetest: Test of Reading
23 pages
Numerical Measures To Describe Data
No ratings yet
Numerical Measures To Describe Data
103 pages
In Uence of Geographical Phenomenon On Yoga: A Study On Yoga-Geography
No ratings yet
In Uence of Geographical Phenomenon On Yoga: A Study On Yoga-Geography
10 pages
GS EP EXP 207 09 Systems Units
No ratings yet
GS EP EXP 207 09 Systems Units
18 pages
Jean Watson's Human Caring Science, A Theory of Nursing
0% (1)
Jean Watson's Human Caring Science, A Theory of Nursing
30 pages
Pahal Solar PVT
No ratings yet
Pahal Solar PVT
21 pages
Understanding How PeopleCode Events Work
No ratings yet
Understanding How PeopleCode Events Work
14 pages
Nothing To Hide, The Blurring of The Physical and Temporal Line Between Life, Work and Education - Microcities
No ratings yet
Nothing To Hide, The Blurring of The Physical and Temporal Line Between Life, Work and Education - Microcities
7 pages
Dynamics Problem Solving
No ratings yet
Dynamics Problem Solving
6 pages
003 - Syngas Generation For GTL PDF
No ratings yet
003 - Syngas Generation For GTL PDF
91 pages
Cambridge IGCSE: Travel & Tourism 0471/21
No ratings yet
Cambridge IGCSE: Travel & Tourism 0471/21
12 pages
Sambhav Daksh Syed Abhimanyu
No ratings yet
Sambhav Daksh Syed Abhimanyu
10 pages
History of Computers DBS
No ratings yet
History of Computers DBS
34 pages
APIGEE: People Management Practices and The Challenge of Growth
100% (1)
APIGEE: People Management Practices and The Challenge of Growth
4 pages
Uncontrolled Rectifier
No ratings yet
Uncontrolled Rectifier
18 pages
Hydrology WSE 3 2008
No ratings yet
Hydrology WSE 3 2008
22 pages
Manual de Mantenimiento S331D
No ratings yet
Manual de Mantenimiento S331D
32 pages
MSUAAF Glidden 2013 Plans Book
No ratings yet
MSUAAF Glidden 2013 Plans Book
24 pages
Marisela Frasuto - Beverly Hills Cop
No ratings yet
Marisela Frasuto - Beverly Hills Cop
5 pages
Public Administration
No ratings yet
Public Administration
178 pages
Rapid Development of India
No ratings yet
Rapid Development of India
21 pages

ML Lab Experiments (1) - Pages-5

Uploaded by

ML Lab Experiments (1) - Pages-5

Uploaded by

Machine Learning Lab (IT804) Jan-Jun 2021

print ('sepal-length', 'sepal-width', 'petal-length', 'petal-width')

x_train, x_test, y_train, y_test = train_test_split(x,y,test_size=0.3)

#To Training the model and Nearest nighbors K=5

β^( x 0 )=( XT WX )−1 XT Wy

def local_regression(x0, X, Y, tau):# add bias term

# fit model: normal equations with kernel

beta = np.linalg.pinv(xw @ X) @ xw @ Y #@ Matrix Multiplication or Dot Product

# Weight or Radial Kernal Bias Function

# prediction through regression

# -*- coding: utf-8 -*-

def kernel(point,xmat, k):

# load data points

You might also like

# -- coding: utf-8 --