0% found this document useful (0 votes)

23 views8 pages

MMDS Da3

The document describes a lab activity to implement the stochastic gradient descent algorithm for linear regression using Python. Students are instructed to code the algorithm, test it on randomly generated datasets using the Boston housing data, and compare the results to Scikit-Learn's SGD regressor. The code implements SGD from scratch and with Scikit-Learn, trains models on housing data, makes predictions, and calculates error metrics to evaluate performance.

Uploaded by

Good Boy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views8 pages

MMDS Da3

Uploaded by

Good Boy

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

School of Computer Science and Engineering

CSE3045- Mathematical Modelling for Data Science

Semester: Fall 2022-23
Slot: L21+L22

LAB Activity-3

Faculty: Dr. Arup Ghosh

1) Implement the Stochastic Gradient Descent Algorithm for

Linear Regression using Python and Test it for some randomly
generated datasets.

Code
import warnings
warnings.filterwarnings("ignore")
from sklearn.datasets import load_boston
from sklearn import preprocessing
import pandas as pd
import numpy as np
import matplotlib.pyplot as plt
from prettytable import PrettyTable
from sklearn.linear_model import SGDRegressor
from sklearn import preprocessing
from sklearn.metrics import mean_squared_error
from numpy import random
from sklearn.model_selection import train_test_split
print("DONE")

# Linear Regression on Boston Housing data

boston_data=pd.DataFrame(load_boston().data,columns=load_boston().feature_names)
Y=load_boston().target
X=load_boston().data
x_train,x_test,y_train,y_test=train_test_split(X,Y,test_size=0.3)

# standardizing data
scaler = preprocessing.StandardScaler().fit(x_train)
x_train = scaler.transform(x_train)
x_test=scaler.transform(x_test)

## Adding the PRICE Column in the data

train_data=pd.DataFrame(x_train)
train_data['price']=y_train
train_data.head(3)

# implemented SGD Classifier

def GradientDescentRegressor(train_data,learning_rate=0.001,n_itr=1000,k=10):
w_cur=np.zeros(shape=(1,train_data.shape[1]-1))
b_cur=0
cur_itr=1
while(cur_itr<=n_itr):
w_old=w_cur
b_old=b_cur
w_temp=np.zeros(shape=(1,train_data.shape[1]-1))
b_temp=0
temp=train_data.sample(k)
#print(temp.head(3))
y=np.array(temp['price'])
x=np.array(temp.drop('price',axis=1))
for i in range(k):
w_temp+=x[i]*(y[i]-(np.dot(w_old,x[i])+b_old))*(-2/k)
b_temp+=(y[i]-(np.dot(w_old,x[i])+b_old))*(-2/k)
w_cur=w_old-learning_rate*w_temp
b_cur=b_old-learning_rate*b_temp
if(w_old==w_cur).all():
break
cur_itr+=1
return w_cur,b_cur
def predict(x,w,b):
y_pred=[]
for i in range(len(x)):
y=np.asscalar(np.dot(w,x[i])+b)
y_pred.append(y)
return np.array(y_pred)

def plot_(test_data,y_pred):
#scatter plot
plt.scatter(test_data,y_pred)
plt.grid()
plt.title('scatter plot between actual y and predicted y')
plt.xlabel('actual y')
plt.ylabel('predicted y')
plt.show()

w,b = GradientDescentRegressor(train_data,learning_rate=0.001,n_itr=1000)
y_pred_sgd=predict(x_test,w,b)

plot_(y_test,y_pred_sgd)
print('Mean Squared Error :',mean_squared_error(y_test, y_pred_sgd))
OUTPUT:

learning_rate=0.001, n_itr=1000

Mean Squared Error : 37.646704089438025

On Changing learning rate to 0.01 i.e., 1%.

learning_rate=0.01,

n_itr=1000

Mean Squared Error : 23.285739328791426

SCREENSHOT OF CODE and OUTPUT:

Implementing Stochastic Gradient Descent using SKLEARN library:
# SkLearn SGD classifier
n_iter=100
clf_ = SGDRegressor(max_iter=n_iter)
clf_.fit(x_train, y_train)
y_pred_sksgd=clf_.predict(x_test)
plt.scatter(y_test,y_pred_sksgd)
plt.grid()
plt.xlabel('Actual y')
plt.ylabel('Predicted y')
plt.title('Scatter plot from actual y and predicted y')
plt.show()

print('Mean Squared Error :',mean_squared_error(y_test, y_pred_sksgd))

# SkLearn SGD classifier predicted weight matrix

sklearn_w=clf_.coef_
sklearn_w
Comparing Both Methods:

Machine Learnin
100% (2)
Machine Learnin
23 pages
Machine
100% (1)
Machine
45 pages
MIT Ans
No ratings yet
MIT Ans
216 pages
04 Training Linear Models
No ratings yet
04 Training Linear Models
35 pages
Ann Experiential Learning
No ratings yet
Ann Experiential Learning
43 pages
ML Lab....... 3-Converted New
No ratings yet
ML Lab....... 3-Converted New
27 pages
Mayhoc
No ratings yet
Mayhoc
51 pages
Lecture04. Training Models (Regression in Chapter 4)
No ratings yet
Lecture04. Training Models (Regression in Chapter 4)
44 pages
ML Labs
No ratings yet
ML Labs
46 pages
ML Lab Record
No ratings yet
ML Lab Record
17 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
Naive Bayes
No ratings yet
Naive Bayes
58 pages
Ai Lab
No ratings yet
Ai Lab
19 pages
Experiment No
No ratings yet
Experiment No
29 pages
Data Mining Practicals
No ratings yet
Data Mining Practicals
22 pages
Lecture3 Upload
No ratings yet
Lecture3 Upload
28 pages
'/content/drive': From Import Import As Import As Import As
No ratings yet
'/content/drive': From Import Import As Import As Import As
9 pages
Mlee Lab4
No ratings yet
Mlee Lab4
11 pages
C2W3 Lab 01 Model Evaluation and Selection
No ratings yet
C2W3 Lab 01 Model Evaluation and Selection
21 pages
LinearRegression Tutorial
No ratings yet
LinearRegression Tutorial
40 pages
ML TW-PW 02-2
No ratings yet
ML TW-PW 02-2
9 pages
Assignment No. 3: 1. Plot of Loss Function J Vs Number of Iterations
No ratings yet
Assignment No. 3: 1. Plot of Loss Function J Vs Number of Iterations
6 pages
C2W3 Lab 01 Model Evaluation and Selection
No ratings yet
C2W3 Lab 01 Model Evaluation and Selection
21 pages
Mlee Lab1
No ratings yet
Mlee Lab1
9 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
ML Journal External
No ratings yet
ML Journal External
14 pages
1710993830340
No ratings yet
1710993830340
9 pages
H2 AndresAlcivar
No ratings yet
H2 AndresAlcivar
12 pages
21 CP 46 - (ML LAB 3)
No ratings yet
21 CP 46 - (ML LAB 3)
13 pages
Sofcomputing Da2
No ratings yet
Sofcomputing Da2
7 pages
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 4
No ratings yet
Setup: This Notebook Contains All The Sample Code and Solutions To The Exercises in Chapter 4
24 pages
Aiml Practicals
No ratings yet
Aiml Practicals
22 pages
ANN PR Code and Output
No ratings yet
ANN PR Code and Output
25 pages
Index: Name - JINESH PRAJAPAT Class - B. Tech, III Year Branch - AI & DS Sem - V
No ratings yet
Index: Name - JINESH PRAJAPAT Class - B. Tech, III Year Branch - AI & DS Sem - V
35 pages
Chapter 6 - Advanced Machine Learning PDF
No ratings yet
Chapter 6 - Advanced Machine Learning PDF
37 pages
ML Assignment
No ratings yet
ML Assignment
5 pages
To Improve The Performance of Models Predicting Ba
No ratings yet
To Improve The Performance of Models Predicting Ba
6 pages
Neural Net Python Sleep Study
No ratings yet
Neural Net Python Sleep Study
3 pages
Da 012307
No ratings yet
Da 012307
8 pages
Assignment 7
No ratings yet
Assignment 7
5 pages
22051001 (2)
No ratings yet
22051001 (2)
5 pages
ML LAB Manual-1
No ratings yet
ML LAB Manual-1
4 pages
20102A0071 DL Experiment5.b
No ratings yet
20102A0071 DL Experiment5.b
5 pages
Btech1007022 Lab5
No ratings yet
Btech1007022 Lab5
14 pages
21bit0706 VL2024250106861 Da
No ratings yet
21bit0706 VL2024250106861 Da
7 pages
Neural Network Code
No ratings yet
Neural Network Code
5 pages
C1 W2 Lab05 Sklearn GD Soln
No ratings yet
C1 W2 Lab05 Sklearn GD Soln
3 pages
Stochastic 2205317
No ratings yet
Stochastic 2205317
3 pages
Btech1007022 Lab5.1
No ratings yet
Btech1007022 Lab5.1
9 pages
Lab-5 Report
No ratings yet
Lab-5 Report
11 pages
AI Lab Final - 2
No ratings yet
AI Lab Final - 2
9 pages
Machine Learning Lab (3) Report (21 CP 81)
No ratings yet
Machine Learning Lab (3) Report (21 CP 81)
7 pages
Ai Last 5
No ratings yet
Ai Last 5
4 pages
16BCB0126 VL2018195002535 Pe003
No ratings yet
16BCB0126 VL2018195002535 Pe003
40 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
23 pages
Shailesh020902@gmail - Com 6
No ratings yet
Shailesh020902@gmail - Com 6
2 pages
''' Function To Load Dataset ''': Open List Range Len Float
No ratings yet
''' Function To Load Dataset ''': Open List Range Len Float
3 pages
CH - En.u4cse19101 Cheduri Linearregression
No ratings yet
CH - En.u4cse19101 Cheduri Linearregression
8 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet

MMDS Da3

Uploaded by

MMDS Da3

Uploaded by

School of Computer Science and Engineering

CSE3045- Mathematical Modelling for Data Science

Faculty: Dr. Arup Ghosh

1) Implement the Stochastic Gradient Descent Algorithm for

# Linear Regression on Boston Housing data

## Adding the PRICE Column in the data

# implemented SGD Classifier

Mean Squared Error : 37.646704089438025

On Changing learning rate to 0.01 i.e., 1%.

Mean Squared Error : 23.285739328791426

SCREENSHOT OF CODE and OUTPUT:

print('Mean Squared Error :',mean_squared_error(y_test, y_pred_sksgd))

# SkLearn SGD classifier predicted weight matrix

You might also like