0% found this document useful (0 votes)

10 views5 pages

Assignment 1

The document outlines an assignment to develop a Linear Regression model using the Least Squares Estimation method to predict salaries based on years of experience. It includes code for data preprocessing, model training, and performance evaluation using Mean Squared Error (MSE). The implementation features data visualization and error calculation for both training and testing datasets.

Uploaded by

Yash Shirsat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views5 pages

Assignment 1

Uploaded by

Yash Shirsat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Assignment 1

Name: Satyajit Shinde

Div: TY AI C Roll No.: 41
PRN: 12211701

Develop and implement a Linear Regression model using the Least

Squares Estimation method to predict a target variable based on a
given dataset. Calculate the sum of squared differences between
the actual and predicted values. The implementation should
include dataset preprocessing, model training, and performance
evaluation using metrics such as Mean Squared Error (MSE).

Code:

import pandas as pd

import numpy as np

import matplotlib.pyplot as plt

df = pd.read_csv("C:\\Users\\user\\Desktop\\Sem 6\\SI\\salary_data.csv")

df.head()

df.shape

# Splitting the data in X and Y

# where, X has independent variable and Y is dependent variable.

X = df.loc[:,"YearsExperience"]

y = df.loc[:,"Salary"]
# Splitting X and Y into X_train, y_train, X_test,y_test

X_train = X.iloc[:21]

y_train = y.iloc[:21]

X_test = X.iloc[21:]

y_test = y.iloc[21:]

X_train,y_train

# Calculating Line Equation

N = len(X_train)

sum_X = sum(X_train)

sum_Y = sum(y_train)

sum_XY = sum(X_train*y_train)

sum_X_square = sum(X_train**2)

b = ((N * sum_XY) - (sum_X * sum_Y))/((N*sum_X_square)-(sum_X**2))

a = (sum_Y - (b*sum_X))/N

# Predicting Value

def pred(a,b,x):

return a + b*x

for x in X_train:

print(f"Experience : {x} and expected salary is : {pred(a,b,x)}")

for x in X_test:

print(f"Experience : {x} and expected salary is : {pred(a,b,x)}")

c = pred(a,b,6)
c

# Predcting a test on train-sets

pred_test = pred(a,b,X_test)

pred_train =pred(a,b,X_train)

pred_test

pred_train

# plotting Scatter Plot

plt.plot(X_train,pred_train,color="yellow")

plt.scatter(X_train,y_train)

plt.show()

# Plotting Predicted values and Actual values

import matplotlib.pyplot as plt

plt.plot(X_test, pred_test, label='Model Prediction')

plt.scatter(X_test, pred_test, color='red', label='Predicted')

plt.scatter(X_test, y_test, label='Actual')

for x, y in zip(X_test, pred_test):

plt.annotate(f'{y:.2f}', (x, y), textcoords="offset points", xytext=(5, 5), ha='left', color='red')

for x, y in zip(X_test, y_test):

plt.annotate(f'{y:.2f}', (x, y), textcoords="offset points", xytext=(5, 5), ha='right')

for x, y_pred, y_actual in zip(X_test, pred_test, y_test):

plt.plot([x, x], [y_pred, y_actual], color='gray', linestyle='--')

plt.xlabel('X_test')

plt.ylabel('Values')

plt.legend()

plt.show()

#Calculating mean Squared error

error_list = []

def mean_squared_error(true,pred):
squared_error = (true - pred)**2

error_list.append(squared_error)

mse = sum(squared_error) / len(true)

return mse

mse_test = mean_squared_error(y_test,pred_test)

mse_train = mean_squared_error(y_train,pred_train)

print(f"Mean Squared Error for testing set is : {mse_test}")

print(f"Mean Squared Error for training set is : {mse_train}")

def abs_error(true,pred):

error = abs(true -pred)

print(f"Error is:\n{error}")

final = sum(error)

ae = final/len(true)

return ae

error_list_mse = []

error_list_mae = []

for i in range(N):

y_pred = np.array([pred(a,b,x) for x in X_train])

mae = abs_error(y_train,y_pred)

mse = mean_squared_error(y_train,y_pred)

error_list_mse.append(mse)

error_list_mae.append(mae)

Tenko Raykov, George A. Marcoulides-Basic Statistics - An Introduction With R-Rowman & Littlefield Publishers (2012) PDF
No ratings yet
Tenko Raykov, George A. Marcoulides-Basic Statistics - An Introduction With R-Rowman & Littlefield Publishers (2012) PDF
345 pages
Untitled
No ratings yet
Untitled
1,326 pages
Generalised Linear Models and Bayesian Statistics
No ratings yet
Generalised Linear Models and Bayesian Statistics
35 pages
Auto Sales Forecasting For Production Planning at Ford
No ratings yet
Auto Sales Forecasting For Production Planning at Ford
12 pages
Measure and Error Analysis Lab 1
No ratings yet
Measure and Error Analysis Lab 1
7 pages
Eurolab Handbook Iso Iec 17025 2017
No ratings yet
Eurolab Handbook Iso Iec 17025 2017
17 pages
Inferential Statistics Powerpoint
No ratings yet
Inferential Statistics Powerpoint
65 pages
An Introduction To The Psych Package: Part I: Data Entry and Data Description
No ratings yet
An Introduction To The Psych Package: Part I: Data Entry and Data Description
63 pages
Machine Learning Hands-On
100% (1)
Machine Learning Hands-On
18 pages
Oiml Bulletin July 2001
No ratings yet
Oiml Bulletin July 2001
49 pages
12 Jurnal Internasional PDF
No ratings yet
12 Jurnal Internasional PDF
6 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
EC221 답 지운 것
No ratings yet
EC221 답 지운 것
99 pages
2.1 ML (Implementation of Simple Linear Regression in Python)
No ratings yet
2.1 ML (Implementation of Simple Linear Regression in Python)
8 pages
16BCB0126 VL2018195002535 Pe003
No ratings yet
16BCB0126 VL2018195002535 Pe003
40 pages
Data Science Chapitre 2
No ratings yet
Data Science Chapitre 2
132 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
2 pages
Introduction To Biostatistics
No ratings yet
Introduction To Biostatistics
73 pages
Experiment No.:1: Program
No ratings yet
Experiment No.:1: Program
7 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
47 pages
Regression and Factor
No ratings yet
Regression and Factor
95 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
23 pages
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
No ratings yet
Machine Learning Lab: Raheel Aslam (74-FET/BSEE/F16)
4 pages
PR2 Thesis Final Report
No ratings yet
PR2 Thesis Final Report
35 pages
E-Servqual: How E-Servqual Can Influence E-Satisfaction in Shopee
No ratings yet
E-Servqual: How E-Servqual Can Influence E-Satisfaction in Shopee
5 pages
CH - En.u4cse19101 Cheduri Linearregression
No ratings yet
CH - En.u4cse19101 Cheduri Linearregression
8 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
Logistic Regression
No ratings yet
Logistic Regression
3 pages
Ibrahim, M.H. and Baharom, A.H., 2011. The Role of Gold in Financial Investment A Malaysian Perspective. Economic Computation and Economic Cybernetics Studies and Research, 45 (4), Pp.227-238.
No ratings yet
Ibrahim, M.H. and Baharom, A.H., 2011. The Role of Gold in Financial Investment A Malaysian Perspective. Economic Computation and Economic Cybernetics Studies and Research, 45 (4), Pp.227-238.
12 pages
Physics Iii: Experiments: Erhan Gülmez & Zuhal Kaplan
No ratings yet
Physics Iii: Experiments: Erhan Gülmez & Zuhal Kaplan
131 pages
Cl-Vii Ass2 4301063
No ratings yet
Cl-Vii Ass2 4301063
5 pages
Probality of An Outcomes
No ratings yet
Probality of An Outcomes
3 pages
AI Lab9
No ratings yet
AI Lab9
5 pages
Machine Learning 2
No ratings yet
Machine Learning 2
45 pages
STATA - Logit-Probit-Tobit - IInd Sem 23-24
No ratings yet
STATA - Logit-Probit-Tobit - IInd Sem 23-24
84 pages
Naive Bayes
No ratings yet
Naive Bayes
58 pages
Supervised Learning For Data Science...
No ratings yet
Supervised Learning For Data Science...
14 pages
Regression Dataset Example
No ratings yet
Regression Dataset Example
14 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
2 Linear Regression
No ratings yet
2 Linear Regression
5 pages
SSRN Id4573622
No ratings yet
SSRN Id4573622
18 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
7 pages
Exp 1
No ratings yet
Exp 1
6 pages
Regression Demo
No ratings yet
Regression Demo
8 pages
ML Remaining
No ratings yet
ML Remaining
17 pages
Stability and Genotype by Environment Interaction Analysis of Linseed Breeding Lines
No ratings yet
Stability and Genotype by Environment Interaction Analysis of Linseed Breeding Lines
8 pages
Data Mining Practicals
No ratings yet
Data Mining Practicals
22 pages
Final Year Thesis Ignitius Zulu
No ratings yet
Final Year Thesis Ignitius Zulu
21 pages
Expt 1
No ratings yet
Expt 1
6 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
5 pages
ML Lab Manual
No ratings yet
ML Lab Manual
29 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
3 Simple Linear Regression
No ratings yet
3 Simple Linear Regression
71 pages
Exp 1 121a1047 Lavanya Kurup ML
No ratings yet
Exp 1 121a1047 Lavanya Kurup ML
11 pages
Practical # 10
No ratings yet
Practical # 10
5 pages
Regression
No ratings yet
Regression
16 pages
Experiment 5 Code
No ratings yet
Experiment 5 Code
4 pages
Unit 2 Regression Analysis
No ratings yet
Unit 2 Regression Analysis
16 pages
Paula E. Lester, Deborah Inman, Lloyd K. Bishop - Handbook of Tests and Measurement in Education and The Social Sciences-Rowman & Littlefield Publishers (2014)
No ratings yet
Paula E. Lester, Deborah Inman, Lloyd K. Bishop - Handbook of Tests and Measurement in Education and The Social Sciences-Rowman & Littlefield Publishers (2014)
355 pages
ML
No ratings yet
ML
17 pages
Data Analytics
No ratings yet
Data Analytics
10 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
Aiml Practicals
No ratings yet
Aiml Practicals
22 pages
DMBA103 - Combined Question Answers
No ratings yet
DMBA103 - Combined Question Answers
7 pages
Regression Model
No ratings yet
Regression Model
6 pages
Import As From Import From Import From Import: R'creditcard - CSV' 'Time' 'Time'
No ratings yet
Import As From Import From Import From Import: R'creditcard - CSV' 'Time' 'Time'
3 pages
ICT Assignment 2
No ratings yet
ICT Assignment 2
7 pages
A Further Step in Experimental Design
No ratings yet
A Further Step in Experimental Design
9 pages
Ai Lab
No ratings yet
Ai Lab
19 pages
Linear - Regression - Insuarace - StudentsPerformance
No ratings yet
Linear - Regression - Insuarace - StudentsPerformance
4 pages
ML Experiment No 1 Linear Regression Analysis
No ratings yet
ML Experiment No 1 Linear Regression Analysis
3 pages
Simple Linear Regression in Machine Learning
No ratings yet
Simple Linear Regression in Machine Learning
7 pages
Mathematical Model
No ratings yet
Mathematical Model
34 pages
Python File
No ratings yet
Python File
5 pages
Lecture-2 Unit 2
No ratings yet
Lecture-2 Unit 2
56 pages
Sahil ML
No ratings yet
Sahil ML
21 pages
Btech1007022 Lab5
No ratings yet
Btech1007022 Lab5
14 pages
Task 1
No ratings yet
Task 1
5 pages
Btech1007022 Lab5.1
No ratings yet
Btech1007022 Lab5.1
9 pages
Midterm Exam Spring 2023 - Answers
No ratings yet
Midterm Exam Spring 2023 - Answers
6 pages
M.E Machine Learning - CP4252 Lab Manual4716718074353656238
No ratings yet
M.E Machine Learning - CP4252 Lab Manual4716718074353656238
26 pages
Lab 6 - Linear Regression and Multiple Linear Regression
No ratings yet
Lab 6 - Linear Regression and Multiple Linear Regression
12 pages
DS P6 Yash
No ratings yet
DS P6 Yash
8 pages
ml1 PRG
No ratings yet
ml1 PRG
2 pages
Yash Shirsat: Introduction To Software Engineering
No ratings yet
Yash Shirsat: Introduction To Software Engineering
1 page
Linear Reg 33
No ratings yet
Linear Reg 33
3 pages
Experiment No.8
No ratings yet
Experiment No.8
5 pages
Role of Artificial Intelligence (AI) - Driven Demand Forecasting: A Machine Learning Approach For Supply Chain Resilience
No ratings yet
Role of Artificial Intelligence (AI) - Driven Demand Forecasting: A Machine Learning Approach For Supply Chain Resilience
12 pages
fl6j5098ufDL ASS4 43
No ratings yet
fl6j5098ufDL ASS4 43
6 pages
Assignment 3
No ratings yet
Assignment 3
5 pages
Assignment 4
No ratings yet
Assignment 4
5 pages
ZH 2 X 0 K 42 Pmdocx
No ratings yet
ZH 2 X 0 K 42 Pmdocx
2 pages
Copy of Green Modern Futuristic Artificial Intelligence Presentation
No ratings yet
Copy of Green Modern Futuristic Artificial Intelligence Presentation
11 pages
Import Pandas As PD
No ratings yet
Import Pandas As PD
3 pages
Introduction To Econometrics 3rd, Global Edition James H. Stock Download
No ratings yet
Introduction To Econometrics 3rd, Global Edition James H. Stock Download
52 pages
C Language Programming Codes
From Everand
C Language Programming Codes
Durgesh
No ratings yet

Assignment 1

Uploaded by

Assignment 1

Uploaded by

Assignment 1

Name: Satyajit Shinde

Develop and implement a Linear Regression model using the Least

import matplotlib.pyplot as plt

# Splitting the data in X and Y

# where, X has independent variable and Y is dependent variable.

# Calculating Line Equation

b = ((N * sum_XY) - (sum_X * sum_Y))/((N*sum_X_square)-(sum_X**2))

print(f"Experience : {x} and expected salary is : {pred(a,b,x)}")

print(f"Experience : {x} and expected salary is : {pred(a,b,x)}")

# Predcting a test on train-sets

# plotting Scatter Plot

# Plotting Predicted values and Actual values

import matplotlib.pyplot as plt

plt.plot(X_test, pred_test, label='Model Prediction')

plt.scatter(X_test, pred_test, color='red', label='Predicted')

plt.scatter(X_test, y_test, label='Actual')

plt.annotate(f'{y:.2f}', (x, y), textcoords="offset points", xytext=(5, 5), ha='left', color='red')

for x, y in zip(X_test, y_test):

plt.annotate(f'{y:.2f}', (x, y), textcoords="offset points", xytext=(5, 5), ha='right')

for x, y_pred, y_actual in zip(X_test, pred_test, y_test):

plt.plot([x, x], [y_pred, y_actual], color='gray', linestyle='--')

#Calculating mean Squared error

mse = sum(squared_error) / len(true)

print(f"Mean Squared Error for testing set is : {mse_test}")

print(f"Mean Squared Error for training set is : {mse_train}")

error = abs(true -pred)

y_pred = np.array([pred(a,b,x) for x in X_train])

You might also like