0% found this document useful (0 votes)

11 views6 pages

Exp 1

The experiment implements a linear regression model to predict salary based on years of experience using a dataset from a Salary_Data.csv file. The model is trained on a training set split from the dataset and used to predict salaries for the test set. Visualizations of the training and test results show the regression line plotted against actual data points. The model is able to accurately learn the correlation between experience and salary from the training data and make predictions for the test set.

Uploaded by

Mr. S

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views6 pages

Exp 1

Uploaded by

Mr. S

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Machine Learning 1

EXPERIMENT NO 1
Title: To implement Linear Regression
Lab Objective: To implement an appropriate machine learning model for the given
application.

Theory:
1. We will begin with importing the dataset using pandas and also import other libraries
such as numpy and matplotlib.
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt

dataset = pd.read_csv('Salary_Data.csv')
dataset.head()

2. Now that we have imported the dataset, we will perform data preprocessing.
X = dataset.iloc[:,:-1].values #independent variable array
y = dataset.iloc[:,1].values #dependent variable vector
The X is independent variable array and y is the dependent variable vector. Note the
difference between the array and vector. The dependent variable must be in vector and
independent variable must be an array itself.
3. Now that we have imported the dataset, we will perform data preprocessing.

X = dataset.iloc[:,:-1].values #independent variable array

y = dataset.iloc[:,1].values #dependent variable vector
The X is independent variable array and y is the dependent variable vector. Note the
difference between the array and vector. The dependent variable must be in vector and
independent variable must be an array itself.
4. We need to split our dataset into the test and train set. Generally, we follow the 20-80
policy or the 30-70 policy respectively.

Why is it necessary to perform splitting? This is because we wish to train our model
according to the years and salary. We then test our model on the test set.
We check whether the predictions made by the model on the test set data matches what was
given in the dataset.
If it matches, it implies that our model is accurate and is making the right predictions.

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X,y,test_size=1/3,random_state=0)
We don’t need to apply feature scaling for linear regression as libraries take care of it.

5. From sklearn’s linear model library, import linear regression class. Create an object
for a linear regression class called regressor.

Name – Aarushi Tiwari Roll no. 60

Machine Learning 2

To fit the regressor into the training set, we will call the fit method – function to fit the
regressor into the training set.
We need to fit X_train (training data of matrix of features) into the target values y_train. Thus
the model learns the correlation and learns how to predict the dependent variables based on
the independent variable.

from sklearn.linear_model import LinearRegression

regressor = LinearRegression()
regressor.fit(X_train,y_train) #actually produces the linear eqn for the data

6. We create a vector containing all the predictions of the test set salaries. The predicted
salaries are then put into the vector called y_pred.(contains prediction for all observations in
the test set)
predict method makes the predictions for the test set. Hence, the input is the test set. The
parameter for predict must be an array or sparse matrix, hence input is X_test.
y_pred = regressor.predict(X_test)
y_pred

y-pred output
y_test

y-test output
y_test is the real salary of the test set.
y_pred are the predicted salaries.
Visualizing the results
Let’s see what the results of our code will look like when we visualize it.
1. Plotting the points (observations)
To visualize the data, we plot graphs using matplotlib. To plot real observation points ie
plotting the real given values.
The X-axis will have years of experience and the Y-axis will have the predicted salaries.
plt.scatter plots a scatter plot of the data. Parameters include :

1. X – coordinate (X_train: number of years)

2. Y – coordinate (y_train: real salaries of the employees)
3. Color ( Regression line in red and observation line in blue)
2. Plotting the regression line
plt.plot have the following parameters :

1. X coordinates (X_train) – number of years

2. Y coordinates (predict on X_train) – prediction of X-train (based on a number of years).

Name – Aarushi Tiwari Roll no. 60

Machine Learning 3

Note : The y-coordinate is not y_pred because y_pred is predicted salaries of the test set
observations.

#plot for the TRAIN

plt.scatter(X_train, y_train, color='red') # plotting the observation line
plt.plot(X_train, regressor.predict(X_train), color='blue') # plotting the regression line
plt.title("Salary vs Experience (Training set)") # stating the title of the graph
plt.xlabel("Years of experience") # adding the name of x-axis
plt.ylabel("Salaries") # adding the name of y-axis
plt.show() # specifies end of graph
Prerequisite Software and Command:
 Python 3 and above
 Pip install numpy
 Pip install pandas
 Pip install matplotlib
 Pip install sklearn
(These above command should be run only once)
Program Code:
# importing the dataset
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt

dataset = pd.read_csv('Salary_Data.csv')
dataset.head()

# data preprocessing
X = dataset.iloc[:, :-1].values #independent variable array
y = dataset.iloc[:,1].values #dependent variable vector

# splitting the dataset

from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X,y,test_size=1/3,random_state=0)

# fitting the regression model

from sklearn.linear_model import LinearRegression
regressor = LinearRegression()
regressor.fit(X_train,y_train) #actually produces the linear eqn for the data

# predicting the test set results

y_pred = regressor.predict(X_test)
y_pred

Name – Aarushi Tiwari Roll no. 60

Machine Learning 4

y_test

# visualizing the results

#plot for the TRAIN

plt.scatter(X_train, y_train, color='red') # plotting the observation line

plt.plot(X_train, regressor.predict(X_train), color='blue') # plotting the regression line
plt.title("Salary vs Experience (Training set)") # stating the title of the graph

plt.xlabel("Years of experience") # adding the name of x-axis

plt.ylabel("Salaries") # adding the name of y-axis
plt.show() # specifies end of graph

#plot for the TEST

plt.scatter(X_test, y_test, color='red')

plt.plot(X_train, regressor.predict(X_train), color='blue') # plotting the regression line
plt.title("Salary vs Experience (Testing set)")

plt.xlabel("Years of experience")
plt.ylabel("Salaries")
plt.show()

Sample Output:

Program Output:

Name – Aarushi Tiwari Roll no. 60

Machine Learning 5

Name – Aarushi Tiwari Roll no. 60

Machine Learning 6

Conclusion: Linear Regression model implemented with experiential experimental

model on given data set of Salary Data csv file for prediction of salary and experience.

Name – Aarushi Tiwari Roll no. 60

ML Experiment No 1 Linear Regression Analysis
No ratings yet
ML Experiment No 1 Linear Regression Analysis
3 pages
Linear Regression
No ratings yet
Linear Regression
6 pages
Data Science Chapitre 2
No ratings yet
Data Science Chapitre 2
132 pages
2.1 ML (Implementation of Simple Linear Regression in Python)
No ratings yet
2.1 ML (Implementation of Simple Linear Regression in Python)
8 pages
Web II & DA Slip Solution
No ratings yet
Web II & DA Slip Solution
40 pages
Unit5 - Linear Regression
No ratings yet
Unit5 - Linear Regression
4 pages
Machine Learning
No ratings yet
Machine Learning
158 pages
Regression
No ratings yet
Regression
16 pages
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
No ratings yet
DMV Unit 3 PPT - RSK - 250419 - 125620 Jfhuehiwhu
89 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
30 pages
Statement of Account
No ratings yet
Statement of Account
109 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
9.2. Data Science - Machine Learning - Simple Linear Regression - Example
No ratings yet
9.2. Data Science - Machine Learning - Simple Linear Regression - Example
10 pages
Department of Electronics& Computer Science (PP Experiment No.6)
No ratings yet
Department of Electronics& Computer Science (PP Experiment No.6)
4 pages
Cisco Certified Expert Firewall Fundamentals: Optional
No ratings yet
Cisco Certified Expert Firewall Fundamentals: Optional
4 pages
Python 1
No ratings yet
Python 1
3 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
43 pages
Data Science Chapitre 2
No ratings yet
Data Science Chapitre 2
98 pages
Word Embedding Generation For Telugu Corpus
No ratings yet
Word Embedding Generation For Telugu Corpus
28 pages
Canon I350 Waste Tank Full - Fixyourownprinter
No ratings yet
Canon I350 Waste Tank Full - Fixyourownprinter
22 pages
Perform Prediction Using Regression Algorithm: Ex No: 1 Date
No ratings yet
Perform Prediction Using Regression Algorithm: Ex No: 1 Date
13 pages
ML 6 7 8
No ratings yet
ML 6 7 8
10 pages
21BEI052 2EI503 ML SpecialAssignmentReport
No ratings yet
21BEI052 2EI503 ML SpecialAssignmentReport
12 pages
Linear Regression
No ratings yet
Linear Regression
20 pages
Machine Learning Hands-On
100% (1)
Machine Learning Hands-On
18 pages
Practical # 10
No ratings yet
Practical # 10
5 pages
Solution To Task 1
No ratings yet
Solution To Task 1
2 pages
DS P6 Yash
No ratings yet
DS P6 Yash
8 pages
CO2 Pre-Test & Functional Test Sheet
No ratings yet
CO2 Pre-Test & Functional Test Sheet
10 pages
Linear Regression2
No ratings yet
Linear Regression2
9 pages
Simple Linear Regression Code
No ratings yet
Simple Linear Regression Code
3 pages
2.3 ML (Implementation of Polynomial Regression Using Python)
No ratings yet
2.3 ML (Implementation of Polynomial Regression Using Python)
9 pages
Assignment 1
No ratings yet
Assignment 1
5 pages
Superseded
No ratings yet
Superseded
19 pages
EXP-4 DMusingPYTHON
No ratings yet
EXP-4 DMusingPYTHON
7 pages
Rittal White Paper 401: The Benefits of Busbar Power Distribution Systems For North American & Global Applications
No ratings yet
Rittal White Paper 401: The Benefits of Busbar Power Distribution Systems For North American & Global Applications
9 pages
Regression Dataset Example
No ratings yet
Regression Dataset Example
14 pages
Task 1
No ratings yet
Task 1
5 pages
SAP S - 4HANA Sourcing and Procurement - 1
100% (2)
SAP S - 4HANA Sourcing and Procurement - 1
36 pages
Business Technology
No ratings yet
Business Technology
4 pages
ml1 PRG
No ratings yet
ml1 PRG
2 pages
MiniWave Manual
No ratings yet
MiniWave Manual
16 pages
Data 98
No ratings yet
Data 98
4 pages
Exam 4 Training Grile
No ratings yet
Exam 4 Training Grile
15 pages
Lab Experiment 5
No ratings yet
Lab Experiment 5
5 pages
Machine Learning-SEAIML-241P (PR) Bharat
No ratings yet
Machine Learning-SEAIML-241P (PR) Bharat
42 pages
Power Grid Substation Report
No ratings yet
Power Grid Substation Report
49 pages
ICT 9 7.2 Design
No ratings yet
ICT 9 7.2 Design
70 pages
NTCC Sem VI Major Project WPR
No ratings yet
NTCC Sem VI Major Project WPR
12 pages
Lecture-2 Unit 2
No ratings yet
Lecture-2 Unit 2
56 pages
Epaycard - Customer - Account - Opening - Form - BUSTAMANTE, ARGEE L
No ratings yet
Epaycard - Customer - Account - Opening - Form - BUSTAMANTE, ARGEE L
1 page
ER04242
No ratings yet
ER04242
5 pages
Task 8
No ratings yet
Task 8
2 pages
ML Recordjp
No ratings yet
ML Recordjp
35 pages
Lecture Note-2
No ratings yet
Lecture Note-2
7 pages
The Evaluation of Operating System
No ratings yet
The Evaluation of Operating System
6 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
2 pages
Final Lab Manual
No ratings yet
Final Lab Manual
34 pages
Summer Internship Format May 2023 New
No ratings yet
Summer Internship Format May 2023 New
67 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
4 pages
Regression Demo
No ratings yet
Regression Demo
8 pages
Machine Learning 2
No ratings yet
Machine Learning 2
45 pages
Simple Linear Regression in Machine Learning
No ratings yet
Simple Linear Regression in Machine Learning
7 pages
Agniva
No ratings yet
Agniva
16 pages
BS en Iso 28927-5-2009 PDF
No ratings yet
BS en Iso 28927-5-2009 PDF
32 pages
Assignment 13 AICT
No ratings yet
Assignment 13 AICT
5 pages
Machine Learning Algorithm With Python Implementation
No ratings yet
Machine Learning Algorithm With Python Implementation
34 pages
Spiral Wound Gasket - Type LS
No ratings yet
Spiral Wound Gasket - Type LS
1 page
An Analysis of QSAR Research Based On Machine Learning Concepts
No ratings yet
An Analysis of QSAR Research Based On Machine Learning Concepts
15 pages
Quick Reference: 45/545RFE Component Location and I.D
No ratings yet
Quick Reference: 45/545RFE Component Location and I.D
8 pages
Ashwani Kumar Yadav Chief Mechanic
No ratings yet
Ashwani Kumar Yadav Chief Mechanic
5 pages
Simple Linear Regression Lab II
No ratings yet
Simple Linear Regression Lab II
5 pages
Duracell CR2 Datasheet
No ratings yet
Duracell CR2 Datasheet
2 pages
Lab Mannual of ML
No ratings yet
Lab Mannual of ML
43 pages
Experiment No.8
No ratings yet
Experiment No.8
5 pages
Linear Regression Research Paper
No ratings yet
Linear Regression Research Paper
2 pages
Praktikum 1 Jupiter Machine Learning
No ratings yet
Praktikum 1 Jupiter Machine Learning
1 page
ML 1-11
No ratings yet
ML 1-11
27 pages
ML Manoj
No ratings yet
ML Manoj
51 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
23 pages
Kashi Vishwanath Entry Ticket (5 Persons)
No ratings yet
Kashi Vishwanath Entry Ticket (5 Persons)
1 page
Unit 2 Regression Analysis
No ratings yet
Unit 2 Regression Analysis
16 pages
Btech1007022 Lab5.1
No ratings yet
Btech1007022 Lab5.1
9 pages
Easy Pract ML
No ratings yet
Easy Pract ML
7 pages
Lab Experiments Vi Sem-1
No ratings yet
Lab Experiments Vi Sem-1
10 pages
20dit073 Jay Prajapati ML
No ratings yet
20dit073 Jay Prajapati ML
68 pages
Shristi CV Latest
No ratings yet
Shristi CV Latest
2 pages
Linear
No ratings yet
Linear
2 pages
Learn Etabs With Fundamentals OF Structural Engineering
No ratings yet
Learn Etabs With Fundamentals OF Structural Engineering
8 pages
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet

Exp 1

Uploaded by

Exp 1

Uploaded by

Machine Learning 1

X = dataset.iloc[:,:-1].values #independent variable array

from sklearn.model_selection import train_test_split

Name – Aarushi Tiwari Roll no. 60

from sklearn.linear_model import LinearRegression

1. X – coordinate (X_train: number of years)

1. X coordinates (X_train) – number of years

Name – Aarushi Tiwari Roll no. 60

#plot for the TRAIN

# splitting the dataset

# fitting the regression model

# predicting the test set results

Name – Aarushi Tiwari Roll no. 60

# visualizing the results

plt.scatter(X_train, y_train, color='red') # plotting the observation line

plt.xlabel("Years of experience") # adding the name of x-axis

#plot for the TEST

plt.scatter(X_test, y_test, color='red')

Name – Aarushi Tiwari Roll no. 60

Name – Aarushi Tiwari Roll no. 60

Conclusion: Linear Regression model implemented with experiential experimental

Name – Aarushi Tiwari Roll no. 60

You might also like