0% found this document useful (0 votes)

27 views12 pages

Implementation of Simple Linear Regression Algorithm Using Python

Uploaded by

ayushisahoo2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views12 pages

Implementation of Simple Linear Regression Algorithm Using Python

Uploaded by

ayushisahoo2004

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

4/4/24, 11:12 PM Untitled - Jupyter Notebook

Implementation of Simple Linear Regression

Algorithm using Python!

Step-1: Data Pre-processing

.First, we will import the three important libraries, which will help us for loading the dataset,
plotting the graphs, and creating the Simple Linear Regression model.!

In [1]: import numpy as np

import pandas as pd
import matplotlib.pyplot as plt

Load the dataset!

In [2]: Thomas_df = pd.read_csv("Iris (1).csv")
Thomas_df

Out[2]:
Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

... ... ... ... ... ... ...

145 146 6.7 3.0 5.2 2.3 Iris-virginica

146 147 6.3 2.5 5.0 1.9 Iris-virginica

147 148 6.5 3.0 5.2 2.0 Iris-virginica

148 149 6.2 3.4 5.4 2.3 Iris-virginica

149 150 5.9 3.0 5.1 1.8 Iris-virginica

150 rows × 6 columns

Data frame columns

localhost:8888/notebooks/ML class/Untitled.ipynb 1/12

4/4/24, 11:12 PM Untitled - Jupyter Notebook

In [3]: Thomas_df.columns

Out[3]: Index(['Id', 'SepalLengthCm', 'SepalWidthCm', 'PetalLengthCm', 'PetalWidthC

m',
'Species'],
dtype='object')

Data frame head

definition and usage.The head() meathod returns a specified number of rows,strings from the
Top

In [4]: Thomas_df.head()

Out[4]:
Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

Data frame tail

definition and usage.The tail() meathod returns a specified number of rows,strings from the
bottom

In [5]: Thomas_df.tail()

Out[5]:
Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

145 146 6.7 3.0 5.2 2.3 Iris-virginica

146 147 6.3 2.5 5.0 1.9 Iris-virginica

147 148 6.5 3.0 5.2 2.0 Iris-virginica

148 149 6.2 3.4 5.4 2.3 Iris-virginica

149 150 5.9 3.0 5.1 1.8 Iris-virginica

shape
In [6]: Thomas_df.shape

Out[6]: (150, 6)

localhost:8888/notebooks/ML class/Untitled.ipynb 2/12

4/4/24, 11:12 PM Untitled - Jupyter Notebook

After that, we need to extract the dependent

and independent variables from the given
dataset.!
In [7]: columns = Thomas_df.select_dtypes(include=['number']).columns
x = Thomas_df[columns].drop(columns=['Id', 'SepalLengthCm'])
y = Thomas_df['SepalLengthCm']

In [8]: x

Out[8]:
SepalWidthCm PetalLengthCm PetalWidthCm

0 3.5 1.4 0.2

1 3.0 1.4 0.2

2 3.2 1.3 0.2

3 3.1 1.5 0.2

4 3.6 1.4 0.2

... ... ... ...

145 3.0 5.2 2.3

146 2.5 5.0 1.9

147 3.0 5.2 2.0

148 3.4 5.4 2.3

149 3.0 5.1 1.8

150 rows × 3 columns

In [9]: y

Out[9]: 0 5.1
1 4.9
2 4.7
3 4.6
4 5.0
...
145 6.7
146 6.3
147 6.5
148 6.2
149 5.9
Name: SepalLengthCm, Length: 150, dtype: float64

Split dataset

localhost:8888/notebooks/ML class/Untitled.ipynb 3/12

4/4/24, 11:12 PM Untitled - Jupyter Notebook

In [10]: from sklearn.model_selection import train_test_split

from sklearn.metrics import mean_absolute_error
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression

In [11]: x_train,x_test,y_train,y_test=train_test_split(x,y, test_size=1/3,random_state

In [12]: x_train

Out[12]:
SepalWidthCm PetalLengthCm PetalWidthCm

69 2.5 3.9 1.1

135 3.0 6.1 2.3

56 3.3 4.7 1.6

80 2.4 3.8 1.1

123 2.7 4.9 1.8

... ... ... ...

9 3.1 1.5 0.1

103 2.9 5.6 1.8

67 2.7 4.1 1.0

117 3.8 6.7 2.2

47 3.2 1.4 0.2

100 rows × 3 columns

localhost:8888/notebooks/ML class/Untitled.ipynb 4/12

4/4/24, 11:12 PM Untitled - Jupyter Notebook

In [13]: x_test

localhost:8888/notebooks/ML class/Untitled.ipynb 5/12

4/4/24, 11:12 PM Untitled - Jupyter Notebook

Out[13]:
SepalWidthCm PetalLengthCm PetalWidthCm

114 2.8 5.1 2.4

62 2.2 4.0 1.0

33 4.2 1.4 0.2

107 2.9 6.3 1.8

7 3.4 1.5 0.2

100 3.3 6.0 2.5

40 3.5 1.3 0.3

86 3.1 4.7 1.5

76 2.8 4.8 1.4

71 2.8 4.0 1.3

134 2.6 5.6 1.4

51 3.2 4.5 1.5

73 2.8 4.7 1.2

54 2.8 4.6 1.5

63 2.9 4.7 1.4

37 3.1 1.5 0.1

78 2.9 4.5 1.5

90 2.6 4.4 1.2

45 3.0 1.4 0.3

16 3.9 1.3 0.4

121 2.8 4.9 2.0

66 3.0 4.5 1.5

24 3.4 1.9 0.2

8 2.9 1.4 0.2

126 2.8 4.8 1.8

22 3.6 1.0 0.2

44 3.8 1.9 0.4

97 2.9 4.3 1.3

93 2.3 3.3 1.0

26 3.4 1.6 0.4

137 3.1 5.5 1.8

84 3.0 4.5 1.5

27 3.5 1.5 0.2

127 3.0 4.9 1.8

132 2.8 5.6 2.2

59 2.7 3.9 1.4

localhost:8888/notebooks/ML class/Untitled.ipynb 6/12

4/4/24, 11:12 PM Untitled - Jupyter Notebook

SepalWidthCm PetalLengthCm PetalWidthCm

18 3.8 1.7 0.3

83 2.7 5.1 1.6

61 3.0 4.2 1.5

92 2.6 4.0 1.2

112 3.0 5.5 2.1

2 3.2 1.3 0.2

141 3.1 5.1 2.3

43 3.5 1.6 0.6

10 3.7 1.5 0.2

60 2.0 3.5 1.0

116 3.0 5.5 1.8

144 3.3 5.7 2.5

119 2.2 5.0 1.5

108 2.5 5.8 1.8

In [14]: y_train

Out[14]: 69 5.6
135 7.7
56 6.3
80 5.5
123 6.3
...
9 4.9
103 6.3
67 5.8
117 7.7
47 4.6
Name: SepalLengthCm, Length: 100, dtype: float64

localhost:8888/notebooks/ML class/Untitled.ipynb 7/12

4/4/24, 11:12 PM Untitled - Jupyter Notebook

In [15]: y_test

Out[15]: 114 5.8

62 6.0
33 5.5
107 7.3
7 5.0
100 6.3
40 5.0
86 6.7
76 6.8
71 6.1
134 6.1
51 6.4
73 6.1
54 6.5
63 6.1
37 4.9
78 6.0
90 5.5
45 4.8
16 5.4
121 5.6
66 5.6
24 4.8
8 4.4
126 6.2
22 4.6
44 5.1
97 6.2
93 5.0
26 5.0
137 6.4
84 5.4
27 5.2
127 6.1
132 6.4
59 5.2
18 5.7
83 6.0
61 5.9
92 5.8
112 6.8
2 4.7
141 6.9
43 5.0
10 5.4
60 5.0
116 6.5
144 6.7
119 6.0
108 6.7
Name: SepalLengthCm, dtype: float64

localhost:8888/notebooks/ML class/Untitled.ipynb 8/12

4/4/24, 11:12 PM Untitled - Jupyter Notebook

Step-2: Fitting the Simple Linear Regression to

the Training Set:!
In [16]: from sklearn.linear_model import LinearRegression

In [17]: model = LinearRegression()

model.fit(x_train, y_train)

Out[17]: ▾ LinearRegression
LinearRegression()

Prediction of test set result:!

In [18]: y_pred = model.predict(x_test)
x_pred = model.predict(x_train)

In [19]: y_pred

Out[19]: array([5.90763683, 5.64942975, 5.46582046, 7.32357947, 5.03281158,

6.85611663, 4.87129874, 6.41802591, 6.37416826, 5.82257278,
6.86804939, 6.3264746 , 6.43639669, 6.14881748, 6.36031157,
4.9112593 , 6.13496079, 6.07563694, 4.62980368, 5.05668896,
6.0320937 , 6.19879872, 5.34359009, 4.63592727, 6.09432214,
4.77201433, 5.45901878, 6.1194946 , 5.1694053 , 4.97058315,
6.82969833, 6.19879872, 5.09664952, 6.29969264, 6.43603302,
5.61107868, 5.37359105, 6.40349114, 5.96571485, 5.76485843,
6.5559758 , 4.74974645, 6.16911217, 4.89449802, 5.2243254 ,
5.13328074, 6.7658604 , 6.62303275, 6.07656836, 6.67975459])

localhost:8888/notebooks/ML class/Untitled.ipynb 9/12

4/4/24, 11:12 PM Untitled - Jupyter Notebook

In [20]: x_pred

Out[20]: array([5.6932874 , 6.8822205 , 6.47574026, 5.55175484, 6.10817882,

6.53729061, 5.73968598, 5.98823605, 6.55182538, 6.39285346,
6.38418894, 4.9189924 , 6.19360655, 5.91412409, 5.57563221,
5.63105897, 6.34645488, 7.2820094 , 5.2243254 , 4.53664286,
6.3489958 , 6.17559944, 5.18820084, 5.5312679 , 6.57341517,
4.84129777, 6.45508189, 4.99403576, 4.88515543, 5.68809522,
7.12532506, 6.04179997, 4.76972674, 6.86675431, 6.69911787,
6.71909815, 6.50599455, 5.36585796, 5.11050621, 6.4428347 ,
6.90287887, 4.10524349, 6.59370987, 4.69976521, 5.91827451,
6.54211911, 4.74974645, 5.08279283, 7.13886734, 4.94899336,
4.62207058, 5.36746746, 5.50338309, 6.85450712, 7.41061669,
5.01895489, 4.9112593 , 4.95511696, 6.21104591, 6.19106563,
4.67205183, 4.91447831, 6.1194946 , 4.89288852, 7.24842576,
5.23324324, 7.83589247, 6.25490357, 5.54963868, 6.22199801,
5.18275533, 7.43059698, 5.2182018 , 4.98283033, 7.03439075,
4.89127902, 6.67913759, 5.9002674 , 5.75100175, 5.51630836,
6.38833936, 6.18296886, 6.19038754, 6.44734879, 4.85515446,
5.54402174, 6.48762378, 6.19360655, 5.03281158, 6.35257847,
6.02794328, 6.34967389, 5.8141616 , 4.94126027, 5.08440233,
4.9112593 , 6.77971708, 6.04631406, 7.92905329, 4.82744108])

In [ ]:

Step: 4. visualizing the Training set results:!

localhost:8888/notebooks/ML class/Untitled.ipynb 10/12

4/4/24, 11:12 PM Untitled - Jupyter Notebook

In [21]: import matplotlib.pyplot as plt

plt.figure(figsize=(10, 7))
plt.scatter(x_train.iloc[:, 0], y_train, color="blue", label="Actual values")
plt.plot(x_train.iloc[:, 0], x_pred, color="red", label="Regression line")
plt.title(" Training set")
plt.xlabel("Feature")
plt.ylabel("Sepal Length (cm)")
plt.legend()
plt.show()

plt.figure(figsize=(10, 7))
plt.scatter(x_test.iloc[:, 0], y_test, color="blue", label="Actual values")
plt.plot(x_test.iloc[:, 0], y_pred, color="red", label="Regression line")
plt.title(" Testing set")
plt.xlabel("Feature")
plt.ylabel("Sepal Length (cm)")
plt.legend()
plt.show()

mae = mean_absolute_error(y_test, y_pred)
print("Here is the Linear Regression Mean Absolute Error:", mae)

localhost:8888/notebooks/ML class/Untitled.ipynb 11/12

4/4/24, 11:12 PM Untitled - Jupyter Notebook

Here is the Linear Regression Mean Absolute Error: 0.25316544984473643

localhost:8888/notebooks/ML class/Untitled.ipynb 12/12

Decision Tree
No ratings yet
Decision Tree
10 pages
SC Assignment Q2
No ratings yet
SC Assignment Q2
7 pages
ML Manual
No ratings yet
ML Manual
53 pages
LinearRegression Iris
No ratings yet
LinearRegression Iris
4 pages
Lab2 Linear Regression
100% (1)
Lab2 Linear Regression
18 pages
CO3
No ratings yet
CO3
8 pages
DatA414 Prac 2 Linear Regression 2024.pdfasisipho
No ratings yet
DatA414 Prac 2 Linear Regression 2024.pdfasisipho
21 pages
DSBDA6
No ratings yet
DSBDA6
6 pages
Eai Exp 2-5
No ratings yet
Eai Exp 2-5
13 pages
Regression Prac 9
No ratings yet
Regression Prac 9
8 pages
ML External File-43
No ratings yet
ML External File-43
23 pages
Practical 5
No ratings yet
Practical 5
13 pages
ML Lab Record
No ratings yet
ML Lab Record
17 pages
Practical 4
No ratings yet
Practical 4
15 pages
ML Yogesh
No ratings yet
ML Yogesh
23 pages
Answer PDF Lab
No ratings yet
Answer PDF Lab
34 pages
COMPARISON - Jupyter Notebook
No ratings yet
COMPARISON - Jupyter Notebook
5 pages
Health Outcomes With Linear Regression
No ratings yet
Health Outcomes With Linear Regression
8 pages
Ann 1
No ratings yet
Ann 1
20 pages
BHMC17 P5.ipynb - Colaboratory
No ratings yet
BHMC17 P5.ipynb - Colaboratory
4 pages
MiniProject - ML - Ipynb - Colaboratory
No ratings yet
MiniProject - ML - Ipynb - Colaboratory
26 pages
Inbuilt Kmeans
No ratings yet
Inbuilt Kmeans
3 pages
ML Lab Experiments (1) - Pages-5
No ratings yet
ML Lab Experiments (1) - Pages-5
8 pages
ML L - Ab
No ratings yet
ML L - Ab
13 pages
21brs1474 ML Lab 2
No ratings yet
21brs1474 ML Lab 2
25 pages
6 Logistic Regression Iris
No ratings yet
6 Logistic Regression Iris
3 pages
Assignment 1
No ratings yet
Assignment 1
5 pages
Big Data Assignment - 4
No ratings yet
Big Data Assignment - 4
6 pages
Ary Reg
No ratings yet
Ary Reg
10 pages
Lab4 KNN
No ratings yet
Lab4 KNN
9 pages
Ai Lab
No ratings yet
Ai Lab
19 pages
Linear Reg 33
No ratings yet
Linear Reg 33
3 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
Regression Model
No ratings yet
Regression Model
6 pages
Exp 4 - LM
No ratings yet
Exp 4 - LM
5 pages
Mlext
No ratings yet
Mlext
1 page
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
1 - Standard Linear Regression: Numpy NP Pandas
No ratings yet
1 - Standard Linear Regression: Numpy NP Pandas
4 pages
Lab5 - NguyenHoangAnhTu - Jupyter Notebook
No ratings yet
Lab5 - NguyenHoangAnhTu - Jupyter Notebook
33 pages
Zerox Ready
No ratings yet
Zerox Ready
21 pages
EXAM PREPERATION - Ipynb - Colaboratory-1
No ratings yet
EXAM PREPERATION - Ipynb - Colaboratory-1
8 pages
Localweighted - Jupyter Notebook
No ratings yet
Localweighted - Jupyter Notebook
4 pages
ML Lab Prgms Split
No ratings yet
ML Lab Prgms Split
3 pages
KRAI LabManual
No ratings yet
KRAI LabManual
77 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
47 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
5 pages
03 Multiple Linear Regression
No ratings yet
03 Multiple Linear Regression
7 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
7 pages
C1 W1 Lab03 Model Representation Soln-Copy1
No ratings yet
C1 W1 Lab03 Model Representation Soln-Copy1
7 pages
C1 W1 Lab02 Model Representation Soln
No ratings yet
C1 W1 Lab02 Model Representation Soln
7 pages
OLSLinear Regquestion
No ratings yet
OLSLinear Regquestion
5 pages
Mlda - Lab
No ratings yet
Mlda - Lab
35 pages
Ex7 HTML
No ratings yet
Ex7 HTML
3 pages
Data Structure Using 'C'.
No ratings yet
Data Structure Using 'C'.
15 pages
16BCB0126 VL2018195002535 Pe003
No ratings yet
16BCB0126 VL2018195002535 Pe003
40 pages
THT SKC Matlab
No ratings yet
THT SKC Matlab
3 pages
Know Your Dataset: Season Holiday Weekday Workingday CNT 726 727 728 729 730
No ratings yet
Know Your Dataset: Season Holiday Weekday Workingday CNT 726 727 728 729 730
1 page
Python Unit 1
No ratings yet
Python Unit 1
22 pages
ML Record Print
No ratings yet
ML Record Print
20 pages
Data - Preprocessing - Tools - Ipynb - Colaboratory
No ratings yet
Data - Preprocessing - Tools - Ipynb - Colaboratory
4 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
23 pages
DL - Assignment 6 Solution
100% (3)
DL - Assignment 6 Solution
6 pages
Wmi 9B
No ratings yet
Wmi 9B
3 pages
Unit 2
No ratings yet
Unit 2
57 pages
Geo AI
No ratings yet
Geo AI
50 pages
Aiml Unit-4
No ratings yet
Aiml Unit-4
82 pages
Bandwidth (Signal Processing)
No ratings yet
Bandwidth (Signal Processing)
4 pages
Question Bank Unit 4 - 5 - 6
No ratings yet
Question Bank Unit 4 - 5 - 6
7 pages
Chapter 1 MEC522
No ratings yet
Chapter 1 MEC522
35 pages
Extendible Hashing
No ratings yet
Extendible Hashing
7 pages
Lecture 3 Dual Theory Part I 1 PDF
No ratings yet
Lecture 3 Dual Theory Part I 1 PDF
33 pages
Min Lin PDF
No ratings yet
Min Lin PDF
10 pages
6 - Network Flow Models - Lecture 9-11
No ratings yet
6 - Network Flow Models - Lecture 9-11
36 pages
3.4 Notes KEY
No ratings yet
3.4 Notes KEY
2 pages
Employee Attrition Analysis of Data Driven Models
No ratings yet
Employee Attrition Analysis of Data Driven Models
10 pages
Finite State Machines State Minimization
No ratings yet
Finite State Machines State Minimization
34 pages
Cubic Spline
No ratings yet
Cubic Spline
40 pages
Kalman Smoothing
No ratings yet
Kalman Smoothing
15 pages
Introduction To Matrix: A11x1 + A12x2 + ... + A1nxn b1 A21x1 + A22x2 + ... + A2nxn b2
No ratings yet
Introduction To Matrix: A11x1 + A12x2 + ... + A1nxn b1 A21x1 + A22x2 + ... + A2nxn b2
42 pages
04 Discrete-Time Signal and System
No ratings yet
04 Discrete-Time Signal and System
57 pages
The 0/1 Knapsack Problem The 0/1 Knapsack Problem
No ratings yet
The 0/1 Knapsack Problem The 0/1 Knapsack Problem
21 pages
January - 2018
No ratings yet
January - 2018
2 pages
Spring 2024 - CS601 - 2 - SOL
No ratings yet
Spring 2024 - CS601 - 2 - SOL
3 pages
Optimization With R PDF
No ratings yet
Optimization With R PDF
54 pages
Time Allotted: 3 Hrs Full Marks: 70: Digital Communication (ECEN 3105)
No ratings yet
Time Allotted: 3 Hrs Full Marks: 70: Digital Communication (ECEN 3105)
2 pages
CSPS: Arc Consistency
No ratings yet
CSPS: Arc Consistency
17 pages
Document
No ratings yet
Document
2 pages
Dijkstra'S Algorithm: (For Programming Based Labs)
No ratings yet
Dijkstra'S Algorithm: (For Programming Based Labs)
5 pages
AD Conversion
No ratings yet
AD Conversion
5 pages
Fundamentos de Seguridad de la Red
From Everand
Fundamentos de Seguridad de la Red
NUMA EDITORIAL
No ratings yet