100% found this document useful (1 vote)

79 views43 pages

5 Types Regression in 45 Lines of Code

The document describes implementing multiple linear regression in Python using scikit-learn. It loads and preprocesses an abalone dataset, trains a linear regression model on 80% of the data, evaluates the model performance on the remaining 20% using various metrics, and saves the trained model for future use. The code demonstrates encoding categorical variables, fitting and predicting with the linear regression model, and comparing actual vs predicted values both numerically and visually.

Uploaded by

Teto Schedule

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

79 views43 pages

5 Types Regression in 45 Lines of Code

Uploaded by

Teto Schedule

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 43

5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

1 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

2 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

3 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

#Multiple Linear Regression

#importing the libraries

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

4 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

#Importing the dataset

dataset = pd.read_csv('abalone.csv',header=None)
# as we don’t have column names

X = dataset.iloc[:,:-1].values

"""#We will define all the independent variables X in the

format [ row, columns] & [ upper bound: lower bound ,upper
: lower bound ] where location is [ : , : -1 ] i.e. all
columns except the last column"""

y = dataset.iloc[:, 8].values #The last 8th column

# Encoding categorical data

from sklearn.preprocessing import LabelEncoder,
OneHotEncoder
from sklearn.compose import ColumnTransformer

#Gender column
ct = ColumnTransformer([("Gender", OneHotEncoder(), [0])],
remainder = 'passthrough')
X = ct.fit_transform(X)

#to avoid dummy variable trap

X = X[:, 1:]

5 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

# Split the dataset into the Training set and Test set
from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y,

test_size = 0.2, random_state = 0)

# Fitting Multiple Linear Regression to the Training set

from sklearn.linear_model import LinearRegression
regressor = LinearRegression()
regressor.fit(X_train, y_train)

#Predicting the model accuracy on Test data set

y_pred = regressor.predict(X_test)

#To get the intercept:

print(regressor.intercept_)
#To view the coefficient values
print(regressor.coef_)

6 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

7 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

#We can also compare the actual versus prediction

df = pd.DataFrame({'Actual': y_test.flatten(),
'Predicted': y_pred.flatten()})

8 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

df_1 = df.head(25)
df_1.plot(kind=’bar’,figsize=(16,10))
plt.grid(which=’major’, linestyle=’-’, linewidth=’0.5',
color=’green’)
plt.grid(which=’minor’, linestyle=’:’, linewidth=’0.5',
color=’black’)
plt.show()

9 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

#evaluation Metrics
from sklearn import metrics
print('Mean Absolute Error:',
metrics.mean_absolute_error(y_test, y_pred))
print('Mean Squared Error:',
metrics.mean_squared_error(y_test, y_pred))
print('Root Mean Squared Error:',
np.sqrt(metrics.mean_squared_error(y_test, y_pred)))

10 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

#save the model in the disk

import pickle
# save the model to disk
filename = 'reg_model.sav'
pickle.dump(regressor, open(filename, 'wb'))

# load the model from disk

filename1 = 'reg_model.sav'
loaded_model = pickle.load(open(filename1, 'rb'))

#another method using joblib

'''Pickled model as a file using joblib: Joblib is the
replacement of pickle as it is more efficent on objects
that carry large numpy arrays. '''

from sklearn.externals import joblib

# Save the model as a pickle in a file
joblib.dump(regressor, 'regressor.pkl')

# Load the model from the file

loaded_model2 = joblib.load('regressor.pkl')

# Use the loaded model to make predictions

loaded_model2.predict(X_test)

11 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

#Multiple Linear Regression

#Importing the libraries

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

#Importing the dataset

dataset = pd.read_csv('abalone.csv',header=None)

X = dataset.iloc[:,:-1].values
#All columns except the last column (by defining the upper
bound)
y = dataset.iloc[:, 8].values

#Encoding categorical data

from sklearn.preprocessing import LabelEncoder,
OneHotEncoder
from sklearn.compose import ColumnTransformer

#Gender column
ct = ColumnTransformer([("Gender", OneHotEncoder(), [0])],
remainder = 'passthrough')
X = ct.fit_transform(X)

#to avoid dummy variable trap

X = X[:, 1:]

#Splitting the dataset into the Training set and Test set
from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X, y,
test_size = 0.2, random_state = 0)

#Fitting Multiple Linear Regression to the Training set

from sklearn.linear_model import LinearRegression
regressor = LinearRegression()
regressor.fit(X_train, y_train)

#Predicting the Test set results

y_pred = regressor.predict(X_test)

#if we wish to predict by manually entering the values

then we have #to put number of values = number of columns,

12 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

representing each #value to its corresponding column

regressor.predict([[1.0,1.0,0.55,0.45,0.15,0.91,0.277,0.24
3,0.33]])

#To get the intercept:

print(regressor.intercept_)
#To view the coefficient values
print(regressor.coef_)

#We can also compare the actual versus predicted

df = pd.DataFrame({'Actual': y_test.flatten(),
'Predicted': y_pred.flatten()})
df

#we can also visualize the actual vs predicted

df_1 = df.head(25)
df_1.plot(kind='bar',figsize=(16,10))
plt.grid(which='major', linestyle='-', linewidth='0.5',
color='green')
plt.grid(which='minor', linestyle=':', linewidth='0.5',
color='black')
plt.show()

#save the model in the disk

import pickle
# save the model to disk
filename = 'reg_model.sav'
pickle.dump(regressor, open(filename, 'wb'))

#load the model from disk

filename1 = 'reg_model.sav'
loaded_model = pickle.load(open(filename1, 'rb'))

#another method using joblib

'''Pickled model as a file using joblib: Joblib is the
replacement of pickle as it is more efficent on objects
that carry large numpy arrays.'''

from sklearn.externals import joblib

#Save the model as a pickle in a file

13 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

joblib.dump(regressor, 'regressor.pkl')

#Load the model from the file

loaded_model2 = joblib.load('regressor.pkl')

#Use the loaded model to make predictions

loaded_model2.predict(X_test)

14 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

#Importing the libraries

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

15 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

#Importing the dataset

dataset = pd.read_csv('abalone.csv', header = None)

#Encoding categorical data

from sklearn.preprocessing import LabelEncoder,
OneHotEncoder
from sklearn.compose import ColumnTransformer

#Gender column
ct = ColumnTransformer([("Gender", OneHotEncoder(), [0])],
remainder = 'passthrough')
dataset = ct.fit_transform(dataset)
#anyway we wont use this column we will simply use 1
independent variable i.e. column ‘Length’ of abalone data
set and X our dependent variable i.e.‘number of rings’
from the last column.

The reason why i m choosing only 2 columns is to show u

the comparison of performance of both the algorithm using
plots()

X = dataset[:,10:]
y = dataset[:,3]

# Fitting Polynomial Regression to the dataset

from sklearn.preprocessing import PolynomialFeatures

poly_reg = PolynomialFeatures(degree = 2)

X_poly = poly_reg.fit_transform(X)
#will transform X into 2 more features(^2,#containing
features and Square root of features)

#To view the X-plot features

X_poly
#build up the regression(poly) model

16 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

poly_reg.fit(X_poly, y)

linear_reg2 = LinearRegression()
linear_reg2.fit(X_poly, y)

# Fitting Linear Regression to the dataset

from sklearn.linear_model import LinearRegression
linear_reg = LinearRegression()
linear_reg.fit(X, y)

# Visualizing the Linear Regression results

plt.scatter(X, y, color = 'red')

plt.plot(X, linear_reg.predict(X), color = 'blue')
plt.title('Predicting the age of abalone from physical
measurements.')
plt.xlabel('Rings')
plt.ylabel('Length')
plt.show()

# Visualizing the Polynomial Regression results

plt.scatter(X, y, color = 'red')

plt.plot(X,

17 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

linear_reg2.predict(poly_reg.fit_transform(X)), color =
'blue')
plt.title('Predicting the age of abalone from physical
measurements.')

plt.xlabel('Rings')
plt.ylabel('length')
plt.show()

# Visualizing the Polynomial Regression results (for

smoother curve)

18 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

X_grid = np.arange(min(X), max(X), 0.1)

X_grid = X_grid.reshape((len(X_grid), 1))
plt.scatter(X, y, color = 'red')
plt.plot(X_grid,linear_reg2.predict(poly_reg.fit_transform
(X_grid)), color = 'blue')
plt.title('Predicting abalone from physical
measurements.')
plt.xlabel('Rings')
plt.ylabel('Length')
plt.show()

# Predicting a new result with Linear Regression

linear_reg.predict([[10]])

# Predicting a new result with Polynomial Regression

linear_reg2.predict(poly_reg.fit_transform([[10]]))

19 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

#Polynomial Regression

#Importing the libraries

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

#Importing the dataset

dataset = pd.read_csv('abalone.csv', header = None)

# Encoding categorical data

from sklearn.preprocessing import LabelEncoder,
OneHotEncoder
from sklearn.compose import ColumnTransformer

#Gender column
ct = ColumnTransformer([("Gender", OneHotEncoder(), [0])],
remainder = 'passthrough')
dataset = ct.fit_transform(dataset)
"""#anyway we wont use this column we will simply use 1
independent variable i.e. column ‘Length’ of abalone data
set and X our dependent variable i.e.‘number of rings’
from the last column.

The reason why i m choosing only 2 columns is to show u

the comparison of performance of both the algorithm using
plots()"""

X = dataset[:,10:]
y = dataset[:,3]

# Fitting Linear Regression to the dataset

from sklearn.linear_model import LinearRegression
linear_reg = LinearRegression()
linear_reg.fit(X, y)

# Fitting Polynomial Regression to the dataset

from sklearn.preprocessing import PolynomialFeatures
poly_reg = PolynomialFeatures(degree = 2)

X_poly = poly_reg.fit_transform(X)

#To view the X-plot features

20 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

X_poly

#build up the regression(poly) model

poly_reg.fit(X_poly, y)
linear_reg2 = LinearRegression()
linear_reg2.fit(X_poly, y)

# Visualizing the Linear Regression results

plt.scatter(X, y, color = 'red')
plt.plot(X, linear_reg.predict(X), color = 'blue')
plt.title('Predicting abalone from physical
measurements.')
plt.xlabel('Rings')
plt.ylabel('Length')
plt.show()

# Visualizing the Polynomial Regression results

plt.scatter(X, y, color = 'red')
plt.plot(X,
linear_reg2.predict(poly_reg.fit_transform(X)), color =
'blue')
plt.title('Predicting abalone from physical
measurements.')
plt.xlabel('Rings')
plt.ylabel('length')
plt.show()

# Visualizing the Polynomial Regression results (for

smoother curve)
X_grid = np.arange(min(X), max(X), 0.1)
X_grid = X_grid.reshape((len(X_grid), 1))
plt.scatter(X, y, color = 'red')
plt.plot(X_grid,
linear_reg2.predict(poly_reg.fit_transform(X_grid)), color
= 'blue')
plt.title('Predicting abalone from physical
measurements.')
plt.xlabel('Rings')
plt.ylabel('Length')
plt.show()

# Predicting a new result with Linear Regression

#linear_reg.predict([length])
linear_reg.predict([[10]])

# Predicting a new result with Polynomial Regression

linear_reg2.predict(poly_reg.fit_transform([[10]]))

21 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

22 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

23 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

# Importing the libraries

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

# Importing the dataset

data = pd.read_csv('AirPressure.csv')
data

#dividing the dataset into X and y

X = data.iloc[:, 1:2].values
y = data.iloc[:, 2].values

# Feature Scaling for SVR

24 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

from sklearn.preprocessing import StandardScaler

sc_X = StandardScaler()
sc_y = StandardScaler()

X = sc_y.fit_transform(X.reshape(-1,1))
y = sc_y.fit_transform(y.reshape(-1,1))

# Fitting Linear Regression to the dataset

from sklearn.linear_model import LinearRegression
lin = LinearRegression()
lin.fit(X, y)

# Fitting SVR to the dataset

from sklearn.svm import SVR
regressor = SVR(kernel = 'rbf')
regressor.fit(X, y)

25 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

# Visualising the Linear Regression results

plt.scatter(X, y, color = 'blue')
plt.plot(X, lin.predict(X), color = 'red')
plt.title('Linear Regression')
plt.xlabel('Temperature')
plt.ylabel('Pressure')
plt.show()

# Visualising the SVR results

plt.scatter(X, y, color = 'blue')
plt.plot(X, regressor.predict(X), color = 'red')
plt.title('Support Vector Regression')
plt.xlabel('Temperature')
plt.ylabel('Pressure')
plt.show()

#Predicting a new result with Linear Regression

lin.predict([[150.0]])

#Predicting a new result(temperature) with Support Vector

Regression
y_Pressure = regressor.predict([[55]])

26 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

# Importing the libraries

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

# Importing the dataset

data = pd.read_csv('AirPressure.csv')
data

#dividing the dataset into X and y

X = data.iloc[:, 1:2].values
y = data.iloc[:, 2].values

#Feature Scaling for SVR

from sklearn.preprocessing import StandardScaler
sc_X = StandardScaler()
sc_y = StandardScaler()

X = sc_y.fit_transform(X.reshape(-1,1))
y = sc_y.fit_transform(y.reshape(-1,1))

#Fitting Linear Regression to the dataset

from sklearn.linear_model import LinearRegression
lin = LinearRegression()

lin.fit(X, y)

#Fitting SVR to the dataset

from sklearn.svm import SVR
regressor = SVR(kernel = 'rbf')
regressor.fit(X, y)

#Visualizing the Linear Regression results

plt.scatter(X, y, color = 'blue')
plt.plot(X, lin.predict(X), color = 'red')
plt.title('Linear Regression')
plt.xlabel('Temperature')
plt.ylabel('Pressure')

27 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

plt.show()

# Visualizing the SVR results

plt.scatter(X, y, color = 'blue')
plt.plot(X, regressor.predict(X), color = 'red')
plt.title('Support Vector Regression')
plt.xlabel('Temperature')
plt.ylabel('Pressure')
plt.show()

#Predicting a new result with Linear Regression

lin.predict([[150.0]])

#Predicting a new result(pressure) with Support Vector

Regression
y_Pressure = regressor.predict([[55]])

28 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

29 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

# Importing the libraries

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

30 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

# Importing the dataset

data = pd.read_csv('AirPressure.csv')
data

#dividing the dataset into X and y

X = data.iloc[:, 1:2].values
y = data.iloc[:, 2].values

# Feature Scaling for SVR

from sklearn.preprocessing import StandardScaler
sc_X = StandardScaler()
sc_y = StandardScaler()

X = sc_y.fit_transform(X.reshape(-1,1))
y = sc_y.fit_transform(y.reshape(-1,1))

#Fitting Linear Regression to the dataset

from sklearn.linear_model import LinearRegression
lin = LinearRegression()
lin.fit(X, y)

#Fitting SVR to the dataset

from sklearn.svm import SVR
regressor = SVR(kernel = 'rbf')
regressor.fit(X, y)

#Fitting Decision Tree Regression

from sklearn.tree import DecisionTreeRegressor
dt_model = DecisionTreeRegressor(random_state = 0)
dt_model.fit(X, y)

#Visualizing the Linear Regression results

plt.scatter(X, y, color = 'blue')
plt.plot(X, lin.predict(X), color = 'red')
plt.title('Linear Regression')
plt.xlabel('Temperature')
plt.ylabel('Pressure')
plt.show()

#Visualizing the SVR results

plt.scatter(X, y, color = 'blue')

31 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

plt.plot(X, regressor.predict(X), color = 'red')

plt.title('Support Vector Regression')
plt.xlabel('Temperature')
plt.ylabel('Pressure')
plt.show()

#Visualizing the Decision Trees Regression results

plt.scatter(X, y, color = 'blue')
plt.plot(X, dt_model.predict(X), color = 'red')
plt.title('Decision Trees Regression')
plt.xlabel('Temperature')
plt.ylabel('Pressure')
plt.show()

32 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

#Predicting a new result with Linear Regression

lin.predict([[55.0]])

#Predicting a new result(pressure) with Support Vector

Regression
y_Pressure = dt_model.predict([[55]])

# import export_graphviz
from sklearn.tree import export_graphviz

# export the decision tree to a tree.dot file

#for visualizing the plot easily anywhere
export_graphviz(dt_model, out_file
='e:/tree.dot',feature_names =['Pressure'])

33 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

# Importing the libraries

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

34 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

# Importing the dataset

data = pd.read_csv('AirPressure.csv')
data

#dividing the dataset into X and y

X = data.iloc[:, 1:2].values
y = data.iloc[:, 2].values

#Feature Scaling for SVR

from sklearn.preprocessing import StandardScaler
sc_X = StandardScaler()
sc_y = StandardScaler()

X = sc_y.fit_transform(X.reshape(-1,1))
y = sc_y.fit_transform(y.reshape(-1,1))

#Fitting Linear Regression to the dataset

from sklearn.linear_model import LinearRegression
lin = LinearRegression()

lin.fit(X, y)

#Fitting SVR to the dataset

from sklearn.svm import SVR
regressor = SVR(kernel = 'rbf')
regressor.fit(X, y)

#Visualising the Linear Regression results

plt.scatter(X, y, color = 'blue')
plt.plot(X, lin.predict(X), color = 'red')
plt.title('Linear Regression')
plt.xlabel('Temperature')
plt.ylabel('Pressure')
plt.show()

#Visualising the SVR results

plt.scatter(X, y, color = 'blue')
plt.plot(X, regressor.predict(X), color = 'red')
plt.title('Support Vector Regression')
plt.xlabel('Temperature')
plt.ylabel('Pressure')
plt.show()

#Visualising the Decision Trees Regression results

plt.scatter(X, y, color = 'blue')
plt.plot(X, dt_model.predict(X), color = 'red')
plt.title('Decision Trees Regression')
plt.xlabel('Temperature')

35 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

plt.ylabel('Pressure')
plt.show()

#we will see the line is passing between the blue

points(Thus better model)

#Predicting a new result with Linear Regression

lin.predict([[55.0]])

#Predicting a new result(pressure) with Support Vector

Regression
y_Pressure = dt_model.predict([[55]])

from sklearn.tree import export_graphviz

#export the decision tree to a tree.dot file
#for visualizing the plot easily anywhere
export_graphviz(dt_model, out_file ='e:/tree.dot',
feature_names =['Pressure'])

"""
The tree is finally exported and we can visualized using
http://www.webgraphviz.com/ by copying the data from the
‘tree.dot’ file."""

import pickle
#save the model to disk
filename = 'final_model.sav'
pickle.dump(dt_model, open(filename, 'wb'))

#load the model from disk

filename1 = 'final_model.sav'
loaded_model = pickle.load(open(filename1, 'rb'))

#another method using joblib

'''Pickled model as a file using joblib: Joblib is the
replacement of pickle as
it is more efficent on objects that carry large numpy
arrays.
'''

from sklearn.externals import joblib

#Save the model as a pickle in a file
joblib.dump(dt_model, 'dt_model.pkl')

#Load the model from the file

loaded_model2 = joblib.load('dt_model.pkl')

#Use the loaded model to make predictions

loaded_model2.predict([[55]])

36 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

37 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

38 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

# Importing the libraries

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

# Importing the dataset

data = pd.read_csv('AirPressure.csv')
data

#dividing the dataset into X and y

X = data.iloc[:, 1:2].values
y = data.iloc[:, 2].values

# Feature Scaling for SVR

from sklearn.preprocessing import StandardScaler
sc_X = StandardScaler()
sc_y = StandardScaler()

X = sc_y.fit_transform(X.reshape(-1,1))
y = sc_y.fit_transform(y.reshape(-1,1))

#Fitting Decision Tree Regression

from sklearn.tree import DecisionTreeRegressor

39 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

dt_model = DecisionTreeRegressor(random_state = 0)
dt_model.fit(X, y)

#Fitting Random Forest Regression to the dataset

from sklearn.ensemble import RandomForestRegressor
rf_model = RandomForestRegressor(n_estimators = 500,
random_state = 0)
rf_model.fit(X, y)

# Visualizing the Decision Trees Regression results

plt.scatter(X, y, color = 'blue')
plt.plot(X, dt_model.predict(X), color = 'red')
plt.title('Decision Trees Regression')
plt.xlabel('Temperature')
plt.ylabel('Pressure')
plt.show()

# Visualizing the Random Forest results

plt.scatter(X, y, color = 'blue')
plt.plot(X, rf_model.predict(X), color = 'red')
plt.title('Random Forest Regression')
plt.xlabel('Temperature')
plt.ylabel('Pressure')
plt.show()

# Visualizing the Random Forest results with more

precisely
X_grid = np.arange(min(X), max(X), 0.01)
X_grid = X_grid.reshape((len(X_grid), 1))
plt.scatter(X, y, color = 'red')
plt.plot(X_grid, rf_model.predict(X_grid), color = 'blue')
plt.title('Random Forest Regression')
plt.xlabel('Temperature')
plt.ylabel('Pressure')
plt.show()

#Predicting a new result(pressure) with Random Forest

Regression
rf_model.predict([[55]])

#Predicting a new result(pressure) with Decision Tree

Regression
dt_model.predict([[55]])

40 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

41 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

#Importing the libraries

import numpy as np
import matplotlib.pyplot as plt
import pandas as pd

#Importing the dataset

data = pd.read_csv('AirPressure.csv')
data

#dividing the dataset into X and y

X = data.iloc[:, 1:2].values
y = data.iloc[:, 2].values

#Feature Scaling for SVR

from sklearn.preprocessing import StandardScaler
sc_X = StandardScaler()
sc_y = StandardScaler()

X = sc_y.fit_transform(X.reshape(-1,1))
#X = sc_X.fit_transform(X)
y = sc_y.fit_transform(y.reshape(-1,1))

#Fitting Decision Tree Regression

from sklearn.tree import DecisionTreeRegressor
dt_model = DecisionTreeRegressor(random_state = 0)
dt_model.fit(X, y)

#Fitting Random Forest Regression to the dataset

from sklearn.ensemble import RandomForestRegressor
rf_model = RandomForestRegressor(n_estimators = 500,
random_state = 0)
rf_model.fit(X, y)

# Visualizing the Decision Trees Regression results

plt.scatter(X, y, color = 'blue')
plt.plot(X, dt_model.predict(X), color = 'red')
plt.title('Decision Trees Regression')
plt.xlabel('Temperature')
plt.ylabel('Pressure')
plt.show()

# Visualizing the Random Forest results

plt.scatter(X, y, color = 'blue')
plt.plot(X, rf_model.predict(X), color = 'red')
plt.title('Random Forest Regression')
plt.xlabel('Temperature')

42 of 43 10/14/2021, 10:14 AM
5 Types Regression in 45 lines of code | by Bob Rupak Roy - II... https://bobrupakroy.medium.com/5-types-regression-in-45-lin...

plt.ylabel('Pressure')
plt.show()

# Visualizing the Random Forest results in high resolution

X_grid = np.arange(min(X), max(X), 0.01)
X_grid = X_grid.reshape((len(X_grid), 1))
plt.scatter(X, y, color = 'red')
plt.plot(X_grid, rf_model.predict(X_grid), color = 'blue')
plt.title('Random Forest Regression')
plt.xlabel('Temperature')
plt.ylabel('Pressure')
plt.show()

#Predicting a new result(pressure) with Random Forest

Regression
rf_model.predict([[55]])

#Predicting a new result(pressure) with Decision Tree

Regression
dt_model.predict([[55]])

43 of 43 10/14/2021, 10:14 AM

DA Unit 2
100% (1)
DA Unit 2
51 pages
ML Lab Manual
100% (1)
ML Lab Manual
37 pages
Unit 3 Notes
100% (2)
Unit 3 Notes
32 pages
Linear Regression - Jupyter Notebook
100% (3)
Linear Regression - Jupyter Notebook
56 pages
Teshome Proposal
100% (1)
Teshome Proposal
22 pages
2.1 ML (Implementation of Simple Linear Regression in Python)
No ratings yet
2.1 ML (Implementation of Simple Linear Regression in Python)
8 pages
Machine Learning Hands-On
100% (1)
Machine Learning Hands-On
18 pages
Module-2 - Logistic Regression in Machine Learning
No ratings yet
Module-2 - Logistic Regression in Machine Learning
28 pages
ML Combined
No ratings yet
ML Combined
254 pages
Regression Dataset Example
No ratings yet
Regression Dataset Example
14 pages
Excel Power Query Tutorial (Get & Transform) + Examples
0% (1)
Excel Power Query Tutorial (Get & Transform) + Examples
65 pages
Multiple Linear Regression 3
No ratings yet
Multiple Linear Regression 3
68 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
Unit 6
No ratings yet
Unit 6
107 pages
Calculator Techniques
No ratings yet
Calculator Techniques
55 pages
Unit 5
No ratings yet
Unit 5
171 pages
Graph Theory Network Analysis
No ratings yet
Graph Theory Network Analysis
12 pages
Lecture-2 Unit 2
No ratings yet
Lecture-2 Unit 2
56 pages
Machine Learning Lab Manual 06
100% (1)
Machine Learning Lab Manual 06
8 pages
Lab Mannual of ML
No ratings yet
Lab Mannual of ML
43 pages
ML Manoj
No ratings yet
ML Manoj
51 pages
ML 01 (Pranavv)
No ratings yet
ML 01 (Pranavv)
14 pages
Machine Learning: Bilal Khan
100% (2)
Machine Learning: Bilal Khan
20 pages
Sasi 111111111
No ratings yet
Sasi 111111111
48 pages
ML Polynomial Regression4
No ratings yet
ML Polynomial Regression4
36 pages
LR LogReg
No ratings yet
LR LogReg
53 pages
Machine Learning Strategies
No ratings yet
Machine Learning Strategies
59 pages
Data Science Chapitre 2
No ratings yet
Data Science Chapitre 2
98 pages
ML LN 3
No ratings yet
ML LN 3
44 pages
Forecasting Using Facebook's Prophet Library
No ratings yet
Forecasting Using Facebook's Prophet Library
11 pages
ML 01 (Shubham)
No ratings yet
ML 01 (Shubham)
14 pages
Machine Learning 2
No ratings yet
Machine Learning 2
45 pages
Math 1281 Learning Journal Unit 6
No ratings yet
Math 1281 Learning Journal Unit 6
2 pages
Unit 2 Regression Analysis
No ratings yet
Unit 2 Regression Analysis
16 pages
7 Regression With Stationary Time-Series Data-Revised
No ratings yet
7 Regression With Stationary Time-Series Data-Revised
75 pages
ML Unit
No ratings yet
ML Unit
23 pages
Regression
No ratings yet
Regression
45 pages
Machine Learning With Python Algorithms
No ratings yet
Machine Learning With Python Algorithms
28 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
30 pages
Regression Analysis
No ratings yet
Regression Analysis
16 pages
Week-7 DS Practical
No ratings yet
Week-7 DS Practical
8 pages
The Language Network Is Not Engaged in Object Categorization
No ratings yet
The Language Network Is Not Engaged in Object Categorization
21 pages
Basic Statistics For Data Science
No ratings yet
Basic Statistics For Data Science
24 pages
228w1f0065 ML
No ratings yet
228w1f0065 ML
15 pages
Experiment 7 ML Vtu
No ratings yet
Experiment 7 ML Vtu
5 pages
(PDF Download) Population Ecology in Practice: Underused, Misused and Abused Methods. 1st Edition Dennis L. Murray Fulll Chapter
100% (4)
(PDF Download) Population Ecology in Practice: Underused, Misused and Abused Methods. 1st Edition Dennis L. Murray Fulll Chapter
64 pages
Convert Your Time Series Forecasting Coding Hours Into Minutes With This One Platform
No ratings yet
Convert Your Time Series Forecasting Coding Hours Into Minutes With This One Platform
32 pages
Regression
No ratings yet
Regression
16 pages
Mlmultiplelinearregression 170919114353 PDF
No ratings yet
Mlmultiplelinearregression 170919114353 PDF
8 pages
Driskell, R., Embry, E., & Lyon, L. (2008) - Faith and Politics - The Influence of Religious Beliefs On Political Participation
No ratings yet
Driskell, R., Embry, E., & Lyon, L. (2008) - Faith and Politics - The Influence of Religious Beliefs On Political Participation
22 pages
Simple Linear Regression in Machine Learning
No ratings yet
Simple Linear Regression in Machine Learning
7 pages
1 Regression
No ratings yet
1 Regression
23 pages
Assignment No.4 - (20-Ele-68)
No ratings yet
Assignment No.4 - (20-Ele-68)
17 pages
FYMCA IDSLab A6 Submission
No ratings yet
FYMCA IDSLab A6 Submission
9 pages
Aih Lab1
No ratings yet
Aih Lab1
10 pages
Import Pandas As PD
No ratings yet
Import Pandas As PD
3 pages
Regression Models
No ratings yet
Regression Models
5 pages
ML Regression Documentation
No ratings yet
ML Regression Documentation
7 pages
A Stochastic Model For Demand Forecating in Python
No ratings yet
A Stochastic Model For Demand Forecating in Python
32 pages
Day 3 ML
No ratings yet
Day 3 ML
4 pages
22 Python Libraries For Geospatial Data Analysis
No ratings yet
22 Python Libraries For Geospatial Data Analysis
10 pages
Demand Forecasting II: Evidence-Based Methods and Checklists
No ratings yet
Demand Forecasting II: Evidence-Based Methods and Checklists
36 pages
Hemraj Python Ass1
No ratings yet
Hemraj Python Ass1
7 pages
Lecture Material 11
No ratings yet
Lecture Material 11
14 pages
Broadly, There Are 3 Types of Machine Learning Algorithms.
No ratings yet
Broadly, There Are 3 Types of Machine Learning Algorithms.
33 pages
Bus Impedance Matrix Method For Analysis of Unsymmetrical Shunt Faults
No ratings yet
Bus Impedance Matrix Method For Analysis of Unsymmetrical Shunt Faults
8 pages
Working With Time Series in Pandas
No ratings yet
Working With Time Series in Pandas
7 pages
Lesson1 - Simple Linier Regression
No ratings yet
Lesson1 - Simple Linier Regression
40 pages
A Time Series Forecasting Case Study - PART 1
No ratings yet
A Time Series Forecasting Case Study - PART 1
24 pages
Week 3 and 4
No ratings yet
Week 3 and 4
19 pages
Lab Manual 04
No ratings yet
Lab Manual 04
12 pages
Regression Model
No ratings yet
Regression Model
6 pages
Write A Lab Report On Linear Regression and Logistic Regression. Include The Cost Function Differentiation and The Code in The Report.
No ratings yet
Write A Lab Report On Linear Regression and Logistic Regression. Include The Cost Function Differentiation and The Code in The Report.
7 pages
Intro To Linear and Logistic Reg
No ratings yet
Intro To Linear and Logistic Reg
5 pages
Experiment1 Explanation
No ratings yet
Experiment1 Explanation
6 pages
Practical # 10
No ratings yet
Practical # 10
5 pages
Support Vector Regression (SVR) Model For Seasonal Time Series Data
No ratings yet
Support Vector Regression (SVR) Model For Seasonal Time Series Data
10 pages
A Short Tutorial On Fuzzy Time Series - Part I
No ratings yet
A Short Tutorial On Fuzzy Time Series - Part I
18 pages
A Short Tutorial On Fuzzy Time Series - Part III
No ratings yet
A Short Tutorial On Fuzzy Time Series - Part III
17 pages
Anomaly Detection With Machine Learning
No ratings yet
Anomaly Detection With Machine Learning
12 pages
Marketing - MCDONALD'S - BRAND ANALYSIS OF A PRODUCT SPECIFICS IN RELATION TO MCDONALD'S
No ratings yet
Marketing - MCDONALD'S - BRAND ANALYSIS OF A PRODUCT SPECIFICS IN RELATION TO MCDONALD'S
73 pages
Linear Regression
No ratings yet
Linear Regression
8 pages
A Step-by-Step Guide To Calculating Autocorrelation and Partial Autocorrelation
No ratings yet
A Step-by-Step Guide To Calculating Autocorrelation and Partial Autocorrelation
13 pages
8 Guidelines To Create Professional Data Science Notebooks
No ratings yet
8 Guidelines To Create Professional Data Science Notebooks
9 pages
2.3 ML (Implementation of Polynomial Regression Using Python)
No ratings yet
2.3 ML (Implementation of Polynomial Regression Using Python)
9 pages
DSML Project Report - Group05
No ratings yet
DSML Project Report - Group05
14 pages
Journal of Mass Spectrometry and Advances in The Clinical Lab
No ratings yet
Journal of Mass Spectrometry and Advances in The Clinical Lab
10 pages
1984.huesmann Lagerspetz Etal - interveningVariablesintheTeleViol AggRel - Developpsych
No ratings yet
1984.huesmann Lagerspetz Etal - interveningVariablesintheTeleViol AggRel - Developpsych
30 pages
An Efficient Method For Paired-Comparison: D. Amnon Silverstein Joyce E. Farrell
No ratings yet
An Efficient Method For Paired-Comparison: D. Amnon Silverstein Joyce E. Farrell
15 pages
Walmart Sales Time Series Forecasting Using Deep Learning
No ratings yet
Walmart Sales Time Series Forecasting Using Deep Learning
12 pages
Linear Regression Mca Lab - Jupyter Notebook
No ratings yet
Linear Regression Mca Lab - Jupyter Notebook
2 pages
ANOVA, T-Test and Other Statistical Tests With Python
No ratings yet
ANOVA, T-Test and Other Statistical Tests With Python
11 pages
Auto Tuning Multiple Timeseries SARIMAX Model - With A Case Study and Detailed Code Explanation
No ratings yet
Auto Tuning Multiple Timeseries SARIMAX Model - With A Case Study and Detailed Code Explanation
10 pages
Wavelet & Fourier Analysis On The ENSO and Monsoon Data in Python
No ratings yet
Wavelet & Fourier Analysis On The ENSO and Monsoon Data in Python
10 pages
AutoViz and Lux
No ratings yet
AutoViz and Lux
9 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
5 Unknown Tricks For Python Classes
No ratings yet
5 Unknown Tricks For Python Classes
8 pages
01app - 2012 Board Characteristics and The Financial Performance of Nigerian Quoted Firms
No ratings yet
01app - 2012 Board Characteristics and The Financial Performance of Nigerian Quoted Firms
19 pages
Create Data Classes in Python
No ratings yet
Create Data Classes in Python
7 pages
DID101R
No ratings yet
DID101R
5 pages
9845 19595 1 SM PDF
No ratings yet
9845 19595 1 SM PDF
9 pages
Literature Review On Corporate Capital Structure and Dividend Policy
No ratings yet
Literature Review On Corporate Capital Structure and Dividend Policy
10 pages
What Are Degrees of Freedom in Statistics
No ratings yet
What Are Degrees of Freedom in Statistics
6 pages
Capacity of U-Turn at Median Opening: Ite Journal June 1999
No ratings yet
Capacity of U-Turn at Median Opening: Ite Journal June 1999
6 pages
Problem Set 5
No ratings yet
Problem Set 5
4 pages
8.0 Lakeland College
No ratings yet
8.0 Lakeland College
2 pages
Econometrics For ECO 2022 Tutorial 4
No ratings yet
Econometrics For ECO 2022 Tutorial 4
2 pages
Ex01 Linear Regression
No ratings yet
Ex01 Linear Regression
2 pages
Hydrodynamic of High Speed Vessels - Lectures
No ratings yet
Hydrodynamic of High Speed Vessels - Lectures
49 pages
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
From Everand
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet