0% found this document useful (0 votes)

0 views

Multi_Regression

The document outlines a lab exercise focused on supervised machine learning using regression with the Scikit Learn library. It includes instructions for performing linear regression on a fuel consumption dataset, extracting features and labels, fitting a model, and evaluating its performance using R-squared. Additionally, it covers multi-regression with multiple independent variables and provides steps for predicting outcomes based on various input features.

Uploaded by

nagulxlugan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

0 views

Multi_Regression

Uploaded by

nagulxlugan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

1 Supervised Machine Learning - Regression with Scikit Learn Library

In this exercise, we will use Linear Regression model from Scikit Learn.
1. Linear-Regression with Scikit Learn Library
2. Multi-Regression with Scikit Learn Library
Instruction to compplete lab exercises:
1. Open python notebook file under Lab folder
2. Read the problem statement in the exercise and expected output
3. Uncomment and remove the lines and fill in wiht your answer
4. Run your code to produce expected output.
Noted: Data files are stored in dataset folder

1.1 Linear-Regression with Scikit Learn Library

Now, we will try out the same fuel consumption example to develop simple linear regression
model using Scikit Learn Library.
Perform simple linear regression using sklearn lib on the fuel consumption dataset. Uese the data
in auto-mpg-clean.csv file, predict the fuel consumption (mpg) of car based on weight.
Show your R-squared. Please refer to the following output as your reference.

1.1.1 Following steps will be performed:

1
6. R-square using r2_score function from Scikit Learn metrics package
7. Predict a single data using with model

[3]: import numpy as np

import pandas as pd
import matplotlib.pyplot as plt
from sklearn.linear_model import LinearRegression
from sklearn.metrics import mean_squared_error, r2_score

[4]: # 1) Load auto-mpg-clean data

car_data = pd.read_csv('dataset/auto-mpg-clean.csv',header=0,
,→skipinitialspace=True) #read the data

## Uncomment following lines and fill in wiht your answer

## display the top 5 data

car_data.head(5)

[4]: mpg cylinders displacement horsepower weight acceleration \

0 26.0 4 97.0 46 1835 20.5
1 26.0 4 97.0 46 1950 21.0
2 43.1 4 90.0 48 1985 21.5
3 44.3 4 90.0 48 2085 21.7
4 43.4 4 90.0 48 2335 23.7

model year origin car name

0 70 2 volkswagen 1131 deluxe sedan
1 73 2 volkswagen super beetle
2 78 2 volkswagen rabbit custom diesel
3 80 2 vw rabbit c (diesel)
4 80 2 vw dasher (diesel)

[5]: ## Uncomment following lines and fill in wiht your answer

## Describe the statistics of the data

[6]: car_data.describe()

[6]: mpg cylinders displacement horsepower weight \

count 392.000000 392.000000 392.000000 392.000000 392.000000
mean 23.445918 5.471939 194.411990 104.469388 2977.584184
std 7.805007 1.705783 104.644004 38.491160 849.402560
min 9.000000 3.000000 68.000000 46.000000 1613.000000
25% 17.000000 4.000000 105.000000 75.000000 2225.250000
50% 22.750000 4.000000 151.000000 93.500000 2803.500000
75% 29.000000 8.000000 275.750000 126.000000 3614.750000
max 46.600000 8.000000 455.000000 230.000000 5140.000000

2
acceleration model year origin
count 392.000000 392.000000 392.000000
mean 15.541327 75.979592 1.576531
std 2.758864 3.683737 0.805518
min 8.000000 70.000000 1.000000
25% 13.775000 73.000000 1.000000
50% 15.500000 76.000000 1.000000
75% 17.025000 79.000000 2.000000
max 24.800000 82.000000 3.000000

1.2 Extract weight as X (feature) and fuel consumption (mpg) as Y (label)

[7]: # 2) Extract weight as X (feature) and fuel consumption (mpg) as Y (label)

# 3) Create LinearRegression model from Sckit Learn package and fit the data
# You may add additional variables within the brackets or reshape the data

'''
X = car_data.iloc[:,4:5]
Y = car_data.iloc[:,0:1]

# or

X = car_data['weight'].to_numpy()
Y = car_data['mpg'].to_numpy()

X = X.reshape(-1, 1)
Y = Y.reshape(-1, 1)
'''
# or

X = car_data[['weight']]
Y = car_data['mpg']

[8]: ## Create LinearRegression model from Sckit Learn package and fit the data
## You may add additional variables within the brackets or reshape the data

[9]: lin_reg = LinearRegression(normalize=True)

## Uncomment following lines and fill in wiht your answer

lin_reg.fit(X,Y)

[9]: LinearRegression(normalize=True)

3
1.3 Predict the all data of X
1.4 Display the Coefficients, Intercept and r-squared value

[10]: ## Uncomment following lines and fill in wiht your answer

Y_pred = lin_reg.predict(X)

r2_value = r2_score(Y, Y_pred)

print("Coefficients: ", lin_reg.coef_)

print("Intercept: ", lin_reg.intercept_)

# The coefficient of determination: 1 is perfect prediction

print('R-squared Coefficient of determination: %.2f'

%r2_value)

Coefficients: [-0.00764734]
Intercept: 46.216524549017585
R-squared Coefficient of determination: 0.69

1.5 Display the predicted data by appending existing data

Append the predicted data and display together with original data as shown:

[11]: # 6) Append the predicted data

car_data['Y_pred'] = Y_pred
print(car_data.head())

mpg cylinders displacement horsepower weight acceleration \

4
0 26.0 4 97.0 46 1835 20.5
1 26.0 4 97.0 46 1950 21.0
2 43.1 4 90.0 48 1985 21.5
3 44.3 4 90.0 48 2085 21.7
4 43.4 4 90.0 48 2335 23.7

model year origin car name Y_pred

0 70 2 volkswagen 1131 deluxe sedan 32.183651
1 73 2 volkswagen super beetle 31.304207
2 78 2 volkswagen rabbit custom diesel 31.036550
3 80 2 vw rabbit c (diesel) 30.271815
4 80 2 vw dasher (diesel) 28.359980

2 Exercise 1:
Perform linear regression using sklearn lib on the Income3 dataset. Predict the Income based on
the Year of Education.

2.0.1 Perform following steps:

1. Load Input dataset
2. Extract feature and label data
3. The create Linear Regression class and fit (train) the data e.g LinearRegres-
sion(normalize=True)
4. Predict(Y) based on (X)
5. Display Coefficient, Intercept
6. R-square using r2_score function from Scikit Learn metrics package
7. Predict a single data using with model

[2]: # 1) Load Input dataset

#my_data = ___________________________________________________________

#print the first 5 data

#_________________________________________

# 2) Extract data

#X = ___________________________________________________________

#Y = = ___________________________________________________________

# 3) The create Linear Regression class and fit the data.

#lin_reg = ___________________________________________________________

5
#= ___________________________________________________________

# 4) Predict income data (Y) based on number of years in higher education (X)

#Y_pred = ___________________________________________________________

# 5) Display Coefficient, Intercept

#___________________________________________________________

# 6) R-square using r2_score function from Scikit Learn metrics package

#r2_data = ___________________________________________________________

#___________________________________________________________

# 7) Predict a single data eg. print the output expected income for 3 years in
,→higher education

#yeasofeducation = ___________________________________________________________

#predicted_income = __________________________________________________________

#print("Predicted income with 3 years higher education is :%.2f"

,→%predicted_income)

Observation Years of Higher Education (x) Income (y)

0 1 6 89617
1 2 0 39826
2 3 6 79894
3 4 3 56547
4 5 4 64795
Coefficients: [[7692.92437864]]
Intercept: [37264.82601798]
R-squared Coefficient of determination: 0.95
Predicted income with 3 years higher education is :60343.60

2.1 Multi-Regression with Scikit Learn Library

Multi-Regression find the relationship between multiple independent variables and one depen-
dent variable. A dependent variable is modeled as a function of several independent variables

6
with corresponding coefficients, along with the constant term. Multiple regression requires two
or more independent variables, and this is why it is called multiple regression.
Use the same fuel consumption dataset and implement Multiple regression model. Select the MPG
data as target and the rest of the data such as ‘cylinders’, ‘displacement’, ‘horsepower’, ‘weight’,
‘acceleration’, ‘model year’ as input data X.

2.2 Load data from csv

[ ]: car_data = pd.read_csv('dataset/auto-mpg-clean.csv',header=0,
,→skipinitialspace=True) #read the data

2.2.1 Prepare the input data and target (label)

• Split the column from index 1 onwards from the dataset as input X or
• Select ‘cylinders’, ‘displacement’, ‘horsepower’, ‘weight’, ‘acceleration’, ‘model year’ as X
• Select the column 0 or ‘mpg’ from the dataset as the label variable Y

[10]: # Prepare input and target label data

#setting the matrixes
# For 2 variables for multiple regression. You may add additional variables
,→within the brackets

#split
#X = car_data.to_numpy()
#X = X[:, 1:]

#or
#select the column

X = car_data[['cylinders', 'displacement', 'horsepower', 'weight',

,→'acceleration', 'model year']]

Y = car_data['mpg']

#make sure the dimension is correct

print (X.shape)
print (Y.shape)

(392, 6)
(392,)

2.2.2 Create LinearRegression model from Sckit Learn package

• Create LinearRegression model from Sckit Learn package and fit the data
• Display the Coefficients, Intercept and r-squared value
Note it will display coefficiens of the corresponding features.

7
[11]: lin_reg = LinearRegression(normalize=True)
lin_reg.fit(X, Y)

print('Intercept: \n', lin_reg.intercept_)

print('Coefficients: \n', lin_reg.coef_)

Intercept:
-14.535250480506125
Coefficients:
[-3.29859089e-01 7.67843024e-03 -3.91355574e-04 -6.79461791e-03
8.52732469e-02 7.53367180e-01]

2.2.3 Predict the all data of X and display r2_value

[12]: # 4) Predict the all data of X

# 5) Print out the Coefficients, Intercept and r-squared value

Y_pred = lin_reg.predict(X)
r2_value = r2_score(Y, Y_pred)

print("Coefficients: ", lin_reg.coef_)

print("Intercept: ", lin_reg.intercept_)

# The coefficient of determination: 1 is perfect prediction

print('R-squared Coefficient of determination: %.2f'
%r2_value)

# 6) Append the predicted data

car_data['Y_pred'] = Y_pred
print(car_data.head())

Coefficients: [-3.29859089e-01 7.67843024e-03 -3.91355574e-04 -6.79461791e-03

8.52732469e-02 7.53367180e-01]
Intercept: -14.535250480506125
R-squared Coefficient of determination: 0.81
mpg cylinders displacement horsepower weight acceleration \
0 26.0 4 97.0 46 1835 20.5
1 26.0 4 97.0 46 1950 21.0
2 43.1 4 90.0 48 1985 21.5
3 44.3 4 90.0 48 2085 21.7
4 43.4 4 90.0 48 2335 23.7

model year origin car name Y_pred

0 70 2 volkswagen 1131 deluxe sedan 26.887799
1 73 2 volkswagen super beetle 28.409156
2 78 2 volkswagen rabbit custom diesel 31.926285
3 80 2 vw rabbit c (diesel) 32.770612

8
4 80 2 vw dasher (diesel) 31.242504

3 Exercise 1: Predict the mpg data of the car with following informa-
tion
• ‘cylinders’ = 4
• ‘displacement’ = 97
• ‘horsepower’ = 48
• ‘weight’ = 2000
• ‘acceleration’ = 23.8
• ‘model year’ = 80
Expected output: Predicted mpg info is 33.58.

[13]: # Crate two dimensional array called new_data with proposed data

#new_data = __________________________________________________________

# Predict mpg and display

#p_mpg = __________________________________________________________

#__________________________________________________________________

Predicted mpg info is 33.58.

4 Exercise 2:
Perform multi regression using sklearn lib on the fishcatch dataset. Predict the weight of fish
based on all features EXCEPT sex and species
Show your R-squared and Adjusted R-squared.

4.1 Load the data from fishcatch.csv

9
[14]: ## Load csv data and display top 5 data

#fishcatch_data =
,→_____________________________________________________________________________

#= __________________________________________________________

[14]: Observation Species Weight Length1 Length2 Length3 Height Width \

0 1 1 242.0 23.2 25.4 30.0 38.4 13.4
1 2 1 290.0 24.0 26.3 31.2 40.0 13.8
2 3 1 340.0 23.9 26.5 31.1 39.8 15.1
3 4 1 363.0 26.3 29.0 33.5 38.0 13.3
4 5 1 430.0 26.5 29.0 34.0 36.6 15.1

Sex
0 NaN
1 NaN
2 NaN
3 NaN
4 NaN

4.2 Selecte the ‘Length1’, ‘Length2’, ‘Length3’, ‘Height’, ‘Width’ as features

4.3 Selecte the ‘Weight’ as label

[15]: #Select features X and label Y with proposed columns

#X = = __________________________________________________________

#Y = = __________________________________________________________

#Display the dimension of X and Y

#__________________________________________________________

(157, 5)
(157,)

4.4 Create LinearRegression model and train with X and Y data

4.5 Display the intercept and coefficient similar to figure shown
• Intercept: -725.5440014015896
• Coefficients: [ 35.74869687 -13.20361087 9.50021993 4.89828407 9.06157899]

10
[16]: # Create model and train

#lin_reg1 = _________________________________________________________

#__________________________________________________________

# Display intercept and coefficients

#__________________________________________________________

Intercept:
-725.5440014015896
Coefficients:
[ 35.74869687 -13.20361087 9.50021993 4.89828407 9.06157899]

4.6 Make prediction of input data X

4.7 Compute R2 score and display

[17]: ## Make prediction

#Y_pred = __________________________________________________________

## Calculate R2 value and display

#r2_value = __________________________________________________________

#__________________________________________________________

R-squared Coefficient of determination: 0.87

4.8 Display the predicted data together with the original data

[18]: ## Display the predicted data

#__________________________________________________________

Observation Species Weight Length1 Length2 Length3 Height Width \

0 1 1 242.0 23.2 25.4 30.0 38.4 13.4
1 2 1 290.0 24.0 26.3 31.2 40.0 13.8
2 3 1 340.0 23.9 26.5 31.1 39.8 15.1
3 4 1 363.0 26.3 29.0 33.5 38.0 13.3
4 5 1 430.0 26.5 29.0 34.0 36.6 15.1

11
Sex Y_pred
0 NaN 362.979915
1 NaN 402.557772
2 NaN 406.192554
3 NaN 456.653174
4 NaN 478.006268

5 Exercise 3:
5.1 Predict the weight of the fish with following input data
• ‘length1’ = 24
• ‘length2’ = 28
• ‘length3’ = 34
• ‘height’ = 41
• ‘width’ = 15
Expected output (estimate): Predicted weight of the fish is 422.48.

[19]: #Crate two dimensional array called new_data with proposed data

#new_data = __________________________________________________________

#predict fish weight and display

#fish_weight = __________________________________________________________

#__________________________________________________________

Predicted weight of the fish is 422.48.

The Elements of Quantitative Investing
From Everand
The Elements of Quantitative Investing
Giuseppe A. Paleologo
No ratings yet
Praxis 2 Scores
No ratings yet
Praxis 2 Scores
3 pages
Fraud Analytics Using Descriptive, Predictive, and Social Network Techniques: A Guide to Data Science for Fraud Detection
From Everand
Fraud Analytics Using Descriptive, Predictive, and Social Network Techniques: A Guide to Data Science for Fraud Detection
Bart Baesens
No ratings yet
Nihal Pathan BT32027
No ratings yet
Nihal Pathan BT32027
4 pages
INSY446 - 02 - Linear Model Part 1
No ratings yet
INSY446 - 02 - Linear Model Part 1
27 pages
Iml 51
No ratings yet
Iml 51
10 pages
ML0101EN Reg Simple Linear Regression Co2 Py v1
No ratings yet
ML0101EN Reg Simple Linear Regression Co2 Py v1
4 pages
Assignment 2 ML
No ratings yet
Assignment 2 ML
11 pages
Linear Regression
100% (1)
Linear Regression
16 pages
Simple Linear Regression With Jupyter Notebook: Dr. Alvin Ang
No ratings yet
Simple Linear Regression With Jupyter Notebook: Dr. Alvin Ang
16 pages
Linear Regression on Car Dataset
No ratings yet
Linear Regression on Car Dataset
2 pages
ML0101EN Reg Mulitple Linear Regression Co2 Py v1
No ratings yet
ML0101EN Reg Mulitple Linear Regression Co2 Py v1
5 pages
DS EXP6
No ratings yet
DS EXP6
5 pages
vertopal.com_UCD_linear_reg2
No ratings yet
vertopal.com_UCD_linear_reg2
3 pages
Exercises d'Application Regression Analysis
No ratings yet
Exercises d'Application Regression Analysis
4 pages
Exp_6-Model Development_sdk_ok
No ratings yet
Exp_6-Model Development_sdk_ok
11 pages
pgm7.doc
No ratings yet
pgm7.doc
3 pages
Exercises 2 Unfinished
No ratings yet
Exercises 2 Unfinished
8 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
10 pages
Car Fuel Efficiency Presentation
No ratings yet
Car Fuel Efficiency Presentation
7 pages
Car Fuel Efficiency Presentation Pro
No ratings yet
Car Fuel Efficiency Presentation Pro
7 pages
Lab 6
No ratings yet
Lab 6
2 pages
Automobile_Linear_Regression
No ratings yet
Automobile_Linear_Regression
1 page
Program -7
No ratings yet
Program -7
4 pages
Generative AI For Models Development
No ratings yet
Generative AI For Models Development
8 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
7.PRGM
No ratings yet
7.PRGM
4 pages
Untitled Document
No ratings yet
Untitled Document
6 pages
Lecture 3
No ratings yet
Lecture 3
90 pages
Multiple Regression1
No ratings yet
Multiple Regression1
27 pages
In Class Exercise Linear Regression in R
No ratings yet
In Class Exercise Linear Regression in R
6 pages
unit 3 8
No ratings yet
unit 3 8
5 pages
LinearRegression HandsOn
No ratings yet
LinearRegression HandsOn
3 pages
R lab
No ratings yet
R lab
3 pages
Lab3 Report Revathy
No ratings yet
Lab3 Report Revathy
8 pages
Int AI TW-PW 03
No ratings yet
Int AI TW-PW 03
4 pages
Lab4
No ratings yet
Lab4
4 pages
Linear Regression Using R Computer Labs: Cars - CSV Data Set - 2
No ratings yet
Linear Regression Using R Computer Labs: Cars - CSV Data Set - 2
9 pages
Exercise 3 Linear Regression Modeling
No ratings yet
Exercise 3 Linear Regression Modeling
1 page
Artificial Intelligence Semester Project: Topic: Car Mileage Predictor Presented by Abdullah Farooq
No ratings yet
Artificial Intelligence Semester Project: Topic: Car Mileage Predictor Presented by Abdullah Farooq
17 pages
Chapter 4 Exercise 11
No ratings yet
Chapter 4 Exercise 11
5 pages
ML Foram
No ratings yet
ML Foram
17 pages
Session7 LinearRegression
No ratings yet
Session7 LinearRegression
52 pages
Predicting-Vehicle-Fuel-Efficiency-with-Regression-Modeling
No ratings yet
Predicting-Vehicle-Fuel-Efficiency-with-Regression-Modeling
9 pages
CO2 Emission Project Source Code
No ratings yet
CO2 Emission Project Source Code
2 pages
ISyE7406 Homework3
No ratings yet
ISyE7406 Homework3
20 pages
Unit 5
No ratings yet
Unit 5
171 pages
1 Regression
No ratings yet
1 Regression
4 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
Regression Practice - MLR
No ratings yet
Regression Practice - MLR
9 pages
Linear Regression
No ratings yet
Linear Regression
36 pages
Mtcars: Choosing The Most Related Variable (S) To The Response
No ratings yet
Mtcars: Choosing The Most Related Variable (S) To The Response
13 pages
En Tanagra Python StatsModels PDF
No ratings yet
En Tanagra Python StatsModels PDF
20 pages
7406HW03
No ratings yet
7406HW03
2 pages
MULTIPLE LINEAR REGRESSION 3
No ratings yet
MULTIPLE LINEAR REGRESSION 3
68 pages
Lecture-2 Unit 2
No ratings yet
Lecture-2 Unit 2
56 pages
ML manoj
No ratings yet
ML manoj
51 pages
Regression
No ratings yet
Regression
5 pages
20BCE1205 Lab3
No ratings yet
20BCE1205 Lab3
9 pages
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
From Everand
Advanced Multiplayer Game Development with Ureal Engine 5: A Comprehensive Guide to C++ Scripting
Vladimir Kiselev
No ratings yet
Data Mining Models: Techniques and Applications
From Everand
Data Mining Models: Techniques and Applications
Ravi Deshpande
No ratings yet
KNN
No ratings yet
KNN
14 pages
Linear_Regression
No ratings yet
Linear_Regression
18 pages
Python_for_AIML2
No ratings yet
Python_for_AIML2
21 pages
Python_for_AIML1
No ratings yet
Python_for_AIML1
15 pages
Mathematics Parachutes
No ratings yet
Mathematics Parachutes
13 pages
KSEEB 1st PUC Statistics Syllabus 2021 22
No ratings yet
KSEEB 1st PUC Statistics Syllabus 2021 22
2 pages
Trigonometry Sheet 1
No ratings yet
Trigonometry Sheet 1
29 pages
02 Assign Electrostatics Gauss Law SC
No ratings yet
02 Assign Electrostatics Gauss Law SC
6 pages
Power Measurement With Cadence EDA: Microelectronics Students Group December 31, 2009
No ratings yet
Power Measurement With Cadence EDA: Microelectronics Students Group December 31, 2009
8 pages
RBC Statistics Overview RBC
No ratings yet
RBC Statistics Overview RBC
31 pages
Simplifying Radical Expressions: Algebra 1
No ratings yet
Simplifying Radical Expressions: Algebra 1
26 pages
Motion in A Straight Line Worksheet
100% (1)
Motion in A Straight Line Worksheet
2 pages
Cambridge International AS & A Level: PHYSICS 9702/32
No ratings yet
Cambridge International AS & A Level: PHYSICS 9702/32
16 pages
Mmpo 02
No ratings yet
Mmpo 02
48 pages
Measurement of Distance and Direction PDF
100% (1)
Measurement of Distance and Direction PDF
9 pages
Graphs Book For Printing
67% (3)
Graphs Book For Printing
112 pages
A Level Further Mathematics For OCR A Pure Core Student Book 1 Practice Paper Solutions
No ratings yet
A Level Further Mathematics For OCR A Pure Core Student Book 1 Practice Paper Solutions
4 pages
Torsional Moments
No ratings yet
Torsional Moments
16 pages
DLL Quarter 1 Week 1
No ratings yet
DLL Quarter 1 Week 1
7 pages
01a 1MA1 1H Spring 2024 Aiming For Grade 7 PDF
No ratings yet
01a 1MA1 1H Spring 2024 Aiming For Grade 7 PDF
10 pages
Marquardt method (1)
No ratings yet
Marquardt method (1)
4 pages
28-1 Ion Formulae
No ratings yet
28-1 Ion Formulae
16 pages
Emfesoln chp08 PDF
No ratings yet
Emfesoln chp08 PDF
26 pages
Iso 11423 2 en PDF
No ratings yet
Iso 11423 2 en PDF
11 pages
Continous Time Signal Vs Discrete Time Signal
No ratings yet
Continous Time Signal Vs Discrete Time Signal
9 pages
3rd Term - Revision For Quiz#1 - GR 3
No ratings yet
3rd Term - Revision For Quiz#1 - GR 3
7 pages
PDF Level Sets and Extrema of Random Processes and Fields 1st Edition Jean-Marc Azais download
100% (9)
PDF Level Sets and Extrema of Random Processes and Fields 1st Edition Jean-Marc Azais download
67 pages
Holiday Homework
No ratings yet
Holiday Homework
4 pages
Stability of Columns
No ratings yet
Stability of Columns
45 pages
Solved - Three Groups of Students From The Geotechnical Engineer
100% (1)
Solved - Three Groups of Students From The Geotechnical Engineer
5 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
3 pages
Staad Details
No ratings yet
Staad Details
7 pages

Multi_Regression

Uploaded by

Multi_Regression

Uploaded by

1 Supervised Machine Learning - Regression with Scikit Learn Library

1.1 Linear-Regression with Scikit Learn Library

1.1.1 Following steps will be performed:

[3]: import numpy as np

[4]: # 1) Load auto-mpg-clean data

## Uncomment following lines and fill in wiht your answer

[4]: mpg cylinders displacement horsepower weight acceleration \

model year origin car name

[5]: ## Uncomment following lines and fill in wiht your answer

[6]: mpg cylinders displacement horsepower weight \

1.2 Extract weight as X (feature) and fuel consumption (mpg) as Y (label)

[7]: # 2) Extract weight as X (feature) and fuel consumption (mpg) as Y (label)

[9]: lin_reg = LinearRegression(normalize=True)

## Uncomment following lines and fill in wiht your answer

[10]: ## Uncomment following lines and fill in wiht your answer

r2_value = r2_score(Y, Y_pred)

print("Coefficients: ", lin_reg.coef_)

# The coefficient of determination: 1 is perfect prediction

print('R-squared Coefficient of determination: %.2f'

1.5 Display the predicted data by appending existing data

[11]: # 6) Append the predicted data

mpg cylinders displacement horsepower weight acceleration \

model year origin car name Y_pred

2.0.1 Perform following steps:

[2]: # 1) Load Input dataset

#print the first 5 data

# 3) The create Linear Regression class and fit the data.

# 5) Display Coefficient, Intercept

# 6) R-square using r2_score function from Scikit Learn metrics package

#print("Predicted income with 3 years higher education is :%.2f"

Observation Years of Higher Education (x) Income (y)

2.1 Multi-Regression with Scikit Learn Library

2.2 Load data from csv

2.2.1 Prepare the input data and target (label)

[10]: # Prepare input and target label data

X = car_data[['cylinders', 'displacement', 'horsepower', 'weight',

#make sure the dimension is correct

2.2.2 Create LinearRegression model from Sckit Learn package

print('Intercept: \n', lin_reg.intercept_)

2.2.3 Predict the all data of X and display r2_value

[12]: # 4) Predict the all data of X

print("Coefficients: ", lin_reg.coef_)

# The coefficient of determination: 1 is perfect prediction

# 6) Append the predicted data

Coefficients: [-3.29859089e-01 7.67843024e-03 -3.91355574e-04 -6.79461791e-03

model year origin car name Y_pred

# Predict mpg and display

Predicted mpg info is 33.58.

4.1 Load the data from fishcatch.csv

[14]: Observation Species Weight Length1 Length2 Length3 Height Width \

4.2 Selecte the ‘Length1’, ‘Length2’, ‘Length3’, ‘Height’, ‘Width’ as features

[15]: #Select features X and label Y with proposed columns

#Display the dimension of X and Y

4.4 Create LinearRegression model and train with X and Y data

# Display intercept and coefficients

4.6 Make prediction of input data X

[17]: ## Make prediction

## Calculate R2 value and display

R-squared Coefficient of determination: 0.87

[18]: ## Display the predicted data

Observation Species Weight Length1 Length2 Length3 Height Width \

#predict fish weight and display

Predicted weight of the fish is 422.48.

You might also like