0% found this document useful (0 votes)

34 views5 pages

ML-Lab07-Building and Evaluating Multivariate Regression Models in Python

Machine learning multiple regression

Uploaded by

muneebgoraya60

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

34 views5 pages

ML-Lab07-Building and Evaluating Multivariate Regression Models in Python

Machine learning multiple regression

Uploaded by

muneebgoraya60

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

| Lab 07 |

Building, Analyzing, and Testing Different Types of Regression Models

Lab Objective:

This lab tutorial will guide you on analyzing, building, and testing regression models in Python. We will use
the popular scikit-learn library for this purpose.

Setting up Python with Google Colab

1. Go to Google Colab.
2. Create a new notebook by clicking on File > New Notebook.
3. You're now ready to start writing and executing Python code in the notebook!

Building and Testing Linear Regression Models in Python

1. Load and Explore: Diabetes Dataset

Let us use the "Diabetes" dataset available in scikit-learn, which contains ten baseline variables, six blood
serum measurements, age, sex, body mass index, average blood pressure, and six blood serum measurements
for 442 diabetes patients.

import numpy as np
import pandas as pd
from sklearn.datasets import load_diabetes

# Load the dataset

diabetes = load_diabetes()
data = pd.DataFrame(data=diabetes.data, columns=diabetes.feature_names)
data['TARGET'] = diabetes.target

# Explore the dataset

print(data.head())
print(data.info())
print(data.describe())

Machine Learning Lab – Fall 2024

Acknowledgement: Air University, Islamabad
2. Dataset Preprocessing

from sklearn.model_selection import train_test_split

from sklearn.preprocessing import StandardScaler

# Split data into features (X) and target variable (y)

X = data.drop('TARGET', axis=1)
y = data['TARGET']

# Split data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42)

3. Building Regression Models

3.1. Linear Regression

from sklearn.linear_model import LinearRegression

# Initialize the model

model_lr = LinearRegression()

# Train the model

model_lr.fit(X_train, y_train)

3.2. Ridge Regression

from sklearn.linear_model import Ridge

# Initialize the model

model_ridge = Ridge(alpha=1.0)

# Train the model

model_ridge.fit(X_train, y_train)

3.3. Lasso Regression

from sklearn.linear_model import Lasso

# Initialize the model

model_lasso = Lasso(alpha=1.0)

# Train the model

model_lasso.fit(X_train, y_train)

Machine Learning Lab – Fall 2024

Acknowledgement: Air University, Islamabad
4. Building Linear Regression Models

from sklearn.metrics import mean_squared_error

# Predict using the models

y_pred_lr = model_lr.predict(X_test)
y_pred_ridge = model_ridge.predict(X_test)
y_pred_lasso = model_lasso.predict(X_test)

# Calculate Mean Squared Error (MSE)

mse_lr = mean_squared_error(y_test, y_pred_lr)
mse_ridge = mean_squared_error(y_test, y_pred_ridge)
mse_lasso = mean_squared_error(y_test, y_pred_lasso)

print(f'MSE Linear Regression: {mse_lr}')

print(f'MSE Ridge Regression: {mse_ridge}')
print(f'MSE Lasso Regression: {mse_lasso}')

POINTS TO PONDER:

1) Differentiate between Linear, Ridge, and Lasso Regression.

2) Analyze the MSE values to determine which model performed best and why.
3) Observe the results with different hyperparameters for Ridge and Lasso regression.
4) Explore what method is being used at the backend for computing the values of theta parameters.

5. Building Non-Linear Regression Model

Nonlinear regression models are used when the relationship between the independent and dependent variables
is not linear. One popular nonlinear regression model is the Polynomial Regression. In the next step, we will
perform Polynomial Regression on the "Diabetes" dataset.

from sklearn.preprocessing import PolynomialFeatures

from sklearn.linear_model import LinearRegression
from sklearn.pipeline import make_pipeline

# Define the degree of the polynomial

degree = 2

# Create a pipeline with Polynomial Features and Linear Regression

model_poly = make_pipeline(PolynomialFeatures(degree), LinearRegression())

# Train the model

model_poly.fit(X_train, y_train)

# Predict using the model

y_pred_poly = model_poly.predict(X_test)

Machine Learning Lab – Fall 2024

Acknowledgement: Air University, Islamabad
# Calculate Mean Squared Error (MSE) for Polynomial Regression
mse_poly = mean_squared_error(y_test, y_pred_poly)

print(f'MSE Polynomial Regression (Degree {degree}): {mse_poly}')

POINTS TO PONDER:

What is the working mechanism behind the above code?

The use of a pipeline with Polynomial Features and Linear Regression allows us to seamlessly combine the
process of transforming the features into polynomial features and fitting a linear regression model in a single
step. Here is a breakdown of why we use this approach:

Step-1 - Polynomial Features: Polynomial regression is essentially a linear regression on transformed

features. It transforms the original features into higher-degree polynomial features. For example, if we have a
single feature, `x`, and we want to perform polynomial regression with degree 2, it will transform the feature
into `x` and `x2`.This transformation allows the model to capture nonlinear relationships between the features
and the target variable.

Step-2 - Linear Regression: After transforming the features using Polynomial Features, we are left with a set
of new features, which may be of higher degree. However, the relationship between these transformed features
and the target variable is still linear.

Step-3 - Pipeline: A pipeline in scikit-learn allows us to chain multiple processing steps together. In this
case, we first apply the Polynomial Features transformation, followed by fitting a Linear Regression model.
The pipeline ensures that the same preprocessing steps are applied to both the training and testing data.

This approach simplifies the modeling process and allows us to utilize the existing tools for linear regression,
including evaluation metrics like Mean Squared Error.

Can we directly use Polynomial Regression without using the Pipeline?

Yes, you can apply Polynomial Regression directly without using a pipeline. In scikit-learn, you can use the
PolynomialFeatures class to transform your features, and then apply a Linear Regression model on the
transformed features. This approach is also valid and provides you with more flexibility if you want to explore
the intermediate steps or apply additional customizations.

Here is an example of how you can do it:

from sklearn.preprocessing import PolynomialFeatures
from sklearn.linear_model import LinearRegression

# Define the degree of the polynomial

degree = 2

# Create polynomial features

Machine Learning Lab – Fall 2024

Acknowledgement: Air University, Islamabad
poly_features = PolynomialFeatures(degree=degree)
X_poly = poly_features.fit_transform(X_train)

# Train a linear regression model on the polynomial features

model_poly = LinearRegression()
model_poly.fit(X_poly, y_train)

# Predict using the model

X_test_poly = poly_features.transform(X_test)
y_pred_poly = model_poly.predict(X_test_poly)

You can access these coefficients using model_poly.coef_ and model_poly.intercept_. Remember that the
equation becomes more complex for higher-degree polynomials and involves multiple coefficients for each
feature.

Lab Task:
Use any of the publically available datasets for the Multivariate Regression problem. The dataset must include
a mix of numerical and categorical/ordinal features. Perform the following steps on the data.

1) Load and Explore the Dataset

a) Check for missing values, if any, and handle them appropriately.

b) Generate summary statistics for the dataset.
c) Split the data into features (X) and target variable (y).
d) Encode the categorical/ordinal variables using relevant encoding techniques.
e) Find and plot the correlation between different variables.

2) Build and Train Regression Models

a) Choose different regression models (e.g., Linear Regression, Ridge Regression, Lasso
Regression, and Polynomial Regression). Train them using the features and target variable.
b) Train these models with Feature Normalization and Standardization as well.

3) Evaluate Model Performance

a) Predict the target variable using the trained models and calculate the Mean Squared Error (MSE).
b) Compare the performance of the chosen regression models with and without feature scaling.
c) Visualize the predictions against the actual values for better understanding.
d) Take a random test sample and predict its value.
e) Prepare a 1-2 page report on the analysis of results for different regression models.

Machine Learning Lab – Fall 2024

Acknowledgement: Air University, Islamabad

M.J.D. Powell - Approximation Theory and Methods-Cambridge University Press (1981)
No ratings yet
M.J.D. Powell - Approximation Theory and Methods-Cambridge University Press (1981)
351 pages
M832 Approximation Theory Course Notes
100% (1)
M832 Approximation Theory Course Notes
94 pages
ML Regression Documentation
No ratings yet
ML Regression Documentation
7 pages
Experiment 7 ML Vtu
No ratings yet
Experiment 7 ML Vtu
5 pages
Extracted Text
No ratings yet
Extracted Text
391 pages
ML Lab 05
No ratings yet
ML Lab 05
10 pages
UNIT-1 Polynomial Regression
No ratings yet
UNIT-1 Polynomial Regression
7 pages
Unit 3 7
No ratings yet
Unit 3 7
4 pages
(Slide) Non Linear Regression
No ratings yet
(Slide) Non Linear Regression
39 pages
SML - Week 3
No ratings yet
SML - Week 3
5 pages
22 Practice Polynomial Regression
No ratings yet
22 Practice Polynomial Regression
6 pages
19BCS2059 DL1
No ratings yet
19BCS2059 DL1
4 pages
2.3 ML (Implementation of Polynomial Regression Using Python)
No ratings yet
2.3 ML (Implementation of Polynomial Regression Using Python)
9 pages
Lab Manual 04
No ratings yet
Lab Manual 04
12 pages
Assignment No.4 - (20-Ele-68)
No ratings yet
Assignment No.4 - (20-Ele-68)
17 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
ML LN 3
No ratings yet
ML LN 3
44 pages
Regression Models
No ratings yet
Regression Models
5 pages
ML Lab File
No ratings yet
ML Lab File
48 pages
ML Polynomial Regression4
No ratings yet
ML Polynomial Regression4
36 pages
ML Lab Programs
No ratings yet
ML Lab Programs
9 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
23 pages
Lab Mannual of ML
No ratings yet
Lab Mannual of ML
43 pages
Sahil ML
No ratings yet
Sahil ML
21 pages
ML Lab Experiment Shivansh
No ratings yet
ML Lab Experiment Shivansh
29 pages
Machine Learning With Python Algorithms
No ratings yet
Machine Learning With Python Algorithms
28 pages
Assignment 5
No ratings yet
Assignment 5
9 pages
Assigment Regression
No ratings yet
Assigment Regression
9 pages
Experiment1 Explanation
No ratings yet
Experiment1 Explanation
6 pages
Machine Learning Practicals
No ratings yet
Machine Learning Practicals
7 pages
ML0101EN Reg Mulitple Linear Regression Co2 Py v1
No ratings yet
ML0101EN Reg Mulitple Linear Regression Co2 Py v1
5 pages
Integrated System Lab
No ratings yet
Integrated System Lab
25 pages
2 - (9-3) Regression Classifiers
No ratings yet
2 - (9-3) Regression Classifiers
35 pages
Wa0002.
No ratings yet
Wa0002.
5 pages
Lecture-2 Unit 2
No ratings yet
Lecture-2 Unit 2
56 pages
Polynomial Regression
No ratings yet
Polynomial Regression
6 pages
ML Lab Manual
100% (1)
ML Lab Manual
37 pages
PythonForML2023 Laboratory07 08 Regression Classification Update2
No ratings yet
PythonForML2023 Laboratory07 08 Regression Classification Update2
6 pages
LAB5 Regularization
No ratings yet
LAB5 Regularization
6 pages
ML Remaining
No ratings yet
ML Remaining
17 pages
Linear Regression - Cheatsheet
No ratings yet
Linear Regression - Cheatsheet
8 pages
ML WorkSheet Milan
No ratings yet
ML WorkSheet Milan
4 pages
Machine Intelligence
No ratings yet
Machine Intelligence
24 pages
Final Lab Manual
No ratings yet
Final Lab Manual
34 pages
3-Polynomial Regression Using Python
No ratings yet
3-Polynomial Regression Using Python
14 pages
Simple Linear Regression - Assignn5
No ratings yet
Simple Linear Regression - Assignn5
8 pages
Linear Regression Code
No ratings yet
Linear Regression Code
5 pages
Model Deploymnet Cheatshete
No ratings yet
Model Deploymnet Cheatshete
2 pages
Vishal AIML 2.2
No ratings yet
Vishal AIML 2.2
4 pages
Regression Analysis
No ratings yet
Regression Analysis
16 pages
LR LogReg
No ratings yet
LR LogReg
53 pages
Day 3 ML
No ratings yet
Day 3 ML
4 pages
PGM 7
No ratings yet
PGM 7
3 pages
Assignment 2 ML
No ratings yet
Assignment 2 ML
11 pages
ML Manoj
No ratings yet
ML Manoj
51 pages
CL IV Manual
No ratings yet
CL IV Manual
108 pages
ICT Assignment 2
No ratings yet
ICT Assignment 2
7 pages
Machine Learning Lab
No ratings yet
Machine Learning Lab
43 pages
Nikita Prasad Polynomial Regression Basics 1710359781
No ratings yet
Nikita Prasad Polynomial Regression Basics 1710359781
16 pages
03 A Polynomial Linear Regression
No ratings yet
03 A Polynomial Linear Regression
6 pages
Understanding Polynomial Regression Model
No ratings yet
Understanding Polynomial Regression Model
11 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Curve Fitting, An Overview of Some Nice Software - A Presentation by Paulius Gebrauskas
No ratings yet
Curve Fitting, An Overview of Some Nice Software - A Presentation by Paulius Gebrauskas
27 pages
Kaplanlearn - Key Concepts 19
100% (1)
Kaplanlearn - Key Concepts 19
2 pages
Get SPSS Advanced 15 0 Manual 1st Edition Inc. Spss PDF Ebook With Full Chapters Now
100% (4)
Get SPSS Advanced 15 0 Manual 1st Edition Inc. Spss PDF Ebook With Full Chapters Now
55 pages
Ch-3 & 4 Solving System of Equations
No ratings yet
Ch-3 & 4 Solving System of Equations
18 pages
Alglib Man
No ratings yet
Alglib Man
430 pages
Dynamic Anisotropy Gold
No ratings yet
Dynamic Anisotropy Gold
10 pages
Filteredspeakertoolkit
No ratings yet
Filteredspeakertoolkit
27 pages
Mathematics 2009
No ratings yet
Mathematics 2009
20 pages
CE 007 (Numerical Solutions To CE Problems)
No ratings yet
CE 007 (Numerical Solutions To CE Problems)
14 pages
Quiz
100% (1)
Quiz
74 pages
Lab 5
No ratings yet
Lab 5
5 pages
Numerical Report
No ratings yet
Numerical Report
22 pages
Lecture 19 Seas Arima
No ratings yet
Lecture 19 Seas Arima
28 pages
Compressor Performance Map Generation and Testing Per SAE J1723
No ratings yet
Compressor Performance Map Generation and Testing Per SAE J1723
40 pages
Ejercicion Interpolacion Arcmap
No ratings yet
Ejercicion Interpolacion Arcmap
13 pages
Analisis Berganda SPSS Dan Responden
No ratings yet
Analisis Berganda SPSS Dan Responden
3 pages
Basics and Principles of Particle Image Velocimetry (PIV)
No ratings yet
Basics and Principles of Particle Image Velocimetry (PIV)
17 pages
Hansson Soderlund2022SDF
No ratings yet
Hansson Soderlund2022SDF
20 pages
Measuring Relationship Via Regression Analysis and Correlation
No ratings yet
Measuring Relationship Via Regression Analysis and Correlation
9 pages
Intro Regression Modeling
No ratings yet
Intro Regression Modeling
11 pages
Interpolating Between Grids of Meteorological Data For Afps 14.5
No ratings yet
Interpolating Between Grids of Meteorological Data For Afps 14.5
5 pages
13 Regression 06 02 2024
No ratings yet
13 Regression 06 02 2024
16 pages
Solutions To Ch12 Blanchard
No ratings yet
Solutions To Ch12 Blanchard
11 pages
Topic 1: Investigating Relationships Between Two Numerical Variables
No ratings yet
Topic 1: Investigating Relationships Between Two Numerical Variables
8 pages
G10 DLL Fourth-Quarter
No ratings yet
G10 DLL Fourth-Quarter
95 pages
College of Engineering CVE154 Course Guide: Numerical Solutions To Civil Engineering Problems
No ratings yet
College of Engineering CVE154 Course Guide: Numerical Solutions To Civil Engineering Problems
4 pages
Chapter 7 Curve Fitting V1.
No ratings yet
Chapter 7 Curve Fitting V1.
43 pages
(PDF Download) Solutions Manual To Accompany An Introduction To Numerical Methods and Analysis James F. Epperson Fulll Chapter
100% (9)
(PDF Download) Solutions Manual To Accompany An Introduction To Numerical Methods and Analysis James F. Epperson Fulll Chapter
64 pages