0% found this document useful (0 votes)

12 views27 pages

INSY446 - 3 - Linear Model Part 2

Uploaded by

iryannh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views27 pages

INSY446 - 3 - Linear Model Part 2

Uploaded by

iryannh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

INSY 446 – Winter 2023

Data Mining for Business

Analytics

Session 3 – Linear Model Part 2

January 23, 2023
Dongliang Sheng
Linear Regression Revisited
y

§ The core idea of linear regression models is

to find a linear relationship between
predictors and the target variable
§ It works well both in statistical perspectives
and data mining perspectives
2
Linear Regression Revisited
y

§ There is one important issue when we use

linear regression models in data mining
§ That is, the model is very sensitive to the
training dataset

3
Linear Regression Revisited
y

§ Suppose we separate the data into training

(red)
§ The relationship estimated from the training
data would be different than the “true”
relationship (dark grey)
4
Linear Regression Revisited
y

§ When the model is used on the test dataset

(blue), the performance would be subpar
§ In other words, although the performance on
the training dataset is reasonably good, the
performance on the test dataset is bad
5
The Issue

§ This issue occurs because the objective of

the linear regression model is to optimize the
sum of squared error in the training data
§ So, the model has a low bias (performs well
with the training data) and high variance
(performs very badly with the test data)
§ This is similar to the overfitting issue

6
Regularization

§ The idea of the regularization technique is to

add a small amount of bias to the model (i.e.,
making the model performs worse with the
training data)
§ In the process, the variance can be reduced
(i.e., the model performs better with the test
data)
§ There are several models that operationalize
the regularization technique

7
Example 1
Linear Regression Issues
# Import data
df = pandas.read_csv("cereals.CSV")

# Construct variables
X = df[[Sodium']]
y = df['Rating']

# Using the whole data

lm1 = LinearRegression()
model1 = lm1.fit(X,y)

# Generate the prediction value

y_pred = model1.predict(X)

# Separate the data

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.33, random_state = 9)

# Run linear regression based on the training data

lm2 = LinearRegression()
model2 = lm2.fit(X_train,y_train)

# Generate the prediction value from the training data

y_train_pred = model2.predict(X_train)
# Generate the prediction value from the test data
y_test_pred = model2.predict(X_test)

# Plot the linear regression line for the whole data

pyplot.plot(X, y_pred, color='black', linewidth=1)
# Plot observations in the training data
pyplot.scatter(X_train, y_train, color=red')
# Plot the linear regression line for training data
pyplot.plot(X_train, y_train_pred, color='red', linewidth=1)
# Plot observations in the test data
pyplot.scatter(X_test, y_test, color='green')
pyplot.plot(X_test, y_test_pred, color='green', linewidth=1)
pyplot.show() 8
Ridge Regression

§ One of popular models that utilize the

regularization technique is Ridge Regression
§ Ridge Regression adds bias by changing the
objective of the model from minimizing the
sum of squared errors (SSE) to minimizing:
𝑺𝑺𝑬 + (λ ×𝑺𝒍𝒐𝒑𝒆𝟐 )

Additional Penalty Imposed

by Ridge Regression

Note: 𝑆𝑙𝑜𝑝𝑒 is similar to the coefficient of the variable x, which

is usually denoted by 𝛽 in a linear regression model.
9
Ridge Regression

§ Intuitively, the slope represents the

sensitivity of the target variable when the
value of predictor(s) changes
§ For a steep line (i.e., large slope), a change in
a predictor leads to a large change in the
target variable
§ For a flat line (i.e., small slope), a change in a
predictor leads to a small change in the target
variable

10
Ridge Regression

§ The parameter λ dictates the sensitivity of the

target variable with respect to the change in
the value of predictor(s)
§ λ ranges from zero to infinite
§ If λ = 𝟎, then the model becomes the
traditional linear regression model
§ When λ increases, the model will be penalized
more when it is sensitive to the change in the
value of predictor(s)
§ In other words, higher λ leads to slope of the
ridge regression line that is close to 0
11
Ridge Regression
y

§ With this penalty term, the ridge regression

line has smaller slope than the linear
regression line
§ The ridge regression line fits the test set
better than the linear regression line
12
Ridge Regression
y

§ With trial-and-errors, we find the value of λ

that optimizes the SSE based on the test set
§ In practice, we use cross validation to find
the optimal value of λ

13
Example 2
Ridge Regression

# Load libraries
from sklearn.linear_model import Ridge
import pandas

# Import data
df = pandas.read_csv("cereals.CSV")

# Construct variables
X = df[['Sodium']]
y = df['Rating']

# Separate the data

from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.33, random_state = 9)

# Run ridge regression

# Generate the prediction value from the test data

# Calculate the MSE

from sklearn.metrics import mean_squared_error

14
Example 3
Penalty Parameter

# Load libraries
from sklearn.linear_model import Ridge
import pandas

# Import data
df = pandas.read_csv("cereals.CSV")

# Construct variables
X = df[['Sodium']]
y = df['Rating']

# Separate the data

from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.33, random_state = 9)

# Run ridge regression

# Print the coefficients

# Generate the prediction value from the test data

# Calculate the MSE

from sklearn.metrics import mean_squared_error

15
Example 4
Finding Optimal Alpha

# Load libraries
from sklearn.linear_model import Ridge
import pandas

# Import data
df = pandas.read_csv("cereals.CSV")

# Construct variables
X = df[['Sodium']]
y = df['Rating']

# Separate the data

from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.33, random_state = 9)

# Run the loops

from sklearn.metrics import mean_squared_error

16
The Idea of Ridge Regression
y

§ Ridge Regression adds bias to the model to

reduce the variance by using the parameter λ
§ The core idea is to manipulate the slope of
the regression line to fit the training data less
and fit the test data more
17
LASSO Regression

§ LASSO: Least Absolute Shrinkage and

Selection Operator
§ It is also a regression model that utilizes the
regularization technique
§ In other words, LASSO also introduces bias
to reduce variance
§ It is very similar to Ridge Regression with
one important difference

18
LASSO Regression

§ The objective of Ridge Regression is to

minimize:

𝑺𝑺𝑬 + (λ ×𝑺𝒍𝒐𝒑𝒆𝟐 )

§ In LASSO, the objective is to minimize:

𝑺𝑺𝑬 + (λ × 𝑺𝒍𝒐𝒑𝒆 )

19
LASSO Regression

§ Similar to Ridge Regression, λ is a parameter

of the model ranges from zero to infinite
§ We usually use cross validation to determine
the optimal value of λ (i.e., the value of λ that
produces the lowest MSE/MAPE based on the
test set)

20
The Role of Lambda

§ Intuitively, the parameter λ penalizes the

predictor(s) with respect to their influence on
the target variable
§ So, the penalty that λ imposes is different for
different predictors
§ When λ increases, the slope (i.e., coefficient)
of each predictor naturally decreases
§ This process is called “shrinking” for both
Ridge Regression and LASSO

21
Ridge Regression vs. LASSO

§ In Ridge Regression, the shrinking operation

can decrease the slope to be asymptotically
zero (i.e., very close to zero)
§ In LASSO, the shrinking operation can
decrease the slope to zero
§ Therefore, LASSO generally perform better
when there are a lot of useless predictors
where LASSO can penalize their coefficient to
zero
§ Ridge Regression tends to perform better
when most of predictors are useful
22
Example 5
LASSO

# Load libraries
from sklearn.linear_model import Lasso
import pandas

# Import data
df = pandas.read_csv("cereals.CSV")

# Construct variables
X = df[['Sodium']]
y = df['Rating']

# Separate the data

from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.33, random_state = 5)

# Run linear regression only for the test data

# Generate the prediction value from the test data

# Calculate the MSE

from sklearn.metrics import mean_squared_error

23
Example 6
Penalty Parameter

# Load libraries
from sklearn.linear_model import Lasso
import pandas

# Import data
df = pandas.read_csv("cereals.CSV")

# Construct variables
X = df[['Sodium']]
y = df['Rating']

# Separate the data

from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.33, random_state = 5)

# Run linear regression only for the test data

# Print the coefficients

# Generate the prediction value from the test data

# Calculate the MSE

from sklearn.metrics import mean_squared_error

24
Example 7
Finding Optimal Alpha

# Load libraries
from sklearn.linear_model import Lasso
import pandas

# Import data
df = pandas.read_csv("cereals.CSV")

# Construct variables
X = df[['Sodium']]
y = df['Rating']

# Separate the data

from sklearn.model_selection import train_test_split
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size = 0.33, random_state = 5)

# Run the loops

from sklearn.metrics import mean_squared_error

25
Exercise #1

§ Use nutrition.csv dataset

§ Use CALORIES as the target variable and
other variables as predictors
§ Construct a ridge regression model
§ Print all coefficients (including intercept).
You do not have to format the results

26
Exercise #2

§ Using the same dataset in #1

§ Use CALORIES as the target variable and
other variables as predictors
§ Construct a LASSO model
§ Print all coefficients (including intercept).
You do not have to format the results

Machine Learning With Ridge and Lasso Regression
No ratings yet
Machine Learning With Ridge and Lasso Regression
19 pages
Econometric Exam 2 Flashcards - Quizlet
No ratings yet
Econometric Exam 2 Flashcards - Quizlet
18 pages
INSY662 - F23 - Week 3-2
No ratings yet
INSY662 - F23 - Week 3-2
15 pages
Ridge and Lasso in Python PDF
No ratings yet
Ridge and Lasso in Python PDF
5 pages
Unit 2
No ratings yet
Unit 2
92 pages
Assigment Regression
No ratings yet
Assigment Regression
9 pages
Regularization and Feature Selectio N
No ratings yet
Regularization and Feature Selectio N
102 pages
Ridge Regression
No ratings yet
Ridge Regression
5 pages
21csc305p ML Unit 2
No ratings yet
21csc305p ML Unit 2
115 pages
Lecture+Notes+-+Advanced+Regression
No ratings yet
Lecture+Notes+-+Advanced+Regression
12 pages
Chapter2 - Optimisation
No ratings yet
Chapter2 - Optimisation
7 pages
Advance Machine Learning
No ratings yet
Advance Machine Learning
16 pages
Regression
No ratings yet
Regression
16 pages
Aml 3
No ratings yet
Aml 3
19 pages
Linear Regression - Numpy and Sklearn
No ratings yet
Linear Regression - Numpy and Sklearn
7 pages
PGN AI and ML Presentation
No ratings yet
PGN AI and ML Presentation
28 pages
Lecture 09 ML
No ratings yet
Lecture 09 ML
26 pages
Linear Regression
No ratings yet
Linear Regression
5 pages
Assignment 3
No ratings yet
Assignment 3
5 pages
3 Unit - Dspu
No ratings yet
3 Unit - Dspu
23 pages
Regression Models
No ratings yet
Regression Models
5 pages
Dependent Independent Variable (S) : Regression: What Is Regression
No ratings yet
Dependent Independent Variable (S) : Regression: What Is Regression
15 pages
Unit 2
No ratings yet
Unit 2
8 pages
Regression Analysis in Machine Learning: Context
No ratings yet
Regression Analysis in Machine Learning: Context
16 pages
Ridge Mt1cars
No ratings yet
Ridge Mt1cars
4 pages
Wa0002.
No ratings yet
Wa0002.
5 pages
DS Unit 2 Essay Answers
No ratings yet
DS Unit 2 Essay Answers
17 pages
Ridge and Lasso Regression in Python
No ratings yet
Ridge and Lasso Regression in Python
18 pages
Lasso and Ridge Regression
No ratings yet
Lasso and Ridge Regression
30 pages
Describe in Brief Different Types of Regression Algorithms
No ratings yet
Describe in Brief Different Types of Regression Algorithms
25 pages
Supervised Regression Notes
No ratings yet
Supervised Regression Notes
11 pages
Module 2 Modified
No ratings yet
Module 2 Modified
67 pages
Bias Variance Ridge Regression
No ratings yet
Bias Variance Ridge Regression
4 pages
ML EasySol
No ratings yet
ML EasySol
62 pages
ML 1
No ratings yet
ML 1
24 pages
Chapter - 2 - Linear and Logistic Regression
No ratings yet
Chapter - 2 - Linear and Logistic Regression
34 pages
Lecture-6 Linear Regression Addition
No ratings yet
Lecture-6 Linear Regression Addition
15 pages
Lecture 3
No ratings yet
Lecture 3
33 pages
Linear Regression Algorithm
No ratings yet
Linear Regression Algorithm
16 pages
ML Linear Regression Trupesh Patel
No ratings yet
ML Linear Regression Trupesh Patel
23 pages
Lec8 Regularization Polynomial Regression
No ratings yet
Lec8 Regularization Polynomial Regression
30 pages
Message
No ratings yet
Message
5 pages
Unit-2 ML
No ratings yet
Unit-2 ML
39 pages
ML Solved Endsem
No ratings yet
ML Solved Endsem
16 pages
Machine Learning
No ratings yet
Machine Learning
19 pages
EDA 4th Module
No ratings yet
EDA 4th Module
26 pages
Linear Regression
No ratings yet
Linear Regression
8 pages
ML Unit-2
No ratings yet
ML Unit-2
34 pages
Lecture Notes - Linear Regression
No ratings yet
Lecture Notes - Linear Regression
26 pages
9 - Linear Regression-Problems and Solutions
No ratings yet
9 - Linear Regression-Problems and Solutions
23 pages
Unit 2
No ratings yet
Unit 2
18 pages
Unit-2 Ak
No ratings yet
Unit-2 Ak
106 pages
Rohit Unit 2 ML Notes
No ratings yet
Rohit Unit 2 ML Notes
7 pages
ML LN 3
No ratings yet
ML LN 3
44 pages
Chapter 3. Linear Regression
No ratings yet
Chapter 3. Linear Regression
41 pages
Regression Questionnaire
No ratings yet
Regression Questionnaire
10 pages
Article Module 4
No ratings yet
Article Module 4
8 pages
HW2 Applied Questions: 1 Problem 6
No ratings yet
HW2 Applied Questions: 1 Problem 6
24 pages
Tugas Biostatistik Dosen Pengampu: Ns Ririn Afrian Sulistyawati M.Kep
No ratings yet
Tugas Biostatistik Dosen Pengampu: Ns Ririn Afrian Sulistyawati M.Kep
10 pages
Differences and Similarities Between Par
100% (1)
Differences and Similarities Between Par
6 pages
Case Processing Summary 1
No ratings yet
Case Processing Summary 1
3 pages
Statistical Method MSC Mathematics
No ratings yet
Statistical Method MSC Mathematics
3 pages
Moving Averages and Exponential Smoothing
No ratings yet
Moving Averages and Exponential Smoothing
4 pages
Time Series Analysis: Box-Jenkins Method
No ratings yet
Time Series Analysis: Box-Jenkins Method
26 pages
(Ebook PDF) Statistics: Unlocking The Power of Data, 2nd Edition Download
100% (2)
(Ebook PDF) Statistics: Unlocking The Power of Data, 2nd Edition Download
51 pages
Wedge Tabla Formulas
No ratings yet
Wedge Tabla Formulas
3 pages
ch04 Sampling Distributions
No ratings yet
ch04 Sampling Distributions
60 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Unit IV - Predictive Analytics - BA by PP
No ratings yet
Unit IV - Predictive Analytics - BA by PP
66 pages
Standard Deviation
No ratings yet
Standard Deviation
13 pages
Sire Index: DR Aashish Dhakal
No ratings yet
Sire Index: DR Aashish Dhakal
9 pages
Ass 1 2019 RMBA
100% (3)
Ass 1 2019 RMBA
8 pages
Module 5 - NON-PARAMETRIC TESTS
No ratings yet
Module 5 - NON-PARAMETRIC TESTS
50 pages
AML Unit 5
No ratings yet
AML Unit 5
13 pages
(Ebook PDF) Principles of Econometrics, 5th Editioninstant Download
100% (4)
(Ebook PDF) Principles of Econometrics, 5th Editioninstant Download
49 pages
A Powerpoint®-Based Guide To Assist in Choosing The Suitable Statistical Test
No ratings yet
A Powerpoint®-Based Guide To Assist in Choosing The Suitable Statistical Test
43 pages
Univariate Statistics
No ratings yet
Univariate Statistics
4 pages
Naue
No ratings yet
Naue
9 pages
ECO401-Econometrics: Panel Data-Estimation Using Stata
No ratings yet
ECO401-Econometrics: Panel Data-Estimation Using Stata
9 pages
Ivanov 2013 Personalize Hotel Searches
No ratings yet
Ivanov 2013 Personalize Hotel Searches
2 pages
Unit1 DrLokeshChaudhary
No ratings yet
Unit1 DrLokeshChaudhary
177 pages
HP Case Analysis
No ratings yet
HP Case Analysis
8 pages
DSF Lab Exp Full
No ratings yet
DSF Lab Exp Full
88 pages
Cap 6B. The Median, Quartiles and Interquartile Range
No ratings yet
Cap 6B. The Median, Quartiles and Interquartile Range
4 pages
Module 7
No ratings yet
Module 7
11 pages
BCS 040 Previous Year Question Papers by Ignouassignmentguru 2
No ratings yet
BCS 040 Previous Year Question Papers by Ignouassignmentguru 2
66 pages

INSY446 - 3 - Linear Model Part 2

Uploaded by

INSY446 - 3 - Linear Model Part 2

Uploaded by

INSY 446 – Winter 2023

Data Mining for Business

Session 3 – Linear Model Part 2

§ The core idea of linear regression models is

§ There is one important issue when we use

§ Suppose we separate the data into training

§ When the model is used on the test dataset

§ This issue occurs because the objective of

§ The idea of the regularization technique is to

# Using the whole data

# Generate the prediction value

# Separate the data

# Run linear regression based on the training data

# Generate the prediction value from the training data

# Plot the linear regression line for the whole data

§ One of popular models that utilize the

Additional Penalty Imposed

Note: 𝑆𝑙𝑜𝑝𝑒 is similar to the coefficient of the variable x, which

§ Intuitively, the slope represents the

§ The parameter λ dictates the sensitivity of the

§ With this penalty term, the ridge regression

§ With trial-and-errors, we find the value of λ

# Separate the data

# Run ridge regression

# Generate the prediction value from the test data

# Calculate the MSE

# Separate the data

# Run ridge regression

# Print the coefficients

# Generate the prediction value from the test data

# Calculate the MSE

# Separate the data

# Run the loops

§ Ridge Regression adds bias to the model to

§ LASSO: Least Absolute Shrinkage and

§ The objective of Ridge Regression is to

§ In LASSO, the objective is to minimize:

§ Similar to Ridge Regression, λ is a parameter

§ Intuitively, the parameter λ penalizes the

§ In Ridge Regression, the shrinking operation

# Separate the data

# Run linear regression only for the test data

# Generate the prediction value from the test data

# Calculate the MSE

# Separate the data

# Run linear regression only for the test data

# Print the coefficients

# Generate the prediction value from the test data

# Calculate the MSE

# Separate the data

# Run the loops

§ Use nutrition.csv dataset

§ Using the same dataset in #1

You might also like