0% found this document useful (0 votes)

23 views19 pages

DataScience - Chapter03 - Machine Learning With Python - 03 - Regression

Uploaded by

lyntm125

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views19 pages

DataScience - Chapter03 - Machine Learning With Python - 03 - Regression

Uploaded by

lyntm125

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

UNIVERSITY OF ECONOMICS HO CHI MINH CITY

INTRODUCTION TO DATA SCIENCE AND

APPLICATIONS

2023

Instructor: TRAN THI TUAN ANH

4. REGRESSION IN SUPERVISED LEARNING

Instructor: TRAN THI TUAN ANH 2 / 19

4. REGRESSION IN SUPERVISED LEARNING

4. Regression in supervised learning

What is regression?

(Source: Internet) Instructor: TRAN THI TUAN ANH 3 / 19

4. REGRESSION IN SUPERVISED LEARNING

4. Regression in supervised learning

What is regression?
In machine learning, regression refers to the problem of learning the
relationships between some (qualitative or quantitative) input variables
x = [x1 , x2 , ..., xp ] and a quantitative output variable y .
Model:
y = f (x1 , x2 , ..., xp ) + u
where
u: a noise/error term which describes everything that cannot be
captured by the model.
Types of regression:
Linear regression
Nonlinear regression
Instructor: TRAN THI TUAN ANH 4 / 19
4. REGRESSION IN SUPERVISED LEARNING

4. Regression in supervised learning

Linear regression

y = β 0 + β1 x1 + β2 x2 + ... + βk xk +u
| {z }
f (x1 ,x2 ,...,xk )

β0 , β1 , ..., βk : parameters
The problem is how to learn the parameters β0 , β1 , ..., βk from
training dataset
The linear regression model canbe used for two different purposes:
Classical statistics: Describe relationships
Machine learning : Predicting future outputs

Instructor: TRAN THI TUAN ANH 5 / 19

4. REGRESSION IN SUPERVISED LEARNING

4. Regression in supervised learning

First, learning the model from training data

Learn the unknown parameters β0 , β1 , ..., βk from a training dataset;
That means to find values such that the model fits the data well.
How?
By OLS: Ordinary Least Squares
By LAD: Least Absolute Deviation
By MLE: Maximum Likelihood Estimator

The OLS method is most commonly used.

Second, use trained model to predict the outputs for new data.

ŷ = β̂0 + β̂1 x1∗ + β̂2 x2∗ + ... + β̂k xk∗

Instructor: TRAN THI TUAN ANH 6 / 19

4. REGRESSION IN SUPERVISED LEARNING

4. Regression in supervised learning

Some special cases:

The polynomial regression

y = β0 + β1 x + β2 x 2 + ... + βp x p + u

Qualitative input variables

Use dummy variables
If a qualitative input variable that only takes two different values,
create one dummy variable;
If a qualitative input variable that can take m different values, create
m − 1 dummy variable;

Instructor: TRAN THI TUAN ANH 7 / 19

4. REGRESSION IN SUPERVISED LEARNING

4. Regression in supervised learning

The problem of overfitting and regularization

Overfit regression models have too many parameters for the number
of observations.
An overfit model can cause the regression coefficients, p-values, and
R-squared to be misleading.

A useful approach to handle overfitting is regularization.

Instructor: TRAN THI TUAN ANH 8 / 19

4. REGRESSION IN SUPERVISED LEARNING

4. Regression in supervised learning

The problem of overfitting and regularization

(Source: Internet)
Instructor: TRAN THI TUAN ANH 9 / 19
4. REGRESSION IN SUPERVISED LEARNING

4. Regression in supervised learning

Regularization
"Regularization" in regression is a way to give a penalty for each
parameter included into the model;
In regularized regression, the magnitude (size) of coefficients, as well
as the magnitude of the error term, are penalized.
Complex models are discouraged, that help to avoid overfitting.
Two most common types of Regularized Regression are:
Ridge regression
Lasso regression

Instructor: TRAN THI TUAN ANH 10 / 19

4. REGRESSION IN SUPERVISED LEARNING

4. Regression in supervised learning

!2
n
P k
P
OLS: Loss = yi − β0 − βj xji → min
i=1 j=1

Ridge regression:
 2
n
X k
X k
X
Loss = yi − β0 − βj xji  + λ βj2 → min
i=1 j=1 j=1

Lasso regression:
 2
n
X k
X k
X
Loss = yi − β0 − βj xji  + λ |βj | → min
i=1 j=1 j=1

where λ is tuning parameter

Instructor: TRAN THI TUAN ANH 11 / 19
4. REGRESSION IN SUPERVISED LEARNING

4. Regression in supervised learning

LASSO vs Ridge:

Instructor: TRAN THI TUAN ANH 12 / 19

4. REGRESSION IN SUPERVISED LEARNING

4. Regression in supervised learning

Note:
The tuning parameter controls the strength of the penalty term.
When λ = 0, Ridge/Lasso regression equals least squares regression;
When λ = ∞, all parameters tend to be 0;
The ideal penalty is therefore somewhere in between 0 and ∞.

Instructor: TRAN THI TUAN ANH 13 / 19

4. REGRESSION IN SUPERVISED LEARNING

4. Regression in supervised learning

Turning parameter:

Instructor: TRAN THI TUAN ANH 14 / 19

4. REGRESSION IN SUPERVISED LEARNING

4. Regression in supervised learning

Example 3.6
+ Linear regression in Python:
from sklearn.linear_model import LinearRegression
lr = LinearRegression()
lr.fit(X_train, y_train)
+ Rigde regression
from sklearn.linear_model import Ridge
ridge = Ridge(alpha=0.01)
ridge.fit(X_train, y_train)
+ Lasso regression
from sklearn.linear_model import Lasso
lasso = Lasso(alpha=0.01)
lasso.fit(X_train, y_train)
Instructor: TRAN THI TUAN ANH 15 / 19
4. REGRESSION IN SUPERVISED LEARNING

4. Regression in supervised learning

Evaluating forecast accuracy

Mean Absolute Error
n
1X
MAE = |ui |
n
i=1
from sklearn.metrics import mean_absolute_error
mean_absolute_error(y_true, y_pred)
Mean Squared Error
n
1X 2
MSE = ui
n
i=1
from sklearn.metrics import mean_squared_error
mean_squared_error(y_true, y_pred)

Instructor: TRAN THI TUAN ANH 16 / 19

4. REGRESSION IN SUPERVISED LEARNING

4. Regression in supervised learning

Evaluating forecast accuracy (cont)

Mean Absolute Percentage Error
 
1 X  Yi − Ŷi
n n
1 X |ui |
MAPE = × 100 = × 100
n |Yi | n |Yi |
i=1 i=1

Root Mean Squared Error

v
u n
√ u1 X
RMSE = MSE = t ui2
n
i=1

Instructor: TRAN THI TUAN ANH 17 / 19

4. REGRESSION IN SUPERVISED LEARNING

4. Regression in supervised learning

Teamwork 3:
Using data in file regression.csv, where
Inputs: x1 , x2 , x3 , x4
Output: y
Create a Python code file to do some tasks as follow:
Loading the required libraries and modules
Loading the data
Creating arrays for the inputs and output variable
Creating the training and test datasets
Build, Predict and Evaluate the Ridge and Lasso regression.
Hint: Similar to Python file of previous algorithms
Instructor: TRAN THI TUAN ANH 18 / 19
4. REGRESSION IN SUPERVISED LEARNING

THE END

THANK YOU FOR LISTENING

Instructor: TRAN THI TUAN ANH 19 / 19

Linear Regression
No ratings yet
Linear Regression
104 pages
ML 3
No ratings yet
ML 3
56 pages
771 A18 Lec5
No ratings yet
771 A18 Lec5
156 pages
ML - LAB - BE CSE (DS) Final
No ratings yet
ML - LAB - BE CSE (DS) Final
110 pages
Week 6 - Lecture 12-1
No ratings yet
Week 6 - Lecture 12-1
34 pages
SL LMRG
No ratings yet
SL LMRG
32 pages
Cp4252 ML Unit-II
No ratings yet
Cp4252 ML Unit-II
44 pages
Ch-2 Supervised Machine Learning
No ratings yet
Ch-2 Supervised Machine Learning
48 pages
Chapter 4 - Linear Model: Prepared By: Shier Nee, SAW Based On: Probabilistic Machine Learning by Kevin Murphy
No ratings yet
Chapter 4 - Linear Model: Prepared By: Shier Nee, SAW Based On: Probabilistic Machine Learning by Kevin Murphy
42 pages
EE708 Module 3A
No ratings yet
EE708 Module 3A
28 pages
Unit - Iii Supervisied Learning - Notes
No ratings yet
Unit - Iii Supervisied Learning - Notes
42 pages
Sparse Regression
No ratings yet
Sparse Regression
37 pages
Group30 Linear Regression
No ratings yet
Group30 Linear Regression
20 pages
Lecture 3
No ratings yet
Lecture 3
33 pages
Group 30
No ratings yet
Group 30
33 pages
Lecture 1.5-1.6
No ratings yet
Lecture 1.5-1.6
23 pages
21csc305p ML Unit 2
No ratings yet
21csc305p ML Unit 2
115 pages
Lec 3 Regression.
No ratings yet
Lec 3 Regression.
20 pages
2.1 Supervised Regression
No ratings yet
2.1 Supervised Regression
26 pages
Ridge Lasso Regression Bias Variance Tradeoff 71
No ratings yet
Ridge Lasso Regression Bias Variance Tradeoff 71
19 pages
L11+ Regularization
No ratings yet
L11+ Regularization
25 pages
Mlfa Autumn 22 Lec 02
No ratings yet
Mlfa Autumn 22 Lec 02
24 pages
Lecture 3
No ratings yet
Lecture 3
22 pages
9 - Linear Regression-Problems and Solutions
No ratings yet
9 - Linear Regression-Problems and Solutions
23 pages
Supervised Regression Notes
No ratings yet
Supervised Regression Notes
11 pages
Advanced Engineering Mathematics Presentation
100% (2)
Advanced Engineering Mathematics Presentation
145 pages
Chapter 3. Linear Regression
No ratings yet
Chapter 3. Linear Regression
41 pages
2-Linear Regression
No ratings yet
2-Linear Regression
31 pages
Machine Learning
No ratings yet
Machine Learning
19 pages
Lecture 2
No ratings yet
Lecture 2
23 pages
Introml 02 Regression Annotated PDF
No ratings yet
Introml 02 Regression Annotated PDF
26 pages
GradientDescent-Regression Slides
No ratings yet
GradientDescent-Regression Slides
26 pages
Progression Linaire
No ratings yet
Progression Linaire
187 pages
Intro To ML RevisionNotes
No ratings yet
Intro To ML RevisionNotes
24 pages
Lecture-6 Linear Regression Addition
No ratings yet
Lecture-6 Linear Regression Addition
15 pages
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
No ratings yet
Understanding The Geometry of Predictive Models: Workshop at S P Jain School Institute of Management and Research
78 pages
17 Regression
No ratings yet
17 Regression
11 pages
Linear Regression: Volker Tresp 2017
No ratings yet
Linear Regression: Volker Tresp 2017
25 pages
UnderstandingDeepLearning 03-26-25 C 31 38
No ratings yet
UnderstandingDeepLearning 03-26-25 C 31 38
8 pages
ML 1
No ratings yet
ML 1
24 pages
Supervised Machine Learning - Regression
No ratings yet
Supervised Machine Learning - Regression
34 pages
Ridge and Lasso Regression in Python
No ratings yet
Ridge and Lasso Regression in Python
18 pages
Classification & Regression BDMDM Print
No ratings yet
Classification & Regression BDMDM Print
5 pages
Today: - Calculus
No ratings yet
Today: - Calculus
61 pages
Wk05 Machine Learning
No ratings yet
Wk05 Machine Learning
6 pages
ML4 Linear Models
No ratings yet
ML4 Linear Models
34 pages
ML Linear Model
No ratings yet
ML Linear Model
10 pages
Unit 2
No ratings yet
Unit 2
8 pages
Module 3.3 Classification Models, An Overview
No ratings yet
Module 3.3 Classification Models, An Overview
11 pages
Lecture 4 Linear Regression
100% (1)
Lecture 4 Linear Regression
44 pages
05 Regression Least Squares
No ratings yet
05 Regression Least Squares
5 pages
ML Models and When To Choose One Over Others
No ratings yet
ML Models and When To Choose One Over Others
7 pages
Essentials of Linear Regression in Python
No ratings yet
Essentials of Linear Regression in Python
23 pages
Machine Learning Lecture 1
No ratings yet
Machine Learning Lecture 1
5 pages
Introduction To Machine Learning Lecture 2: Linear Regression
No ratings yet
Introduction To Machine Learning Lecture 2: Linear Regression
38 pages
Dependent Independent Variable (S) : Regression: What Is Regression
No ratings yet
Dependent Independent Variable (S) : Regression: What Is Regression
15 pages
Fiitjee Polynomials Rmo, Nsejs Theory+Questions
100% (1)
Fiitjee Polynomials Rmo, Nsejs Theory+Questions
20 pages
Artificial Neural Network-Adaline & Madaline
No ratings yet
Artificial Neural Network-Adaline & Madaline
18 pages
Polynomial Worksheet
No ratings yet
Polynomial Worksheet
2 pages
8 Math Algebraic Expressions
100% (2)
8 Math Algebraic Expressions
20 pages
Textbooks
No ratings yet
Textbooks
2 pages
ML Summary PDF
No ratings yet
ML Summary PDF
5 pages
Solving Simultaneous Equations
No ratings yet
Solving Simultaneous Equations
7 pages
Numerical Methods - Revision Objective Questions
No ratings yet
Numerical Methods - Revision Objective Questions
11 pages
Class X - Maths-Ch-2 - Worksheet
No ratings yet
Class X - Maths-Ch-2 - Worksheet
4 pages
Polynomial Equations: Lesson 3
No ratings yet
Polynomial Equations: Lesson 3
41 pages
Numerical Methods For O.D.E.s: Created by T. Madas
No ratings yet
Numerical Methods For O.D.E.s: Created by T. Madas
26 pages
Data Mining-Mining Sequence Patterns in Biological Data
No ratings yet
Data Mining-Mining Sequence Patterns in Biological Data
6 pages
ENGR391 Final
No ratings yet
ENGR391 Final
3 pages
Supervised learningNN
No ratings yet
Supervised learningNN
73 pages
Mixed Integer Linear Programming and Mixed Nonlinear Programming Problems
No ratings yet
Mixed Integer Linear Programming and Mixed Nonlinear Programming Problems
11 pages
CFD Computational Fluid Dynamics (With Heat and Mass Transfer)
No ratings yet
CFD Computational Fluid Dynamics (With Heat and Mass Transfer)
20 pages
CFD - ASSES 2 QB
No ratings yet
CFD - ASSES 2 QB
3 pages
CE 223 Lesson - Newton-Raphson Method
No ratings yet
CE 223 Lesson - Newton-Raphson Method
24 pages
ME 361-Differentiation
No ratings yet
ME 361-Differentiation
22 pages
Boltzmann Machine
No ratings yet
Boltzmann Machine
6 pages
Outliers and Influential Points
No ratings yet
Outliers and Influential Points
14 pages
Explore Machine Learning and Regression Learner Machine Learning
No ratings yet
Explore Machine Learning and Regression Learner Machine Learning
3 pages
DD2437 Lecture01 PH
No ratings yet
DD2437 Lecture01 PH
32 pages
NM 2068 PDF
No ratings yet
NM 2068 PDF
12 pages
Applied Numerical Methods: Dr. Khaled Ahmida Ashouri
No ratings yet
Applied Numerical Methods: Dr. Khaled Ahmida Ashouri
12 pages
SE - Sem 4 - Com-DS-AIML - Design & Analysis of Algorithms
No ratings yet
SE - Sem 4 - Com-DS-AIML - Design & Analysis of Algorithms
2 pages
Pneumonia Detection Using Convolutional Neural Networks
No ratings yet
Pneumonia Detection Using Convolutional Neural Networks
2 pages
P2 - 1.2 Dividing Polynomials
No ratings yet
P2 - 1.2 Dividing Polynomials
9 pages
Week 06 Assignment
No ratings yet
Week 06 Assignment
3 pages
Lab 5. LU Factorization: Name: 1 Instructions
No ratings yet
Lab 5. LU Factorization: Name: 1 Instructions
2 pages
Calculus: Maths of the Gods
From Everand
Calculus: Maths of the Gods
Bill Todorovich
No ratings yet
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet

DataScience - Chapter03 - Machine Learning With Python - 03 - Regression

Uploaded by

DataScience - Chapter03 - Machine Learning With Python - 03 - Regression

Uploaded by

UNIVERSITY OF ECONOMICS HO CHI MINH CITY

INTRODUCTION TO DATA SCIENCE AND

Instructor: TRAN THI TUAN ANH

4. REGRESSION IN SUPERVISED LEARNING

Instructor: TRAN THI TUAN ANH 2 / 19

4. Regression in supervised learning

(Source: Internet) Instructor: TRAN THI TUAN ANH 3 / 19

4. Regression in supervised learning

4. Regression in supervised learning

Instructor: TRAN THI TUAN ANH 5 / 19

4. Regression in supervised learning

First, learning the model from training data

The OLS method is most commonly used.

ŷ = β̂0 + β̂1 x1∗ + β̂2 x2∗ + ... + β̂k xk∗

Instructor: TRAN THI TUAN ANH 6 / 19

4. Regression in supervised learning

Some special cases:

Qualitative input variables

Instructor: TRAN THI TUAN ANH 7 / 19

4. Regression in supervised learning

The problem of overfitting and regularization

A useful approach to handle overfitting is regularization.

Instructor: TRAN THI TUAN ANH 8 / 19

4. Regression in supervised learning

The problem of overfitting and regularization

4. Regression in supervised learning

Instructor: TRAN THI TUAN ANH 10 / 19

4. Regression in supervised learning

where λ is tuning parameter

4. Regression in supervised learning

Instructor: TRAN THI TUAN ANH 12 / 19

4. Regression in supervised learning

Instructor: TRAN THI TUAN ANH 13 / 19

4. Regression in supervised learning

Instructor: TRAN THI TUAN ANH 14 / 19

4. Regression in supervised learning

4. Regression in supervised learning

Evaluating forecast accuracy

Instructor: TRAN THI TUAN ANH 16 / 19

4. Regression in supervised learning

Evaluating forecast accuracy (cont)

Root Mean Squared Error

Instructor: TRAN THI TUAN ANH 17 / 19

4. Regression in supervised learning

THANK YOU FOR LISTENING

Instructor: TRAN THI TUAN ANH 19 / 19

You might also like