0% found this document useful (0 votes)

25 views

Note 13 - Linear Regression

Uploaded by

anupriya6105

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views

Note 13 - Linear Regression

Uploaded by

anupriya6105

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

IT3011 - Theory & Practices in Statistical Modelling

Regression Analysis

H.M. Samadhi Chathuranga Rathnayake

MSc, PgDip, BSc
Covariance of Two Numerical Variables

Covariance will measure joint variation of two numerical variables. This is a measure of the relationship
between two variables.

𝑛
1
𝐶𝑂𝑉 𝑋, 𝑌 = ෍ 𝑋𝑖 − 𝑋ത 𝑌𝑖 − 𝑌ത
𝑛
𝑖=1
Correlation of Two Numerical Variables

Correlation is also a measurement of the relationship between two numerical variables. It is lying between -1
and +1.
𝐶𝑂𝑉(𝑋, 𝑌)
𝐶𝑂𝑅𝑅 𝑋, 𝑌 =
𝑉 𝑋 𝑉(𝑌)
Correlation VS Causation

A correlation between variables, however, does not automatically mean that the change in one variable is the
cause of the change in the values of the other variable. Causation indicates that one event is the result of the
occurrence of the other event; i.e. there is a causal relationship between the two events.
Linear Regression

Linear regression is perhaps one of the most well known and well understood algorithms in statistics and
machine learning.
Linear regression is a linear model, e.g. a model that assumes a linear relationship between the input variables
(x) and the single output variable (y). More specifically, that y can be calculated from a linear combination of
the input variables (x).

When there is a single input variable (x), the method is referred to as Simple Linear Regression. When there
are multiple input variables, literature from statistics often refers to the method as Multiple Linear Regression.

Different techniques can be used to prepare or train the linear regression equation from data, the most
common of which is called Ordinary Least Squares. It is common to therefore refer to a model prepared this
way as Ordinary Least Squares Linear Regression or just Least Squares Regression.
Simple Linear Regression

Assume that we want to see the relationship between the height and the weight of Sri Lankan university
students. Our objective is to fit a model to predict the weight using height of the students.
Since the entire population cannot be accessed, a sample will be taken, and the model will be created.

Let’s plot the sample data.

Simple Linear Regression

Simple linear regression is useful for finding relationship between two variables. One is predictor or
independent variable and other is response or dependent variable which is a quantitative variable. For
example, relationship between height and weight.
Simple Linear Regression

Infinitely many number of plots can be created.

Simple Linear Regression

The equation for this model for the population is as follows,

𝑦 = 𝛽0 + 𝛽1 𝑥 + 𝜀
The values 𝛽0 and 𝛽1 must be chosen so that they minimize the error. If sum of squared error is taken as a
metric to evaluate the model, then goal to obtain a line that best estimated model which reduces the error.
𝑛 𝑛
2 2
𝑆𝑢𝑚 𝑜𝑓 𝑆𝑞𝑢𝑎𝑟𝑒𝑠 𝑜𝑓 𝐸𝑟𝑟𝑜𝑟 (𝑆𝑆𝐸) = ෍ 𝐴𝑐𝑡𝑢𝑎𝑙 𝑂𝑢𝑡𝑝𝑢𝑡 − 𝑃𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑 𝑂𝑢𝑡𝑝𝑢𝑡 = ෍ 𝑦𝑖 − 𝑦ො𝑖
𝑖=1 𝑖=1

This is called as Residual Sum of Squares (RSS) as well. By minimizing this SSE, we can obtain following
parameter estimations. Significance of these parameters can also be measured using Hypothesis Testing.

σ𝑛𝑖=1 𝑥𝑖 − 𝑥ҧ 𝑦𝑖 − 𝑦ത
𝛽መ1 =
σ𝑛𝑖=1 𝑥𝑖 − 𝑥ҧ 2
𝛽መ0 = 𝑦ത − 𝛽መ1 𝑥ҧ
Then the estimated model is,
𝑦ො = 𝛽መ0 + 𝛽መ1 𝑥
Simple Linear Regression

Best Fit Line

The core idea is to obtain a line that best fits the data. The best fit line is the one for which total prediction
error (all data points) are as small as possible.
Multiple Linear Regression

The equation for this model is as follows,

𝑦 = 𝛽0 + 𝛽1 𝑥1 + 𝛽2 𝑥2 + 𝛽3 𝑥3 + ⋯ + 𝛽𝑘 𝑥𝑘 + 𝜀
Consider here we have k variables. By minimizing this SSE, we can obtain the parameter estimations in here as
well. These estimated parameters can be represented as a vector. The parameter vector 𝛽መ can be obtained
through,

𝛽መ = 𝑋 𝑇 𝑋 −1 𝑋 𝑇 𝑌

Here,

𝑿𝟏 𝑿𝟐 … 𝑿𝟒
𝑋11 𝑋12 … 𝑋1𝑘
𝑋21 𝑋22 … 𝑋2𝑘
… … … …
𝑋𝑛1 𝑋𝑛2 … 𝑋𝑛𝑘
Multiple Linear Regression

How this model is visualized. Consider a 2 variables with one response example. Here the Income is the
response variable and Seniority and the Years Of Education are the independent variables.

Higher dimensions cannot be visualized easily. There are several advanced techniques for visualize them.
Qualitative Predictors (Categorical Independent Variables)

Categorical variables cannot be added to the model as the numerical variables.

Ex- University Year Variable (First, Second, Third, Fourth)

First Second Third Fourth

1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1

These are called as Dummy Variables.

Qualitative Predictors (Categorical Independent Variables)

Since one variable can be represented as other variable’s 0 case, one variable can be removed. That is called as
the reference level.
Consider First as the reference level.

First Second Third Fourth

1 0 0 0
0 1 0 0
0 0 1 0
0 0 0 1

This is called as Dummy Trapping.

Qualitative Predictors (Categorical Independent Variables)

Since one variable can be represented as other variable’s 0 case, one variable can be removed. That is called as
the reference level.

Second Third Fourth

0 0 0
1 0 0
0 1 0
0 0 1

This is called as Dummy Trapping.

Dummy Variables

To represent categorical variables in a linear regression model, we use dummy variables. Consider following
example.
Ex:- Gender Ex- Temperature (High, Medium, Low)

Gender Dummy Variable (G) Dummy Variable Dummy Variable

Temperature
Male 1 (T1) (T2)

Female 0 High 1 0
Medium 0 1
Low 0 0
Ex- Colour (Red, Green, Yellow, Blue)

Dummy Dummy Dummy

Colour
Variable (C1) Variable (C2) Variable (C3)
Red 1 0 0
Green 0 1 0
Yellow 0 0 1
Blue 0 0 0
Dummy Variables

Now assume that the model to be fitted for a numerical variable (X) and the above discussed categorical
variable Colour.

Dummy Dummy Dummy

Colour
Variable (C1) Variable (C2) Variable (C3)
Red 1 0 0
Green 0 1 0
Yellow 0 0 1
Blue 0 0 0

𝑦ො = 𝛽መ0 + 𝛽መ1 𝑥1 + 𝛽መ2 𝑐1 + 𝛽መ3 𝑐2 + 𝛽መ4 𝑐3

R Squared Value (Coefficient of Determination)

𝑛
2
𝑇𝑜𝑡𝑎𝑙 𝑆𝑢𝑚 𝑜𝑓 𝑆𝑞𝑢𝑎𝑟𝑒𝑠 𝑇𝑆𝑆 = ෍ 𝑦𝑖 − 𝑦ത
𝑖=1

𝑆𝑢𝑚 𝑜𝑓 𝑆𝑞𝑢𝑎𝑟𝑒𝑠 𝑜𝑓 𝐸𝑟𝑟𝑜𝑟 (𝑆𝑆𝐸) = ෍ 𝑦𝑖 − 𝑦ො𝑖 2

𝑖=1

𝑇𝑆𝑆 − 𝑆𝑆𝐸
𝐶𝑜𝑒𝑓𝑓𝑖𝑐𝑖𝑒𝑛𝑡 𝑜𝑓 𝐷𝑒𝑡𝑒𝑟𝑚𝑖𝑛𝑎𝑡𝑖𝑜𝑛 = 𝑅2 =
𝑇𝑆𝑆
This R Squared Value is explaining the fraction of variation explained by the estimated model. In simple words
how much of the data captured by this model.

For the Simple Linear Regression case 𝐶𝑜𝑒𝑓𝑓𝑖𝑐𝑖𝑒𝑛𝑡 𝑜𝑓 𝐷𝑒𝑡𝑒𝑟𝑚𝑖𝑛𝑎𝑡𝑖𝑜𝑛 = 𝑅2 = (𝐶𝑜𝑟𝑟𝑒𝑙𝑎𝑡𝑖𝑜𝑛)2

Hypothesis Testing for Simple Linear Regression

For each sample of the population, different estimates can be obtained. So these estimates are random
variables.

𝑆𝑆𝐸 1 𝑥ҧ 2
𝑉 𝛽መ0 = +
𝑛 − 2 𝑛 σ𝑛𝑖=1 𝑥𝑖 − 𝑥ҧ 2

𝑆𝑆𝐸 1
𝑉 𝛽መ1 =
𝑛 − 2 σ𝑛𝑖=1 𝑥𝑖 − 𝑥ҧ 2
Hypothesis Testing for Simple Linear Regression

Following hypothesis can be tested.

𝐻0 : 𝑇ℎ𝑒𝑟𝑒 𝑖𝑠 𝑛𝑜 𝑟𝑒𝑙𝑎𝑡𝑖𝑜𝑛𝑠ℎ𝑖𝑝 𝑏𝑒𝑡𝑤𝑒𝑒𝑛 𝑋 𝑎𝑛𝑑 𝑌 (𝛽1 = 0)

𝐻1 : 𝑇ℎ𝑒𝑟𝑒 𝑖𝑠 𝑎 𝑟𝑒𝑙𝑎𝑡𝑖𝑜𝑛𝑠ℎ𝑖𝑝 𝑏𝑒𝑡𝑤𝑒𝑒𝑛 𝑋 𝑎𝑛𝑑 𝑌 (𝛽1 ≠ 0)

Here the test statistic is,

𝛽መ1 − 0
𝑡𝑐𝑎𝑙 = ~𝑡𝑛−2
መ
𝑆𝐸(𝛽1 )
Hypothesis Testing for Multiple Linear Regression

Following hypothesis can be tested.

𝐻0 : 𝛽1 = 𝛽2 = 𝛽3 = ⋯ = 𝛽𝑘 = 0

𝐻1 : 𝐴𝑡 𝑙𝑒𝑎𝑠𝑡 𝑜𝑛𝑒 𝛽𝑗 𝑖𝑠 𝑛𝑜𝑛 𝑧𝑒𝑟𝑜 𝑓𝑜𝑟 𝑗 = 1,2, … , 𝑘

ANOVA table here is,

Degrees of
Source of variation Mean square F
freedom

𝑇𝑆𝑆 − 𝑆𝑆𝐸
Regression k 𝑀𝑆𝑅 =
𝑘

𝑆𝑆𝐸 𝑀𝑆𝑅
Error n-k-1 𝑀𝑆𝐸 =
𝑛−𝑘−1
𝐹𝑠𝑡𝑎𝑡 =
𝑀𝑆𝐸

Total n-k-1+(k)=n-1
Linear Regression Assumptions

There are mainly four assumptions associated with a linear regression model:
• Linearity: The relationship between X and the mean of Y is linear.
• Scatter plots & partial regression models
• Multicollinearity: Correlation among independent variables.
• VIF
• Homoscedasticity: Homogeneity of variance of residuals.
• Residual VS Fitted Values plot
• Independence: Observations are independent of each other.
• Residual VS Fitted Values plot
• Normality: Residuals are normally distributed.
• Shapiro–Wilk test
Predictions

After fitting the model, the goal is to predict the response using independent data.
Prediction Error

Always there is a prediction error.

}
Model Evaluation – Validation Set Approach

Split the dataset into two sets,

• Training dataset (Generally 80 of the data But it can be changed)
• Testing dataset (Rest of the data)

Train the model using the training dataset and then check the accuracy of the model using the testing dataset.
MSE of a regression model will be calculated and the model will be evaluated.

σ𝑛𝑖=1 𝑦𝑖 − 𝑦ො𝑖 2
𝑀𝑒𝑎𝑛 𝑆𝑞𝑢𝑎𝑟𝑒𝑑 𝐸𝑟𝑟𝑜𝑟 =
𝑛

James R. Evans - Statistics, Data Analysis and Decision Modeling International 5th Ed.-Pearson (2013)
86% (14)
James R. Evans - Statistics, Data Analysis and Decision Modeling International 5th Ed.-Pearson (2013)
543 pages
PDF (Ebook PDF) Basic Business Statistics: Concepts and Applications 12th Edition Download
100% (4)
PDF (Ebook PDF) Basic Business Statistics: Concepts and Applications 12th Edition Download
41 pages
Introductory Statistics 8th Ed by Mann ( PDFDrive )-16-20
No ratings yet
Introductory Statistics 8th Ed by Mann ( PDFDrive )-16-20
5 pages
Multiple Regression Analysis
100% (3)
Multiple Regression Analysis
75 pages
BBFH107
100% (1)
BBFH107
102 pages
STAT630Slide Adv Data Analysis
No ratings yet
STAT630Slide Adv Data Analysis
238 pages
Chapter 3
No ratings yet
Chapter 3
36 pages
1.linear Regression PSP
No ratings yet
1.linear Regression PSP
92 pages
Regression and Introduction To Bayesian Network
No ratings yet
Regression and Introduction To Bayesian Network
12 pages
Regression
No ratings yet
Regression
14 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
27 pages
DMJAP-LinearRegression-3
No ratings yet
DMJAP-LinearRegression-3
28 pages
Simple and Multiple Linear Regression
No ratings yet
Simple and Multiple Linear Regression
6 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
20 pages
Multiple linear regression
No ratings yet
Multiple linear regression
39 pages
Simple LR Lecture
No ratings yet
Simple LR Lecture
60 pages
Regression
No ratings yet
Regression
24 pages
Session 15 Regression and Correlation
No ratings yet
Session 15 Regression and Correlation
66 pages
UNIT II Regression
No ratings yet
UNIT II Regression
59 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
ch2 Linear Regression
No ratings yet
ch2 Linear Regression
39 pages
Regression
No ratings yet
Regression
56 pages
Regression Linear
No ratings yet
Regression Linear
24 pages
Simple Regression Model: Erbil Technology Institute
No ratings yet
Simple Regression Model: Erbil Technology Institute
9 pages
Statics Thinking-Regression
No ratings yet
Statics Thinking-Regression
51 pages
Notes 1017 Part1
No ratings yet
Notes 1017 Part1
50 pages
Advanced Marketing Research
No ratings yet
Advanced Marketing Research
32 pages
Data Analytics Unit 3
No ratings yet
Data Analytics Unit 3
104 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
36 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
Unit 5
No ratings yet
Unit 5
104 pages
Multiple Linear Regression Session 4
No ratings yet
Multiple Linear Regression Session 4
32 pages
Untitled 472
No ratings yet
Untitled 472
13 pages
Lecture 6 - Regression Analysis
No ratings yet
Lecture 6 - Regression Analysis
34 pages
Correlation, Simple Linear Regression and Multiple Linear Regression Practice
No ratings yet
Correlation, Simple Linear Regression and Multiple Linear Regression Practice
50 pages
Regression Coeffient
No ratings yet
Regression Coeffient
52 pages
Econometrics for Mgt ppt-2 (1)
No ratings yet
Econometrics for Mgt ppt-2 (1)
58 pages
Ra Web
No ratings yet
Ra Web
70 pages
Simple and Multiple Regression Analysis
No ratings yet
Simple and Multiple Regression Analysis
46 pages
Lecture9 Regression1 PDF
No ratings yet
Lecture9 Regression1 PDF
22 pages
Business Analytics: Advance: Simple & Multiple Linear Regression
No ratings yet
Business Analytics: Advance: Simple & Multiple Linear Regression
38 pages
MODULE-3
No ratings yet
MODULE-3
34 pages
Lecture5 Mar22 2024
No ratings yet
Lecture5 Mar22 2024
44 pages
Lesson 7 - Regression Analysis
No ratings yet
Lesson 7 - Regression Analysis
57 pages
Regression
No ratings yet
Regression
25 pages
Regression and Correlation
No ratings yet
Regression and Correlation
66 pages
L4&5 Multiple Regression 2010B
No ratings yet
L4&5 Multiple Regression 2010B
77 pages
QMM Epgdm 5
No ratings yet
QMM Epgdm 5
58 pages
Chapter2 1
No ratings yet
Chapter2 1
55 pages
5 - Part II - Regression Analysis w-notes(1)
No ratings yet
5 - Part II - Regression Analysis w-notes(1)
10 pages
Simple and Multiple Linear Regression
No ratings yet
Simple and Multiple Linear Regression
91 pages
1.5.Linear Regression
No ratings yet
1.5.Linear Regression
5 pages
Linear Regression
No ratings yet
Linear Regression
22 pages
COMM5005 Lecture 8
No ratings yet
COMM5005 Lecture 8
54 pages
9 Regression (Statistics IEM 2-2)
No ratings yet
9 Regression (Statistics IEM 2-2)
32 pages
Updated_Lecture_7
No ratings yet
Updated_Lecture_7
29 pages
F_Regression
No ratings yet
F_Regression
65 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
65 pages
A Tutorial On How To Run A Simple Linear Regression in Excel
No ratings yet
A Tutorial On How To Run A Simple Linear Regression in Excel
19 pages
MBAS901 - L3
No ratings yet
MBAS901 - L3
103 pages
Hanan
No ratings yet
Hanan
9 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Exercises of Logarithms and Exponentials
From Everand
Exercises of Logarithms and Exponentials
Simone Malacrida
No ratings yet
Exercises of Advanced Statistics
From Everand
Exercises of Advanced Statistics
Simone Malacrida
No ratings yet
Fundamental Math
From Everand
Fundamental Math
Russell Pead
No ratings yet
Correlation and Regression Analyses
No ratings yet
Correlation and Regression Analyses
8 pages
Correlation and Regration
No ratings yet
Correlation and Regration
8 pages
3 SimpleLinearRegression
No ratings yet
3 SimpleLinearRegression
30 pages
Machine Learning PDF
No ratings yet
Machine Learning PDF
2 pages
Econometrics I
No ratings yet
Econometrics I
3 pages
Simple Linear Regression With Excel
No ratings yet
Simple Linear Regression With Excel
39 pages
Q.1 Explain The Underlying Ideas Behind The Log It Model. Explain On What Grounds Log It Model Is An Improvement Over Linear Probability Model. Ans
No ratings yet
Q.1 Explain The Underlying Ideas Behind The Log It Model. Explain On What Grounds Log It Model Is An Improvement Over Linear Probability Model. Ans
17 pages
Exercise 1 (Week 37)
No ratings yet
Exercise 1 (Week 37)
4 pages
Chapter 12 - Simple Linear Regression
No ratings yet
Chapter 12 - Simple Linear Regression
77 pages
Statistics for Engineers and Scientists 5th Edition William Navidi - The latest updated ebook version is ready for download
No ratings yet
Statistics for Engineers and Scientists 5th Edition William Navidi - The latest updated ebook version is ready for download
49 pages
Regression For Everyone Vol. 1
No ratings yet
Regression For Everyone Vol. 1
25 pages
Business Statistics in Practice: Using Data, Modeling, and Analytics, 9th Edition Bruce L. Bowerman pdf download
100% (1)
Business Statistics in Practice: Using Data, Modeling, and Analytics, 9th Edition Bruce L. Bowerman pdf download
81 pages
Data Analysis Stata and SPSS A Handbook PDF
No ratings yet
Data Analysis Stata and SPSS A Handbook PDF
33 pages
ML UNIT II
No ratings yet
ML UNIT II
30 pages
Rock Bearing Capacity Manual
No ratings yet
Rock Bearing Capacity Manual
13 pages
Chartered Economics - Detailed Curriculum
100% (2)
Chartered Economics - Detailed Curriculum
43 pages
Selvanathan 6e - 19 - PPT
No ratings yet
Selvanathan 6e - 19 - PPT
72 pages
Introduction to the Practice of Statistics David S. Moore & George P. Mccabe & Bruce A. Craig - Download the complete ebook in PDF format and read freely
50% (2)
Introduction to the Practice of Statistics David S. Moore & George P. Mccabe & Bruce A. Craig - Download the complete ebook in PDF format and read freely
42 pages
CUHK STAT5102 Ch1
No ratings yet
CUHK STAT5102 Ch1
54 pages
Spss 23 P 3
No ratings yet
Spss 23 P 3
21 pages
Dokumen - Pub Practical Econometrics The Mcgraw Hill Irwin Series in Economics 0073511412 9780073511412
100% (1)
Dokumen - Pub Practical Econometrics The Mcgraw Hill Irwin Series in Economics 0073511412 9780073511412
542 pages
Stevenson 14e Chap003
No ratings yet
Stevenson 14e Chap003
41 pages
Sem 3 - MCQ - Subject-wise
No ratings yet
Sem 3 - MCQ - Subject-wise
23 pages
True/False: Chapter 10: Determining How Costs Behave
100% (1)
True/False: Chapter 10: Determining How Costs Behave
35 pages

Note 13 - Linear Regression

Uploaded by

Note 13 - Linear Regression

Uploaded by

IT3011 - Theory & Practices in Statistical Modelling

H.M. Samadhi Chathuranga Rathnayake

Let’s plot the sample data.

Infinitely many number of plots can be created.

The equation for this model for the population is as follows,

Best Fit Line

The equation for this model is as follows,

Categorical variables cannot be added to the model as the numerical variables.

First Second Third Fourth

These are called as Dummy Variables.

First Second Third Fourth

This is called as Dummy Trapping.

Second Third Fourth

This is called as Dummy Trapping.

Gender Dummy Variable (G) Dummy Variable Dummy Variable

Dummy Dummy Dummy

Dummy Dummy Dummy

𝑦ො = 𝛽መ0 + 𝛽መ1 𝑥1 + 𝛽መ2 𝑐1 + 𝛽መ3 𝑐2 + 𝛽መ4 𝑐3

𝑆𝑢𝑚 𝑜𝑓 𝑆𝑞𝑢𝑎𝑟𝑒𝑠 𝑜𝑓 𝐸𝑟𝑟𝑜𝑟 (𝑆𝑆𝐸) = ෍ 𝑦𝑖 − 𝑦ො𝑖 2

For the Simple Linear Regression case 𝐶𝑜𝑒𝑓𝑓𝑖𝑐𝑖𝑒𝑛𝑡 𝑜𝑓 𝐷𝑒𝑡𝑒𝑟𝑚𝑖𝑛𝑎𝑡𝑖𝑜𝑛 = 𝑅2 = (𝐶𝑜𝑟𝑟𝑒𝑙𝑎𝑡𝑖𝑜𝑛)2

Following hypothesis can be tested.

𝐻0 : 𝑇ℎ𝑒𝑟𝑒 𝑖𝑠 𝑛𝑜 𝑟𝑒𝑙𝑎𝑡𝑖𝑜𝑛𝑠ℎ𝑖𝑝 𝑏𝑒𝑡𝑤𝑒𝑒𝑛 𝑋 𝑎𝑛𝑑 𝑌 (𝛽1 = 0)

𝐻1 : 𝑇ℎ𝑒𝑟𝑒 𝑖𝑠 𝑎 𝑟𝑒𝑙𝑎𝑡𝑖𝑜𝑛𝑠ℎ𝑖𝑝 𝑏𝑒𝑡𝑤𝑒𝑒𝑛 𝑋 𝑎𝑛𝑑 𝑌 (𝛽1 ≠ 0)

Here the test statistic is,

Following hypothesis can be tested.

𝐻1 : 𝐴𝑡 𝑙𝑒𝑎𝑠𝑡 𝑜𝑛𝑒 𝛽𝑗 𝑖𝑠 𝑛𝑜𝑛 𝑧𝑒𝑟𝑜 𝑓𝑜𝑟 𝑗 = 1,2, … , 𝑘

Always there is a prediction error.

Split the dataset into two sets,

You might also like