0% found this document useful (0 votes)

40 views7 pages

Least Square Method Definition

Uploaded by

Octyl Acetate

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

40 views7 pages

Least Square Method Definition

Uploaded by

Octyl Acetate

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Least Square Method Definition

The least-squares method is a crucial statistical method that is practiced to find a regression line or a
best-fit line for the given pattern. This method is described by an equation with specific parameters. The
method of least squares is generously used in evaluation and regression. In regression analysis, this
method is said to be a standard approach for the approximation of sets of equations having more
equations than the number of unknowns.

The method of least squares actually defines the solution for the minimization of the sum of squares of
deviations or the errors in the result of each equation. Find the formula for sum of squares of errors,
which help to find the variation in observed data.

The least-squares method is often applied in data fitting. The best fit result is assumed to reduce the sum
of squared errors or residuals which are stated to be the differences between the observed or
experimental value and corresponding fitted value given in the model.

There are two basic categories of least-squares problems:

Ordinary or linear least squares

Nonlinear least squares

These depend upon linearity or nonlinearity of the residuals. The linear problems are often seen in
regression analysis in statistics. On the other hand, the non-linear problems are generally used in the
iterative method of refinement in which the model is approximated to the linear one with each iteration.

Types of Regression Metrics

 Mean Absolute Error(MAE)

 Mean Squared Error(MSE)

 Root Mean Squared Error(RMSE)

 R Squared (R2)

 R Squared (R2)
Mean Absolute Error(MAE)

MAE is a very simple metric which calculates the absolute difference between

actual and predicted values.

To better understand, let’s take an example you have input data and output data

and use Linear Regression, which draws a best-fit line.

Now you have to find the MAE of your model which is basically a mistake made by

the model known as an error. Now find the difference between the actual value and

predicted value that is an absolute error but we have to find the mean absolute of

the complete dataset.

so, sum all the errors and divide them by a total number of observations And this is

MAE. And we aim to get a minimum MAE because this is a loss.

Advantages of MAE

 The MAE you get is in the same unit as the output variable.

 It is most Robust to outliers.

Disadvantages of MAE
 The graph of MAE is not differentiable so we have to apply various

optimizers like Gradient descent which can be differentiable.

from sklearn.metrics import mean_absolute_error
print("MAE",mean_absolute_error(y_test,y_pred))

Now to overcome the disadvantage of MAE next metric came as MSE.

Mean Squared Error(MSE)

MSE is a most used and very simple metric with a little bit of change in mean

absolute error. Mean squared error states that finding the squared difference

between actual and predicted value.

So, above we are finding the absolute difference and here we are finding the

squared difference.

What actually the MSE represents? It represents the squared distance between

actual and predicted values. we perform squared to avoid the cancellation of

negative terms and it is the benefit of MSE.

Advantages of MSE

The graph of MSE is differentiable, so you can easily use it as a loss function.

Disadvantages of MSE
 The value you get after calculating MSE is a squared unit of output.

for example, the output variable is in meter(m) then after calculating

MSE the output we get is in meter squared.

 If you have outliers in the dataset then it penalizes the outliers most

and the calculated MSE is bigger. So, in short, It is not Robust to

outliers which were an advantage in MAE.

from sklearn.metrics import mean_squared_error
print("MSE",mean_squared_error(y_test,y_pred))

Root Mean Squared Error(RMSE)

As RMSE is clear by the name itself, that it is a simple square root of mean squared

error.

Advantages of RMSE

 The output value you get is in the same unit as the required output

variable which makes interpretation of loss easy.

Disadvantages of RMSE

 It is not that robust to outliers as compared to MAE.

for performing RMSE we have to NumPy NumPy square root function over MSE.

print("RMSE",np.sqrt(mean_squared_error(y_test,y_pred)))
Most of the time people use RMSE as an evaluation metric and mostly when you

are working with deep learning techniques the most preferred metric is RMSE.

R Squared (R2)

R2 score is a metric that tells the performance of your model, not the loss in an

absolute sense that how many wells did your model perform.

In contrast, MAE and MSE depend on the context as we have seen whereas the R2

score is independent of context.

So, with help of R squared we have a baseline model to compare a model which

none of the other metrics provides. The same we have in classification problems

which we call a threshold which is fixed at 0.5. So basically R2 squared calculates

how must regression line is better than a mean line.

Hence, R2 squared is also known as Coefficient of Determination or sometimes also

known as Goodness of fit.

R2 Squared

Now, how will you interpret the R2 score? suppose If the R2 score is zero then the

above regression line by mean line is equal means 1 so 1-1 is zero. So, in this case,

both lines are overlapping means model performance is worst, It is not capable to

take advantage of the output column.

Now the second case is when the R2 score is 1, it means when the division term is

zero and it will happen when the regression line does not make any mistake, it is

perfect. In the real world, it is not possible.

So we can conclude that as our regression line moves towards perfection, R2 score

move towards one. And the model performance improves.

The normal case is when the R2 score is between zero and one like 0.8 which

means your model is capable to explain 80 per cent of the variance of data.

from sklearn.metrics import r2_score

r2 = r2_score(y_test,y_pred)
print(r2)

Adjusted R Squared

The disadvantage of the R2 score is while adding new features in data the R2 score

starts increasing or remains constant but it never decreases because It assumes

that while adding more data variance of data increases.

But the problem is when we add an irrelevant feature in the dataset then at that time

R2 sometimes starts increasing which is incorrect.

Hence, To control this situation Adjusted R Squared came into existence.

Now as K increases by adding some features so the denominator will decrease, n-1

will remain constant. R2 score will remain constant or will increase slightly so the

complete answer will increase and when we subtract this from one then the

resultant score will decrease. so this is the case when we add an irrelevant feature

in the dataset.

And if we add a relevant feature then the R2 score will increase and 1-R2 will

decrease heavily and the denominator will also decrease so the complete term

decreases, and on subtracting from one the score increases.

n=40
k=2
adj_r2_score = 1 - ((1-r2)*(n-1)/(n-k-1))
print(adj_r2_score)

Hence, this metric becomes one of the most important metrics to use during the

evaluation of the model.

Surveying 1 Formulas
No ratings yet
Surveying 1 Formulas
5 pages
Metrices of The Model
No ratings yet
Metrices of The Model
9 pages
Financial Markets - Notes (GitHub)
No ratings yet
Financial Markets - Notes (GitHub)
30 pages
Introductory Econometrics A Modern Approach 6th Edition Wooldridge Test Bankpdf Download
100% (5)
Introductory Econometrics A Modern Approach 6th Edition Wooldridge Test Bankpdf Download
44 pages
Intermediate Regression With Statsmodels in Python
No ratings yet
Intermediate Regression With Statsmodels in Python
129 pages
P4 New - CHeat Sheet End-Term
No ratings yet
P4 New - CHeat Sheet End-Term
7 pages
The Coefficient of Determination R-Squared Is More Informative Than SMAPE, MAE, MAPE, MSE, and RMSE in Regression Analysis Evaluation
No ratings yet
The Coefficient of Determination R-Squared Is More Informative Than SMAPE, MAE, MAPE, MSE, and RMSE in Regression Analysis Evaluation
28 pages
Data Science Statistics Mathematics Cheat Sheet
100% (1)
Data Science Statistics Mathematics Cheat Sheet
13 pages
Regression v33
No ratings yet
Regression v33
81 pages
Course Eer A
No ratings yet
Course Eer A
97 pages
Day.10 Regression Evaluation Metrics MSE, RMSE, MAE, R-Squared
No ratings yet
Day.10 Regression Evaluation Metrics MSE, RMSE, MAE, R-Squared
8 pages
Economic Model Econometric Model
No ratings yet
Economic Model Econometric Model
2 pages
Introduction To Econometrics, 5 Edition: Chapter 6: Specification of Regression Variables
No ratings yet
Introduction To Econometrics, 5 Edition: Chapter 6: Specification of Regression Variables
17 pages
1-2 Measuring Error
No ratings yet
1-2 Measuring Error
15 pages
Week 6 - Lecture 12-1
No ratings yet
Week 6 - Lecture 12-1
34 pages
Regression Metrics
No ratings yet
Regression Metrics
3 pages
Unit 2 Machine Learning
No ratings yet
Unit 2 Machine Learning
32 pages
Unit 2 Regression
No ratings yet
Unit 2 Regression
31 pages
Reading 1 Multiple Regression 1
No ratings yet
Reading 1 Multiple Regression 1
59 pages
Heteroskedasticity Glejser Using SPSS
No ratings yet
Heteroskedasticity Glejser Using SPSS
9 pages
Define Mean Square Error
No ratings yet
Define Mean Square Error
3 pages
Review: Normal Distribution
No ratings yet
Review: Normal Distribution
46 pages
Regression Review
No ratings yet
Regression Review
50 pages
Quiz 14.15 Confidence Interval Practices
No ratings yet
Quiz 14.15 Confidence Interval Practices
11 pages
Ec410 Lecture 4 - Simple Regression II
No ratings yet
Ec410 Lecture 4 - Simple Regression II
8 pages
Theoretical Analysis of Cyber-Interpersonal Violence Victimization and Offending Using Cyber-Routine Activities Theory
No ratings yet
Theoretical Analysis of Cyber-Interpersonal Violence Victimization and Offending Using Cyber-Routine Activities Theory
38 pages
322634447
No ratings yet
322634447
285 pages
1.2 Test Your Understanding - Predictive Modeling
No ratings yet
1.2 Test Your Understanding - Predictive Modeling
3 pages
Linear Regression Summary
No ratings yet
Linear Regression Summary
57 pages
A Guide On How To Compare Different Models
No ratings yet
A Guide On How To Compare Different Models
44 pages
Confidence Interval, Model Fitness and Prediction: S S T B
No ratings yet
Confidence Interval, Model Fitness and Prediction: S S T B
8 pages
The Coefficient of Determination R-Squared Is More Informative Than SMAPE, MAE, MAPE, MSE and RMSE in Regression Analysis Evaluation
No ratings yet
The Coefficient of Determination R-Squared Is More Informative Than SMAPE, MAE, MAPE, MSE and RMSE in Regression Analysis Evaluation
25 pages
MS Data Mixed Halides Janz 1979
No ratings yet
MS Data Mixed Halides Janz 1979
179 pages
Alasan R2
No ratings yet
Alasan R2
24 pages
Assumptions of Regression
100% (2)
Assumptions of Regression
16 pages
Lab10 Regression Evaluation Methods
No ratings yet
Lab10 Regression Evaluation Methods
5 pages
Pengaruh Lingkungan Kerja Dan Karakteristik Individu Terhadap Stres Kerja Wartawan PT Serambi Media Press Di Kota Padang
No ratings yet
Pengaruh Lingkungan Kerja Dan Karakteristik Individu Terhadap Stres Kerja Wartawan PT Serambi Media Press Di Kota Padang
13 pages
Factors Affecting
No ratings yet
Factors Affecting
7 pages
Advanced Regression PDF
No ratings yet
Advanced Regression PDF
160 pages
The NLIN Procedure
No ratings yet
The NLIN Procedure
49 pages
Linear Regression Case Study
No ratings yet
Linear Regression Case Study
6 pages
(Unit-04) Part-01 - ML Algo
No ratings yet
(Unit-04) Part-01 - ML Algo
49 pages
Lecture 5 Regression
No ratings yet
Lecture 5 Regression
77 pages
UNIT-III Lecture Notes
No ratings yet
UNIT-III Lecture Notes
18 pages
BIOSTATISTICS
No ratings yet
BIOSTATISTICS
15 pages
A CA5102 Module 3 Notes
No ratings yet
A CA5102 Module 3 Notes
5 pages
4 Unit 1 Part
No ratings yet
4 Unit 1 Part
9 pages
Intermediate Analytics-Chai Square and ANOA-Week 2-1
No ratings yet
Intermediate Analytics-Chai Square and ANOA-Week 2-1
45 pages
ML Unit 3
No ratings yet
ML Unit 3
2 pages
R-Squared and Adjusted R-Squared - Short Intro
No ratings yet
R-Squared and Adjusted R-Squared - Short Intro
6 pages
Regresion
No ratings yet
Regresion
38 pages
Common Metrics Used To Evaluate The Performance of Regression Models
No ratings yet
Common Metrics Used To Evaluate The Performance of Regression Models
3 pages
Everything You Need To Know About Linear Regression
No ratings yet
Everything You Need To Know About Linear Regression
19 pages
Linear Regression
No ratings yet
Linear Regression
31 pages
Evaluation Metrics For Regression Problems
No ratings yet
Evaluation Metrics For Regression Problems
9 pages
Linear Regression
No ratings yet
Linear Regression
9 pages
NON-Linear SVM and Evaluation Metrices
No ratings yet
NON-Linear SVM and Evaluation Metrices
13 pages
ArticleText 52748 1 10 20220630
No ratings yet
ArticleText 52748 1 10 20220630
8 pages
BS INdi
No ratings yet
BS INdi
8 pages
Concepts - Regression Overview
No ratings yet
Concepts - Regression Overview
14 pages
Machine Learning
No ratings yet
Machine Learning
19 pages
Linear Regression
No ratings yet
Linear Regression
49 pages
Eviews Basics PDF
No ratings yet
Eviews Basics PDF
11 pages
Syllabus Coverage and Blue Print - BP801T - Biostatistics and Research Methodology
No ratings yet
Syllabus Coverage and Blue Print - BP801T - Biostatistics and Research Methodology
6 pages
Regression
No ratings yet
Regression
35 pages
Regression Metrics
No ratings yet
Regression Metrics
26 pages
Mme 8201-4-Linear Regression Models
No ratings yet
Mme 8201-4-Linear Regression Models
24 pages
PARTIVUNIT2MAEMSERMSE
No ratings yet
PARTIVUNIT2MAEMSERMSE
3 pages
Mid-1 ML
No ratings yet
Mid-1 ML
12 pages
Chapter 23 Summary: T Method. We Discussed The Pros and Cons of Each Method and Illustrated
No ratings yet
Chapter 23 Summary: T Method. We Discussed The Pros and Cons of Each Method and Illustrated
2 pages
Assesing Performance of Regression-Error Measures
No ratings yet
Assesing Performance of Regression-Error Measures
5 pages
Evaluating Regression Models Performance
No ratings yet
Evaluating Regression Models Performance
9 pages
Linear Regression-Part 2
No ratings yet
Linear Regression-Part 2
26 pages
ML Exp5
No ratings yet
ML Exp5
7 pages
Evaluation Metrics For Your Regression Model - Analytics Vidhya
No ratings yet
Evaluation Metrics For Your Regression Model - Analytics Vidhya
6 pages
Model Evaluation Metrics
No ratings yet
Model Evaluation Metrics
21 pages
3 Da
No ratings yet
3 Da
16 pages
Module 1
No ratings yet
Module 1
19 pages
MATH6183 Introduction+Regression
No ratings yet
MATH6183 Introduction+Regression
70 pages
Experiment No 7
No ratings yet
Experiment No 7
7 pages
L4b - Perfomance Evaluation Metric - Regression
No ratings yet
L4b - Perfomance Evaluation Metric - Regression
6 pages
A Guide On How To Compare Different Models in Linear Progression
No ratings yet
A Guide On How To Compare Different Models in Linear Progression
8 pages
L4b - Perfomance Evaluation Metric - Regression
No ratings yet
L4b - Perfomance Evaluation Metric - Regression
6 pages
12 Measurement Uncertainty
100% (1)
12 Measurement Uncertainty
28 pages
Regression Notes
100% (1)
Regression Notes
20 pages
The Effects of Integrating Creative and Critical Thinking On Schools Students' Thinking Ali Salim Rashid Alghafri and Hairul Nizam Bin Ismail
No ratings yet
The Effects of Integrating Creative and Critical Thinking On Schools Students' Thinking Ali Salim Rashid Alghafri and Hairul Nizam Bin Ismail
9 pages
Regression Analysis
No ratings yet
Regression Analysis
7 pages
Evaluation Metrics For Regression: Dr. Jasmeet Singh Assistant Professor, Csed Tiet, Patiala
No ratings yet
Evaluation Metrics For Regression: Dr. Jasmeet Singh Assistant Professor, Csed Tiet, Patiala
13 pages
Random Sample Consensus: Robust Estimation in Computer Vision
From Everand
Random Sample Consensus: Robust Estimation in Computer Vision
Fouad Sabry
No ratings yet
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
From Everand
Bundle Adjustment: Optimizing Visual Data for Precise Reconstruction
Fouad Sabry
No ratings yet

Least Square Method Definition

Uploaded by

Least Square Method Definition

Uploaded by

Least Square Method Definition

There are two basic categories of least-squares problems:

Ordinary or linear least squares

Nonlinear least squares

Types of Regression Metrics

 Mean Absolute Error(MAE)

 Mean Squared Error(MSE)

 Root Mean Squared Error(RMSE)

actual and predicted values.

and use Linear Regression, which draws a best-fit line.

the complete dataset.

MAE. And we aim to get a minimum MAE because this is a loss.

 It is most Robust to outliers.

optimizers like Gradient descent which can be differentiable.

Now to overcome the disadvantage of MAE next metric came as MSE.

Mean Squared Error(MSE)

between actual and predicted value.

actual and predicted values. we perform squared to avoid the cancellation of

negative terms and it is the benefit of MSE.

for example, the output variable is in meter(m) then after calculating

MSE the output we get is in meter squared.

and the calculated MSE is bigger. So, in short, It is not Robust to

outliers which were an advantage in MAE.

Root Mean Squared Error(RMSE)

variable which makes interpretation of loss easy.

 It is not that robust to outliers as compared to MAE.

score is independent of context.

which we call a threshold which is fixed at 0.5. So basically R2 squared calculates

how must regression line is better than a mean line.

Hence, R2 squared is also known as Coefficient of Determination or sometimes also

known as Goodness of fit.

take advantage of the output column.

perfect. In the real world, it is not possible.

move towards one. And the model performance improves.

from sklearn.metrics import r2_score

starts increasing or remains constant but it never decreases because It assumes

that while adding more data variance of data increases.

R2 sometimes starts increasing which is incorrect.

decreases, and on subtracting from one the score increases.

evaluation of the model.

You might also like