0% found this document useful (0 votes)
21 views15 pages

MTH613 Lecture 4 - Topic 5

The document discusses multiple linear regression. It defines multiple linear regression as using two or more explanatory variables to predict an outcome, unlike simple linear regression which uses only one. An example of using age and mileage to predict car price is provided. The document also covers estimating parameter values, model validation using F-statistics and ANOVA tables, and using the regression model to make predictions.

Uploaded by

MAYLYN SINAVA
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
21 views15 pages

MTH613 Lecture 4 - Topic 5

The document discusses multiple linear regression. It defines multiple linear regression as using two or more explanatory variables to predict an outcome, unlike simple linear regression which uses only one. An example of using age and mileage to predict car price is provided. The document also covers estimating parameter values, model validation using F-statistics and ANOVA tables, and using the regression model to make predictions.

Uploaded by

MAYLYN SINAVA
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 15

OBJECTIVES

By the end of this topic, we should learn:


MULTIPLE  Basics about multiple linear regression
LINEAR  Interpret the estimated parameter values
 Use the model for predictions
REGRESSION

Prepared & Compiled by Brian. Q. Inomea


SIMPLE LINEAR REGRESSION

 When we have only one explanatory variable in our model, it is


called linear regression.

MULTIPLE
LINEAR
REGRESSION  When we have two or more explanatory variables in our model,
it is called linear regression.

Prepared & Compiled by Brian. Q. Inomea


MULTIPLE LINEAR REGRESSION

EXAMPLE
 Let’s say that we would like to predict the price of a car, and use
age and mileage as explanatory variables. Since we use more
MULTIPLE than one explanatory variable, it is called multiple linear
LINEAR regression.

REGRESSION

Prepared & Compiled by Brian. Q. Inomea


STANDARD ERROR OF ESTIMATE

In multiple regression, the standard error of estimate is defined as


follows:

MULTIPLE
LINEAR
REGRESSION

Prepared & Compiled by Brian. Q. Inomea


COEFFICIENT OF DETERMINATION
Coefficient of Determination is defined as follows:

Notice that Excel and Minitab print a second R2 statistic, called the coefficient of
determination adjusted for degrees of freedom, which has been adjusted to take into
account the sample size and the number of independent variables.
MULTIPLE The rationale for this statistic is that, if the number of independent variables k is large
relative to the sample size n, the unadjusted R2 value may be unrealistically high.
LINEAR To understand this point, consider what would happen if the sample size is 2 in a
simple linear regression model.
REGRESSION The line would fit the data perfectly resulting in R2 = 1 when, in fact, there may be no
linear relationship.
To avoid creating a false impression, the adjusted R2 is often calculated. Its formula is
as follows:

Prepared & Compiled by Brian. Q. Inomea


TESTING THE VALIDITY OF THE MODEL
To test the validity of the regression model, we specify the following hypotheses:

 If the null hypothesis is true, none of the independent variables x1, x2, . . . , xk is
linearly related to y, and therefore the model is invalid. If at least one βi is not equal
to 0, the model does have some validity.
MULTIPLE  The total variation in the dependent variable [measured by a ( yi − y)2] can be
decomposed into two parts: the explained variation (measured by SSR) and the
LINEAR unexplained variation (measured by SSE). That is:
Total variation in y = SSR + SSE
REGRESSION
 If SSR is large relative to SSE, the coefficient of determination will be high—
signifying a good model.
 On the other hand, if SSE is large, most of the variation will be unexplained, which
indicates that the model provides a poor fit and consequently has little validity.
 To judge whether SSR is large enough relative to SSE to allow us to infer that at least
one coefficient is not equal to 0, we compute the ratio of the two mean squares.
 The calculation of the test statistic is summarized in an analysis of variance (ANOVA)
Prepared & Compiled by Brian. Q. Inomea table in excel.
TESTING THE VALIDITY OF THE MODEL

Analysis of Variance Table for Regression Analysis

MULTIPLE
Sample calculation of the test statistic summarized in an analysis of
LINEAR variance (ANOVA) table in excel.
REGRESSION

Prepared & Compiled by Brian. Q. Inomea


TESTING THE VALIDITY OF THE MODEL
F-Statistic
A large value of F indicates that most of the variation in y is explained by the regression
equation and that the model is valid.
A small value of F indicates that most of the variation in y is unexplained.
The rejection region allows us to determine whether F is large enough to justify rejecting
the null hypothesis.

MULTIPLE
LINEAR
REGRESSION

Prepared & Compiled by Brian. Q. Inomea


TESTING THE VALIDITY OF THE MODEL
The standard error

Coefficient of determination

MULTIPLE
LINEAR
The relationship among sε, R2, and F is summarized in Table below
REGRESSION

Prepared & Compiled by Brian. Q. Inomea


TESTING THE COEFFICIENTS

MULTIPLE
LINEAR
REGRESSION

Prepared & Compiled by Brian. Q. Inomea


MULTIPLE LINEAR REGRESSION

EXAMPLE
 Using multiple linear regression, create a model from the data
given such that both age and mileage are included as
explanatory variables.

MULTIPLE
LINEAR
REGRESSION

Using Multiple linear regression, predict the price of a car that is


two years old and that has been driven 50 thousands miles.
Prepared & Compiled by Brian. Q. Inomea
MULTIPLE LINEAR REGRESSION

EXAMPLE
 Solution
Using excel to generate the regression analysis
MULTIPLE
LINEAR
REGRESSION

Prepared & Compiled by Brian. Q. Inomea


MULTIPLE LINEAR REGRESSION

EXAMPLE
 Solution
Using excel to generate the regression analysis

MULTIPLE
LINEAR
REGRESSION

Prepared & Compiled by Brian. Q. Inomea


MULTIPLE LINEAR REGRESSION

EXAMPLE
 Solution
We are asked to predict the price of a car that is two years old
and that has been driven 50 thousands miles.

MULTIPLE
LINEAR
REGRESSION
From the analysis report;
Intercept = 32.45
Age = -1.54
Mileage = -0.15
Standard error = 0.79
Prepared & Compiled by Brian. Q. Inomea
MULTIPLE LINEAR REGRESSION

EXAMPLE
 Solution
Now our least square equation for this problem is as follows,

MULTIPLE Y = 32.45 – 1.54X1 – 0.15X2 + 0.79

LINEAR
Inserting the known variables into the equation to find the
REGRESSION corresponding price. Given the car is two years old and that has
been driven 50 thousands miles.
Y = 32.45 – 1.54(2) – 0.15(50) + 0.79
Y=?

Prepared & Compiled by Brian. Q. Inomea

You might also like