Multiple Regression Slides Mod-Ed
Multiple Regression Slides Mod-Ed
Regression Model
We will discuss:
◼ How to develop a multiple regression model
Three-category approach to
quantitative methods
◼ Quantitative methods has 3 distinctive categories or
functions:
◼ Descriptive statistics
◼ Analysis of differences in the same variables measured (or
Regression Models
◼ A regression model can be applied to estimate the
parameters of many relationships in economics, business,
and the social sciences. The term regression was coined by
Francis Galton (1886).
◼ regression toward the mean (or regression to the
mean), ”on average, move towards the middle”. ◼ Any time
you ask how much a change in one variable will affect
another variable, regression analysis is a potential tool
◼ Similarly, any time you wish to predict the value of one
variable given the value of another then least squares
regression is a tool to consider The Simple Linear Regression Model 3
Principles of Econometrics, 5e
Introduction (2 of 2)
◼ Variable to be predicted is called the dependent
variable or response variable
◼ Value depends on the value of the
independent variable(s)
◼ Explanatory or predictor variable
Copyrigh
t ©2015 Pearson Education, Ltd. All rights reserved. Copyright © 2018, 2015, 2012 Pearson Education, Inc. All Rights
Reserved
ˆ
i011i22ikkiY = b + b X + b X + ⋅ ⋅ ⋅
+bX
We will use soft to obtain the regression slope coefficients
and other regression summary measures.
Copyrig
ht ©2015 Pearson Education, Ltd. All rights reserved.
◼ corr Y X
◼ reg Y X1 X2
◼ predict yhat
R Square 0.52148
Adjusted R Square 0.44172
Standard Error 47.46341 Observations 15 Sales = 306.526 - 24.975(Pri ce) + 74.131(Adv
ertising)
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 306.52619 114.25389 2.68285 0.01993
57.58835 555.46404 Price -24.97509 10.83213 -2.30565 0.03979 -48.57626 -1.37392 Advertising 74.13096 25.96732
2.85478 0.01449 17.55303 130.70888
Copyright ©2015 Pearson Education, Ltd. All rights reserved.
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 306.52619 114.25389 2.68285 0.01993
57.58835 555.46404 Price -24.97509 10.83213 -2.30565 0.03979 -48.57626 -1.37392 Advertising 74.13096 25.96732
2.85478 0.01449 17.55303 130.70888
Copyright ©2015 Pearson Education, Ltd. All rights reserved.
Multiple Coefficient of
Determination
Regression Statistics
SSR 29460.0
Multiple R 0.72213
r2= = = .52148
R Square 0.52148
SST 56493.3
Adjusted R Square 0.44172 Standard Error 47.46341
explained by the variation in price
Observations 15
and advertising
52.1% of the variation in pie sales is
ANOVA df SS MS F Significance F Regression 2 29460.027 14730.013 6.53861 0.01201 Residual 12
27033.306 2252.776
Total 14 56493.333
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 306.52619 114.25389 2.68285 0.01993
57.58835 555.46404 Price -24.97509 10.83213 -2.30565 0.03979 -48.57626 -1.37392 Advertising 74.13096 25.96732
2.85478 0.01449 17.55303 130.70888
2
Adjusted r
◼ r2 never decreases when a new X variable is
added to the model
This can be a disadvantage when comparing
◼
models
◼ What is the net effect of adding a new variable?
variable is added
◼ Did the new X variable add enough
2(continued)
Adjusted r
◼ Shows the proportion of variation in Y explained
by all X variables adjusted for the number of X
variables used
⎢ ⎡⎟ ⎥
⎣ ⎠⎞ ⎦⎤
=−−
11
n −
22 r r adj ⎜ ⎛
⎝ −−nk
1 (1 )
(where n = sample size, k = number of independent variables)
◼ Penalizes excessive use of unimportant independent
variables
2
◼ Smaller than r
2
Adjusted r in Excel
Regression Statistics
44.2% of the variation in pie sales is
Multiple R 0.72213 R Square 0.52148 Adjusted R Square
explained by the variation in price
0.44172 Standard Error 47.46341 Observations 15
and advertising, taking into account
r .44172 2adj = the sample size and number of
independent variables
ANOVA df SS MS F Significance F Regression 2 29460.027 14730.013 6.53861 0.01201 Residual 12
27033.306 2252.776
Total 14 56493.333
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 306.52619 114.25389 2.68285 0.01993
57.58835 555.46404 Price -24.97509 10.83213 -2.30565 0.03979 -48.57626 -1.37392 Advertising 74.13096 25.96732
2.85478 0.01449 17.55303 130.70888
Test Statistic:
t− 0
= b j
j
STAT b
S (df = n – k – 1)
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 306.52619 114.25389 2.68285 0.01993
57.58835 555.46404 Price -24.97509 10.83213 -2.30565 0.03979 -48.57626 -1.37392 Advertising 74.13096 25.96732
2.85478 0.01449 17.55303 130.70888
ˆ
Yi = + +
Residual = observation Yi
<
x2i x1i
X2