Chapter 14 MR
Chapter 14 MR
ANOVA df SS MS F Significance F
Regression 2 29460.027 14730.013 6.53861 0.01201
Residual 12 27033.306 2252.776
Total 14 56493.333
where
Sales is in number of pies per week
Price is in $
Advertising is in $100’s.
b1 = -24.975: sales b2 = 74.131: sales will
will decrease, on increase, on average,
average, by 24.975 by 74.131 pies per
pies per week for week for each $100
each $1 increase in increase in
selling price, net of advertising, net of the
the effects of changes effects of changes
due to advertising due to price
Using The Equation to Make
Predictions
Predict sales for a week in which the selling
price is $5.50 and advertising is $350:
Check the
“confidence and
prediction interval
estimates” box
Statistics for Managers Using Microsoft Excel, 4e © 2004 Prentice-Hall, Inc. Chap 13-10
Predictions in PHStat
(continued)
Input values
<
Predicted Y value
Confidence interval for the
<
mean Y value, given
these X’s
<
individual Y value, given
these X’s
Coefficient of
Multiple Determination
▪ Reports the proportion of total variation in Y explained by all X
variables taken together
Multiple Coefficient of
Determination
(continued)
Regression Statistics
Multiple R 0.72213
R Square 0.52148
Adjusted R Square 0.44172
Standard Error 47.46341
52.1% of the variation in pie sales
Observations 15 is explained by the variation in
price and advertising
ANOVA df SS MS F Significance F
Regression 2 29460.027 14730.013 6.53861 0.01201
Residual 12 27033.306 2252.776
Total 14 56493.333
<
ei = (Yi – Yi)
<
Yi
x2i
X2
<
x1i The best fit equation, Y ,
is found by minimizing the
X1 sum of squared errors, Σe2
Multiple Regression Assumptions
<
ei = (Yi – Yi)
Assumptions:
▪ The errors are normally distributed
▪ Errors have a constant variance
▪ The model errors are independent
Is the Model Significant?
▪ F-Test for Overall Significance of the Model
▪ Shows if there is a linear relationship between all
of the X variables considered together and Y
▪ Use F test statistic
▪ Hypotheses:
H0: β1 = β2 = … = βk = 0 (no linear relationship)
H1: at least one βi ≠ 0 (at least one independent
variable affects Y)
F-Test for Overall Significance
▪ Test statistic:
ANOVA df SS MS F Significance F
Regression 2 29460.027 14730.013 6.53861 0.01201
Residual 12 27033.306 2252.776
Total 14 56493.333
Test Statistic:
(df = n – k – 1)
Are Individual Variables
Significant?
(continued)
Regression Statistics
t-value for Price is t = -2.306, with
Multiple R 0.72213
p-value .0398
R Square 0.52148
Adjusted R Square 0.44172
Standard Error 47.46341 t-value for Advertising is t = 2.855,
Observations 15 with p-value .0145
ANOVA df SS MS F Significance F
Regression 2 29460.027 14730.013 6.53861 0.01201
Residual 12 27033.306 2252.776
Total 14 56493.333
where t has
(n – k – 1) d.f.
Let:
Y = pie sales
X1 = price
X2 = holiday (X2 = 1 if a holiday occurred during the week)
(X2 = 0 if there was no holiday that week)
Dummy-Variable Example
(with 2 Levels)
(continued)
Holiday
No Holiday
Different Same
intercept slope
Y (sales)
If H0: β2 = 0 is
b0 + b 2 rejected, then
Holi
day
b0 (X = “Holiday” has a
No H 2 1)
olida significant effect
y (X on pie sales
2 = 0)
X1 (Price)
Interpreting the Dummy Variable
Coefficient (with 2 Levels)
Example:
Y = house price
X1 = square feet
X2 = 1 if ranch, 0 otherwise
X3 = 1 if split level, 0 otherwise
For a condo: X2 = X3 = 0
With the same square feet, a
ranch will have an estimated
average price of 23.53
For a ranch: X2 = 1; X3 = 0 thousand dollars more than a
condo