Chapter 14 Solutions
Chapter 14 Solutions
Chapter 14 Solutions
14-1
CHAPTER 14Multiple Regression and Model Building
14.1 SSE
14.2 Insert x values into the least squares equation and solve for
y .
14.3 a. b
0
= 29.347, b
l
= 5.6128, b
2
= 3.8344
b
0
= meaningless
b
1
= 5.6128 implies that we estimate that mean sales price increases by $5,612.80 for
each increase of 100 square feet in house size, when the niceness rating stays constant.
b
2
= 3.8344 implies that we estimate that mean sales price increases by $3,834.40 for
each increase in niceness rating of 1, when the square footage remains constant.
b. 172.28. From y
= 8.41099, y
\
|
+
|
.
|
\
|
=
|
|
.
|
\
|
+
|
.
|
\
|
=
k n
n
n
k
R R
87 . 350
7 / 6 . 73
2 / 7374
)) 1 2 ( 10 /( 6 . 73
2 / 7374
)) 1 ( /( ) variation d Unexplaine (
/ ) variation Explained (
= =
+
=
+
=
k n
k
8818 .
) 1 3 ( 30
1 30
1 30
3
894 .
) 1 (
1
1
2 2
=
|
|
.
|
\
|
+
|
.
|
\
|
=
|
|
.
|
\
|
+
|
.
|
\
|
=
k n
n
n
k
R R
80 . 72
)) 1 3 ( 30 /( 4318 . 1
3 / 0268 . 12
)) 1 ( /( ) variation d Unexplaine (
/ ) variation Explained (
=
+
=
+
=
k n
k
2
R
2
R
Chapter 14 - Multiple Regression and Model Building
14-3
(4) F(model)
(5) Based on 3 and 26 degrees of freedom, F
.05
= 2.98. Since F(model) = 72.80 > 2.98, we
reject 0 :
3 2 1 0
= = = | | | H by setting . 05 . = o
(6) Based on 3 and 26 degrees of freedom, F
.01
= 4.64. Since F(model) = 72.80 > 4.64, we
reject 0 :
3 2 1 0
= = = | | | H by setting . 01 . = o
(7) p-value is less than .001. Since this p-value is less than o = .10, .05, .01, and .001, we
have extremely strong evidence that 0 :
3 2 1 0
= = = | | | H is false. That is, we have
extremely strong evidence that at least one of x
1
, x
2
, x
3
is significantly related to y.
14.11 (1) SSE = 1,798,712.2179
s
2
=
) 1 k ( n
SSE
+
=
) 1 3 ( 16
2179 . 712 , 798 , 1
+
= 149892.68483
s = 68483 . 149892 = 387.15977
(2) total variation = 464,126,601.6
unexplained = 1,798,712.2179
explained = 462,327,889.39
(3) R
2
=
6 . 601 , 126 , 464
39 . 889 , 327 , 462
= .9961
9952 .
) 1 3 ( 16
1 16
1 16
3
9961 .
) 1 k ( n
1 n
1 n
k
R R
2 2
=
|
|
.
|
\
|
+
|
.
|
\
|
=
|
|
.
|
\
|
+
|
.
|
\
|
=
2
R and close to each other and 1.
(4) F =
)) 1 k ( n ( / ) variation d Unexplaine (
k / ) variation Explained (
+
=
)) 1 3 ( 16 ( / 2179 . 712 , 798 , 1
3 / 39 . 889 , 327 , 462
+
=
1028.131
2
R
Chapter 14 - Multiple Regression and Model Building
14-4
(5) Based on 3 and 12 degrees of freedom, F
.05
= 3.49
F = 1028.131 > F
.05
= 3.49. Reject H
0
: 0
3 2 1
= = = | | | at o = .05
(6) Based on 3 and 12 degrees of freedom, F
.01
= 5.95
F = 1028.131 > F
.01
= 5.95. Reject H
0
: 0
3 2 1
= = = | | | at o = .01
(7) p-value = .0001. Reject H
0
at o = .05, .01, and .001.
14.12
j
x
significantly related to y with strong (a) or very strong (b) evidence
14.13 Explanations will vary.
14.14 We first consider the intercept
0
|
(1) b
0
= 29.347,
0
b
s = 4.891, t = 6.00
where t = b
0
/
0
b
s = 29.347/4.891 = 6.00
(2) We reject H
0
:
0
| = 0 (and conclude that the intercept is significant) with o = .05
if t > t
.05/2
= t
.025
Since t
.025
= 2.365 (with n (k + 1) = 10 (2 + 1) = 7 degrees of freedom), we have t
= 6.00 > t
.025
= 2.365.
We reject H
0
:
0
| = 0 with o = .05 and conclude that the intercept is significant at the
.05 level.
(3) We reject H
0
:
0
| = 0 with o = .01 if t > t
.01/2
= t
.005
Since t
.005
= 3.499 (with 7 degrees of freedom), we have t = 6.00 > t
.005
= 3.499.
We reject H
0
:
0
| = 0 with o = .01 and conclude that the intercept is significant at the
.01 level.
(4) The Minitab output tells us that the p-value for testing H
0
:
0
| = 0 is 0.000. Since this
p-value is less than each given value of o, we reject H
0
:
0
| = 0 at each of these
values of o. We can conclude that the intercept
0
| is significant at the .10, .05, .01,
and .001 levels of significance.
(5) A 95% confidence interval for
0
| is
[b
0
t
o/2
0
b
s ] = [b
0
t
.025
0
b
s ]
= [29.347 2.365(4.891)]
= [17.780, 40.914]
This interval has no practical interpretation since
0
| is meaningless.
Chapter 14 - Multiple Regression and Model Building
14-5
(6) A 99% confidence interval for
0
| is
[b
0
t
.005
0
b
s ] = [29.347 3.499(4.891)]
= [12.233, 46.461]
We next consider
1
| .
(1) b
1
= 5.6128,
1
b
s = .2285, t = 24.56
where t = b
1
/
1
b
s = 5.6128/.2285 = 24.56
(2), (3), and (4):
We reject H
0
:
1
| = 0 (and conclude that the independent variable x
1
is significant) at
level of significance o if t > t
o/2
. Here t
o/2
is based on n (k + 1) = 10 3 = 7 d.f.
For o = .05, t
o/2
= t
.025
= 2.365, and for o = .01, t
o/2
= t
.005
= 3.499.
Since t = 24.56 > t
.025
= 2.365, we reject H
0
:
1
| = 0 with o = .05.
Since t = 24.56 > t
.005
= 3.499, we reject H
0
:
1
| = 0 with o = .01.
Further, the Minitab output tells us that the p-value related to testing H
0
:
1
| = 0 is
0.000. Since this p-value is less than each given value of o, we reject H
0
at each of
these values of o (.10, .05, .01, and .001).
The rejection points and p-values tell us to reject H
0
:
1
| = 0 with o = .10, o = .05, o
= .01, and o = .001. We conclude that the independent variable x
1
(home size) is
significant at the .10, .05, .01, and .001 levels of significance.
(5) and (6):
95% interval for
1
| :
[b
1
t
.025
1
b
s ] = [5.6128 2.365(.2285)]
= [5.072, 6.153]
99% interval for
1
| :
[b
1
t
.005
1
b
s ] = [5.6128 3.499(.2285)]
= [4.813, 6.412]
For instance, we are 95% confident that the mean sales price increases by between
$5072 and $6153 for each increase of 100 square feet in home size, when the rating
stays constant.
Last, we consider
2
| .
(1) b
2
= 3.8344,
2
b
s = .4332, t = 8.85
where t = b
2
/
2
b
s = 3.8344/.4332 = 8.85
(2), (3), and (4):
We reject H
0
:
2
| = 0 (and conclude that the independent variable x
2
is significant) at
level of significance o if t > t
o/2
. Here, t
o/2
is based on n (k +1) = 10 3 = 7 d.f.
For o = .05, t
o/2
= t
.025
= 2.365, and for o = .01, t
o/2
= t
.005
= 3.499.
Since t = 8.85 > t
.025
= 2.365, we reject H
0
:
2
| = 0 with o = .05.
Since t = 8.85 > t
.005
= 3.499, we reject H
0
:
2
| = 0 with o = .01.
Chapter 14 - Multiple Regression and Model Building
14-6
Further, the Minitab output tells us that the p-value related to testing H
0
:
2
| = 0 is
0.000. Since this p-value is less than each given value of o, we reject H
0
at each of
these values of o (.10, .05, .01, and .001).
The rejection points and p-values tell us to reject H
0
:
2
| = 0 with o = .10, o = .05, o
= .01, and o = .001.
We conclude that the independent variable x
2
(niceness rating) is significant at the .10,
.05, .01, and .001 levels of significance.
(5) and (6):
95% interval for
2
| :
[b
2
t
.025
2
b
s ] = [3.8344 2.365(.4332)]
= [2.810, 4.860]
99% interval for
2
| :
[b
2
t
.005
2
b
s ] = [3.8344 3.499(.4332)]
= [2.319, 5.350]
For instance, we are 95% confident that the mean sales price increases by between
$2810 and $4860 for each increase of one rating point, when the home size remains
constant.
14.15 Works like Exercise 14.14
y = c | | | | + + + +
3 3 2 2 1 1 0
x x x
n (k + 1) = 30 (3 + 1) = 26
Rejection points:
t
.025
= 2.056 t
.005
= 2.779
H
0
:
0
| = 0 t =
4450 . 2
5891 . 7
= 3.104; Reject H
0
at o = .05, o = .01
H
0
:
1
| = 0 t =
6379 .
3577 . 2
= 3.696; Reject H
0
at o = .05, not .01
H
0
:
2
| = 0 t =
2954 .
6122 . 1
= 5.459; Reject H
0
at o = .05, o = .01
H
0
:
3
| = 0 t =
1259 .
5012 .
= 3.981; Reject H
0
at o = .05, o = .01
p-value for testing H
0
:
1
| = 0 is .001; Reject H
0
at o = .01
H
0
:
2
| = 0 is less than .001; Reject H
0
at o = .001
H
0
:
3
| = 0 is .0005; Reject H
0
at o = .001
95% C.I.: [b
j
2.056
j
b
s ]
99% C.I.: [b
j
2.779
j
b
s ]
Chapter 14 - Multiple Regression and Model Building
14-7
14.16 Works like Exercise 14.14
y = c | | | | + + + +
3 3 2 2 1 1 0
x x x
n (k + 1) = 16 (3 + 1) = 12
Rejection points:
t
.025
= 2.179 t
.005
= 3.055
H
0
:
0
| = 0 t =
1819 . 504
8020 . 1946
= 3.861; Reject H
0
at o = .05, o = .01
H
0
:
1
| = 0 t =
0130 .
0386 .
= 2.958; Reject H
0
at o = .05, not .01
H
0
:
2
| = 0 t =
0676 .
0394 . 1
= 15.386; Reject H
0
at o = .05, o = .01
H
0
:
3
| = 0 t =
5983 . 98
7578 . 413
= 4.196; Reject H
0
at o = .05, o = .01
p-value for testing H
0
:
1
| = 0 is .0120; Reject H
0
at o = .05
H
0
:
2
| = 0 is .0001; Reject H
0
at o = .001
H
0
:
3
| = 0 is .0012; Reject H
0
at o = .01
95% C.I.: [b
j
2.179
j
b
s ]
99% C.I.: [b
j
3.055
j
b
s ]
14.17 You can be x% confident that a confidence interval contains the true average value of y
given particular values of the xs while you can be x% confident that a prediction interval
contains an individual value of y given particular values of the xs.
14.18 The midpoint of both the confidence interval and the prediction interval is the value of y-
hat given by the least squares model.
14.19 a. Point estimate is y = 172.28 ($172,280)
95% confidence interval is [168.56, 175.99]
b. Point prediction is y = 172.28
95% prediction interval is [163.76, 180.80]
c. Stdev Fit = value Distance s = 1.57
This implies that Distance value = (1.57 / s)
2
= (1.57 / 3.242)
2
= 0.2345
The 99% confidence interval for mean sales price is
[ y t
.005
value Distance s ] with t
.005
based on 7 degrees of freedom
Chapter 14 - Multiple Regression and Model Building
14-8
= [172.28 3.499(1.57)]
= [172.28 5.49]
= [166.79, 177.77]
The 99% prediction interval for an individual sales price is
[ y t
.005
value Distance 1 s + ]
= [172.28 3.499(3.242) 2345 . 0 1+ ]
= [172.28 12.60]
= [159.68, 184.88]
14.20 a. 95% PI | | 90255 . 8 , 91876 . 7
890,255 bottles
791,876($3.70) = $2,929,941.20
b. 99% PI; t
.005
= 2.779
[8.41065 2.779(.235) 04 . 1+ ] = [7.74465,9.07665]
14.21 y = 17207.31 is above the upper limit of the interval [14906.2, 16886.3]; this y-value is
unusually high.
14.22 An independent variable that is measured on a categorical scale.
14.23 There is one dummy variable for each value a categorical variable can take on. You use m-
1 dummy variables to model a categorical variable that can take on m values.
14.24 The difference in the average of the dependent variable when the dummy variable equals 1
and when the dummy variable equals 0.
14.25 a. Different y-intercept for the different types of firms.
b.
2
| equals the difference between the mean innovation adoption times of stock
companies and mutual companies.
c. p-value is less than .001; Reject H
0
at both levels of o.
2
| is significant at both levels of o.
95% CI for
2
| : [4.9770, 11.1339]; 95% confident that for any size of insurance firm,
the mean speed at which an insurance innovation is adopted is between 4.9770 and
11.1339 months faster if the firm is a stock company rather than a mutual company.
d. No interaction
14.26 a. The pool coefficient is $25,862.30. Since the cost of the pool is $35,000 you expect to
recoup $25,862.3 / $35,000 = 74%.
Chapter 14 - Multiple Regression and Model Building
14-9
b. There is not an interaction between pool and any other independent variable.
14.27 a.
B
=
B T M B
| | | | = + + ) 0 ( ) 0 (
M
=
M B T M B
| | | | | + = + + ) 0 ( ) 1 (
T
=
T B T M B
| | | | | + = + + ) 1 ( ) 0 (
b. F = 184.57, p-value < .001:
Reject H
0
; conclude there are differences.
c.
95% C.I. for
M
: ( ) | | 454 . 24 , 346 . 18 ] 433 . 1 131 . 2 4 . 21 [ =
95% C.I. for
T
: | | | | 246 . 1 , 354 . 7 433 . 1 ) 131 . 2 ( 300 . 4 =
d. 77.20, [75.040, 79.360], [71.486, 82.914]
e. ] 754 . 28 , 646 . 22 [ )] 433 . 1 ( 131 . 2 700 . 25 [ =
p-value . t significan is , 001 .
T m
<
14.28 a. The point estimate of the effect on the mean of campaign B compared to campaign A
is b
4
= 0.2695.
The 95% confidence interval = [0.1262, 0.4128]
The point estimate of the effect on mean of campaign C compared to campaign A is b
5
= 0.4396.
The 95% confidence interval = [0.2944, 0.5847]
Campaign C is probably most effective even though intervals overlap.
b. y
20
= b
0
+ b
1
(20) +b
Q2
(0) + b
Q3
(0) + b
Q4
(1) = 8.75 + 0.5(20) + 4.5 = 23.250
14.42 ln y
133
= 4.69618 + .0103075 (133) + .01903 = 6.086108
Point estimate: e
6.0861
= 439.7
95% prediction interval: [e
5.96593
, e
6.20627
] = [389.92, 495.85]
14.43 a. b
0
= 25.7152, b
1
= 4.9762, b
2
= 1.01905, y