CH 16
CH 16
JOHN S. LOUCKS
St. Edward’s University
x
E ( y )
We can transform this nonlinear
0 1 model to a linear
model by taking the logarithm of both sides.
F Test
To test whether the addition of x2 to a
model involving x1 (or the deletion of x2 from a
model involving x1and x2) is statistically
(SSE(reduced)-SSE(full))/ number of extra terms
significant
F
MSE(full)
Note: Rows 2-9 are hidden and rows 18-26 not shown.
A B C D E F
36
37 ANOVA
38 df SS MS F Significance F
39 Regression 4 177260 44315 21.06027 6.1385E-07
40 Residual 20 42083.98 2104.199
41 Total 24 219344
42
A B C D E
42
43 Coeffic. Std. Err. t Stat P-value
44 Intercept -59.416 54.6072 -1.0881 0.28951
45 House Size 6.50587 3.24687 2.0037 0.05883
46 Bedrooms 29.1013 26.2148 1.1101 0.28012
47 Bathrooms 26.4004 18.8077 1.4037 0.17574
48 Cars -10.803 27.329 -0.3953 0.6968
49
A B C D E F
36
37 ANOVA
38 df SS MS F Significance F
39 Regression 4 177260 44315 21.06027 6.1385E-07
40 Residual 20 42083.98 2104.199
41 Total 24 219344
42
A B C D E
42
43 Coeffic. Std. Err. t Stat P-value
44 Intercept -47.342 44.3467 -1.0675 0.29785
45 House Size 6.02021 2.94446 2.0446 0.05363
46 Bedrooms 23.0353 20.8229 1.1062 0.28113
47 Bathrooms 27.0286 18.3601 1.4721 0.15581
48
49
A B C D E F
36
37 ANOVA
38 df SS MS F Significance F
39 Regression 2 174459.6 87229.79 42.7555 2.63432E-08
40 Residual 22 44884.42 2040.201
41 Total 24 219344
42
A B C D E
42
43 Coeffic. Std. Err. t Stat P-value
44 Intercept -12.349 31.2392 -0.3953 0.69642
45 House Size 7.94652 2.38644 3.3299 0.00304
46 Bathrooms 30.3444 18.2056 1.6668 0.10974
47
48
49
A B C D E F
36
37 ANOVA
38 df SS MS F Significance F
39 Regression 1 168791.7 168791.7 76.79599 8.67454E-09
40 Residual 23 50552.25 2197.924
41 Total 24 219344
42
A B C D E
42
43 Coeffic. Std. Err. t Stat P-value
44 Intercept -9.8669 32.3874 -0.3047 0.76337
45 House Size 11.3383 1.29384 8.7633 8.7E-09
46
47
48
49
Best-Subsets Regression
• The three preceding procedures are one-
variable-at-a-time methods offering no
guarantee that the best model for a given
number of variables will be found.
• Some statistical software packages include
best-subsets regression that enables the user to
find, given a specified number of independent
variables, the best regression model.
• Typical output identifies the two best one-
variable estimated regression equations, the
two best two-variable regression equations, and
so on.
Sample Data
Analysis of Variance
SOURCE DF SS MS
F P
Regression 4 3.79469 .94867
13.10 .000
Error 20 1.44865 .07243
Total 24 5.24334