Assignment 6 Regression Solution PDF
Assignment 6 Regression Solution PDF
Problem 1 (Exams)
Students in a management science class have just received their grades on the midterm test. The
instructor has provided information about the midterm grades in some previous classes as well
as the final average for the same students. Some of these grades have been sampled and are as
follows:
Student 1 2 3 4 5 6 7 8 9
Midterm 98 77 88 78 96 61 66 95 69
Final average 93 78 84 72 84 64 64 95 76
(a) Develop a linear regression model that could be used to predict the final average in the
course based on the midterm grade.
(b) Predict the final average of a student who made an 83 on the midterm.
(c) Give the values of r and r2 for this model. Interpret the value of r2 in the context of this
problem.
Solution
(a)
ANOVA
df SS MS F ignificance F
Regression 1 863.3629 863.3629 40.96594 0.000367
Residual 7 147.526 21.07514
Total 8 1010.889
(a) Graph these data to see whether a linear equation might describe the relationship between
the group’s television shows and bass drum sales.
(b) Using the equations presented in this chapter, compute the SST, SSE, and SSR. Find the
regression equation for these data.
(c) Based on the available information, what is your estimate for bass drum sales if the Green
Shades performed on TV six times last month?
Solution:
(a)
10
0
0 2 4 6 8 10
Green Shades TV Appearance
(b)
ANOVA
df SS MS F ignificance F
Regression 1 22.92215 22.92215 16.87404 0.009284
Residual 5 6.792135 1.358427
Total 6 29.71429
Solution
(a) Number of days on the road: X1 = 5; Distance traveled: X2 = 300 miles
The amount he may be expected to claim is
𝑌𝑌� = 90.00 + 48.50(5) + $0.40(300) = $452.50
(b) The reimbursement request, according to the model, appears to be too high. However, this
does not mean that it is not justified. The accountant should question Thomas Williams
about his expenses to see if there are other explanations for the high cost.
(c) A number of other variables should be included, such as the type of travel (air or car),
conference fees if any, and expenses for entertainment of customers, and other
transportation (cab and limousine) expenses. In addition, the coefficient of correlation is
only 0.68 and r2 = (0.68)2 = 0.46. Thus, about 46% of the variability in the cost of the trip is
explained by this model; the other 54% is due to other factors.
Problem 4 (Salary)
The following data give the starting salary for students who recently graduated from a local
university and accepted jobs soon after graduation. The starting salary, grade-point average
(GPA), and major (business or other) are provided.
Salary 29500 38000 44800 36500 42000 31500 36200 30000
GPA 3.1 3.5 3.8 2.9 3.4 2.1 2.5 3.6
Major Other Busines Busines Other Busines Other Busines Other
s s s s
(a) Develop a linear model that could be used to predict starting salary based on GPA and major.
Comment on the effectiveness of the model.
(b) What does the model say about the starting salary for a business major compared to a
nonbusiness major --- specifically, does the business major have a significant impact on the
starting salary?
Solution
(a) The estimated prediction equation is
� = 26667 + 1780.5 × 𝐺𝐺𝐺𝐺𝐺𝐺 + 7707.3 × 𝑍𝑍,
𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆
where Z = 1 if major is business and 0 if otherwise. The adjusted r squared is 0.55 and the p-
value for the F test is 0.058, which means that the model is not significant at 5% level. Hence,
the model is not very useful for predicting the starting salary.
(b) The t test for Z is significant at 5% level and the p-value is 0.04. Thus, business major does
have a significantly different starting salary in comparison to other majors.