Yy 1 Xy
Yy 1 Xy
I3–TD6
( Linear Regression Models)
2. The following are midterm and final examination test scores for 10 students from a
calculus class, where x denotes the midterm score and y denotes the final score for
each student.
x 68 87 75 91 82 77 86 82 75 79
y 74 79 80 93 88 79 97 95 89 92
3. (a) Show that the least-squares estimates of β0 and β1 of a line can be expressed as
Pn
(x − x̄)(yi − ȳ)
ˆ Pn i
β0 = ȳ − β̂1 x̄, β̂1 = i=1 2
.
i=1 (xi − x̄)
(b) Using part (a), show that the line fitted by the method of least squares passes
through the point (x, y).
4. Show that the mle’s of β0 and β1 are indeed the least squares estimates. [Hint: The
pdf of Yi is normal with mean µi = β0 + β1 xi and variance σ 2 ; the likelihood is the
product of the n pdf’s.]
5. A farmer collected the following data, which show crop yields for various amounts of
fertilizer used.
6. The accompanying data table gives observations on total acidity of coal samples of
three different types, with determinations made using three different concentrations of
ethanolic NaOH (“Chemistry of Brown Coals,” Australian J. Applied Science, 1958:
375-379).
x 38 26 48 22 40 15 30 33
y 10 11 16 8 12 5 10 11
(b) Plot the points and graph the line as a check on your calculations.
(c) Calculate the 95% confidence intervals for β0 and β1 , respectively.
7. Show that Y and βˆ1 are independent, under the usual assumptions of a simple linear
regression model.
8. The following data represent survival time in days after a heart transplant and patient
age in years at the time of transplant for 10 randomly selected patients.
Age at transplant 28 41 46 53 39 36 47 29 48 44
Survival time, in days 7 278 44 48 406 382 1995 176 323 1846
9. The following are midterm and final examination test scores for 10 calculus students,
where x denotes the midterm score and y denotes the final score for each student.
x 68 87 75 91 82 77 86 82 75 79
y 74 89 80 93 88 79 97 95 89 92
10. The following data give the annual incomes (in thousands of dollars) and amounts (in
thousands of dollars) of life insurance policies for eight persons.
Annual income 42 58 27 36 70 24 53 37
Life insurance 150 175 25 75 250 50 250 100
11. The Turbine Oil Oxidation Test (TOST) and the Rotating Bomb Oxidation Test
(RBOT) are two different procedures for evaluating the oxidation stability of steam
turbine oils. The article “Dependence of Oxidation Stability of Steam Turbine Oil
on Base Oil Composition” (J. Soc. Tribologists Lubricat. Engrs., Oct. 1997: 19-24)
reported the accompanying observations on x = TOST time (hr) and y = RBOT time
(min) for 12 oil specimens.
e. Carry out a test of hypotheses to decide whether RBOT time and TOST time are
linearly related.
12. Verify that the t ratio for testing H0 : β1 = 0 is identical to t for testing H0 : ρ = 0.
13. The following are midterm and final examination test scores for 10 calculus students,
where x denotes the midterm score and y denotes the final score for each student.
x 68 87 75 91 82 77 86 82 75 79
y 74 89 80 93 88 79 97 95 89 92
14. The following data give the annual incomes (in thousands of dollars) and amounts (in
thousands of dollars) of life insurance policies for eight persons.
Annual income 42 58 27 36 70 24 53 37
Life insurance 150 175 25 75 250 50 250 100
(a) At the 98% confidence level, test whether annual income and the amount of life
insurance policies are independent.
(b) Find the attained significance level.
(c) State any assumptions you have made in solving the problem.
15. A new drug is tested for serum cholesterol-lowering properties on six randomly selected
volunteers. The serum cholesterol values are given in the following table.
X1 X2 y
3 1 4
2 5 3
3 3 6
1 2 5
17. The following is a random sample of height (in inches) and weight (in pounds) of seven
basketball players.
Height 73 83 77 80 85 71 80
Weight 186 234 208 237 265 190 220
Calculate the least-squares regression line for these data using matrix operations.
x1 x2 y
-1 -1 1
-1 1 1
1 -1 0
1 1 4