Revision Questions On Regression
Revision Questions On Regression
MULTIPLE-CHOICE
1 When using a least squares line to model a relationship displayed in a
scatter plot, one key assumption is that:
A there are two variables
B the variables are related
C the variables are linearly related
D r 2 > 0.5
E the correlation coefficient is positive
7 Using a least squares regression line, the predicted value of a data value is
78.6. The residual value is −5.4.The actual data value is:
A 73.2
B 84.0
C 88.6
D 94.6
E 424.4
8 The equation of the least squares line plotted on the scatter plot opposite is
closest to:
A y = 8.7 − 0.9x
B y = 8.7 + 0.9x
C y = 0.9 − 8.7x
D y = 0.9 − 8.7x
E y = 8.7 − 0.1x
9 Which of the following statements that relate to the regression line are
false?
A The slope of the regression line is 0.95.
B The independent variable in the regression equation is height.
C The least squares line does not pass through the origin.
D The intercept is 96.
E The equation predicts that a person who is 180 cm tall will weigh 75 kg.
11 Noting that the value of the correlation coefficient is r = 0.79, we can say
that:
A 62% of the variation in weight can be explained by the variation in height
B 79% of the variation in weight can be explained by the variation in height
C 88% of the variation in weight can be explained by the variation in height
D 79% of the variation in height can be explained by the variation in weight
E 95% of the variation in height can be explained by the variation in weight
13 If a three median line is fitted to the scatter plot shown, then its slope is
closest to:
A 0.2
B 0.4
C 0.6
D 0.8
E 1.03
EXTENDED RESPONSE
1 In an investigation of the relationship between the hours of sunshine (per
year) and days of rain (per year) for 25 cities, the least squares regression
line was found to be:
Hours of sunshine = 2847 − 6.88 × Days of rain, with r2 = 0.4838
Use this information to complete the following sentences.
a In this regression equation, the independent variable is ______ .
b The slope is ______ and the intercept is ________ .
c The regression equation predicts that a city that has 120 days of rain per
year will have ________ hours of sunshine per year.
d The slope of the regression line predicts that the hours of sunshine per year
will _________by ___________ hours for each additional day of rain.
e r = ______ , correct to three decimal places.
f _______% of the variation in sunshine hours can be explained by the
variation in_________ .
g One of the cities used to determine the regression equation had 142 days of
rain and 1390 hours of sunshine.
i The regression equation predicts its hours of sunshine to be ______ hours.
ii The residual value for this city is _________ hours.
h Using a regression line to make predictions within the range of data used
to determine the regression equation is called .
2 We wish to find the equation of the least squares regression line that will
enable height (in cm) to be predicted from femur (thigh bone) length
(in cm).
a Which is the DV and which is the IV?
b Use the following summary statistics to determine the equation of the least
squares regression line that will enable height (y) to be predicted from femur
length (x).
4 Can the weight of a mouse’s heart be reliably predicted from its body
weight? The body weights (in g) and the heart weights (in mg) of a random
sample of 12 laboratory mice are shown below.
Extended-response questions
1 a Days of rain
b −6.88, 2847
c 2021
d decrease, 6.88
e −0.696
f 48.4, days of rain
g i 1870 ii −480
h interpolation Male rate