Regression
Regression
Let L1 be the regression line of x on y. The equation of the line L1 can be written in
the form x = ay + b.
Let L1 be the regression line of x on y. The equation of the line L1 can be written in
the form x = ay + b.
Let L2 be the regression line of y on x. The lines L1 and L2 pass through the same
point with coordinates (p , q).
(a) State the name for this type of sampling technique. [1]
(b.i) Show that 3 students will be selected from grade 12. [3]
(b.ii) Calculate the number of students in each grade in the sample. [2]
In order to select the 3 students from grade 12, the principal lists their names in
alphabetical order and selects the 28th, 56th and 84th student on the list.
(c) State the name for this type of sampling technique. [1]
Once the principal has obtained the names of the 9 students in the random
sample, she surveys each student to find out how long they used social media
the previous day and measures their self-esteem using the Rosenberg scale. The
Rosenberg scale is a number between 10 and 40, where a high number
represents high self-esteem.
(a) Use this model to estimate the number of children in the park
on a day when the highest temperature is 25 °C. [2]
An ice cream vendor investigates the relationship between the total number of
children visiting the park and the number of ice creams sold, x. The following
table shows the data collected on five different days.
Total number
81 175 202 346 360
of children (y)
Ice creams
15 27 23 35 46
sold (x)
(b) Find an appropriate regression equation that will allow the
vendor to predict the number of ice creams sold on a day when
there are y children in the park. [3]
The regression line of y on x for this data can be written in the form
y = ax + b.
(c) Use the equation of your regression line to predict the Science
test score for a student who has a score of 78 on the
Mathematics test. Express your answer to the nearest integer. [2]
6. [Maximum mark: 7] 22M.1.SL.TZ1.3
A survey at a swimming pool is given to one adult in each family. The age of
the adult, a years old, and of their eldest child, c years old, are recorded.
The ages of the eldest child are summarized in the following box and whisker
diagram.
4
c + 20. The regression line of c on a is
a − 9.
1
c =
2
(b.i) One of the adults surveyed is 42 years old. Estimate the age of
their eldest child. [2]
(b.ii) Find the mean age of all the adults surveyed. [2]
(c) One of these eight students was disappointed with her result
and wished she had practised more. Based on the given data,
determine how her score could have been expected to alter had
she practised an extra five hours per week. [2]
(d) Lucy asserts that the number of hours a student practises has a
direct effect on their final diploma result. Comment on the
validity of Lucy’s assertion. [1]
(e) Lucy suspected that each student had not been practising as
much as they reported. In order to compensate for this, Lucy
deducted a fixed number of hours per week from each of
the students’ recorded hours.
(b) Use this model to predict the value of y when x = 18. [2]
(c)
¯
¯ Write down the value of x and the value of y . [1]
(d) Draw the line of best fit on the scatter diagram. [2]
Sarah, a regular customer, visited the café on five consecutive days. The
following table shows the number of customers, x, ahead of Sarah who have
already ordered and are waiting to receive their coffee and Sarah’s waiting time,
y minutes.
The relationship between x and y can be modelled by the regression line of y
on x with equation y = ax + b.
(a.i) Find the value of a and the value of b. [2]
(c) On another day, Sarah visits the café to order a coffee. Seven
customers have already ordered their coffee and are waiting to
receive it.
Use the result from part (a)(i) to estimate Sarah’s waiting time to
receive her coffee. [2]
Jill is doing a 1000-piece jigsaw puzzle. She started by sorting the edge pieces
from the interior pieces. Six times she stopped and counted how many of each
type she had found. The following table indicates this information.
Jill models the relationship between these variables using the regression
equation y = ax + b.
(a) Write down the value of a and of b. [3]
(b) Use the model to predict how many edge pieces she had found
when she had sorted a total of 750 pieces. [3]
(b.i) Write down, for this set of data the mean temperature
difference from 37 °C, x̄. [1]
(b.ii) Write down, for this set of data the mean number of heartbeats
per minute, ȳ. [1]
(c) Plot and label the point M(x̄, ȳ) on the scatter diagram. [2]
(a.ii) Use your graphic display calculator to write down ȳ, the mean
examination score. [1]
(b.i) Find the exact value of m and of c for these data. [2]
(b.ii) Show that the point M (x̄, ȳ) lies on the regression line y on x. [2]
(b) Another athlete on this sports team has a hand length of 21.5
cm. Use the regression equation to estimate the height of this
athlete. [2]
(a.ii) For these data, find the equation of the regression line y on x. [2]
(a) Plot and label the point M(m̄, p̄) on the scatter diagram. [2]
(b) Draw the line of best fit, by eye, on the scatter diagram. [2]
(c) Using your line of best fit, estimate the physics test score for a
student with a score of 20 in their mathematics test. [2]