WWSL Paper2
WWSL Paper2
The manager of a folder factory recorded the number of folders produced by the factory (in thousands) and the production costs (in thousand Euros),
Every month the factory sells all the folders produced. Each folder is sold for 2.99 Euros.
a. Draw a scatter diagram for this data. Use a scale of 2 cm for 5000 folders on the horizontal axis and 2 cm for 10 000 Euros on the vertical axis. [4]
b.i. Write down, for this set of data the mean number of folders produced, ; [1]
b.ii.Write down, for this set of data the mean production cost, . [1]
d. Use your graphic display calculator to find the Pearson’s product–moment correlation coefficient, . [2]
e. State a reason why the regression line on is appropriate to model the relationship between these variables. [1]
f. Use your graphic display calculator to find the equation of the regression line on . [2]
h. Use the equation of the regression line to estimate the least number of folders that the factory needs to sell in a month to exceed its production [4]
The weights of these boxes are normally distributed with a mean of 502 grams and a standard deviation of 2 grams.
All boxes of cereal with a weight between 497.5 grams and 505 grams are sold. The manufacturer’s income from the sale of each box of cereal is
$2.00.
The manufacturer recycles any box of cereal with a weight not between 497.5 grams and 505 grams. The manufacturer’s recycling cost is $0.16 per
box.
A different manufacturer produces boxes of cereal with weights that are normally distributed with a mean of 350 grams and a standard deviation of
1.8 grams.
This manufacturer sells all boxes of cereal that are above a minimum weight, .
b. (i) Find the probability that a box of cereal, chosen at random, is sold. [4]
(ii) Calculate the manufacturer’s expected daily income from these sales.
As part of his IB Biology field work, Barry was asked to measure the circumference of trees, in centimetres, that were growing at different
distances, in metres, from a river bank. His results are summarized in the following table.
a. State whether distance from the river bank is a continuous or discrete variable. [1]
b. On graph paper, draw a scatter diagram to show Barry’s results. Use a scale of 1 cm to represent 5 m on the x-axis and 1 cm to represent 10 [4]
cm on the y-axis.
(i) the mean distance, , of the trees from the river bank;
g. Use the equation of the regression line on to estimate the circumference of a tree that is 40 m from the river bank. [2]
The following table shows the number of bicycles, , produced daily by a factory and their total production cost, , in US dollars (USD). The table
a. (i) Write down the Pearson’s product–moment correlation coefficient, , for these data. [4]
b. Write down the equation of the regression line on for these data, in the form . [2]
c. Estimate the total cost, to the nearest USD, of producing bicycles on a particular day. [3]
d. All the bicycles that are produced are sold. The bicycles are sold for 304 USD each. [2]
Explain why the factory does not make a profit when producing bicycles on a particular day.
e. All the bicycles that are produced are sold. The bicycles are sold for 304 USD each. [5]
(i) Write down an expression for the total selling price of bicycles.
(ii) Write down an expression for the profit the factory makes when producing bicycles on a particular day.
(iii) Find the least number of bicycles that the factory should produce, on a particular day, in order to make a profit.
The following table shows the average body weight, , and the average weight of the brain, , of seven species of mammal. Both measured in
kilograms (kg).
a. Find the range of the average body weights for these seven species of mammal. [2]
b.i. For the data from these seven species calculate , the Pearson’s product–moment correlation coefficient; [2]
b.ii.For the data from these seven species describe the correlation between the average body weight and the average weight of the brain. [2]
c. Write down the equation of the regression line on , in the form . [2]
d. Use your regression line to estimate the average weight of the brain of grey wolves. [2]
f. State whether it is valid to use the regression line to estimate the average weight of the brain of mice. Give a reason for your answer. [2]
A group of candidates sat a Chemistry examination and a Physics examination. The candidates’ marks in the Chemistry examination are normally
b. Write down the probability that a randomly chosen candidate who sat the Chemistry examination scored at most 60 marks. [1]
Find the probability that a randomly chosen candidate who sat the Chemistry examination scored more than Hee Jin.
d. The candidates’ marks in the Physics examination are normally distributed with a mean of and a standard deviation of . Hee Jin also [2]
Find the probability that a randomly chosen candidate who sat the Physics examination scored less than Hee Jin.
e. The candidates’ marks in the Physics examination are normally distributed with a mean of and a standard deviation of . Hee Jin also [2]
Determine whether Hee Jin’s Physics mark, compared to the other candidates, is better than her mark in Chemistry. Give a reason for your
answer.
f. To obtain a “grade A” a candidate must be in the top of the candidates who sat the Physics examination. [3]
Find the minimum possible mark to obtain a “grade A”. Give your answer correct to the nearest integer.
The table below shows the distribution of test grades for 50 IB students at Greendale School.
A student is chosen at random from these 50 students.
The number of minutes that the 50 students spent preparing for the test was normally distributed with a mean of 105 minutes and a standard
deviation of 20 minutes.
d. Find the probability that this student scored a grade 5 or higher. [2]
e. Given that the first student chosen at random scored a grade 5 or higher, find the probability that both students scored a grade 6. [3]
f.i. Calculate the probability that a student chosen at random spent at least 90 minutes preparing for the test. [2]
f.ii. Calculate the expected number of students that spent at least 90 minutes preparing for the test. [2]
In a school, all Mathematical Studies SL students were given a test. The test contained four questions, each one on a different topic from the syllabus.
The quality of each response was classified as satisfactory or not satisfactory. Each student answered only three of the four questions, each on a
The table below shows the number of satisfactory and not satisfactory responses for each question.
A test is carried out at the 5% significance level for the data in the table.
a.ii.If the teacher chooses a response at random, find the probability that it is a satisfactory response to the Calculus question; [2]
a.iii.If the teacher chooses a response at random, find the probability that it is a satisfactory response, given that it is a response to the Calculus [2]
question.
b. The teacher groups the responses by topic, and chooses two responses to the Logic question. Find the probability that both are not [3]
satisfactory.
d. Show that the expected frequency of satisfactory Calculus responses is 12. [1]
e. Write down the number of degrees of freedom for this test. [1]
f. Use your graphic display calculator to find the statistic for this data. [2]
g. State the conclusion of this test. Give a reason for your answer. [2]
In the month before their IB Diploma examinations, eight male students recorded the number of hours they spent on social media.
For each student, the number of hours spent on social media ( ) and the number of IB Diploma points obtained ( ) are shown in the following table.
Ten female students also recorded the number of hours they spent on social media in the month before their IB Diploma examinations. Each of these
The equation of the regression line y on x for these ten female students is
An eleventh girl spent 34 hours on social media in the month before her IB Diploma examinations.
a. On graph paper, draw a scatter diagram for these data. Use a scale of 2 cm to represent 5 hours on the -axis and 2 cm to represent 10 points [4]
on the -axis.
c. Plot the point on your scatter diagram and label this point M. [2]
d. Write down the value of , the Pearson’s product–moment correlation coefficient, for these data. [2]
e. Write down the equation of the regression line on for these eight male students. [2]
f. Draw the regression line, from part (e), on your scatter diagram. [2]
g. Use the given equation of the regression line to estimate the number of IB Diploma points that this girl obtained. [2]
The table below shows the scores for 12 golfers for their first two rounds in a local golf tournament.
(iii) Find the number of these golfers that had a score of more than one standard deviation above the mean in Round 1.
Determine whether you can use the equation of the regression line to estimate his score in Round 2. Give a reason for your answer.
In a mountain region there appears to be a relationship between the number of trees growing in the region and the depth of snow in winter. A set
of 10 areas was chosen, and in each area the number of trees was counted and the depth of snow measured. The results are given in the table
below.
In a study on students there seemed to be a difference between males and females in their choice of favourite car colour. The results are given
A, a,
Usei. your graphic display calculator to find the mean number of trees. [1]
A, a,
Useiii. your graphic display calculator to find the mean depth of snow. [1]
A, a,
Useiv. your graphic display calculator to find the standard deviation of the depth of snow. [1]
A, d.
If the number of trees in an area is 55, estimate the depth of snow. [2]
A, e,
Usei. the equation of the regression line to estimate the depth of snow in an area with 100 trees. [1]
A, e,
Decide
ii. whether the answer in (e)(i) is a valid estimate of the depth of snow in the area. Give a reason for your answer. [2]
B, a.
Write down the total number of male students. [1]
B, b.
Show that the expected frequency for males, whose favourite car colour is blue, is 12.6. [2]
B, c,
Thei. calculated value of is and the critical value of is at the significance level. [1]
B, c,
Theii. calculated value of is and the critical value of is at the significance level. [1]
B, c,
Theiv. calculated value of is and the critical value of is at the significance level. [2]
Determine whether the null hypothesis should be accepted at the significance level. Give a reason for your answer.
A group of 800 students answered 40 questions on a category of their choice out of History, Science and Literature.
For each student the category and the number of correct answers, , was recorded. The results obtained are represented in the following table.
A test at the 5% significance level is carried out on the results. The critical value for this test is 12.592.
b.ii.Write down, for , the mid-interval value of the modal class. [1]
c.i. Use your graphic display calculator to estimate the mean of ; [2]
c.ii.Use your graphic display calculator to estimate the standard deviation of . [1]
d. Find the expected frequency of students choosing the Science category and obtaining 31 to 40 correct answers. [2]
e.i. Write down the null hypothesis for this test; [1]
g. State the result of the test. Give a reason for your answer. [2]
Part A
100 students are asked what they had for breakfast on a particular morning. There were three choices: cereal (X) , bread (Y) and fruit (Z). It is
found that
10 students had all three
17 students had bread and fruit only
15 students had cereal and fruit only
12 students had cereal and bread only
13 students had only bread
8 students had only cereal
9 students had only fruit
Part B
The same 100 students are also asked how many meals on average they have per day. The data collected is organized in the following table.
A.b.Find the number of students who had none of the three choices for breakfast. [2]
A.c.Write down the percentage of students who had fruit for breakfast. [2]
A.d.Describe in words what the students in the set had for breakfast. [2]
A.e.Find the probability that a student had at least two of the three choices for breakfast. [2]
A.f.Two students are chosen at random. Find the probability that both students had all three choices for breakfast. [3]
B.a.Write down the null hypothesis, H0, for this test. [1]
B.b.Write down the number of degrees of freedom for this test. [1]
B.d.Show that the expected number of females that have more than 5 meals per day is 13, correct to the nearest integer. [2]
B.e.Use your graphic display calculator to find the for this data. [2]
The table shows the distance, in km, of eight regional railway stations from a city centre terminus and the price, in , of a return ticket from each
Use the equation of the regression line to estimate the price of a return ticket to the city centre terminus from this regional station. Give your
answer correct to the nearest .
g. Give a reason why it is valid to use your regression line to estimate the price of this return ticket. [1]
Using your answer to part (f), calculate the percentage error in the estimated price of the ticket.
In a debate on voting, a survey was conducted. The survey asked people’s opinion on whether or not the minimum voting age should be reduced to
A test at the 1% significance level was conducted. The critical value of the test is 9.21.
a. State [2]
significant figures.
d. Find [3]
The seniors from Gulf High School are required to participate in exactly one after-school sport. Data were gathered from a sample of 120 students
A test was carried out at the 5 % significance level to analyse the relationship between gender and choice of after-school sport.
a. Write down the null hypothesis, H0, for this test. [1]
(i) is male;
h. Two students are chosen at random from the 120 students. [5]
He measures the diameters of the apples to the nearest cm. The following table shows the distribution of the diameters.
a. Using your graphic display calculator, write down the value of [3]
b. Daniel assumes that the diameters of all of the apples from his harvest are normally distributed with a mean of 7 cm and a standard deviation of [3]
1.2 cm. He classifies the apples according to their diameters as shown in the following table.
c. Daniel assumes that the diameters of all of the apples from his harvest are normally distributed with a mean of 7 cm and a standard deviation of [2]
1.2 cm. He classifies the apples according to their diameters as shown in the following table.
d. Daniel assumes that the diameters of all of the apples from his harvest are normally distributed with a mean of 7 cm and a standard deviation of [2]
1.2 cm. He classifies the apples according to their diameters as shown in the following table.
1.2 cm. He classifies the apples according to their diameters as shown in the following table.
Estimate the number of large apples that Daniel will grow this year.
An agricultural cooperative uses three brands of fertilizer, A, B and C, on 120 different crops. The crop yields are classified as High, Medium or
Low.
The agricultural cooperative decides to conduct a chi-squared test at the 1 % significance level using the data.
d. Show that the expected number of Medium Yield crops using Fertilizer C is 17, correct to the nearest integer. [2]
e. Use your graphic display calculator to find for the data [3]
f. State the conclusion of the test. Give a reason for your decision. [2]
The number of bottles of water sold at a railway station on each day is given in the following table.
a. Write down [2]
b. Write down the correlation coefficient, , for the variables and . [1]
(ii) Estimate how many bottles of water will be sold when the temperature is .
e. On a day when the temperature was Peter calculates that bottles would be sold. Give one reason why his answer might be unreliable. [1]
a. A speed camera on Peterson Road records the speed of each passing vehicle. The speeds are found to be normally distributed with a mean of [2]
Sketch a diagram of this normal distribution and shade the region representing the probability that the speed of a vehicle is between and
.
c. It is found that of the vehicles are exceeding the speed limit of . [2]
d. There is a fine of for exceeding the speed limit on Peterson Road. On a particular day the total value of fines issued was . [4]
(i) Calculate the number of fines that were issued on this day.
(ii) Estimate the total number of vehicles that passed the speed camera on Peterson Road on this day.
One day the numbers of customers at three cafés, “Alan’s Diner” ( ), “Sarah’s Snackbar” ( ) and “Pete’s Eats” ( ), were recorded and are
given below.
17 were customers of Pete’s Eats only
27 were customers of Sarah’s Snackbar only
15 were customers of Alan’s Diner only
10 were customers of Pete’s Eats and Sarah’s Snackbar but not Alan’s Diner
8 were customers of Pete’s Eats and Alan’s Diner but not Sarah’s Snackbar
Some of the customers in each café were given survey forms to complete to find out if they were satisfied with the standard of service they
received.
A.a.Draw a Venn Diagram, using sets labelled , and , that shows this information. [3]
A.b.There were 48 customers of Pete’s Eats that day. Calculate the number of people who were customers of all three cafés. [2]
A.c.There were 50 customers of Sarah’s Snackbar that day. Calculate the total number of people who were customers of Alan’s Diner. [3]
A.d.Write down the number of customers of Alan’s Diner that were also customers of Pete’s Eats. [1]
A.e.Find . [2]
B.a.One of the survey forms was chosen at random, find the probability that the form showed “Dissatisfied”; [2]
B.b.One of the survey forms was chosen at random, find the probability that the form showed “Satisfied” and was completed at Sarah’s Snackbar; [2]
B.c.One of the survey forms was chosen at random, find the probability that the form showed “Dissatisfied”, given that it was completed at Alan’s [2]
Diner.
B.d.A test at the significance level was carried out to determine whether there was any difference in the level of customer satisfaction in each [1]
of the cafés.
B.e.A test at the significance level was carried out to determine whether there was any difference in the level of customer satisfaction in each [1]
of the cafés.
B.f.A test at the significance level was carried out to determine whether there was any difference in the level of customer satisfaction in each [2]
of the cafés.
B.g.A test at the significance level was carried out to determine whether there was any difference in the level of customer satisfaction in each [2]
of the cafés.
State, giving a reason, the conclusion to the test.
The Brahma chicken produces eggs with weights in grams that are normally distributed about a mean of with a standard deviation of . The
eggs are classified as small, medium, large or extra large according to their weight, as shown in the table below.
a. Sketch a diagram of the distribution of the weight of Brahma chicken eggs. On your diagram, show clearly the boundaries for the classification [3]
of the eggs.
b. An egg is chosen at random. Find the probability that the egg is [4]
(i) medium;
c. There is a probability of that a randomly chosen egg weighs more than grams. [2]
Find .
d. The probability that a Brahma chicken produces a large size egg is . Frank’s Brahma chickens produce eggs each month. [2]
Calculate an estimate of the number of large size eggs produced by Frank’s chickens each month.
e. The selling price, in US dollars (USD), of each size is shown in the table below. [3]
Estimate the monthly income, in USD, earned by selling the eggs. Give your answer correct to two decimal places.
Alex and Kris are riding their bicycles together along a bicycle trail and note the following distance markers at the given times.
a. Draw a scatter diagram of the data. Use 1 cm to represent 1 hour and 1 cm to represent 10 km. [3]
b.i. Write down for this set of data the mean time, . [1]
b.ii.Write down for this set of data the mean distance, . [1]
e. Using your graph, estimate the time when Alex and Kris pass the 85 km distance marker. Give your answer correct to one decimal place. [2]
f. Write down the equation of the regression line for the data given. [2]
g.i. Using your equation calculate the distance marker passed by the cyclists at 10.3 hours. [2]
g.ii.Is this estimate of the distance reliable? Give a reason for your answer. [2]
A store recorded their sales of televisions during the 2010 football World Cup. They looked at the numbers of televisions bought by gender and
This information is shown in the table below; S represents the size of the television screen in inches.
The store wants to use this information to predict the probability of selling these sizes of televisions for the 2014 football World Cup.
(iv) a television with a screen size greater than 46 inches will be bought, given that it is bought by a male.
b. The manager of the store wants to determine whether the screen size is independent of gender. A Chi-squared test is performed at the 1 % [1]
significance level.
significance level.
Show that the expected frequency for females who bought a screen size of 32 < S ≤ 46, is 79, correct to the nearest integer.
d. The manager of the store wants to determine whether the screen size is independent of gender. A Chi-squared test is performed at the 1 % [1]
significance level.
e. The manager of the store wants to determine whether the screen size is independent of gender. A Chi-squared test is performed at the 1 % [2]
significance level.
f. The manager of the store wants to determine whether the screen size is independent of gender. A Chi-squared test is performed at the 1 % [1]
significance level.
g. The manager of the store wants to determine whether the screen size is independent of gender. A Chi-squared test is performed at the 1 % [2]
significance level.
Determine if the null hypothesis should be accepted. Give a reason for your answer.
Pam has collected data from a group of 400 IB Diploma students about the Mathematics course they studied and the language in which they were
examined (English, Spanish or French). The summary of her data is given below.
a. A student is chosen at random from the group. Find the probability that the student [8]
(iv) did not study Mathematics SL and was not examined in English;
(v) studied Mathematical Studies SL given that the student was examined in Spanish.
b. Pam believes that the Mathematics course a student chooses is independent of the language in which the student is examined. [2]
Using your answers to parts (a) (i), (ii) and (iii) above, state whether there is any evidence for Pam’s belief. Give a reason for your answer.
c. Pam decides to test her belief using a Chi-squared test at the level of significance. [3]
(ii) Show that the expected number of Mathematical Studies SL students who took the examination in Spanish is , correct to 3 significant
figures.
e. State, giving a reason, whether there is sufficient evidence at the level of significance that Pam’s belief is correct. [2]
a. Jorge applies a test at the level to investigate whether wearing a seat belt is associated with the time a driver has had their licence. [8]
(iii) Show that the expected number of drivers that wear a seat belt and have had their driving licence for more than years is , correct to
the nearest whole number.
b. Consider the drivers surveyed. One driver is chosen at random. Calculate the probability that [4]
(ii) the driver does not wear a seat belt, given that the driver has held a licence for more than years.
c. Two drivers are chosen at random. Calculate the probability that [6]
c. (i) Write down an estimate for the standard deviation of the lengths. [3]
(ii) How many fish (if any) have length greater than three standard deviations above the mean?
d. The fishing company must pay a fine if more than 10% of the catch have lengths less than 40cm. [2]
e. A sample of 15 of the fish was weighed. The weight, W was plotted against length, L as shown below. [2]
Exactly two of the following statements about the plot could be correct. Identify the two correct statements.
On one day 180 flights arrived at a particular airport. The distance travelled and the arrival status for each incoming flight was recorded. The flight was
A χ2 test is carried out at the 10 % significance level to determine whether the arrival status of incoming flights is independent of the distance
travelled.
b. Calculate the expected frequency of flights travelling at most 500 km and arriving slightly delayed. [2]
e. State, with a reason, whether you would reject the null hypothesis. [2]
f. Write down the probability that this flight arrived on time. [2]
g. Given that this flight was not heavily delayed, find the probability that it travelled between 500 km and 5000 km. [2]
h. Two flights are chosen at random from those which were slightly delayed. [3]
Find the probability that each of these flights travelled at least 5000 km.
A random sample of 167 people who own mobile phones was used to collect data on the amount of time they spent per day using their phones.
Manuel conducts a survey on a random sample of 751 people to see which television programme type they watch most from the following:
Manuel decides to ignore the ages and to test at the 5 % level of significance whether the most watched programme type is independent of gender.
i.b. Use your graphic display calculator to calculate approximate values of the mean and standard deviation of the time spent per day on these [3]
mobile phones.
i.c. On graph paper, draw a fully labelled histogram to represent the data. [4]
ii.a.Draw a table with 2 rows and 4 columns of data so that Manuel can perform a chi-squared test. [3]
ii.c.Find the expected frequency for the number of females who had ‘Comedy’ as their most-watched programme type. Give your answer to the [2]
ii.d.Using your graphic display calculator, or otherwise, find the chi-squared statistic for Manuel’s data. [3]
ii.e.(i) State the number of degrees of freedom available for this calculation. [3]
Francesca is a chef in a restaurant. She cooks eight chickens and records their masses and cooking times. The mass m of each chicken, in kg, and
e. Using your line of best fit, estimate the cooking time, in minutes, for a 1.7 kg chicken. [2]
h. The cooking time of an additional 2.0 kg chicken is recorded. If the mass and cooking time of this chicken is included in the data, the correlation [2]
is weak.
(i) Explain how the cooking time of this additional chicken might differ from that of the other eight chickens.
(ii) Explain how a new line of best fit might differ from that drawn in part (d).
A survey of people is carried out by a market research organization in two different cities, Buenos Aires and Montevideo. The people are
asked which brand of cereal they prefer out of Chocos, Zucos or Fruti. The table below summarizes their responses.
The following table shows the cost in of seven paperback books chosen at random, together with the number of pages in each book.
i.a. One person is chosen at random from those surveyed. Find the probability that this person [4]
i.b. Two people are chosen at random from those surveyed. Find the probability that they both prefer Fruti. [3]
i.c. The market research organization tests the survey data to determine whether the brand of cereal preferred is associated with a city. A chi- [1]
i.d. The market research organization tests the survey data to determine whether the brand of cereal preferred is associated with a city. A chi- [1]
i.e. The market research organization tests the survey data to determine whether the brand of cereal preferred is associated with a city. A chi- [2]
Show that the expected frequency for the number of people who live in Montevideo and prefer Zucos is .
i.f. The market research organization tests the survey data to determine whether the brand of cereal preferred is associated with a city. A chi- [2]
i.g. The market research organization tests the survey data to determine whether the brand of cereal preferred is associated with a city. A chi- [2]
State whether the market research organization would accept the null hypothesis. Clearly justify your answer.
ii.a.Plot these pairs of values on a scatter diagram. Use a scale of to represent pages on the horizontal axis and to represent [3]
ii.b.Write down the linear correlation coefficient, , for the data. [2]
ii.c.Stephen wishes to buy a paperback book which has pages in it. He plans to draw a line of best fit to determine the price. State whether or [2]
not this is an appropriate method in this case and justify your answer.
Part A
A university required all Science students to study one language for one year. A survey was carried out at the university amongst the 150 Science
students. These students all studied one of either French, Spanish or Russian. The results of the survey are shown below.
Ludmila decides to use the test at the level of significance to determine whether the choice of language is independent of gender.
At the end of the year, only seven of the female Science students sat examinations in Science and French. The marks for these seven students are
A.c.Find the expected frequency for the females studying Spanish. [2]
A.d.Use your graphic display calculator to find the test statistic for this data. [2]
A.e.State whether Ludmila accepts the null hypothesis. Give a reason for your answer. [2]
B.a.Draw a labelled scatter diagram for this data. Use a scale of to represent on the -axis ( ) and on the -axis ( ). [4]
B.d.Use your graphic display calculator to find the equation of the regression line of on . [2]
B.f.Carletta’s mark on the Science examination was . She did not sit the French examination. [2]
B.g.Monique’s mark on the Science examination was 85. She did not sit the French examination. Her French teacher wants to use the regression [2]
State whether the mark obtained from the regression line for Monique’s French examination is reliable. Justify your answer.
a. For an ecological study, Ernesto measured the average concentration of the fine dust, , in the air at different distances from a [2]
power plant. His data are represented on the following scatter diagram. The concentration of is measured in micrograms per cubic metre
Use the scatter diagram to find the value of and of in the table.
b. Calculate [4]
d. Ernesto’s school is located from the power plant. He uses the equation of the regression line to estimate the concentration of in [4]
A manufacturer claims that fertilizer has an effect on the height of rice plants. He measures the height of fertilized and unfertilized plants. The
A chi-squared test is performed to decide if the manufacturer’s claim is justified at the 1 % level of significance.
i, a.Write down the null and alternative hypotheses for this test. [2]
i, b.For the number of fertilized plants with height greater than 75 cm, show that the expected value is 97.5. [3]
i, f. Is the manufacturer’s claim justified? Give a reason for your answer. [2]
ii, a,Write
i. down the value of p. [1]
ii, a,Write
ii. down the value of q. [2]
ii, b.Using the values in the table above, draw the graph of N for 0 ≤ t ≤ 20. Use 1 cm to represent 2 days on the horizontal axis and 1 cm to [6]
ii, c.Use your graph to estimate the number of days for the population of fleas to reach 55. [2]
The weight, W, of basketball players in a tournament is found to be normally distributed with a mean of 65 kg and a standard deviation of 5 kg.
The probability that a basketball player has a weight that is within 1.5 standard deviations of the mean is q.
A basketball coach observed 60 of her players to determine whether their performance and their weight were independent of each other. Her
She decided to conduct a χ 2 test for independence at the 5% significance level.
a.i. Find the probability that a basketball player has a weight that is less than 61 kg. [2]
Find the expected number of players with a weight less than 61 kg in this training session.
The heat output in thermal units from burning of wood changes according to the wood’s percentage moisture content. The moisture content
and heat output of blocks of the same type of wood each weighing were measured. These are shown in the table.
a. Draw a scatter diagram to show the above data. Use a scale of to represent on the x-axis and a scale of to represent [4]
c. Plot the point on your scatter diagram and label this point M . [2]
e. The equation of the regression line on is . Draw the regression line on on your scatter diagram. [2]
f. The equation of the regression line on is . Estimate the heat output in thermal units of a block of wood that has [2]
moisture content.
g. The equation of the regression line on is . State, with a reason, whether it is appropriate to use the regression line [2]
In an environmental study of plant diversity around a lake, a biologist collected data about the number of different plant species (y) that were
a. Draw a scatter diagram to show the data. Use a scale of 2 cm to represent 10 metres on the x-axis and 2 cm to represent 10 plant species on [4]
the y-axis.
b. Using your scatter diagram, describe the correlation between the number of different plant species and the distance from the lake shore. [1]
c.i. Use your graphic display calculator to write down , the mean of the distances from the lake shore. [1]
c.ii.Use your graphic display calculator to write down , the mean number of plant species. [1]
d. Plot the point ( , ) on your scatter diagram. Label this point M. [2]
e. Write down the equation of the regression line y on x for the above data. [2]
g. Estimate the number of plant species growing 30 metres from the lake shore. [2]
George leaves a cup of hot coffee to cool and measures its temperature every minute. His results are shown in the table below.
a. Write down the decrease in the temperature of the coffee [3]
b. Assuming the pattern in the answers to part (a) continues, show that . [2]
c. Use the seven results in the table to draw a graph that shows how the temperature of the coffee changes during the first six minutes. [4]
Use a scale of 2 cm to represent 1 minute on the horizontal axis and 1 cm to represent 10 °C on the vertical axis.
d. The function that models the change in temperature of the coffee is y = p (2−t )+ q. [2]
e. Solve the equations found in part (d) to find the value of p and the value of q. [2]
g. George decides to model the change in temperature of the coffee with a linear function using correlation and linear regression. [4]
h. Use the equation of the regression line to estimate the temperature of the coffee at t = 3. [2]
i. Find the percentage error in this estimate of the temperature of the coffee at t = 3. [2]
A biologist is studying the relationship between the number of chirps of the Snowy Tree cricket and the air temperature. He records the chirp rate, ,
a. Draw the scatter diagram for the above data. Use a scale of 2 cm for 20 chirps on the horizontal axis and 2 cm for 4°C on the vertical axis. [4]
b. Use your graphic display calculator to write down the Pearson’s product–moment correlation coefficient, , between and . [2]
e. Calculate the air temperature when the cricket’s chirp rate is . [2]
f. Given that , draw the regression line on on your scatter diagram. [2]
g. A forest ranger uses her own formula for estimating the air temperature. She counts the number of chirps in 15 seconds, , multiplies this [1]
Write down the formula that the forest ranger uses for estimating the temperature, .
(i) calculate an estimate for the temperature, , using the forest ranger’s formula;
(ii) determine the actual temperature recorded by the biologist, using the table above;
(iii) calculate the percentage error in the forest ranger’s estimate for the temperature, compared to the actual temperature recorded by the
biologist.