Extra Math QN
Extra Math QN
1. TPJC/2009/2/6
The number of official working hours (x hrs) per week of a sample of 50 teachers are summarised as follows: ( x 20) = 30 , x 2 = 21300 Find the unbiased estimates for the mean and variance of working hours of a teacher. [4] Find the probability that the sample mean differs from the population mean working hours per week of 50 teachers by less than 0.2. [3]
2. 2008TJC/2/10 (Modified) A secondary school has a student population consisting of 600 boys and 720 girls. A study on the amount of time spent reading newspapers in a week is to be conducted by randomly choosing 60 male students and 72 female students from the class lists and obtaining necessary information from them. The data (in hours) collected from the 60 boys (denoted by x) and the 72 girls (denoted by y) is summarised as follows: , , , . (i) Find unbiased estimates of the means and variances of x and y. [4] (ii) Find the probability that, on average, male students in the school spend more time reading newspapers in a week than female students in the school. [4] 3. 2008MJC/2/12 The length, in cm, of a floor tile follows a normal distribution with mean 30 and standard deviation 0.2. The length, in cm, of a wall tile follows a normal distribution with mean 10 and standard deviation 0.15. (i) Find the probability that the sum of the lengths of five randomly chosen floor tiles is between 150.5 cm and 151.5 cm. [3]
(ii) The random variable S is the sum of the lengths of six randomly chosen wall tiles and the random variable W is twice the length of a randomly chosen floor tile. Find P(S < W + 1). value of a such that P F 30 a = 0.02 . 4. 2009/AJC/2/10 [4] (iii) The average length of 25 randomly chosen floor tiles is denoted by F . Find the
[4]
Customers at a supermarket pay for their purchases either by cash or credit card. The probability of a randomly chosen customer paying by credit card is 0.3.
(i)
Find the probability that, on a randomly chosen day, there are more than 6 customers among the first 20 customers who pay by credit card and the fifth customer is the second customer who does so. [3] Fifty samples of 15 customers each were taken. If there is a 95% chance that the average number of customers who pay by credit card per sample lies in the interval (4, a), find the value of a, giving your answer to 3 significant figures. [3]
(ii)
5. 2009/MJC/2/12 (a) The height, X cm, of boys in a school may be assumed to follow a normal distribution with mean and variance 2 . Given that P ( X < 155 ) = P ( X > 185 ) = 0.025 , find the values of and 2 . [4] Hence, find the probability that (i) the total height of two randomly chosen boys exceeds twice the height of a third randomly chosen boy by at least 5 cm. [2] (ii) the mean height of 50 randomly chosen boys is less than 172 cm. [2] (b) The mass of a randomly chosen box of hamster food has mean g and standard deviation 5 g. A large sample of n boxes is taken. Find the least value of n such that P X < 1 > 0.99 where X is the sample mean.
[4]
Hypothesis Testing
1. [MI/08/Prelims/2/8] A newspaper claims that newly-graduated Economics students have an average starting salary of $2600. The director of a recruitment company believes that the figure stated in the newspaper is incorrect, and does a survey with a group of 80 newly-graduated Economics students. The data the director obtained is as follows, with $ x representing the monthly salaries of these graduates:
( x 2500 ) = 6400
and
Calculate the unbiased estimates of the mean and variance of the monthly salary of a newlygraduated Economics student. [3] Carry out an appropriate test at the 1% significance level to determine whether the directors belief is justified. [4] 2. [NJC/08/Prelims/2/11 (modified)] Yau Yan slimming centre offers a particular slimming package for ladies. Based on past records, the mean weight of a lady after treatment was 50 kg. The weight of a lady may be assumed to be normally distributed.
( x 2500 )
= 780 000
In January 2008, a sample of 20 customers was randomly chosen and the weight, x kg, of each customer after treatment was recorded. The data were summarised by
( x 50 ) = 54
and
( x 50 )
= 1226
Find the unbiased estimate of the population variance. [1] The centre manager claimed that the mean weight after treatment had changed. Test the managers claim at 10% level of significance. [4] Explain, in context of the question, the meaning of 10% level of significance. [1]
In June 2008, the centre manager found that the standard deviation of the weight of a customer after treatment was 6 kg. (iv) She analysed another random sample of 20 and when a test was conducted at 5% level of significance, there was sufficient evidence to justify that the weight after treatment had reduced. Find an estimate of the maximum total weight of this sample. Give your answer to the nearest integer. [3]
3. [IJC/07/Prelims/II/Q11(modified)] (a) The mean height of a mustard plant was found to be 5.1 mm after 4 days of growth. A random sample of 10 mustard plants had the following heights, in mm, after 4 days of growth. 5.0, 4.5, 4.8, 5.2, 4.3, 5.1, 5.2, 4.9, 5.1, 5.0 Using a 6% significance level, test whether the mean height of these plants is less than 5.1 mm. State any assumption made. [5] Explain the meaning of 6% significance level in the context of this question. (b) [1] In general, the marks obtained by students in their Mathematics examination may be assumed to be a random variable with mean 65 and variance 32. After the introduction of a new program of revision classes, it is found that there are improvements to the marks of some of the students. A random sample of the marks of 50 students is taken and the mean is found to be 66.4. A test is conducted and it shows that there is insufficient evidence that the new program of revision classes is effective. Find an inequality satisfied by the significance level of the test. [3] State, with a reason, whether it is necessary to assume that the Mathematics examination marks follow a normal distribution. [1] 4. [CJC/09/Prelims/2/6]
In July 2009, Company J introduced the new jPhone 3GS, the fastest, most powerful jPhone, packed with improved performance up to twice as fast as the previous model with longer battery life. The battery life in a randomly chosen jPhone 3GS has a normal distribution and the battery life of the phone is supposed to be 120 hours. A random sample of 90 jPhones 3GS is taken, and the battery life of each phone, x hours, is recorded. The data are summarised by
( x 120 ) = 29 ,
( x 120 )
= 419
Test, at the 5% significance level, whether the mean battery life of a jPhone 3GS is less than 120 hours. Explain, in the context of the question, the meaning of at the 5% significance level.
5. [ACJC/08/Prelims/2/12 (modified)] A marmalade manufacturer produces thousands of jars of marmalade each week. The mass, x grams, of marmalade in a randomly chosen jar has mean 25g. Following a slight adjustment to the filling machine, a random sample of 50 jars is taken and the mass of marmalade in the jars is measured. The masses are summarised by
71 . 49
[2]
The manufacturer wishes to test whether there has been a change in the mean mass of the marmalade at the 5% significance level. (a)(i) Carry out the test above. [5] (ii) State giving a reason whether it is necessary to assume a normal distribution for the test to be valid. [1] (b) Assuming that the mass of the marmalade in a randomly chosen jar is normally distributed, explain whether a z-test or t-test should be used when a random sample of 10 jars is used. [2] 6. [PJC/09/Prelims/2/8]
A machine is designed to produce screws of length 8 millimeters. A random sample of nine screws were selected and the lengths, in millimeters, are found to be as follows: 7.99, 8.01, 8.00, 8.02, 8.03, 7.99, 8.00, 8.01, 8.01
(a) It is required to test whether the machine is producing screws of length more than 8 millimeters. (i) State suitable null and alternative hypotheses, and the assumption required for this test. [2] (ii) Determine the least level of significance at which the test would result in the rejection of the null hypothesis. [3] (b) It is known that the standard deviation of the length of screws produced by the machine is 0.013 millimeters. Test, at the 10% level of significance, whether the machine is faulty. [3]
7. [SAJC/09/Prelims/2/10]
The Operations Manager of HardWork Company claims that each worker works an average of 8 hours a day. However, most staff feedback indicates that this figure is an underestimation. He attempts to verify this information by surveying 75 randomly chosen workers. Denoting the time spend by a worker in the office in a day by x hrs, the sample data for the 75 workers are summarized by x = 634.2, x 2 = 5653.5 . (i) (ii) Calculate the unbiased estimate of the mean and variance of the time spent by each worker in the office in a day. [2]
Test at the 5% significance level whether the Operation Managers claim of 8 hours is an under estimation of the mean number of hours worked for each worker. [4] (iii) Let 0 denote the companys claim of the mean number of hours spend by a worker in a day. The companys objective is to conduct a test at the 5% significance level such that the null hypothesis is not rejected. Using the same sample data given above, calculate the range of values of 0 that will allow the company to achieve its objective. [3]
x y (i)
4 0.8
5 2
6 0.65
9 0.6
11 0
14 0.65
15 0.55
19 0.35
21 0.40
22 0.38
Calc ulate the correlation coefficient for the data. [2] (ii) Give a sketch of the scatter diagram for the data, as shown on your calculator, and comment on your answer in (i). [3] (iii) Identi fy the two data pairs which should be removed, and calculate the correlation coefficient, and the regression line of y on x, for the revised data. [3]
(iv)
Use the regression line of y on x in (iii) to demonstrate that it is unwise to extrapolate beyond the range of the data. [2] (v) A new data pair ( a, b ) is now obtained at an eleventh site, where 4 a 22 . Using the revised data in (iii) and the new data pair ( a, b ) , the regression equation of y on x is calculated and found to be identical to that using only the revised data. Find a possible data pair ( a, b ) . [2] 2. [HCI/09/2/10]
The table below shows the Certificates of Entitlement Quota Premiums for eight bidding exercises in 2009: Small cars premium x (in thousands of dollars) Big cars premium y (in thousands of dollars) (i) 4.89 0 5.10 1 5.11 6 5.00 1 7.09 0 7.50 1 7.58 9 7.49 0 8.48 9 7.55 2 9.88 9 9.18 0 11.69 0 11.88 9 12.899 14.840
Obtain the value of the linear (product moment) correlation coefficient r for the data. Explain whether we can conclude that the rise in the big car premiums is due to the rise in the small car premiums. [2] Plot a scatter diagram for the data and explain how its shape is related to the value of r obtained in (i). [2]
(ii)
(iii) A motorist wishes to use y = ax + b, y = abx or y = axb to establish a model for the relationship between x and y. Identify the most suitable model and justify your answer. [2] (iv) Obtain the values of a and b for the model chosen in (iii). [2]
3. SAJC/09/2/11 Ms Chan recorded the length of time, y minutes, taken to travel to college when leaving home x minutes after 6.40 a.m. on seven selected mornings. The results are as follows: x y (i) 0 16 10 27 20 28 30 39 40 39 50 48 60 51
Calculate the product moment correlation coefficient for the above set of data. Comment on your result. [2] (ii) Calculate the equation of the least squares regression line of y on x, in the form of y = a + bx. [1] (iii) Ms Chan took 30 minutes to travel to College. Find the time after 6.40 a.m., at which she left her house, using your result in (ii). Explain whether your estimate is likely to be reasonable. [3] Now, Ms Chan needs to arrive at College no later than 7.30 a.m. The number of minutes by which she arrives early at school, when leaving home x minutes after 6.40 a.m., is denoted by z.
(iv) Deduce an equation for z in terms of x. [2] (v) Hence estimate, to the nearest minute, the latest time that Ms Chan can leave home without then arriving late at College. [2]
4. PJC/09/2/9 Singapore College wishes to determine the relationship between the Mathematics performance and Physics performance of their students. The college issued a test for each of the subjects and recorded the Mathematics score (x) and the Physics score (y) of a random sample of 7 students. The following set of data was obtained: Math score, x 98 84 75 66 50 43 31 Physics score, y 96 91 86 85 73 66 54 (i) Give a sketch of the scatter diagram for the data. [2] The following models were proposed. A: y = a + b ln( x) B: y 2 = a + bx b C: y = a + x (ii) State, with reasons, which one of the above models is the most appropriate for this set of data. [3] (iii) For the most appropriate model, calculate the least squares estimates of a and b. [1] (iv) Estimate the Physics score of a student scoring 10 marks for Mathematics using the model in (ii). Comment on the validity of this estimate. [2] (v) The Head of Mathematics Department used the model in (ii) to estimate the Mathematics score of a student scoring 75 marks for Physics. Comment on the suitability of this method. [1] 5. YJC/09/2/11 (a) With the aid of suitable diagrams, describe the difference between the least squares linear regression line of y on x and that of x on y. [2] State the point at which the estimated regression line of y on x and that of x on y will both pass through. [1] The equation of the estimated least squares regression line of y on x for a set of bivariate data is y = a +bx and the corresponding equation of x on y is x = c + dy . Show that the square of the linear (product moment) correlation coefficient is bd. [1] (b) A study is conducted with a group of dieters to see if the number of grams (x) of fat consumes per day is related to cholesterol level (y). The results were as follows. x y 5 160 9 13 229 7 11 225 3 136
(i) Given that the estimated least squares regression line of y on x is 7 y = 68 x + 786 , and that the linear (product moment) correlation
coefficient is 0.985. Show that the values of and are 205 and 185 respectively. (ii) Predict the value of x when y = 210 and comment on the result. (iii) Sketch a scatter diagram to illustrate all the 6 pairs of data and the regression line y on x.
Functions
1. [ACJC/08PrelimP2/Q4] Functions f, g and h are defined by f : x a x2 6 x + 5 g : x a ln ( x 2 ) h:xa (i) (ii) 1 2 x for x , 2 x 6 , for x , x 0 , for x , x < 2 .
Give a reason why f does not have an inverse. State the largest possible domain of f in the form [ a, b ] , a, b , for which the inverse function f exists. Hence, define f -1 in similar form. (iii) Explain why the composite function hg does not exist. 2 (iv) Another function r is defined by r : x a ln ( x ) for x , k < x < k , x 0 . Given that k , state the maximum value of k, for which the composite function hr exist. Hence, state the range of hr. 2. [HCI/08PrelimP1/Q8] The functions f and g are defined by f : x e 3 x , g : x a ( x 1) 2 , x , x and a is a constant.
Sketch the graph of y = f(x), and show that f does not have an inverse. (i) The function f has an inverse if its domain is restricted to x b . State the smallest possible value of b and define, in similar form, the inverse function f 1 corresponding to this domain for f. (ii) Find the largest value of a such that the composite function f 1g exists and state the range of f 1g for this value of a. 3. 2007/NJC/Promos/3 The function g is defined as follows: g: x a 3e x ,
2
x 0.
The graph for the function f has asymptote at x = 1 and stationary point at (3, 6) as shown below. y y = f(x)
(3,6) x
0 (i)
(ii)
A one-one function f1 has the same rule as f but its domain is of the form [k , ) . Find the least integer value for k . Hence, find the set of values of x for which f1f11 ( x ) = f11f1 (x) . Show that the composite function f1 g exists and find the range of f1 g.
g : x ln( x + 2) , h : x 2 + e x ,
(i) Show that f is not a one-one function. If the domain of f is restricted to x a, find the least value of a for which f is one-one. (ii) Define, in a similar form, the inverse function f 1 corresponding to this new domain of f. (iii) The composite function gh is well defined and the range of gh is given as (ln 4 , ln 6] . Find the exact value of b. 5. AJC/2006/PrelimP1/5 The functions f and g are defined as follows: f : x a ln ( x + k + 1) , g : x a x + 2 x 1,
2
x k , k > 0 , xR.
(i) (ii)
Find the values of k such that the composite function fg exists. Find the value of k so that the range of fg is given by [ ln 3, ) .
6. NYJC/2008/PrelimsP2/ The functions f and g are defined by f : x ln(x + 3), x > 3 1 2 g:x x +x, 2<x<2 4 (i) Show by means of a graphical argument that g is one-one, and find g 1 in similar form. (ii) Show that fg exists, and state its rule and domain. (iii) By finding the range of fg, or otherwise, determine if f 2 g exists.
Answers: Sampling 1. 20.6 , 1.67 , 0.726 2. (i) , s2x = 2.304, , s2y = 2.95
3. (iii) 4. (ii) 0.486 (i) 0.131 (ii) 0.967 0.00931 (i) 0.0598 (ii) 4.98 5. 170, 58.6 (ai) 0.394 (ii) 0.968 (b) 166 Hypothesis Testing
4. p-value = 0.0771 5. p-value = 0.0188 6. (aii) 8.46% (b) p-value = 0.122 7. (i) 8.46, 3.93
(iii) 20.4
(iv) z = 31.5 1.564x (v) 7am 4. (ii) Model A (iii) -71.4, 36.7 (iv) 13.1 5. (bii) 10.1 Functions
-1 1. f ( x ) = 3 + 4 x , 0 x 4 , Rhr = ( 0, )
R f 1g = [ 3, )