Set 1 2023 Review Outline Solutions
Set 1 2023 Review Outline Solutions
NAME………………………………………………..
STUDENT NUMBER…………..……………………
TIME ALLOWED: Two (2) hours plus fifteen minutes reading time
REMEMBER TO WRITE YOUR NAME AND STUDENT NUMBER AT THE TOP OF THIS PAGE
(1) The method involving the collection, presentation, and characterization of a set of
data in order to properly describe the various features of that set of data is called
(a) scientific method
(b) statistical inference
(c) descriptive statistics
(d) none of the above
(3) The portion of the universe that has been selected for analysis is called
(a) a sample
(b) a statistic
(c) a population
(d) a parameter.
2
Part B
7 5 8 2 8 10 6
(a) Calculate the mean number of electricity provider changes per staff member
employed per day.
X
X
46
6.57(2 dec. places )
n 7
1 2 ( X ) 2
S X
n 1 n
1 (46) 2
S 342 6.619 2.57 (2 dec. places)
7 1 7
(d) What does the Z-score measure? Using the information in (a) and (c), compute
the Z-score of 9.
X X 9 6.57
Z = 0.95(2 dec. places)
S 2.57
3
Part C
Identifying managers who are both talented and motivated is difficult. A human resources
manager assessed 110 (112) staff and constructed a table with nine combinations of talent and
motivation levels.
Talent
Motivation Level High Medium Low Total
High 6 18 6 30
Medium 21 35 6 62
Low 12 6 2 20
Total 39 59 14 112
(b) A managerial prospect with “high motivation” and “high talent” is an example
of a joint event. True or False (please circle)
(c) Suppose the human resource manager has decided to hire a new manager, what
is the probability that the new manager
i. is in the high motivation category?
(d) Are high motivation and high talent independent? Justify your answer.
4
Question 2 (5 + 10 +10 = 25 Marks)
(2). All the events in the sample space that are not part of the specified event are called
(a) the sample space.
(b) the complement of the event.
(c) joint events.
(d) simple events.
(5). If two equally likely events A and B are mutually exclusive and collectively
exhaustive, what is the probability that event A occurs?
(a) 0
(b) 0.50
(c) 1.00
(d) Cannot be determined from the information given.
5
Part B
Adrian sells cars for Honest Joe’s car dealership. Adrian has never sold more than three cars
in a given week. Given X is the number of cars sold by Adrian in a week, the probability
distribution of X is summarized in the following table:
X 0 1 2 3
P(X) 0.25 0.45 0.25 0.05
(c) What is the probability of Adrian selling no more than one car?
0.5
0.45
0.4
0.35
0.3
P(X)
0.25
0.2
0.15
0.1
0.05
0
0 1 2 3
X
(e) Adrian gets paid a bonus of $500 if he sells one car per week. For each
additional car sold per week Adrian receives $300. Calculate his mean bonus.
(a) What is the expected number of applicants who fail? Interpret this value.
{If we repeat the experiment of sampling 20 applicants from the humanitarian scheme, the
average number of applicants who will fail the test will tend towards the true population mean
of 3.2}
(b) What is the probability that exactly 3 applicants will fail the test?
P(X=3) = 0.241
(c) What is the probability that fewer than 2 applicants will fail the test?
P(X<2) = 0.1471
(d) What is the probability that exactly 16 applicants will pass the test?
7
Binomial Probabilities
Data
Sample size 20
Probability of success 0.16
8
Question 3 (10 + 10 + 5 = 25 marks)
Part A
Time spent using email per session is normally distributed with a mean of 20 minutes and a
standard deviation of 3 minutes.
(a) What is the probability that time spent using email per session is greater than 21
minutes?
(b) What is the probability that time spent using email per session is between 18.3
minutes and 22.6 minutes?
P(18.3 < X < 22.6) = P(-0.57 < Z < 0.87) = P(Z<0.87) – P(Z<-0.57)
= 0.8078 – 0.2843 = 0.5235
(c) What is the probability that a random sample of 64 email users will spend an
average time greater than 21 minutes?
As X ~ N, then the sampling distribution is also normal
P( X >21) = P(Z> 2.67) = 1-P(Z<2.67) = 1 – 0.9962 = 0.0038
(d) Explain the difference between the results in part (a) and part (c).
9
Part B
The owner of a Greek restaurant wants to know the mean amount spent per customer in the
restaurant. A sample of 60 customers is taken. The sample mean is $38.54 and the standard
deviation is $7.26.
(a) Construct a 95% confidence interval for the mean amount spent per customer.
Give the confidence interval limits correct to two decimal places.
7.262
is unknown but the sample size is large so by CLT we have X ~ N , .
60
Our 95% confidence interval for is given by:
X Z 2 S n
Given the sample data, we have:
38.54 1.96 7.26 / 60
38.54 1.84
($36.70, $40.38)
(b) Given the customer base remains the same the owner’s accountant has stated that
the mean amount spent per customer needs to be $45.00 in order to ensure that the
business remains profitable. Should the owner be concerned about the profitability
of the restaurant?
Yes the owner should be worried, because we are confident that the mean
amount spent per customer is between $36.70 and $40.38. Thus the mean
amount spent per customer is below $45.00.
(c) What sample size would need to be taken to obtain a result which is within $1.00
of the true mean with a probability of 0.95?
2
Z 2
2
1.96*7.26
n n 202.48 = 203
e 1
10
Part C
(ii) The distribution of the sample mean has the same central location as the
True/False
distribution of the parental population.
(iv) Other things being equal, the confidence interval for the mean will be
True/False
wider for 95% confidence than for 90% confidence.
11
Question 4 (12 + 13 = 25 marks)
Part A
A company is trying out a new marketing plan. Prior to the new campaign, store sales per
week were $5000. The new method is trialled in 15 randomly selected stores. The sample
provided sales figures with a sample mean of $5500 and a standard deviation of $800. Can the
company conclude that the new marketing plan works? Use = 0.01 and the following
template to answer the question.
HO: ≤ 5000
H1: > 5000
Level of Significance: 0.01
X
~ t n1
S
n
At 1%, t0.01,14 = 2.62. Since 2.42 < 2.62, we do not reject H0.
Conclusion:
We do not have sufficient evidence to conclude that the new marketing plan
works.
Assume X ~ N
Indicate the test statistic and p-value on the small sketch below.
12
Part B
A sample of ten developed countries is selected. Information about the percentage labour
force participation in the agricultural sector and the per capita income (in thousands of US
dollars) is obtained. An analysis using EXCEL yielded the information below.
Regression Analysis
Regression Statistics
Multiple R 0.585728146
R Square 0.34307746
Adjusted R Square 0.260962143
Standard Error 1.728871758
Observations 10
ANOVA
df SS MS F Significance F
Regression 1 12.48801956 12.48801956 4.17799591 0.075203745
Residual 8 23.91198044 2.988997555
Total 9 36.4
13
(d) Test whether the regression coefficient β1 is significant at the 10% level of
significance.
HO: ß1 = 0
H1: ß1 0
0.55 0
t 2.04
0.27
t0.05,8 = -1.8595
Since -2.04 < -1.8595, we reject H0. β1 is significant.
p-value = 0.075.
0.075 < 0.10, we reject Ho.
(e) If the percentage of the labour force in agriculture was 10%, what would you
predict the level of per capita income for that country to be?
The prediction is not good because this regression only explains 34.31% of variation in
per capital income.
14
FORMULAE
1 2 (X) 2
X
X
n
S2 X
n 1 n
or S2
1
n 1
X X 2
XX
Z
S
P(A') = 1 P(A)
x P x 2 x 2 P(x) 2
= np
2 np1 p
n!
P X p X 1 p
nX
X! (n X)!
X
Z
X X ps p
~ N(0 , 12 ) ~ t n1 ~ N(0 , 12 )
S p(1 p) /n
n n
S p s (1 p s )
X Z / 2 X t( n1) / 2 ps Z / 2
n n n
b1 1 b0 0
~ t n 2, ~ tn2
Sb1 Sb0
15