Homework Solutions2
Homework Solutions2
Page | 1
UNIT 5 – BASIC PROBABILITY .................................................................................................. 47
Page | 2
Lesson 38 – Confidence Intervals for Means, σ Unknown ............................................................................. 94
Page | 3
Unit 1 – Introductory Topics in
Statistics
Lesson 2 – Some Introductory Statistical Definitions
A. True and False
1. T 6. T 11. T 16. F
2. T 7. T 12. F 17. F
3. T 8. F 13. F 18. F
4. F 9. T 14. F 19. T
5. T 10. T 15. F 20. T
B. Statistical Definitions
1. There were 850 potential graduates at SIAST Palliser Campus in 1999.
2. The average weekly earnings in Saskatchewan reached $920 in March 2012.
3. A statistical study was undertaken to determine the effects of increases in the costs of
labour on consumer prices in the housing market.
4. Descriptive
5. Descriptive
6. Inferential
7. a) All single parent families in urban Saskatchewan.
b) Not realistically. Families moving in and out of a region would make identification
difficult.
c) By drawing an inference from sample data.
8. The number of wheels is a constant while weight fluctuates from car to car.
9. a) The variable is the type of computer operating system.
b) The data is qualitative.
c) The level of measurement is nominal.
10. The symbol X is used to represent any value in the collection and the symbol Xi is used the
represent a particular observation.
Page | 4
Table 1 – 4: GEC Pension Fund1
Asset Allocation2
Fund Type
Assets ($million) Per Cent of Total Assets
Money Market 16.3 12.9
Bond 20.3 16.1
Fixed Income 25.6 20.3
Equity 45.6 36.2
Balanced 18.1 14.4
Total 125.9 100
1
For Canadian Employees
2
As of December 31, 2002
Source: GEC 2002 Annual Report
Page | 5
Unit 2 – Organization and Presentation
of Data
Lesson 5 – Organizing Quantitative Data for Analysis
1. a) A raw data format would be used because of the small number of values.
b) An ungrouped frequency table would be used because there are many repetitions of a
few distinct (discrete) observations.
c) A grouped frequency table would be used because there are many distinct observations.
2. An ungrouped frequency table would be used because there are many repetitions of a few
distinct observations.
3. a) A grouped frequency table because there are many observations are very little
repetition.
b) X = the cost of heating a three bedroom bungalow in Moose Jaw.
c) n = 78, the number of observations
d) R = X78 – X1 = 123.14 – 39.08 = $84.06
log 78
e) N = 1 + = 7.29 classes (~ 7 or 8 classes)
log 2
R 84.06 $
f) C=N= 7.29
= 11.53 class
g) The closest even value to 11.53 is $12.00/class, although $10.00/class would be
reasonable too. We will use $12.00/class so that we have 7 or 8 classes.
h) The lower limit of the class should start at an even number for even-sized class intervals.
Any of 28, 30, 32, 34, 36 or 38 would be reasonable as long as the smallest observation
falls within the first class. We will start at $38.00 in this solution.
i) See the table that follows. The tally has been left out.
Page | 6
Cost of Heating a Three Bedroom Bungalow in Moose Jaw in January1
Cost ($) Frequency
38.00 – 50.00 7
50.00 – 62.00 6
62.00 – 74.00 10
74.00 – 86.00 31
86.00 – 98.00 12
98.00 – 110.00 7
110.00 – 122.00 4
122.00 – 134.00 1
Σf = 78
1
2009
Page | 7
Monthly Cell Phone Payments by a Group of Salepersons
Payment ($) Frequency
60 – 75 7
75 – 90 13
90 – 105 22
105 – 120 35
120 – 135 38
135 – 150 23
150 – 165 5
165 – 180 7
Σf = 150
Page | 8
Lesson 6 – Frequency Distribution Tables Extended
1. a) ΣX = 149.60
b)
X (g) f fX
451 2 902
467 7 3269
472 9 4248
475 2 950
Σf = 20 ΣfX = 9369
c)
Wage ($/hour) f X fX
10.00 – 20.00 5 15.00 75.00
20.00 – 30.00 12 25.00 300.00
30.00 – 40.00 15 35.00 525.00
40.00 – 50.00 8 45.00 360.00
Σf = 40 ΣfX = 1260
2.
a) 0.36 or 36%
b) 45 people
c) 33 people
3.
a) 0.034 or 3.4%
Page | 9
b) 42
c) 143
4. Proportion can be used to compare when the sample sizes are different.
Within the range, national rates tend to be slightly higher than local wages since 56.448% of
the national wages fall in the range of $10 to $14 while 61% of the local wages fall in the
range of $8 to $12.
f) <Cf4 = 14,839. 14,839 births occurred where the age of the father was 34 years or
younger.
g) No. The first and last class are open ended. Even if they were not, the total age would
only be an estimate because the individual ages are lost in the grouping.
Page | 10
7.
8. ΣfX = $549,700
Page | 11
Lesson 7 – Statistical Graphs: Histograms and Frequency Polygons
1. a)
25
20
f 15
10
Weight (g)
2. a)
35
30
25
20
15
10
5
0
Score (%)
Page | 12
3. a)
0.25
Relative Frequency
0.2
0.15
0.1
0.05
Time (minutes)
4. a)
0.25
Relative Frequency
0.2
0.15
0.1
0.05
Noise (dB)
Page | 13
5. a)
20
15
f
10
Amount ($)
Page | 14
Lesson 8 – Statistical Graphs: Cumulative Frequency Polygons
1.
40 60
Cf 50 %Cf
30 40
20 30
20
10
10
0 0
218.0 222.0 226.0 230.0 234.0 238.0 242.0
Weight (g)
a) ~228.5 g
b) ~225.0 g
c) ~235.0 g
d) ~57%
2.
Page | 15
Aptitude Test Scores
140 100
130 90
120
110 80
100 70
90
80 60
Cf 70 50 %Cf
60 40
50
40 30
30 20
20
10 10
0 0
30 40 50 60 70 80 90 100
Score (%)
62
a) 78 failed, 140 = 0.442 857 142 ≈ 44.3% 𝑝𝑎𝑠𝑠𝑒𝑑
b) ~72%
c) 36
d) 50% to 69%, therefore range = 19%.
Page | 16
Lesson 9 – Measures of Position
1. a) Position(Q1 ) = 0.25(60 + 1) = 15.25
Q1 = X15.25 = X15 + 0.25(X16 − X15 ) = 6.50 + 0.25(6.50 − 6.50) = $6.50/hr
b) Position(P50 ) = 0.5(152) = 76
76 − 16
median = 40 + 20 = 56.438 356 16 ≈ 56.4%
73
a) Position(P50 ) = 0.5(9 + 1) = 5
Median = X 5 = 0.65 ≈ $0.65
Unit 1 – 2 Review
1. a) The weight of a four-liter can of paint.
b) All the paint cans in the current shipment.
c) 50 paints randomly selected from the current shipment.
Page | 17
d) The retailer wants to determine if the paint cans in the current shipment are under
filled.
2. a) Sample.
b) Population.
c) Population.
d) Sample.
3. a) Quantitative continuous.
b) Qualitative multinomial.
c) Qualitative binomial.
d) Quantitative discrete.
4. a) Ratio.
b) Nominal.
c) Ordinal.
d) Ratio.
e) Interval.
5. R = 𝑋50 − 𝑋1 = 28 − 3 = 25 hours
log 50
N=1+ = 6.64 classes
log 2
R 25 hours
C= = = 3.77
N 6.64 class
Some options include using a class width of 4 hours/class or 5 hours/class. If using 4, the
first class should start at 0 or 2. If using 5, the first class should start at 0.
6. a) The population is all of the readings of the concentration of sulfur dioxide in the city.
b) The data represents a sample.
Page | 18
c) The variable is the concentration of sulfur dioxide in the city.
d) The variable is quantitative.
e) The variable is continuous.
f) The data presented is raw data.
1
g) Median position = 2 (n + 1) = 0.5(18 + 1) = 9.5
1 1
Median value = x9.5 = (x9 + x10 ) = (0.19 + 0.20) = 0.195 ≈ 0.195 ppm
2 2
1
h) Q1 position = 4 (n + 1) = 0.25(18 + 1) = 4.75
Q1 = X 4.75 = X 4 + 0.75(X5 − 𝑋4 ) = 0.17 + 0.75(0.17 − 0.17) = 0.17 ≈ 0.170 ppm
3
Q 3 position = (n + 1) = 0.75(18 + 1) = 14.25
4
Q 3 = 𝑋14.25 = X14 + 0.25(X15 − X14 ) = 0.23 + 0.25(0.24 − 0.23) = 0.2325
≈ 0.233 ppm
IQR = Q 3 − Q1 = 0.2325 − 0.17 = 0.0625 ≈ 0.063 ppm
7.
Number of Instructions f P <Cf
18 4 0.0500 4
19 12 0.1500 16
20 15 0.1875 31
21 20 0.2500 51
22 13 0.1625 64
23 8 0.1000 72
24 6 0.0750 78
25 2 0.0250 80
Σf = 80 ΣP = 1.0000
a) The population is all of the observations of the number of instructions the computer
system can perform in 1 nanosecond.
b) The data represents a sample.
c) The variable is the number of instructions a computer system can perform in 1
nanosecond.
d) The variable is quantitative.
e) The variable is discrete.
f) The data presented as an ungrouped frequency distribution.
g) See the table for the answer.
1
h) Median position = 2 (∑ f + 1) = 0.5(80 + 1) = 40.5
Page | 19
1 1
Median value = X 40.5 = (X 40 + X 41 ) = (21 + 21) = 21 ≈ 21.0 instructions
2 2
i) P80 position = 0.8(∑ f + 1) = 0.8(80 + 1) = 64.8
P80 = X 64.8 = X 64 + 0.8(X65 − X 64 ) = 22 + 0.8(23 − 22) = 22.8 ≈ 22.8 instructions
8.
Stopping Distance (m) f <Cf
45 – 50 2 2
50 – 55 15 17
55 – 60 32 49
60 – 65 36 85
65 – 70 24 109
70 – 75 10 119
75 – 80 1 120
Σf = 120
a) The population is all of the observations of the braking distance required to stop a car
from 110 km/h.
b) The data represents a sample.
c) The variable is the braking distance required to stop a car from 110 km/h.
d) The variable is quantitative.
e) The variable is continuous.
f) The data is presented as a grouped frequency distribution.
g) See the graph for the answer.
Mode = 62.5 m; Range = 80 – 45 = 35 m; Midrange = 62.5 m
The histogram/frequency polygon is approximately symmetrical.
Page | 20
Distance Required to Stop a Car from 110 km/h
40
35
30
25
Frequency 20
15
10
5
0
90
100
80
70
80
60
Cf 60 50 %Cf
40
40
30
20
20
10
0 0
45 50 55 60 65 70 75 80
Stopping Distance (m)
1
i) Median position = 2 ∑ f = 0.5(120) = 60
60 − 49
Median value = 60 + 5 = 61.527̅ ≈ 61.5 m
36
j) 1st decile position = 0.1(120) = 12
12 − 2
1st decile value = 50 + 5 = 53. 3̅ ≈ 53.3 m
15
Page | 21
9th decile position = 0.9(120) = 108
108 − 85
9th decile value = 65 + 5 = 69.7916̅ ≈ 69.8 m
24
IDR = D9 − D1 = 69.7916̅ − 53. 3̅ = 16.4583̅ = 16.5 m
Page | 22
Unit 3 – Measures of Central Tendency
Lesson 10 – Measures of Central Tendency
1. Mode
2. The median would best represent the salaries of the employees because the data set is
skewed by the one salary that is much larger than the other salaries.
∑ X 3678
̅=
X = = 613 ≈ 613.0 g
n 6
Mode is 456 g
2.
∑ X 205 $1000
𝜇= = = 41 ≈ 41.0 ( )
N 5 annum
Position(median) = 0.5(5 + 1) = 3
$1000
Median = X 3 = 25 ≈ 25.0 ( )
annum
There is no mode.
Page | 23
3.
Expense ($) Frequency fX <Cf
123 5 615 5
198 11 2178 16
200 7 1400 23
220 2 440 25
Σf = 25 ΣfX = 4633
∑ fX 4633
̅=
X = = 185.32 ≈ $185.32
∑f 25
Position(median) = 0.5(25 + 1) = 13
Median = X13 = 198 ≈ $198.00
Mode is $198.00
4.
∑ fX 37
μ= = = 1.23̇ ≈ 1.2 mistakes
∑f 30
Mode is 0 mistakes.
Page | 24
Lesson 12 – Calculation of Averages for Grouped Data
1.
Age (years) Frequency X <cf fX
30 – 35 8 32.5 8 260.0
35 – 40 21 37.5 29 787.5
40 – 45 24 42.5 53 1020.0
45 – 50 32 47.5 85 1520.0
50 – 55 40 52.5 125 2100.0
55 – 60 23 57.5 148 1322.5
60 – 65 8 62.5 156 500.0
Σf = 156 ΣfX = 7510
∑ fX 7510
𝜇= = = 48.141 025 64 ≈ 48.1 years
∑f 156
1
Position(median) = ∑ 𝑓 = 0.5(156) = 78
2
1
∑ 𝑓 − 𝑐𝑓𝑖−1 78 − 53
Median = L + [2 ] C = 45 + [ ] 5 = 48.906 25 ≈ 48.9 years
𝑓𝑖 32
2.
Warranty Repair Jobs per Month Number of Months X <cf fX
95 – 105 4 100 4 400
105 – 115 14 110 18 1540
115 – 125 18 120 36 2160
125 – 135 15 130 51 1950
135 – 145 10 140 61 1400
145 – 155 7 150 68 1050
155 – 165 2 160 70 320
Σf = 70 ΣfX = 8820
∑ fX 8820
̅
X= = = 126 ≈ 126.0 warranty repair jobs per month
∑f 70
1
Position(median) = ∑ 𝑓 = 0.5(70) = 35
2
1
∑ 𝑓 − 𝑐𝑓𝑖−1 35 − 18
Median = L + [2 ] C = 115 + [ ] 10 = 124. 4̇
𝑓𝑖 18
∑ fX 6900
̅
X= = = 69 ≈ 69.0 acceptable pieces produced
∑f 100
1
Position(median) = ∑ 𝑓 = 0.5(100) = 50
2
1
∑ 𝑓 − 𝑐𝑓𝑖−1 50 − 35
Median = L + [2 ] C = 60 + [ ] 15 = 68.653 846 15
𝑓𝑖 26
≈ 68.7 acceptable pieces produced
4.
Account Balance ($) Frequency X <cf
0 – 100 10 50 10
100 – 200 15 150 25
200 – 300 40 250 65
300 – 400 22 350 87
400 – 500 10 450 97
500 and above 3 100
Σf = 100
a) Mode = $250.00
1
Position(median) = ∑ 𝑓 = 0.5(100) = 50
2
1
∑ 𝑓 − 𝑐𝑓𝑖−1 50 − 25
Median = L + [2 ] C = 200 + [ ] 100 = 262.5 ≈ $262.50
𝑓𝑖 40
b) The last class is open-ended, therefore there is no midpoint available to calculate the
mean.
Page | 26
Lesson 13 – Other Measures of Central Tendency: The Weighted Mean
1.
∑ wX 4680
̅w =
X = = 80.689 655 17 ≈ 80.7%
∑w 58
̅ = 65(280) = $18,200
2. ∑ X = nX
3.
∑ wX 5,579,300
̅w =
X = = 50,720.909 ≈ $50,720
∑w 110
4.
∑ wX 49 764.26
̅
Xw = = = 9.162 663 868 ≈ 9.16%
∑w 5431.2
Page | 27
Lesson 14 – Other Measures of Central Tendency: The Geometric
Mean
7
1. G. M. = [ √(1 + 0.15)(1 + 0.02)(1 + 0.13)(1 + 0.10)(1 + 0.06)(1 + 0.07)(1 + 0.01) −
1] 100 = 7.603 336 228 ≈ 7.60%/annum
4
2. G. M. = [ √(1 + 0.20)(1 − 0.08)(1 + 0.12)(1 − 0.05) − 1] 100 = 4.106 459 091 ≈
4.11%/annum
6 18,000
3. G. M. = [√13,500 − 1] 100 = 4.911 506 342 ≈ 4.91%/annum
8 12,500
4. G. M. = [√ − 1] 100 = 10.074 127 68 ≈ 10.07%/annum
5800
4 60,000,000
6. G. M. = [√45,000,000 − 1] 100 = 7.456 993 182 ≈ 7.46%/quarter
Page | 28
Unit 4 – Measures of Dispersion
Lesson 15 – Measuring Dispersion in a Distribution
1. There is more variation in the ages of workers in the iron processing mills than the
automobile industry.
2.
X ($1000) X–µ |X – µ| (X – µ)2
15 -10 10 100
21 -4 4 16
28 3 3 9
36 11 11 121
ΣX = 100 Σ(X – µ) = 0 Σ|X – µ| = 28 Σ(X – µ)2 = 246
∑ X 100
μ= = = 25 ≈ 25.0 ($1000)
N 4
a) R = 36 − 15 = 21 ≈ 21.0 ($1000)
28
b) M. A. D. = = 7 ≈ 7.0 ($1000)
4
246
c) σ2 = 4
= 61.5 ≈ 61.5 ($1000)2
d) σ = √61.5 = 7.842 193 571 ≈ 7.8 ($1000)
Page | 29
Lesson 16 – The Computing Formula for the Standard Deviation
1.
X (g) X–µ (X – µ)2 X2
34.5 −11.483̅ 131.866 944 4 1190.25
67.8 21.816 ̅ 475.966 944 4 4596.84
23.9 −22.083̅ 487.673 611 1 571.21
45.4 −0.583̅ 0.340 277 777 2061.16
54.6 8.616̅ 74.246 944 44 2981.16
49.7 3.716̅ 13.813 611 11 2470.09
ΣX = 275.9 Σ(X – µ) = 0 Σ(X – µ)2 = 1183.908 333 ΣX2 = 13,870.71
∑ X 275.9
μ= = = 45.983̅ ≈ 45.98 g
N 6
∑ X2 ∑X 2 13,870.71 275.9 2
b) σ = √ −[ ] =√ −[ ] = 14.046 994 54 ≈ 14.05 g
N N 6 6
c) The computing formula will be most convenient.
∑ X = 275.9 + 42.0 = 317.9
2.
X (%) f fX fX2
76 5 380 28,880
82 12 984 80,688
91 3 273 24,843
Σf = 20 ΣfX = 1637 ΣfX2 = 134,411
Page | 30
3.
Age (years) Frequency X fX fX2
30 – 35 8 32.5 260.0 8450.00
35 – 40 21 37.5 1087.5 29 531.25
40 – 45 24 42.5 1020.0 43 350.00
45 – 50 32 47.5 1520.0 72 200.00
50 – 55 40 52.5 2100.0 110 250.00
55 – 60 23 57.5 1322.5 76 043.75
60 – 65 8 62.5 500.0 31 250.00
Σf = 156 ΣfX = 7510 ∑fX2 = 371 075
2
∑ fX 2 ∑ fX 371 075 7510 2
σ= √ −[ ] = √ −[ ] = 7.818 410 815 ≈ 7.8 years
∑f ∑f 156 156
σ2 = 7.818 410 8152 = 61.127 547 67 ≈ 61.1 (years)2
Page | 31
Lesson 17 – The Sample Standard Deviation
1.
a) Day 2
∑ 𝑋 170
𝑋̅ = = = 34 ≈ 34.0 𝑚𝑖𝑛𝑢𝑡𝑒𝑠
𝑛 5
∑(𝑋 − 𝑋̅)2 194
𝑠2 = = = 48.5 ≈ 48.5 (𝑚𝑖𝑛𝑢𝑡𝑒𝑠)2
𝑛−1 4
b) Day 3
∑ 𝑋 210
𝑋̅ = = = 35 ≈ 35.0 𝑚𝑖𝑛𝑢𝑡𝑒𝑠
𝑛 6
(∑ 𝑋)2 2102
∑ 𝑋2 − 7446 −
𝑠2 = 𝑛 = 6 = 19.2 ≈ 19.2 (𝑚𝑖𝑛𝑢𝑡𝑒𝑠)2
𝑛−1 5
Page | 32
c)
Day 1 Day 4 Day 5
Mean (min.) 33.0 37.0 38.0
Standard Deviation (min.) 12.8 1.6 0.8
Variance (min.)2 164.7 2.7 0.7
d) Day 1 has the minimum completion time with a mean of 33.0 minutes.
e) Day 5 has the least variability in completion times with a standard deviation of 0.8
minutes (or variance of 0.7 minutes2).
2.
X (%) f fX fX2
42 3 126 5,292
58 6 348 20,184
75 12 900 67,500
88 9 792 69,696
2
Σf = 30 ΣfX = 2166 ΣfX = 162,672
(∑ fX)2 21662
∑ fX2 − 162,672−
a) s = √ ∑ f−1
∑f
=√ 29
30
= 14.723 661 46 ≈ 14.7%
b) Same as a).
3. No. As the sample size increases, the difference between n and n – 1 becomes
proportionally less significant. For example, when n = 300, we will divide by 300 for the
population and 299 for the sample – a very small difference. If n = 5, we will divide by 5 for
the population versus 4 for the sample – a remarkable difference.
Page | 33
5.
Warranty Repair Jobs per Month Number of X fX fX2
Months
95 – 105 4 100 400 40 000
105 – 115 14 110 1540 169 400
115 – 125 18 120 2160 259 200
125 – 135 15 130 1950 253 500
135 – 145 10 140 1400 196 000
145 – 155 7 150 1050 157 500
155 – 165 2 160 320 51 200
2
Σf = 70 ΣfX = 8820 ∑fX = 1 126 800
(∑ fX)2 88202
∑ fX 2 − 1 126 800 −
√ ∑f √ 70 = 14.978 245 09
s= =
∑f − 1 69
≈ 15.0 warranty jobs per month
𝑠 = 14.978 245 092 = 224.347 826 1 ≈ 224.3 (warranty jobs per month)2
2
6.
Number of Acceptable Pieces Number of X fX fX2
Produced Workers
15 – 30 3 22.5 67.5 1 518.75
30 – 45 14 37.5 525.0 19 687.50
45 – 60 18 52.5 945.0 49 612.50
60 – 75 26 67.5 1755 118 462.50
75 – 90 20 82.5 1650.0 136 125.00
90 – 105 12 97.5 1170.0 114 075.00
105 – 120 7 112.5 787.5 88 593.75
Σf = 100 ΣfX = 6900 ∑fX2 = 528 075
(∑ fX)2 69002
∑ fX 2 −
√ ∑f √528 075 − 100
s= = = 22.912 878 47
∑f − 1 99
≈ 22.9 acceptable pieces produced
𝑠 = 22.912 878 472 = 525 ≈ 525.0 (acceptable pieces produced)2
2
Page | 34
Lesson 18 – Application of the Mean and Standard Deviation
1. a)
Income in a Community
X−μ 85−72.5
b) Z = σ
= 13
= 0.96
X−μ 42−72.5
c) Z = σ
= 13
= −2.35
X−μ 125−72.5
d) Z = σ
= 13
= 4.04
It is possible, but it is not very common since it lays 4.04 standard deviations above the
mean.
e) Approximately 95% of all wage earners in the community earn between $46,500 and
$98,500 since the Z scores of these two wages are -2 and +2 respectively.
f) Approximately 97.5% of all wage earners in the community earn more than $46,500.
a) ̅
X = 45.435 ≈ $45.435; s = 4.289 045 848 ≈ $4.289
X 4 + X 5 44.50 + 45.55
Position(median) = 0.5(8 + 1) = 4.5; median = = = 45.025
2 2
≈ $45.025
s 4.289 045 848
b) C. V. = X̅ ∙ 100 = 45.435
∙ 100 = 9.44%
Page | 35
̅ −median)
3(X 3(45.435−45.025)
c) Sk = s
= 4.289 045 848
= 0.29
d) The coefficient of variation indicates that there is a small amount of variation in the
price of an oil change in the community. The coefficient of skewness indicates that the
shape of the distribution of the price of an oil change in the community is approximately
symmetrical.
3.
X (%) Frequency <Cf
52 5 5
64 12 17
72 25 42
83 14 46
96 7 63
Σf = 63
4.
Value ($) Frequency <Cf
0.00 – 500.00 10 10
500.00 – 1000.00 20 30
1000.00 – 1500.00 9 39
1500.00 – 2000.00 6 45
2000.00 – 2500.00 4 49
2500.00 – 3000.00 2 51
Σf = 51
Page | 36
Unit 3 – 4 Review
1. a) All people in Saskatchewan who take headache medication.
b) 500 people from Saskatchewan who take headache medication.
c) The rating of effectiveness of the new medication.
d) Qualitative data.
e) Ordinal data.
f) This is an example of inferential statistics.
∑X 5600
g) μ = N
= 8
= 700 ≈ $700.00/week
∑|X−μ| 400
h) M. A. D. = N
= 8
= 50 ≈ $50.00/week
∑(X−μ)2 35000
i) σ2 = = = 4375 ≈ 4,375.00 ($)2
N 8
σ= 2
√σ = √4375 = 66.14378278 ≈ $66.14/week
3. a) The population is all of the readings of the concentration of sulfur dioxide in the city.
b) The data represents a sample.
c) The variable is the concentration of sulfur dioxide in the city.
d) The variable is quantitative.
e) The variable is continuous.
f) The data presented is raw data.
Page | 37
Concentration of Sulfur Dioxide (ppm) X2
0.14 0.019 6
0.15 0.022 5
0.15 0.022 5
0.17 0.028 9
0.17 0.028 9
0.18 0.032 4
0.18 0.032 4
0.18 0.032 4
0.19 0.036 1
0.20 0.040 0
0.22 0.048 4
0.22 0.048 4
0.23 0.052 9
0.23 0.052 9
0.24 0.057 6
0.26 0.067 6
0.27 0.072 9
0.32 0.102 4
ΣX = 3.70 ΣX2 = 0.798 8
∑X 3.70
g) x̅ = n
= 18
= 0.205̅ ≈ 0.206 ppm
(∑ X)2 3.702
∑ X2 − 0.7988−
2
h) s = n−1
n
= 17
18
= 0.002249673 ≈ 0.002 (ppm)2
Page | 38
4.
Number of Instructions f fX fX2 <Cf
18 4 72 1296 4
19 12 228 4332 16
20 15 300 6000 31
21 20 420 8820 51
22 13 286 6292 64
23 8 184 4232 72
24 6 144 3456 78
25 2 50 1250 80
2
Σf = 80 ΣfX = 1684 ΣfX = 35,678
a) The population is the number of instructions the computer system can perform in 1
nanosecond.
b) The data represents a sample.
c) The variable is the number of instructions a computer system can perform in 1 nanosecond.
d) The variable is quantitative.
e) The variable is discrete.
f) The data presented as an ungrouped frequency distribution.
g) Mode = 21 instructions
1
h) Median position = 2 (∑ f + 1) = 0.5(80 + 1) = 40.5
1 1
Median value = X 40.5 = (X 40 + X 41 ) = (21 + 21) = 21 ≈ 21.0 instructions
2 2
∑ fX 1684
i) ̅
X= ∑ = = 21.05 ≈ 21.1 instructions
f 80
(∑ fX)2 16842
∑ fX2 − 35678−
2 ∑f
j) s = ∑ f−1
= 79
80
= 2.908860759 ≈ 2.9 (instructions)2
Page | 39
5.
Stopping Distance (m) Midpoint fX fX2 <Cf
45 – 50 47.5 95.0 4,512.50 2
50 – 55 52.5 787.5 41,343.75 17
55 – 60 57.5 1840.0 105,800.00 49
60 – 65 62.5 2250.0 140,625.00 85
65 – 70 67.5 1620.0 109,350.00 109
70 – 75 72.5 725.0 52,562.50 119
75 – 80 77.5 77.5 6,006.25 120
2
ΣfX = 7395.0 ΣfX = 460,200
a) The population is the braking distance required to stop a car from 110 km/h.
b) The data represents a sample.
c) The variable is the braking distance required to stop a car from 110 km/h.
d) The variable is quantitative.
e) The variable is continuous.
f) The data is presented as a grouped frequency distribution.
g) Mode = 62.5 m
1
h) Median position = 2 ∑ f = 0.5(120) = 60
60 − 49
Median value = 60 + 5 = 61.527̅ ≈ 61.5 m
36
∑ fX 7395
i) x̅ = ∑f
= 120
= 61.625 ≈ 61.6 m
(∑ fX)2 73952
∑ fX2 − 460200−
2 ∑f
j) s = ∑ f−1
= 119
120
= 37.67331933 ≈ 37.7 m2
Page | 40
6.
∑𝑋 4557
a) 𝜇 = 𝑁
= 6
= 759.5 ≈ 759.5 ($𝑏𝑖𝑙𝑙𝑖𝑜𝑛)
b) 𝑀𝑒𝑑𝑖𝑎𝑛 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛 = 0.5(𝑛 + 1) = 0.5(6 + 1) = 3.5
𝑋3 + 𝑋4 715 + 721
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝑋3.5 = = = 718 ≈ 718.0 ($𝑏𝑖𝑙𝑙𝑖𝑜𝑛)
2 2
c) There is no mode.
(𝑋−𝜇)2 51 159.5
d) 𝜎 = √ =√ = 92.339 500 4 ≈ 92.3 ($𝑏𝑖𝑙𝑙𝑖𝑜𝑛)
𝑁 6
2
∑ 𝑋2 ∑𝑋 3 512 201 4557 2
𝜎= √ −[ ] = √ −[ ] = 92.339 500 4 ≈ 92.3 ($𝑏𝑖𝑙𝑙𝑖𝑜𝑛)
𝑁 𝑁 6 6
e) 𝜎 2 = (92.339 500 4)2 = 8 526.583 33 ≈ 8 526.6 ($𝑏𝑖𝑙𝑙𝑖𝑜𝑛)2
f) Answers do verify.
𝜎 92.339 500 4
g) 𝐶. 𝑉. = 𝜇 ∙ 100 = 759.5
∙ 100 = 12.157 932 9 ≈ 12.2%
There is a moderate spread in the data.
3(𝜇−𝑀𝑒𝑑𝑖𝑎𝑛) 3(759.5−718)
h) 𝑆𝑘 = 𝜎
= 92.339 500 4
= 1.348 285 398 ≈ 1.35
The data is positively skewed.
i) 𝑄3 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛 = 0.75(𝑛 + 1) = 0.75(6 + 1) = 5.25
𝑄3 = 𝑋5.25 = 𝑋5 + 0.25(𝑋6 − 𝑋5 ) = 873 + 0.25(901 − 873) = 880
≈ 880.0 ($𝑏𝑖𝑙𝑙𝑖𝑜𝑛)
j) 𝐷8 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛 = 0.8(𝑛 + 1) = 0.8(6 + 1) = 5.6
𝐷8 = 𝑋5.6 = 𝑋5 + 0.6(𝑋6 − 𝑋5 ) = 873 + 0.6(901 − 873) = 889.8 ≈ 889.8 ($𝑏𝑖𝑙𝑙𝑖𝑜𝑛)
Page | 41
7.
∑𝑋 311
a) 𝜇 = 𝑁
= 30
= 10.36̇ ≈ 10.4 𝑦𝑒𝑎𝑟𝑠
b) Mode = 10 years
c) 𝑀𝑒𝑑𝑖𝑎𝑛 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛 = 0.5(∑ 𝑓 + 1) = 0.5(30 + 1) = 15.5
𝑋15 + 𝑋16 10 + 10
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝑋15.5 = = = 10 ≈ 10.0 𝑦𝑒𝑎𝑟𝑠
2 2
∑ 𝑓𝑋 2 ∑ 𝑓𝑋 2 3247 311 2
d) 𝜎 = √ ∑𝑓
−[∑ ] =√ −[ ] = 0.874 960 316 ≈ 0.9 𝑦𝑒𝑎𝑟𝑠
𝑓 30 30
e) 𝜎 2 = (0.874 960 316)2 = 0.765̇ ≈ 0.8 (𝑦𝑒𝑎𝑟𝑠)2
f) Answers do verify.
𝜎 0.874 960 316
g) 𝐶. 𝑉. = 𝜇 ∙ 100 = 10.36̇
∙ 100 = 8.440 131 671 ≈ 8.4%
There is a slight spread in the data.
3(𝜇−𝑀𝑒𝑑𝑖𝑎𝑛) 3(10.36̇−10)
h) 𝑆𝑘 = = = 1.257 199 874 ≈ 1.26
𝜎 0.874 960 316
The data is positively skewed.
i) 𝑃70 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛 = 0.7(∑ 𝑓 + 1) = 0.7(30 + 1) = 21.7
𝑃70 = 𝑋21.7 = 𝑋21 + 0.7(𝑋22 − 𝑋21 ) = 11 + 0.7(11 − 11) = 11 ≈ 11.0 𝑦𝑒𝑎𝑟𝑠
j) 𝑄1 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛 = 0.25(∑ 𝑓 + 1) = 0.25(30 + 1) = 7.75
𝑄1 = 𝑋7.75 = 𝑋7 + 0.75(𝑋8 − 𝑋7 ) = 10 + 0.75(10 − 10) = 10 ≈ 10.0 𝑦𝑒𝑎𝑟𝑠
Page | 42
8.
∑ 𝑓𝑋 7600
a) 𝑋̅ = ∑ = = 76 ≈ 76.0%
𝑓 100
b) Mode = 75%
c) 𝑀𝑒𝑑𝑖𝑎𝑛 𝑝𝑜𝑠𝑖𝑡𝑖𝑜𝑛 = 0.5 ∑ 𝑓 = 0.5(100) = 50
0.5 ∑ 𝑓 − 𝐶𝑓𝑖−1 50 − 28
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿 + [ ] 𝐶 = 70 + [ ] 10 = 76.285 714 ≈ 76.3%
𝑓𝑖 35
(∑ 𝑓𝑋)2 76002
∑ 𝑓𝑋 2 − 591 500−
d) 𝑠 = √ =√
∑𝑓 100
∑ 𝑓−1
= 11.849 221 09 ≈ 11.8%
99
9.
County Per Cent Unemployed Size of Workforce wX
Wood 4.5 15,300 68,850
Ottawa 3.0 10,400 31,200
Lucas 10.2 150,600 1,536,120
Σw = 176,300 ΣwX = 1,636,170
∑ 𝑤𝑋 1 636 170
𝑋̅𝑤 = = = 9.280601248 ≈ 9.28%
∑𝑤 176 300
Page | 43
4
10. G. M. = [ √(1 − 0.20)(1 + 0.10)(1 + 0.02)(1 + 0.40) − 1] 100 = 5.877 266 397 ≈
5.88%/quarter
11.
Amount ($) Interest Rate (Annual %) wX (interest earned)
5,000 7 350
12,000 12 1440
14,000 15 2100
Σw = 31,000 ΣwX = 3890
∑ 𝑤𝑋 3,890
𝑋̅𝑤 = = = 0.125 483 871 ≈ 12.55%/annum
∑𝑤 31,0000
12.
Expense Proportion of Student’s Budget Expected Per Cent wX
(%) Increase
Housing 30 2 0.60
Food 25 8 2.00
Tuition 10 10 1.00
Books 5 15 0.75
Transportation 3 8 0.24
Recreation 27 10 2.70
Σw = 100 ΣwX =
7.29
∑ 𝑤𝑋 7.29
𝑋̅𝑤 = = = 0.0729 ≈ 7.2%/annum
∑𝑤 100
4 1800
13. G. M. = [√1200 − 1] 100 = 10.668 191 97 ≈ 10.67%/annum
4
14. G. M. = [ √(1 + 0.05)(1 + 0.08)(1 + 0.09)(1 − 0.02) − 1] 100 = 4.909 884 054 ≈ 4.91%
Page | 44
15.
∑ wX 752
̅w =
X = = 75.2 ≈ 75.2%
∑w 10
16.
4 8 12 16 20 24 28
Value ($)
a) R = 28 − 4 = $24.00
b) $12.00 and $20.00
c) $8.00 and $24.00
d) Below $8.00
e) 16%
f) $16.00 and $16.00
7.50−16.00
g) Z = = −2.13
4.00
25.00−16.00
h) Z = 4.00
= 2.25
17. a) Median. The distribution is skewed negatively. The median is less affected by skews than
the mean.
b) -1.0 (something that indicates a negative skew).
Page | 45
18. A partially labeled curve follows. See your instructor for a fully labeled curve.
15 18 21 24 27 30 33
Income ($1000)
Page | 46
Unit 5 – Basic Probability
Lesson 19 – Introduction to Probability
1.
Outcome Person 1 Person 2
1 Favor Favor
2 Favor Against
3 Against Favor
4 Against Against
6
2. a) 𝑃(𝑀𝑎𝑛𝑎𝑔𝑒𝑚𝑒𝑛𝑡) = 34
b) Empirical probability.
3. a) Empirical probability.
b) Classical probability.
c) Classical probability.
d) Subjective.
5. a) Any four-digit code from 0000 to 9999. For example, 1234, 2345, 3456 and 4567.
1
b) 𝑃(𝑠𝑎𝑚𝑒 𝑃𝐼𝑁) = 10 000
Page | 47
Lesson 20 – Counting Outcomes
1. 3 × 2 × 2 = 12 arrangements
Attached
Model Floor Plan
Garage
Yes
A
No
1
Yes
B
No
Yes
A
No
Start 2
Yes
B
No
Yes
A
No
3
Yes
B
No
10!
6. 10C4 = 4!(10−4)! = 210 groups
7. 5! = 120 routes
15!
8. 15P10 = (15−10)! = 1.089 728 64 × 1010
900! 900×899
9. 900C898 = 898!(900−898)! = 2×1
= 450 × 899 = 404,550
Page | 48
Lesson 21 – Events and Venn Diagrams
1. a) n(Sixes) = 4
b) n(-Sixes) = 48
Spade Six
London Rome
Page | 49
Lesson 22 – Rules Governing Probability Calculations
1. E = event that a transaction is in error.
3
a) 𝑃(𝐸) = = 0.075
40
b) 𝑃(−𝐸) = 1 − 𝑃(𝐸) = 1 − 0.075 = 0.925
2. S = event that a student takes Statistics. E = event that a student takes Economics.
a) P(S) = 0.6; P(E) = 0.5; P(S and E) = 0.1
S E
Page | 50
Lesson 23 – Conditional Probability and the Multiplication Rules
1. a) Independent
b) Dependent
c) Independent
2. Ci = event that a carton drawn at a stage. These are dependent events because there is
sampling without replacement.
6 5 1
P(C1 and C2 ) = P(C1 ) ∙ P(C2 |C1 ) = ∙ = = 0.05
25 24 20
4. A tree diagram or contingency table will help to organize the information. The latter is
shown.
Shift 1 (One) Shift 2 (Two) Total
Correct Bin (C) 1800 3250 5050
Wrong Bin (W) 200 250 450
Total 2000 3500 5500
1800 18
̅̅̅̅ ≈ 0.327 3
a) P(One and C) = 5500 = 55 = 0.327
250 1
̅̅̅̅ ≈ 0.045 5
b) P(Two and W) = 5500 = 22 = 0.045
n(W and One) 200 1
c) P(W|One) = n(One)
= 2000 = 10 ≈ 0.100 0
n(Two and C) 3250 65
d) P(Two|C) = n(C)
= 5050 = 101 = 0.643 564 356 ≈ 0.643 6
Page | 51
Lesson 24 – Combining the Rules of Probability
1. Ci = event of a carton with cracked eggs at a stage. The tree diagram follows the solutions.
a) This probability occurs on branches 4, 6 and 7. The carton with cracked eggs may be
6
chosen first, second or third. Each of them occurs with the same likelihood, that is, 25 ∙
19 18
∙ ,
24 23
therefore we can multiply this probability of branch 4 by three to get the answer.
Note that the probability statement would be quite lengthy so a statement will be used.
6 19 18
P(1 carton with cracked eggs) = 3 ( ∙ ∙ ) = 0.446 086 956 ≈ 0.446 1
25 24 23
C3
-C1 and C2 and C3
C2 5
6
-C3
-C1 and C2 and -C3
-C1 18
19
C3
-C1 and -C2 and C3
-C2 6
18
-C3
-C1 and -C2 and -C3
17
25 24 23
Page | 52
2.
Test Problem
0.50 S
0.60 P
0.50 -S
0.40
F
Page | 53
Lesson 25 – Bayes’ Rule for Revising Probabilities
A tree diagram or contingency table may be used to aid the solution to all of these problems. The
contingency table is shown for the problems.
1.
Purchases Cash or Cheque (Cash) Credit Card (CC) Debit Card (DC) Total
At most $50 (-M) 30(0.8) = 24 30(0.1) = 3 40(0.4) = 16 43
More than $50 (M) 30(0.2) = 6 30(0.9) = 27 40(0.6) = 24 57
Total 30 30 40 100
P(Cash and M) 6
P(Cash|M) = = = 0.105 263 157 ≈ 0.105 3
P(M) 57
2.
Door Open (O) Door Closed (-O) Total
Not Robbed (-S) 0.25(0.95) = 0.2375 0.75(0.99) = 0.7425 0.98
Robbed (S) 0.25(0.05) = 0.0125 0.75(0.01) = 0.0075 0.02
Total 0.25 0.75 1.00
3.
Regular Homework (H) Irregular Homework (-H) Total
Pass 0.55(0.8) = 0.44 0.45(0.3) = 0.135 0.575
Fail 0.55(0.2) = 0.11 0.45(0.7) = 0.315 0.425
Total 0.55 0.45 1.000
a) P(Pass) = 0.575
P(H and Pass) 0.44
b) P(H|Pass) = P(Pass)
= 0.575 = 0.765 217 391 ≈ 0.765 2
4.
Firm X Firm Y Total
Good 0.8(0.85) = 0.68 0.2(0.95) = 0.19 0.87
Poor 0.8(0.15) = 0.12 0.2(0.05) = 0.01 0.13
Total 0.8 0.2 1.00
Page | 54
5.
TICKET SELLER
Customer Betty John Jean Joe Total
Receives Change (C) 0.3(0.98) = 0.3(0.97) = 0.2(0.95) = 0.2(0.96) = 0.967
0.294 0.291 0.19 0.192
Doesn’t Receive 0.3(0.02) = 0.3(0.03) = 0.2(0.05) = 0.2(0.04) = 0.033
Change (-C) 0.006 0.009 0.01 0.008
Total 0.3 0.3 0.2 0.2 1.00
Page | 55
Unit 5 Review
1. Subjective probability.
Page | 56
Cones Flavor Nuts
With
Chocolate
Without
With
Waffle Vanilla
Without
With
Strawberry
Without
With
Chocolate
Without
With
Regular Vanilla
Without
With
Strawberry
Without
Page | 57
20!
3. 20P7 = (20−7)! = 390 700 800 arrangements
20!
4. 20C12 = 12!(20−12)! = 125 970 ways
16!
5. 16C3 = 3!(16−3)! = 560 groups
6.
Draw 1 Draw 2 Draw 3
D3
This branch is not possible.
D2 0
4
1
5 -D3
2 defective.
D1 4
4
2
D3
6 2 defective.
-D2 1
4
4
-D3
5
1 defective.
3
4
D3
2 defective.
D2 1
4
2
6-Feb -D3
5
1 defective.
-D1 3
4
4
D3
6
1 defective.
-D2 2
4
3
-D3
5
0 defective.
2
4
Page | 58
2 4 3 4 2 3 4 3 2 3
𝑃(𝐸𝑥𝑎𝑐𝑡𝑙𝑦 1 𝑑𝑒𝑓𝑒𝑐𝑡𝑖𝑣𝑒) = ∙ ∙ + ∙ ∙ + ∙ ∙ =
6 5 4 6 5 4 6 5 4 5
20!
7. 20C6 = 6!(20−6)! = 38 760 groups
8.
LOAN ASSESSMENT
INDUSTRY Low Risk (L) Moderate to High Total
Risk (-L)
Agriculture (A) 6 22 28
Food Retailers (F) 15 17 32
Total 21 39 60
28
a) P(A) = 60 = 0.46̅ ≈ 0.466 7; This is the probability that the industry chosen will be
agricultural.
32
b) P(F) = = 0.53̅ ≈ 0.533 3; This is the probability that the industry chosen will be
60
food retailers.
6
c) P(A and L) = = 0.1 ≈ 0.100 0; This is the probability that the industry chosen will
60
be agricultural and low risk.
n(A and L) 6
d) P(A|L) = ̅̅̅̅̅̅̅̅̅̅̅
= 21 = 0. 285 714 ≈ 0.285 7; This is the probability that the
n(L)
industry chosen will be agricultural given that the industry is known to be low risk.
17
e) P(F and − L) = 60 = 0.283̅ ≈ 0.283 3; This the probability that the industry chosen is
food retailers and moderate to high risk.
n(A and L) 6
f) P(L|A) = = ̅̅̅̅̅̅̅̅̅̅̅
= 0. 214 285 ≈ 0.214 3; This is the probability that the
n(A) 28
industry chosen is low risk given that the industry is known to be agricultural.
n(A and−L) 22
g) P(A|−L) = n(−L)
= 39 = 0.564 102 564 ≈ 0.564 1; This is the probability that the
industry chosen is agricultural given that it is known the industry is moderate to high
risk.
26 4 2 28 7
9. P(Black or Six) = P(Black) + P(Six) − P(Black and Six) = 52 + 52 − 52 = 52 = 13
10. Fi = event that a face card is drawn at a stage. The events are independent because the
sampling is completed with replacement.
12 12 9
P(F1 and F2 ) = P(F1 ) ∙ P(F2 ) = ∙ = = 0.053 254 437 ≈ 0.053 3
52 52 169
Page | 59
11. Fi = event that a face card is drawn at a stage. The events are dependent because the
sampling is completed without replacement.
12 11 11
P(F1 and F2 ) = P(F1 ) ∙ P(F2 |F1 ) = ∙ = = 0.049 773 755 ≈ 0.049 8
52 51 221
12. Fi = event that a face card is drawn at a stage. We will assume that the sampling is
completed without replacement like typical card games.
12 11 10 11
P(F1 and F2 and F3 ) = P(F1 ) ∙ P(F2 |F1 ) ∙ P(F3 |(F1 and F2 )) = ∙ ∙ =
52 51 50 1105
= 0.009 954 751 ≈ 0.010 0
14. GP = event that a merchant favors greater powers of arrest be given to police. C = event
that a merchant favors a curfew for persons under 16. P(GP) = 0.8; P(C) = 0.6; and P(GP and
C) = 0.5.
GC C
a) P(GP or C) = P(GP) + P(C) − P(GP and C) = 0.8 + 0.6 − 0.5 = 0.9 ≈ 0.900 0
b) P(−GP and − C) = 1 − P(GP or C) = 1 − 0.9 = 0.1 ≈ 0.100 0
Page | 60
15. Pi = event that an applicant passes a test at a stage. Fi = event that an applicant fails a test at
a stage.
Fail 0.7
0.56 Not Hired
1
Fail 0.2
0.20 Not Hired
16.
Credit Rating
Loan Status Good (G) Not Good (-G) Total
Default (D) 0.2(0.45) = 0.09 0.2(0.55) = 0.11 0.2
Don’t Default (-D) 0.8(0.8) = 0.64 0.8(0.2) = 0.16 0.8
Total 0.73 0.27 1.00
17.
Accident
Seat Belt Fatal (F) Not Fatal (-F) Total
Worn (W) 0.6(0.08) = 0.048 0.6(0.92) = 0.552 0.6
Not Worn (-W) 0.4(0.2) = 0.080 0.4(0.8) = 0.320 0.4
Total 0.128 0.872 1.00
Page | 61
18. Assume that the flights are independent. Li = event that a flight is late at a stage.
L3
2
L2 0.2
0.2
-L3
1
-L1 0.8
0.8
L3
1
-L2 0.2
0.8
-L3
0
0.8
Page | 62
12.
SOURCE OF ERROR
System Status Disk Drive Computer Memory Operating System Total
(DD) (CM) (OS)
Failure (F) 0.6(0.5) = 0.30 0.7(0.3) = 0.21 0.4(0.2) = 0.08 0.59
No Failure (-F) 0.4(0.5) = 0.20 0.3(0.3) = 0.09 0.6(0.2) = 0.12 0.41
Total 0.5 0.3 0.2 1
Page | 63
Unit 6 – Probability Distributions for
Discrete Random Variables
Lesson 26 – An Introduction to Probability Distributions
1.
0.5
0.4
P 0.3
0.2
0.1
0
0 1 2 3 4 5
Number of Pairs of Shoes Purchased
Page | 64
2.
3.
Color X – Payoff ($) P(X) X∙P(X)
Red 1.00 0.5 0.5
Green -5.00 0.4 -2
Blue 10.00 0.1 1
ΣP(X) = 1.0 ΣX∙P(X) = -0.5
a) μ = ∑ X ∙ P(X) = −0.5 ≈ −$0.50; The players lose $0.50 per play, so the operator
makes $0.50 per play.
Page | 65
Color X – Payoff ($) P(X) X∙P(X)
Red 1.00 0.5 0.5
Green XG 0.4 0.4XG
Blue 10.00 0.1 1
ΣP(X) = 1.0 ΣX∙P(X) = 1.5 + XG
Page | 66
Lesson 27 – The Binomial Probability Distribution
1. X = the number of correct answers; p = 0.5; n = 15
a) P(X = 0) = 15C0 ∙ (0.5)0 (0.5)15 = 0.000 030 517 ≈ 0.000 0
b) P(X = 7) = 15C7 ∙ (0.5)7 (0.5)8 = 0.196 380 615 ≈ 0.196 4
c) P(X ≥ 9) = 0.153 + 0.092 + 0.042 + 0.014 + 0.003 + 0.000 + 0.000 = 0.304 ≈
0.304
d) μ = np = 15(0.5) = 7.5 ≈ 7.5 correct
e) σ = √np(1 − p) = √15(0.5)(0.5) = 1.936 491 673 ≈ 1.9 correct
3. a)
X – Number of Drivers Wearing Seat Belts P(X)
0 0.008
1 0.076
2 0.265
3 0.412
4 0.240
ΣP(X) = 1.001
4. X = the number of vehicles requiring warranty service within the first year; p = 0.12; n = 12.
a) P(X = 0) = 12C0 ∙ (0.12)0 (0.88)12 = 0.215 671 155 ≈ 0.215 7
b) P(X = 1) = 12C1 ∙ (0.12)1 (0.88)11 = 0.352 916 436 ≈ 0.352 9
c) P(X = 2) = 12C2 ∙ (0.12)2 (0.88)10 = 0.264 687 327 ≈ 0.264 7
d) P(X < 3) = P(X = 0) + P(X = 1) + P(X = 2) = 0.833 274 92 ≈ 0.833 3
e) μ = np = 12(0.12) = 1.44 ≈ 1.4 vehicles
σ = √np(1 − p) = √12(0.12)(0.88) = 1.125 699 782 ≈ 1.1 vehicles
Page | 67
6. X = the number of customer problems solved; p = 0.7; n = 15.
a) μ = np = 15(0.7) = 10.5 ≈ 10.5 problems solved
σ = √np(1 − p) = √15(0.7)(0.3) = 1.774 823 935 ≈ 1.8 problems solved
b) P(X = 10) = 0.206
c) P(X = 10 or X = 11) = P(10 ≤ X ≤ 11) = 0.206 + 0.219 = 0.425
d) P(X > 10) = P(X = 11) + P(X = 12) + … + P(X = 15) = 0.219 + 0.170 + 0.092 + 0.031 + 0.005 =
0.517
Page | 68
Lesson 28 – The Hypergeometric Probability Distribution
1. X = the number of males chosen for the committee; N = 18; n = 6; S = 11.
11C0∙7C6
a) P(X = 0) = 18C6
= 0.000 377 073 ≈ 0.000 4
11C3∙7C3
b) P(X = 3) = 18C6
= 0.311 085 972 ≈ 0.311 1
X = the number of committee members chosen who receive social assistance; n = 10; p =
80
1000
= 0.08.
P(X = 2) = 10C2(0.08)2 (0.92)8 = 0.147 807 035 ≈ 0.147 8
4. X = the number of boxes of bolts chosen from Supplier A; N = 40; n = 10; S = 10.
10C5 ∙ 30C5
P(X = 5) = = 0.042 365 44 ≈ 0.042 4
40C10
Page | 69
6. a) X = the number of bolts with stripped threads chosen; n = 5; p = 0.5.
10C2∙10C3
P(X = 2) = 20C5
= 0.348 297 213 ≈ 0.348 3
7. In theory this is a hypergeometric problem. However, the combinations are too large for
the calculator to handle. Since n < 0.05N, the binomial approximation may be used to
calculate the probability. X = the number of unemployed workers chosen in the sample; n =
2,000 1
25; p = = .
12,000 6
1 2 5 23
P(X = 2) = 25C2 (6) (6) = 0.125 791 152 ≈ 0.125 8
Page | 70
Lesson 29 – The Poisson Probability Distribution
1. X = the number of typesetting errors observed.
1
a) μ = np = 18,000 (2000) = 9 ≈ 9.0 errors
90 ∙e−9
b) P(X = 0) = 0!
= 0.000 123 409 ≈ 0.000 1
10
9 ∙e −9
c) P(X = 10) = 10! = 0.118 580 076 ≈ 0.118 6
d) P(X < 10) = P(X = 0) + P(X = 1) + P(X = 2) + ⋯ + P(X = 9)
P(X < 10) = 0.0001 + 0.0011 + 0.0050 + 0.0150 + 0.0337 + 0.0607 + 0.0911
+ 0.1171 + 0.1318 + 0.1318 = 0.5874
e) P(X ≥ 15) = P(X = 15) + P(X = 16) + P(X = 17) + ⋯
P(X ≥ 15) = 0.0194 + 0.0109 + 0.0058 + 0.0029 + 0.0014 + 0.0006 + 0.0003
+ 0.0001 = 0.0414
Or
P(X ≥ 15) = 1 − P(X < 15) = P(X = 0) + P(X = 1) + P(X = 2) + ⋯ + P(X = 14)
P(X < 15) = 1 − (0.0001 + 0.0011 + 0.0050 + 0.0150 + 0.0337 + 0.0607 + 0.0911
+ 0.1171 + 0.1318 + 0.1318 + 0.1186 + 0.0970 + 0.0728 + 0.0504
+ 0.0324) = 0.0414
Page | 71
Discrete Probability Distribution Exercises
1.
2. X = the number of members who have loans with the credit union; n = 6; p = 0.2.
a) P(X = 0) = 6C0(0.2)0 (0.8)6 = 0.262 144 ≈ 0.262 1
b) P(X = 6) = 6C6(0.2)6 (0.8)0 = 0.000 064 ≈ 0.000 1
c) P(X ≥ 1) = 1 − P(X = 0) = 1 − 0.262 144 = 0.737 856 ≈ 0.737 9
4. X = number of transactions where proper procedure was not followed; n = 30; p = 0.03.
μ = np = 300(0.03) = 9 ≈ 9.0 transactions
93 ∙e−9
a) P(X = 3) = 3!
= 0.014 994 291 ≈ 0.015 0
b) P(X < 5) = 0.000 1 + 0.001 1 + 0.005 0 + 0.015 0 + 0.033 7 = 0.054 9 ≈ 0.054 9
5. X = the number of cartons that do not have 100 bolts; n = 30; p = 0.01.
μ = np = 30(0.01) = 0.3 ≈ 0.3 cartons
0.35 ∙ e−0.3
P(X = 5) = = 0.000 015 001 ≈ 0.000 0
5!
Page | 72
Unit 7 – The Normal Probability
Distribution
Lesson 31 – The Standard Normal Probability Distribution
1. a)
b)
c)
Page | 73
d)
e)
f)
Page | 74
2.
0.02
0.48
Z1 ≈ −2.05
3.
0.01
0.49
Z1 ≈ 2.33
4.
0.3
Z1 ≈ 0.84
Page | 75
5.
0.49 0.49
Z ≈ ±2.33
6.
0.45 0.45
Z ≈ ±1.645
7.
0.495 0.495
Z ≈ ±2.575
Page | 76
Lesson 32 – Application of the Normal Probability Distribution
1. a)
b)
c)
X − μ 180 − 172
P(X > 180 𝑐𝑚) = P ( > ) = P(Z > 0.80) = 0.5 − 0.2881 = 0.2119
σ 10
Page | 77
2. a)
61 66 71 76 81 86 91
Score
X − μ 76 − 76
P(X > 76) = P ( > ) = P(Z > 0) = 0.5
σ 5
b)
61 66 71 76 81 86 91
Score
65 − 76 X − μ 85 − 76
P(65 < 𝑋 < 85) = P ( < < ) = P(−2.20 < 𝑍 < 1.80)
5 σ 5
= 0.4861 + 0.4641 = 0.9502
c)
0.1
0.4
61 66 71 76 81 86 91
Score
𝑋−76
Z ≈ −1.28; −1.28 = 5
𝑋 = 76 + (−1.28)(5) = 69.6 ≈ 69.6
Page | 78
d)
0.15
0.35
61 66 71 76 81 86 91
Score
𝑋−76
Z ≈ 1.04; 1.04 = 5
𝑋 = 76 + (1.04)(5) = 81.2 ≈ 81.2
e)
61 66 71 76 81 86 91
Score
X − μ 80 − 76
P(X > 80) = P ( > ) = P(Z > 0.80) = 0.5 − 0.2881 = 0.2119
σ 5
0.2119(1000) = 211.9 ≈ 212 people
3. a)
X − μ 250 − 308.45
P(X > $250) = P ( > ) = P(Z > −1.36) = 0.4131 + 0.5
σ 43.09
= 0.9131
Page | 79
b)
X − μ 400 − 308.45
P(X < $400) = P ( < ) = P(Z < 2.12) = 0.5 + 0.4830 = 0.9830
σ 43.09
c)
4.
0.05
35 45 55 65 75 85 95
Score
𝑋 − 65
Z ≈ 1.645; 1.645 =
10
𝑋 = 65 + (1.645)(10) = 81.45 ≈ 81.5
Page | 80
5.
0.4 0.4
𝑋 − 10
Z ≈ ±1.28; ±1.28 =
0.2
𝑋 = 10 + (±1.28)(0.2) = 9.744, 10.256 ≈ (9.74 cm, 10.26 cm)
6.
0.12
40 50 60 70 80 90 100
Score
𝑋 − 70
Z ≈ 0.31; 0.31 =
10
𝑋 = 70 + (0.31)(10) = 73.1 ≈ 73.1
If the scores must be whole numbers then the actual score would be 73.
Page | 81
Unit 6 – 7 Review
1. a)
Price per Bushel ($) P(X) X∙P(X) X2∙P(X)
2.20 0.05 0.110 0.2420
2.30 0.25 0.575 1.3225
2.40 0.35 0.840 2.0160
2.50 0.20 0.500 1.2500
2.60 0.15 0.390 1.0140
ΣP(X) = 1.00 ΣX∙P(X) = 2.415 Σ X2∙P(X) = 5.8445
Page | 82
6. a)
-3 -2 -1 0 1 2 3
Z
b)
-3 -2 -1 0 1 2 3
Z
c)
-3 -2 -1 0 1 2 3
Z
d)
Page | 83
-3 -2 -1 0 1 2 3
Z
7. a)
6 13 20 27 34 41 48
Monthly Interest Paid ($)
X − μ 30 − 27
P(X > $30) = P ( > ) = P(Z > 0.43) = 0.5 − 0.1664 = 0.3336
σ 7
b)
0.20
6 13 20 27 34 41 48
Monthly Interest Paid ($)
X − 27
Z ≈ 0.84; 0.84 =
7
X = 27 + 0.84(7) = 32.88 ≈ $32.88
Page | 84
8.
0.4
X − 100
Z ≈ 1.28; 1.28 =
10
X = 100 + 1.28(10) = 112.8 ≈ 112.8 (1000 cans) ≈ 112,800 cans
9.
0.01
300 − 𝜇
𝑍 ≈ 2.33; 2.33 =
3.1
μ = 300 − 2.33(3.1) = 292.777 ≈ 292.8 mL
Page | 85
Unit 8 – Sampling Distributions
Lesson 35 – The Central Limit Theorem
640
1. The shape of the sampling distribution is normal since n > 30, µ = $9,600 and 𝜎𝑋̅ = =
√1024
20 ≈ $20. The FCF does not need to be applied because n < 0.05N.
a)
b)
̅
X − μ 9660 − 9600
̅ > $9,660) = P (
P(X > ) = P(Z > 3.00) = 0.5 − 0.4987
σX̅ 20
= 0.0013
Page | 86
700
2. The shape of the sampling distribution is normal since n > 30, µ = 2500 kg and 𝜎𝑋̅ = =
√40
110.679 718 1 ≈ 110 𝑘𝑔.
a)
2300 − 2500 ̅
X−μ 2700 − 2500
̅ < 2700 𝑘𝑔) = P (
P(2300 kg < X < < )
110.679 718 1 σX̅ 110.679 718 1
= P(−1.81 < 𝑍 < 1.81) = 0.4649 + 0.4649 = 0.9298
b)
̅−μ
X 500
̅ − μ) > |500|kg] = P [
P[(X >| |] = P[Z > |4.52|]
σX̅ 110.679 718 1
= P(Z < −4.52) + P(Z > 4.52) = 0.0000
Page | 87
3. Effectively, we are being asked to determine the two test scores that will mark the
boundaries of the middle 95% of the scores. The Z score that corresponds to these
boundaries may be found by dividing 0.95 by 2. This is 0.4750 and the corresponding Z
score is ±1.96.
The shape of the sampling distribution is normal since n > 30, µ = 75% and 𝜎𝑋̅ =
10 500−100
√ = 0.895 322 962 ≈ 0.9%.
√100 500−1
72 73 74 75 76 77 78
Test Score
𝑋̅ − 75
±1.96 =
0.895 322 962
̅
𝑋𝐿 = 75 − 1.96(0.895 322 962) = 73.245 166 99 ≈ 73.2%
𝑋̅𝐿 = 75 + 1.96(0.895 322 962) = 76.754 833 01 ≈ 76.8%
The average would fall between 73.2% and 76.8% in 95 cases out of 100.
This type of problem is known as a confidence interval. We will study confidence intervals in
the next unit.
4.
Population Distribution
𝑋−𝜇 3000−2940
a) 𝑃(𝑋 > $3000) = 𝑃 ( 𝜎
> 60
) = 𝑃(𝑍 > 1.00) = 0.5 − 0.3413 = 0.1587
Page | 88
60
b) 𝜎𝑋̅ = = 10 ≈ $10
√36
c)
Sampling Distribution
𝑋̅ − 𝜇 3000 − 2940
𝑃(𝑋̅ > $3000) = 𝑃 ( > ) = 𝑃(𝑍 > 6.00) = 0.5 − 0.5 = 0.0000
𝜎𝑋̅ 10
Page | 89
Lesson 36 – The Sampling Distribution of the Proportion
1. X = the number of defective batteries chosen. The shape of the sampling distribution is
0.2(0.8)
normal since np > 5 and n(1 – p) > 5. The mean proportion, p = 0.2, and 𝜎𝑝̅ = √ 30
=
0.073 029 674 ≈ 0.07.
3 X 6
P(3 batteries < 𝑋 < 6 𝑏𝑎𝑡𝑡𝑒𝑟𝑖𝑒𝑠) = P ( < < ) = P(0.1 < p ̅ < 0.2)
30 n 30
0.1 − 0.2 p
̅ −p 0.2 − 0.2
= P( < < ) = P(−1.37 < 𝑍 < 0)
0.073 029 674 σp̅ 0.073 029 674
= 0.4147
2. X = the number of voters who support the ruling party. The shape of the sampling
distribution is normal since np > 5 and n(1 – p) > 5. The mean proportion, p = 0.45, and
0.45(0.55)
𝜎𝑝̅ = √ 1024
= 0.015 546 678 ≈ 0.015 5.
Page | 90
3. X = the number of nonconforming components (for both a) and b).
a) The shape of the sampling distribution is normal since np > 5 and n(1 – p) > 5. The
0.05(0.95)
mean proportion, p = 0.05, and 𝜎𝑝̅ = √ 200
= 0.015 411 035 ≈ 0.015 4.
0 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 0.1
Proportion of Nonconforming Components
p
̅−p 0.04 − 0.05
P(p
̅ < 0.04) = P ( < ) = P(𝑍 < −0.65) = 0.5 − 0.2422
σp̅ 0.015 411 035
= 0.2578
b) The shape of the sampling distribution is normal since np > 5 and n(1 – p) > 5. The
0.025(0.975)
mean proportion, p = 0.025, and 𝜎𝑝̅ = √ = 0.011 039 701 ≈ 0.011 0.
200
p
̅−p 0.04 − 0.025
P(p
̅ ≥ 0.04) = P ( < ) = P(𝑍 ≥ 1.36) = 0.5 − 0.4131
σp̅ 0.011 039 701
= 0.0869
Page | 91
Unit 9 – Confidence Intervals
Lesson 37 – Point Estimates and Confidence Intervals
1. The sampling distribution is normal since n > 30. The standard error of the mean is 𝜎𝑋̅ =
𝜎 58
= = 5.8 ≈ $5.80. The Z score associated with 90% confidence is Z = ±1.645.
√𝑛 √100
We are 90% confident that the true mean amount owing in a large journal of receivables will
be between $502.46 and $521.54.
2. The sampling distribution is normal since n > 30. The standard error of the mean is 𝜎𝑋̅ =
𝜎 𝑁−𝑛 15 500−80
√ = √ = 1.538 581 967 ≈ $1.54. The Z score associated with 95%
√𝑛 𝑁−1 √80 500−1
confidence is Z = ±1.96.
Page | 92
We are 95% confident that the true mean amount spent per household winterizing a
dwelling in the community will be between $81.98 and $88.02.
3. The sample size must be at least 30 to consider the sampling distribution to be normal if the
shape of the population distribution is unknown.
4. The sampling distribution is normal since n > 30. The standard error of the mean is 𝜎𝑋̅ =
𝜎 6
= = 0.447 213 595 ≈ 0.4%. The Z score associated with 98% confidence is Z =
√𝑛 √180
±2.33.
We are 98% confident that the true mean test score of employees at the company will be
between 81.0% and 83.0%.
Page | 93
Lesson 38 – Confidence Intervals for Means, σ Unknown
X = 2.029 16̅ ≈ $2.029; and s =
1. The sample statistics for the set of data are: n = 12; ̅
1.274 558 45 ≈ $1.275.
a) The sampling distribution is normal since the population distribution is normal. The
s 1.274 558 45
standard error of the mean is sX̅ = = = 0.367 933 332 ≈ $0.368. The t
√n √12
score associated with 90% confidence and 11 degrees of freedom is t = ±1.796.
We are 90% confident that the true mean purchase amount at the snack food outlet will
be between $1.368 and $2.690.
b) 800(1.368 358 402) = 1094.486 722 ≈ $1,094.69 and 800(2.689 974 931) = 2151.979 945
≈ $2,151.98.
The total of all 800 purchases should lie between $1,094.69 and $2,151.98.
2. a) The value of the population mean is unknown. The best estimate of this value is the
sample mean of 20 eggs per month.
b) We need to use the Student t distribution because the population standard deviation is
unknown. We need to assume that the population distribution of eggs laid per chicken
per month is normal.
s 2
c) The standard error of the mean is … sX̅ = = = 0.447 213 595 ≈
√n √20
0.4 eggs per chicken per month. The t score associated with 95% confidence and 19
degrees of freedom is t = ±2.093.
Page | 94
18 18.5 19 19.5 20 20.5 21 21.5 22
Eggs Laid (per month per chicken)
E = tsX̅ = 2.093(0.447 213 595) = 0.936 018 055 ≈ 0.9 eggs per chicken per month
̅ − E) < 𝜇 < (X
P[(X ̅ + E)] = 0.95
P[(20 − 0.936 018 055) < 𝜇 < (20 + 0.936 018 055)] = 0.95
P[19.063 981 94 < 𝜇 < 20.936 018 06] = 0.95
We are 95% confident that the true mean number of eggs laid per chicken per month
will be between 19.1 and 20.9.
We are 90% confident that the true mean weekly child care cost will be between $95.39 and
$101.81.
Page | 95
Lesson 39 – Confidence Intervals for Proportions
1. X = the number of vouchers that are incorrect. The shape of the sampling distribution is
𝑋 9
normal since 𝑛𝑝̅ > 5 and 𝑛(1 − 𝑝̅ ) > 5. The mean proportion, 𝑝̅ = = = 0.25, and
𝑛 36
0.25(0.75)
𝜎𝑝̅ = √ = 0.072 168 783 ≈ 0.07. The Z score associated with 98% confidence is Z
36
= ±2.33.
We are 98% confident that the true proportion of incorrect vouchers will be between 0.08
and 0.42.
2. X = the number of employees with 2 or more dependents. The shape of the sampling
𝑋
distribution is normal since 𝑛𝑝̅ > 5 and 𝑛(1 − 𝑝̅ ) > 5. The mean proportion, 𝑝̅ = 𝑛 =
48 ̇
0.3(0.6) ̇
144
= 0. 3̇, and 𝜎𝑝̅ = √ 144 = 0.039 283 71 ≈ 0.039. The Z score associated with 95%
confidence is Z = ±1.96.
Page | 96
̅ − E) < 𝜇 < (p
P[(p ̅ + E)] = 0.95
P[(0. 3̇ − 0.076 996 071) < 𝑝 < (0. 3̇ + 0.076 996 071)] = 0.95
P[0.256 337 261 < 𝑝 < 0.410 329 405] = 0.95
We are 95% confident that the true proportion of employees with two or more dependents
will be between 0.256 and 0.410.
3. X = the number of defective batteries. The shape of the sampling distribution is normal
𝑋 84
since 𝑛𝑝̅ > 5 and 𝑛(1 − 𝑝̅ ) > 5. The mean proportion, 𝑝̅ = 𝑛 = 600 = 0.14, and 𝜎𝑝̅ =
0.14(0.86) 10 000−600
√ √ = 0.013 734 829 ≈ 0014. The Z score associated with 99%
600 10 000−1
confidence is Z = ±2.575.
We are 99% confident that the true proportion of defective batteries will be between 0.104
and 0.175. This confidence interval is not compatible with a 10% defective rate.
Page | 97
Lesson 40 – Sample Size Determination
Zσ 2 2.575(12) 2
1. n = [ E ] = [ 4
] = 59.675 625; ∴ n = 60 cereal products
Z 2 1.96 2
2. a) n = p(1 − p) [E] = 0.5(0.5) [0.04] = 600.25; ∴ n = 601 burial plots
6
b) p = 20 = 0.3
Z2 1.96 2
n = p(1 − p) [ ] = 0.3(0.7) [ ] = 504.21; ∴ n = 505 burial plots
E 0.04
Page | 98
Unit 8 – 9 Review
1. a)
0 4 8 12 16 20 24
Weeks Unemployed
X − μ 18 − 12
P(X > 18 𝑤𝑒𝑒𝑘𝑠) = P ( > ) = P(Z > 1.50) = 0.5 − 0.4332 = 0.0668
σ 4
b) The sampling distribution is normal since the population distribution is normal. The
𝜎 4
standard error of the mean is 𝜎𝑋̅ = = = 0.4 ≈ 0.4 weeks. The finite correction
√𝑛 √100
factor does not need to be applied since n < 0.05N (it is almost 1).
̅
̅ > 18 𝑤𝑒𝑒𝑘𝑠) = P (X−μ > 18−12) = P(Z > 15.00) = 0.5 − 0.5 ≈ 0.0000
P(X σ 0.4̅
𝑋
Notice the horizontal scale difference from the population distribution in part a) to the
sampling distribution in part b). If the two graphs were plotted on the same horizontal scale
they would look like this:
0 2 4 6 8 10 12 14 16 18 20 22 24
Weeks Unemployed
Page | 99
c) The sampling distribution is normal since the population distribution is normal. The
𝑠 3.75
standard error of the mean is 𝑠𝑋̅ = = = 0.118 585 412 ≈ 0.1 weeks. The
√𝑛 √1000
finite correction factor does not need to be applied since n < 0.05N (It is almost 1). The t
score associated with 99% confidence and 999 (relatively infinite) degrees of freedom is
t = ±2.576.
We are 99% confident that the true mean time a person in Canada is unemployed will
be between 11.5 and 12.1 weeks.
d) The sampling distribution is normal since the population distribution is normal. The
𝑠 3.75
standard error of the mean is 𝑠𝑋̅ = = = 0.838 525 491 ≈ 0.8 weeks. The finite
√𝑛 √20
correction factor does not need to be applied since n < 0.05N. The t score associated
with 99% confidence and 19 degrees of freedom is t = ±2.861.
Page | 100
We are 99% confident that the true mean time a person in Canada is unemployed will
be between 9.4 and 14.2 weeks.
e) The sampling distribution is normal since the population distribution is normal. The
𝑠 3.75 1,200,000−150,000
standard error of the mean is 𝑠𝑋̅ = = √ 1,200,000−1 = 0.009 057 114 ≈
√𝑛 √150 000
0.009 weeks. The t score associated with 99% confidence and 149,999 (relatively
infinite) degrees of freedom is t = ±2.576.
We are 99% confident that the true mean time a person in Canada is unemployed will
be between 11.78 and 11.82 weeks.
Zσ 2 1.75(4.2) 2
f) n=[E] =[ 0.5
] = 216.09; ∴ n = 217 unemployed labourers
2. The sampling distribution is normal since n > 30. The standard error of the mean is sX̅ =
s 2500
= = 176.776 695 3 ≈ 180 km. The finite correction factor does not need to be
√n √200
applied since n < 0.05N. The t score associated with 98% confidence and 199 degrees of
freedom is t = ±2.345.
Page | 101
E = tsX̅ = 2.345(176.776 695 3) = 414.541 350 5 ≈ 410 km
̅ − E) < 𝜇 < (X
P[(X ̅ + E)] = 0.98
P[(35,000 − 414.541 350 5) < 𝜇 < (35,000 + 414.541 350 5)] = 0.98
P[34,585.458 65 < 𝜇 < 35,414.541 35] = 0.98
We are 98% confident that the true mean distance business travel for the company’s
salespeople will be between 34,590 and 35,410 km.
3. We must assume that the population distribution of fuel economy is normal so that the
sampling distribution can be assumed to be normal for the small sample of 10.
s 1.2 L
The standard error of the mean is sX̅ = = = 0.379 473 319 ≈ 0.38 100 km. The t
√n √10
score associated with 90% confidence and 9 degrees of freedom is t = ±1.833.
L
E = tsX̅ = 1.833(0.379 473 319) = 0.695 574 594 ≈ 0.70
100 km
̅ − E) < 𝜇 < (X
P[(X ̅ + E)] = 0.90
P[(7.2 − 0.695 574 594) < 𝜇 < (7.2 + 0.695 574 594)] = 0.90
P[6.504 425 406 < 𝜇 < 7.895 574 594] = 0.90
We are 90% confident that the true mean fuel consumption for the compact vehicle will be
between 6.50 and 7.90 L/100 km.
4. X = the number of women employed in the Alberta public sector. The shape of the sampling
𝑋
distribution is normal since 𝑛𝑝̅ > 5 and 𝑛(1 − 𝑝̅ ) > 5. The mean proportion, 𝑝̅ = 𝑛 =
51 0.34(0.66)
= 0.34, and 𝑠𝑝̅ = √ = 0.038 678 159 ≈ 0.039. The Z score associated with
150 150
90% confidence is Z = ±1.645.
Page | 102
0.223 0.262 0.301 0.34 0.379 0.418 0.457
Proportion of Women Employed
We are 90% confident that the true proportion of women employed in Alberta’s public
sector will be between 0.276 and 0.404.
Z 2 1.96 2
5. n = p(1 − p) [E] = 0.5(0.5) [0.02] = 2401; ∴ 2401 voters
Zσ 2 2.575(800) 2
6. n = [ ] = [ ] = 2652.25; ∴ 2653 entries
E 40
Page | 103
Unit 10 – Tests of Hypothesis
Lesson 41 – The Logic of Hypothesis Testing
1. H0 : μ = 10 hours/week H1 : μ > 10 ℎ𝑜𝑢𝑟𝑠/𝑤𝑒𝑒𝑘
b) H0 : μ = 142 mL H1 : μ ≠ 142 mL
Page | 104
Lesson 42 – Tests of Hypothesis for a Population Mean
1. Step 1: Formulate the null and alternative hypothesis.
H0 : μ = $60.00 H1 : μ ≠ $60.00
Step 2: State the level of significance, α.
α = 0.02
Step 3: Determine the test distribution to use.
The sample size used to test the null hypothesis is large, n > 30, and the population standard
deviation is known, therefore the Z distribution can be used. The sampling distribution of
𝜎 12
the mean has 𝜇𝑋̅ = 𝜇 = $60.00 and 𝜎𝑋̅ = = = 0.8 ≈ $0.80.
√𝑛 √225
Step 4: Establish a decision rule.
The decision rule is two-tailed. The critical values are Z = ±2.33. The decision rule could be
stated as we will not reject H0 if -2.33 ≤ Z ≤ 2.33, otherwise we will reject H0.
Reject
-3 -2 -1 0 1 2 3
Z
6.25. Since 6.25 > 2.33, we reject H0. The sample evidence indicates that the mean credit
card sales of the retailer are significantly different from $60.00 at the 2% level of
significance.
2. a)
Step 1: Formulate the null and alternative hypothesis.
H0 : μ = $560.00 H1 : μ ≠ $560.00
Step 2: State the level of significance, α.
α = 0.10
Step 3: Determine the test distribution to use.
The population standard deviation is unknown and the sample size is small, n <30. The
population distribution of journal entries is normal so the t distribution can be used.
𝑠 80
The sampling distribution of the mean has 𝜇𝑋̅ = 𝜇 = $560.00 and 𝑠𝑋̅ = = =
√𝑛 √20
17.888 543 82 ≈ $17.89.
Step 4: Establish a decision rule.
Page | 105
The decision rule is two-tailed. The critical values are t = ±1.729 for 19 degrees of
freedom. The decision rule could be stated as we will not reject H0 if -1.729 ≤ t ≤ 1.729,
otherwise we will reject H0.
Reject
-4 -3 -2 -1 0 1 2 3 4
t
b) α = 0.10
c) A type 2 error occurs when we do not reject H0 when it should be rejected. In this
problem this would have occurred if we had decided not to reject H0. We would have
concluded that the mean value of a journal entry is not significantly different from
$560.00 when in fact it is.
Page | 106
Reject
-4 -3 -2 -1 0 1 2 3 4
t
Reject
-3 -2 -1 0 1 2 3
Z
Page | 107
that the mean wage of the group of minority workers is significantly less than $24,500 at the
5% level of significance.
Reject
-4 -3 -2 -1 0 1 2 3 4
t
evidence indicates that the mean amount of chemical produced per day for the past week is
not significantly less than 800 tonnes at the 5% level of significance.
Page | 108
Lesson 43 – Test of Hypothesis for a Population Proportion
1. Step 1: Formulate the null and alternative hypothesis.
H0 : p = 0.01 H1 : p > 0.01
Step 2: State the level of significance, α.
α = 0.05
Step 3: Determine the test distribution to use.
Since 1234(0.01) > 5 and 1234(1 − 0.01) > 5, the Z distribution can be used. The
𝑝(1−𝑝) 0.01(0.99)
sampling distribution of the mean has 𝜇𝑝̅ = 𝑝 = 0.01 and 𝜎𝑝̅ = √ =√ =
𝑛 1234
0.002 832 435 ≈ 0.002 8.
Step 4: Establish a decision rule.
The decision rule is one-tailed (right tail). The critical value is Z = 1.645. The decision rule
could be stated as we will reject H0 if Z > 1.645, otherwise we will not reject H0.
Reject
-3 -2 -1 0 1 2 3
Z
The sample evidence indicates that the mean proportion of overcharges by UPC scanners is
significantly greater than 1% at the 5% level of significance.
Page | 109
Step 4: Establish a decision rule.
The decision rule is one-tailed (right tail). The critical value is Z = 1.645. The decision rule
could be stated as we will reject H0 if Z > 1.645, otherwise we will not reject H0.
Reject
-3 -2 -1 0 1 2 3
Z
Reject
-3 -2 -1 0 1 2 3
Z
Page | 110
Step 5: Evaluate the evidence.
𝑋 22
The sample evidence indicates a proportion of 𝑝̅ = 𝑛 = 118 = 0.186 440 678 ≈ 0.186. The
𝑝̅ −𝑝 0.186 440 678−0.32
test statistic is 𝑍 = 𝜎𝑝
= 0.042 942 61
= −3.11. Since -3.11 < -2.05, we reject H0. The
̅
sample evidence indicates that the mean proportion of late deliveries since the just-in-time
system was installed is significantly less than 32% at the 2% level of significance. The
proportion of late deliveries was reduced significantly.
Reject
-3 -2 -1 0 1 2 3
Z
reject H0. If the sample has 8 nonconforming cans, the mean proportion of nonconforming
cans in the shipment would not be considered significantly greater than 2.5% at the 5% level
of significance. The shipment should not be rejected in this situation.
Page | 111
Unit 11 – Two Sample Tests of
Hypothesis
Lesson 44 – Comparing Two Population Means, σ Known
1. (O – Old bolts and N – New bolts)
a) 𝐻0 : μN = μO or 𝐻0 : μN − μO = 0
𝐻1 : μN > μO or 𝐻1 : μN − μO > 0
b) We are dealing with known population standard deviations so we use the Z distribution.
Reject
-3 -2 -1 0 1 2 3
Z
𝜎2 𝜎2 800 400
d) The standard error is 𝜎𝑋̅1 −𝑋̅2 = √𝑛1 + 𝑛2 = √ 40 + 40
= 5.477 225 575
1 2
Since 5.48 > 2.05, we reject H0. The sample evidence indicates that the mean yield point
of the new bolts is significantly greater than that of the old bolts at the 2% level of
significance.
Page | 112
2. Step 1: Formulate the null and alternative hypothesis.
(B – Bibs’ brand and C – Competitor’s brand)
𝐻0 : μB = μC or 𝐻0 : μB − μC = 0
𝐻1 : μB < μC or 𝐻1 : μB − μC < 0
Step 2: State the level of significance, α.
α = 0.05
Step 3: Determine the test distribution to use.
We have large samples with known population standard deviations, so the Z distribution
may be applied. The mean of the sampling distribution is 0. The standard error will be
calculated at step 5.
Step 4: Establish a decision rule.
The decision rule is one-tailed (left tail). The critical value is Z = -1.645. The decision rule
could be stated as we will reject H0 if Z < -1.645, otherwise we will not reject H0.
Reject
-3 -2 -1 0 1 2 3
Z
Since -0.81 > -1.645, we do not reject H0. The sample evidence indicates that the mean
weight gain by the babies eating Bibs is not significantly less than the weight gain by the
babies eating the competitor’s brand at the 5% level of significance.
Page | 113
3. Step 1: Formulate the null and alternative hypothesis.
(1 – Oil related stocks and 2 – other stocks)
𝐻0 : μ1 = μ2 or 𝐻1 : μ1 − μ2 = 0
𝐻1 : μ1 ≠ μ2 or 𝐻1 : μ1 − μ2 ≠ 0
Step 2: State the level of significance, α.
α = 0.01
Step 3: Determine the test distribution to use.
We have large samples with known population standard deviations, so the Z distribution
may be applied. The mean of the sampling distribution is 0. The standard error will be
calculated at step 5.
Step 4: Establish a decision rule.
The decision rule is two-tailed. The critical values are Z = ±2.575. The decision rule could be
stated as we will not reject H0 if -2.575 ≤ Z ≤ 2.575, otherwise we will reject H0.
Reject
-3 -2 -1 0 1 2 3
Z
Since -2.66 < -2.575, we reject H0. The sample evidence indicates that there is a significant
difference in the turnover rates of the two types of stock at the 1% level of significance.
Page | 114
Lesson 45 – Comparing Two Population Means, σ Unknown
1. Step 1: Formulate the null and alternative hypothesis.
(B – Buffalo Pound and W – Wakamow)
𝐻0 : μB = μW or 𝐻0 : μB − μW = 0
𝐻1 : μB ≠ μW or 𝐻1 : μB − μW ≠ 0
Step 2: State the level of significance, α.
α = 0.05
Step 3: Determine the test distribution to use.
We will have to assume that the population distributions of sales for the Burger Chalet are
normally distributed to proceed. Since we are dealing with unknown population standard
deviations, we must use the t distribution. The mean of the sampling distribution is 0. The
standard error will be calculated at step 5.
Step 4: Establish a decision rule.
The decision rule is two-tailed. The critical values are t = ±2.086 for 20 degrees of freedom.
The decision rule could be stated as we will not reject H0 if -2.086 ≤ t ≤ 2.086, otherwise we
will reject H0.
Reject
-4 -3 -2 -1 0 1 2 3 4
Since 0.881 < 2.086, we do not reject H0. The sample evidence indicates that there is not a
significant difference in sales at the two locations at the 5% level of significance.
Page | 115
2. (S – Sales and A – Audit)
Sample Size Sample Mean Sample Variance
Sales 6 142.5 149.9
Audit 7 130.285 714 249.238 095
Step 1: Formulate the null and alternative hypothesis.
𝐻0 : μS = μA or 𝐻0 : μS − μA = 0
𝐻1 : μS > μA or 𝐻1 : μS − μA > 0
Step 2: State the level of significance, α.
α = 0.10
Step 3: Determine the test distribution to use.
We will have to assume that the population distributions of sales for the travel expenses are
normally distributed to proceed. Since we are dealing with unknown population standard
deviations, we must use the t distribution. The mean of the sampling distribution is 0. The
standard error will be calculated at step 5.
Step 4: Establish a decision rule.
The decision rule is one-tailed (right tail). The critical value is t = 1.363 for 11 degrees of
freedom. The decision rule could be stated as we will reject H0 if t > 1.363, otherwise we
will not reject H0.
Reject
-4 -3 -2 -1 0 1 2 3 4
Page | 116
Since 1.537 > 1.363, we reject H0. The sample evidence indicates that the mean daily
expenses for the sales staff are significantly greater than the mean daily expenses for the
audit staff at the 10% level of significance.
Reject
-4 -3 -2 -1 0 1 2 3 4
2
(𝑛𝐶𝐹 − 1)𝑠𝐶𝐹 + (𝑛𝑁 − 1)𝑠𝑁2 1 1
𝑠𝑋̅𝐶𝐹 −𝑋̅𝑁 = √ [ + ]
𝑛𝐶𝐹 + 𝑛𝑁 − 2 𝑛𝐶𝐹 𝑛𝑁
Since -1.907 > -2.381, we do not reject H0. The sample evidence indicates that there is not a
significant difference in the two methods at the 2% level of significance.
Page | 117
Lesson 46 – Comparing Two Population Proportions
1. Step 1: Formulate the null and alternative hypothesis.
(A – Loan Officer A and B – Loan Officer B)
H0 : pA = pB or H0 : pA − pB = 0
H1 : pA ≠ pB or H1 : pA − pB ≠ 0
Step 2: State the level of significance, α.
α = 0.05
Step 3: Determine the test distribution to use.
The sample size used for each group is sufficiently large such that 𝑛𝑝̅ ≥ 5 and 𝑛(1 − 𝑝̅ ) ≥ 5,
therefore the Z distribution can be used. The mean of the sampling distribution is 0. The
standard error will be calculated at step 5.
Step 4: Establish a decision rule.
The decision rule is two-tailed. The critical values are Z = ±1.96. The decision rule could be
stated as we will not reject H0 if -1.96 ≤ Z ≤ 1.96, otherwise we will reject H0.
Reject
-3 -2 -1 0 1 2 3
Z
𝑋𝐴 + 𝑋𝐵 9+6 15
𝑝𝑐 = = = = 0.107 142 857
𝑛𝐴 + 𝑛𝐵 60 + 80 140
pc (1 − pc ) pc (1 − pc )
σp̅A −p̅B = √ +
nA nB
0.107 142 857(1 − 0.107 142 857) 0.107 142 857(1 − 0.107 142 857)
=√ +
60 80
= 0.052 822 214 ≈ 0.053
Page | 118
(𝑝̅𝐴 − 𝑝̅𝐵 ) − 0 0.15 − 0.075
𝑍= = = 1.42
σp̅A −p̅B 0.052 822 214
Since 1.42 < 1.96, we do not reject H0. The sample evidence indicates that there is not a
significant difference in the default rates between the two loan officers at the 5% level of
significance.
Reject
-3 -2 -1 0 1 2 3
Z
𝑋𝐴 + 𝑋𝐵 63 + 60 123
𝑝𝑐 = = = = 0.350 428 571
𝑛𝐴 + 𝑛𝐵 150 + 200 350
Page | 119
pc (1 − pc ) pc (1 − pc )
σp̅A −p̅B = √ +
nA nB
0.350 428 571(1 − 0.350 428 571) 0.350 428 571(1 − 0.350 428 571)
=√ +
150 200
= 0.051 566 877 ≈ 0.052
Since 2.327 < 2.33, we do not reject H0. The sample evidence indicates that there is not a
significant difference in the recall proportions for the two commercials at the 2% level of
significance.
Reject
-3 -2 -1 0 1 2 3
Z
Page | 120
The pooled estimate of the population proportion is …
𝑋𝐴 + 𝑋𝐵 13 + 7 20
𝑝𝑐 = = = = 0.080 971 659
𝑛𝐴 + 𝑛𝐵 180 + 67 247
pc (1 − pc ) pc (1 − pc )
σp̅A −p̅B = √ +
nA nB
0.080 971 659(1 − 0.080 971 659) 0.080 971 659(1 − 0.080 971 659)
=√ +
180 67
= 0.039 039 639 ≈ 0.039
Since -0.83 > -1.645, we do not reject H0. The sample evidence indicates that there is not a
significant difference in the performance between the two machines at the 5% level of
significance.
Page | 121
Unit 10 – 11 Review
1. Step 1: Formulate the null and alternative hypothesis.
H0 : μ = 178 cm H1 : μ > 178 𝑐𝑚
Step 2: State the level of significance, α.
α = 0.05
Step 3: Determine the test distribution to use.
The sample size used to test the null hypothesis is large, n > 30, and the population standard
deviation is known, therefore the Z distribution can be used. The sampling distribution of
𝜎 5
the mean has 𝜇𝑋̅ = 𝜇 = 178 𝑐𝑚 and 𝜎𝑋̅ = = ̅̅̅̅ ≈ 0.5 𝑐𝑚.
= 0. 45
√𝑛 √121
Step 4: Establish a decision rule.
The decision rule is one-tailed (right tail). The critical value is Z = 1.645. The decision rule
could be stated as we will reject H0 if Z > 1.645, otherwise we will not reject H0.
Reject
-3 -2 -1 0 1 2 3
Z
6.60. Since 6.60 > 1.645, we reject H0. The sample evidence indicates that the mean height
of adult males in North America is significantly greater than 178 cm at the 5% level of
significance.
Page | 122
Step 3: Determine the test distribution to use.
The sample size used for each group is sufficiently large such that 𝑛𝑝̅ ≥ 5 and 𝑛(1 − 𝑝̅ ) ≥ 5,
therefore the Z distribution can be used. The mean of the sampling distribution is 0. The
standard error will be calculated at step 5.
Step 4: Establish a decision rule.
The decision rule is two-tailed. The critical values are Z = ±2.575. The decision rule could be
stated as we will not reject H0 if -2.575 ≤ Z ≤ 2.575, otherwise we will reject H0.
Reject
-3 -2 -1 0 1 2 3
Z
380 120
𝑝̅>45 = = 0.327 586 206 𝑎𝑛𝑑 𝑝̅≤45 = = 0.255 319 148
1160 470
pc (1 − pc ) pc (1 − pc )
σp̅>45 −p̅≤45 = √ +
n>45 n≤45
0.306 748 466(1 − 0.306 748 466) 0.306 748 466(1 − 0.306 748 466)
=√ +
1160 470
= 0.025 214 624 ≈ 0.025 2
Since 2.87 > 2.575, we reject H0. The sample evidence indicates that there is a significant
difference between the two groups in the proportion who prefer an accountant for financial
planning at the 1% level of significance.
Page | 123
3. Step 1: Formulate the null and alternative hypothesis.
H0 : p = 0.5 H1 : p < 0.5
Step 2: State the level of significance, α.
α = 0.10
Step 3: Determine the test distribution to use.
Since 100(0.5) > 5 and 100(1 − 0.5) > 5, the Z distribution can be used. The sampling
𝑝(1−𝑝) 0.5(0.5)
distribution of the mean has 𝜇𝑝̅ = 𝑝 = 0.5 and 𝜎𝑝̅ = √ 𝑛
=√ 100
= 0.05 ≈ 0.050.
Step 4: Establish a decision rule.
The decision rule is one-tailed (left tail). The critical value is Z = -1.28. The decision rule
could be stated as we will reject H0 if Z < -1.28, otherwise we will not reject H0.
Reject
-3 -2 -1 0 1 2 3
Z
evidence indicates that the mean proportion of males winning prizes is significantly less than
half at the 10% level of significance. THEY’RE RIGGED!
Page | 124
Reject
-4 -3 -2 -1 0 1 2 3 4
t
Reject
-4 -3 -2 -1 0 1 2 3 4
t
Page | 125
Step 5: Evaluate the evidence.
(𝑋̅𝐼 − 𝑋̅𝐸 ) − 0 45 − 48
𝑡= = = −1.884
𝑠𝑋̅𝐼 −𝑋̅𝐸 1.592 137 545
Since -1.884 > -2.457, we do not reject H0. The sample evidence indicates that firm’s
employees do not earn significantly less than similar professionals at other firms at the 1%
level of significance.
Reject
-4 -3 -2 -1 0 1 2 3 4
t
Page | 126
(𝑛𝐷 − 1)𝑠𝐷2 + (𝑛𝐹 − 1)𝑠𝐹2 1 1
𝑠𝑋̅𝐷 −𝑋̅𝐹 = √ [ + ]
𝑛𝐷 + 𝑛𝐹 − 2 𝑛𝐷 𝑛𝐹
Since -2.947 < -1.660, we reject H0. The sample evidence indicates mean cost to repair a
foreign car is significantly greater than the cost to repair a domestic car at the 5% level of
significance.
Reject
-3 -2 -1 0 1 2 3
Z
𝜎12 𝜎22 42 42
𝜎𝑋̅1 −𝑋̅2 = √ + =√ + = 0.8 ≈ 0.8 𝑢𝑛𝑖𝑡𝑠 𝑝𝑒𝑟 𝑤𝑒𝑒𝑘
𝑛1 𝑛2 50 50
Page | 127
The test statistic is …
(𝑋̅1 − 𝑋̅2 ) − 0 50 − 48
𝑍= = = 2.50
𝜎𝑋̅1 −𝑋̅2 0.8
Since 2.50 > 2.05, we reject H0. The sample evidence indicates that the mean weekly sales
at eye level are significantly greater than the mean weekly sales not at eye level at the 2%
level of significance.
Page | 128
Unit 12 – Simple Linear Regression and
Correlation
Lesson 47 – Introduction to Simple Linear Regression
1. There appears to be a curve in the data.
50.0
40.0
Sales ($1000)
30.0
20.0
10.0
0.0
0.0 5.0 10.0 15.0 20.0 25.0
Advertising Expenditures ($1000)
120
100
80
60
40
20
0
0 1 2 3 4 5 6 7
Family Size
Page | 129
3. There appears to be a linear relationship.
35,000
30,000
Number of Beers Sold
25,000
20,000
15,000
10,000
5,000
0
15 20 25 30 35
Temperature (°C)
5.00
4.00
3.00
2.00
1.00
0.00
0 5 10 15 20 25 30 35 40 45
Weekly Usage (hours)
Page | 130
5. There appears to be a linear relationship.
3.5
3.0
Sales ($1000)
2.5
2.0
1.5
1.0
0.5
0.0
0 1 2 3 4 5 6
Number of Competitors Within 1 Mile
Page | 131
Lesson 48 – The Simple Linear Regression Equation
1.
Ad. Exp. ($1000) Sales ($1000) X2 XY Y2
1.0 19.0 1 19 361
2.0 32.0 4 64 1024
4.0 44.0 16 176 1936
6.0 40.0 36 240 1600
10.0 52.0 100 520 2704
14.0 53.0 196 742 2809
20.0 54.0 400 1080 2916
2 2
ΣX = 57 ΣY = 294 ΣX = 753 ΣXY = 2841 ΣY = 13 350
𝑛 ∑ 𝑋𝑌−∑ 𝑋 ∑ 𝑌 7(2841)−57(294)
a) 𝑏 = 𝑛 ∑ 𝑋 2 −(∑ 𝑋)2
= 7(753)−(57)2
= 1.547 477 745 ≈ 1.55
∑𝑌 ∑ 𝑋 294 57
𝑎= −𝑏 = − 1.547 477 745 = 29.399 109 79 ≈ 29.40
𝑛 𝑛 7 7
𝑌𝑝 = 𝑎 + 𝑏𝑋 = 29.40 + 1.55𝑋
For each additional $1000 of advertising expenditures, the sales will increase
approximately 1.55 ($1000). For an advertising expenditure of $0, the sales will be
approximately 29.40 ($1000).
2.
Family Size Amt. Spent on Food ($) X2 XY Y2
3 99 9 297 9 801
6 104 36 624 10 816
5 151 25 755 22 801
6 129 36 774 16 641
6 142 36 852 20 164
3 111 9 333 12 321
4 74 16 296 5 476
4 91 16 364 8 281
5 119 25 595 14 161
3 91 9 273 8 281
2 2
ΣX = 45 ΣY = 1111 ΣX = 217 ΣXY = 5163 ΣY = 128 743
𝑛 ∑ 𝑋𝑌−∑ 𝑋 ∑ 𝑌 10(5163)−45(1111)
a) 𝑏 = 𝑛 ∑ 𝑋 2 −(∑ 𝑋)2
= 10(217)−(45)2
= 11.275 862 07 ≈ 11.28
∑𝑌 ∑ 𝑋 1111 45
𝑎= −𝑏 = − 11.275 862 07 = 60.358 620 69 ≈ 60.36
𝑛 𝑛 10 10
𝑌𝑝 = 𝑎 + 𝑏𝑋 = 60.36 + 11.28𝑋
Page | 132
The amount spent on food will increase approximately $11.28 for each additional family
member. It is not meaningful to interpret the vertical intercept because it would
represent the amount spent on food for a family of 0 members.
3.
Temperature Number of Beers X2 XY Y2
(˚C) Sold
27 20,533 729 554,391 421,604,089
20 1,439 400 28,780 2,070,721
26 13,829 676 359,554 191,241,241
26 21,286 676 553,436 453,093,796
31 30,985 961 960,535 960,070,225
23 17,187 529 395,301 295,392,969
30 30,240 900 907,200 914,457,600
33 37,596 1089 1,240,668 1,413,459,216
25 9,610 625 240,250 92,352,100
29 28,742 841 833,518 826,102,564
ΣX = 270 ΣY = 211,447 ΣX2 = 7426 ΣXY = 6,073,633 ΣY2 = 5,569,844,521
𝑛 ∑ 𝑋𝑌−∑ 𝑋 ∑ 𝑌 10(6,073,633)−270(211,447)
a) 𝑏 = 𝑛 ∑ 𝑋 2 −(∑ 𝑋)2
= 10(7426)−(270)2
= 2,680.617 647 ≈ 2,680.6
∑𝑌 ∑ 𝑋 211,447 270
𝑎= −𝑏 = − 2,680.617 647 = −51,231.976 47 ≈ −51,232.0
𝑛 𝑛 10 10
𝑌𝑝 = 𝑎 + 𝑏𝑋 = −51,232.0 + 2,680.6𝑋
For each 1˚C increase in temperature, the number of beers sold will increase by
approximately 2,680.6 beers. For a temperature of 0˚C, the numbers of beers sold will
be -51,232.0. Although a temperature of 0˚C is possible, the lowest temperature
observed was 20˚C and the lowest amount that can be sold is 0 beers.
Page | 133
4.
Weekly Usage Annual Maintenance X2 XY Y2
(hours) Expense ($1000)
13 1.70 169 22.10 2.8900
10 2.20 100 22.00 4.8400
20 3.00 400 60.00 9.0000
28 3.70 784 10.36 13.6900
32 4.70 1024 150.40 22.0900
17 3.05 289 51.85 9.3025
24 3.25 576 78.00 10.5625
31 3.90 961 120.90 15.2100
40 5.15 1600 206.00 26.5225
38 4.00 1444 152.00 16.0000
2 2
ΣX = 253 ΣY = 34.65 ΣX = 7347 ΣXY = 966.85 ΣY = 130.1075
𝑛 ∑ 𝑋𝑌−∑ 𝑋 ∑ 𝑌 10(966.85)−253(34.65)
a) 𝑏 = 𝑛 ∑ 𝑋 2 −(∑ 𝑋)2
= 10(7347)−(253)2
= 0.095 344 043 ≈ 0.095
∑𝑌 ∑ 𝑋 34.65 253
𝑎= −𝑏 = − 0.095 344 043 = 1.052 795 688 ≈ 1.053
𝑛 𝑛 10 10
𝑌𝑝 = 𝑎 + 𝑏𝑋 = 1.053 + 0.095𝑋
For each additional hour of weekly usage, the annual maintenance expense increases
approximately 0.095 ($1000). For a weekly usage of 0 hours, the annual maintenance
expense will be approximately 1.053 ($1000).
b) 𝑌𝑝 (25) = 1.052 795 688 + 0.095 344 043(25) = 3.436 396 787 ≈ 3.436 ($1000)
5.
Number of Sales ($1000) X2 XY Y2
Competitors
Within 1 Mile
1 3.6 1 3.6 12.96
1 3.3 1 3.3 10.89
2 3.1 4 6.2 9.61
3 2.9 9 8.7 8.41
3 2.7 9 8.1 7.29
4 2.5 16 10.0 6.25
5 2.3 25 11.5 5.29
5 2.0 25 10.0 4.00
2 2
ΣX = 24 ΣY = 22.4 ΣX = 90 ΣXY = 61.4 ΣY = 64.7
𝑛 ∑ 𝑋𝑌−∑ 𝑋 ∑ 𝑌 8(61.4)−24(22.4)
a) 𝑏 = = = −0.322 222 222 ≈ −0.32
𝑛 ∑ 𝑋 2 −(∑ 𝑋)2 8(90)−(24)2
∑𝑌 ∑ 𝑋 22.4 24
𝑎= −𝑏 = − (−0.322 222 222) = 3.766 666 667 ≈ 3.77
𝑛 𝑛 8 8
𝑌𝑝 = 𝑎 + 𝑏𝑋 = 3.77 − 0.32𝑋
Page | 134
For each additional competitor, the sales will decrease approximately 0.32 ($1000). If
there are no competitors within 1 mile, the sales will be approximately 3.77 ($1000).
c) 𝑌𝑝 (3) = 3.766 666 667 − 0.322 222 222(3) = 2.8 ≈ 2.80 ($1000)
Page | 135
Lesson 49 – Correlation Analysis
1.
𝑛 ∑ 𝑋𝑌−∑ 𝑋 ∑ 𝑌
a) 𝑟 =
√𝑛 ∑ 𝑋 2 −(∑ 𝑋)2 √𝑛 ∑ 𝑌 2 −(∑ 𝑌)2
7(2841) − 57(294)
𝑟=
√7(753) − (57)2 √7(13 350) − (294)2
𝑟 = 0.830 868 141 ≈ 0.83
There is a very strong positive correlation between the advertising expenditures and
sales.
2.
𝑛 ∑ 𝑋𝑌−∑ 𝑋 ∑ 𝑌
a)
√𝑛 ∑ 𝑋 2 −(∑ 𝑋)2 √𝑛 ∑ 𝑌 2 −(∑ 𝑌)2
10(5163) − 45(1111)
𝑟=
√10(217) − (45)2 √10(128 743) − (1111)2
𝑟 = 0.589 182 303 ≈ 0.59
There is a moderate correlation between family size and amount spent on food.
3.
𝑛 ∑ 𝑋𝑌−∑ 𝑋 ∑ 𝑌
a)
√𝑛 ∑ 𝑋 2 −(∑ 𝑋)2 √𝑛 ∑ 𝑌 2 −(∑ 𝑌)2
10(6,073,633) − 270(211,447)
𝑟=
√10(7426) − (270)2 √10(5,569,844,521) − (211,447)2
𝑟 = 0.943 046 09 ≈ 0.94
There is an extremely strong positive correlation between the temperature and number
of beers sold.
Page | 136
4.
𝑛 ∑ 𝑋𝑌−∑ 𝑋 ∑ 𝑌
a)
√𝑛 ∑ 𝑋 2 −(∑ 𝑋)2 √𝑛 ∑ 𝑌 2 −(∑ 𝑌)2
10(966.85) − 253(34.65)
𝑟=
√10(7347) − (253)2 √10(130.1075) − (34.65)2
𝑟 = 0.925 298 194 ≈ 0.93
There is an extremely strong positive correlation between the weekly usage and annual
maintenance expense.
5.
𝑛 ∑ 𝑋𝑌−∑ 𝑋 ∑ 𝑌
a)
√𝑛 ∑ 𝑋 2 −(∑ 𝑋)2 √𝑛 ∑ 𝑌 2 −(∑ 𝑌)2
8(61.4) − 24(22.4)
𝑟=
√8(90) − (24)2 √8(64.7) − (22.4)2
𝑟 = −0.971 536 554 ≈ −0.97
There is an extremely strong negative correlation between the number of competitors
within 1 mile and sales.
Page | 137
Cumulative Review
Descriptive Statistics
1.
Amount Spent on Groceries ($) X2
47.63 2 268.616 9
51.70 2 672.890 0
58.49 3 421. 080 1
60.47 3 656. 620 9
64.15 4 115. 222 5
73.25 5 365.562 5
81.14 6 583.699 6
83.22 6 925.568 4
ΣX = 520.05 ΣX2 = 35 009.260 9
∑𝑋 520.05
a) 𝑋̅ = 𝑛 = 8 = 65.006 25 ≈ $65.01
b) 𝑃𝑜𝑠𝑖𝑡𝑖𝑜𝑛 = 0.5(𝑛 + 1) = 0.5(8 + 1) = 4.5
𝑋4 + 𝑋5 60.47 + 64.15
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝑋4.5 = = = 62.31 ≈ $62.31
2 2
(∑ 𝑋)2 520.052
∑ 𝑋2− 35 009.260 9−
2
c) 𝑠 = 𝑛−1
𝑛
= 7
8
= 171.822 941 1 ≈ 171.82 ($)2
(∑ 𝑋)2 520.052
∑ 𝑋2− 35 009.260 9−
d) 𝑠 = √ 𝑛−1
𝑛
=√ 7
8
= 13.108 125 ≈ $13.11
𝑠 13.108 125
e) 𝐶. 𝑉. = ∙ 100 = ∙ 100 = 20.164 407 27 ≈ 20.2%
𝑋̅ 65.006 25
2.
a) Range = X50 – X1 = 89 – 35 = 54 years
log n log 50
N=1+ =1+ = 6.64
log 2 log 2
R 54
C= = = 8.13
N 6.64
Page | 138
Forbes’ Top 50 Wealthiest People
Ages Midpoint f <cf fX fX2
30 – 40 35 2 2 70 2 450
40 – 50 45 6 8 270 12 150
50 – 60 55 10 18 550 30 250
60 – 70 65 13 31 845 54 925
70 – 80 75 8 39 600 45 000
80 – 90 85 11 50 935 79 475
2
Σf = 50 ΣfX = 3270 ΣfX = 224 250
∑ fX 3270
b) μ = ∑f
= 50
= 65.4 ≈ 65.4 years
0.5 ∑ f − Cfi−1 25 − 18
Median = L + [ ] C = 60 + [ ] 10 = 65.384 615 38 ≈ 65.4 years
fi 13
d)
Ages (years)
Page | 139
3.
a) Range = X45 – X1 = 97 – 31 = 66%
log n log 45
N=1+ =1+ = 6.49
log 2 log 2
R 66
C= = = 10.17
N 6.49
Final Grades
Grades Midpoint f <cf fX fX2
30 – 40 35 2 2 70 2 450
40 – 50 45 3 5 135 6 075
50 – 60 55 5 10 275 15 125
60 – 70 65 10 20 650 42 250
70 – 80 75 13 33 975 73 125
80 – 90 85 8 41 680 57 800
90 – 100 95 4 45 380 36 100
2
Σf = 45 ΣfX = 3165 ΣfX = 232 925
∑ fX 3165
b) μ = ∑f
= 45
= 70. 3̇ ≈ 70.3%
Mode = 75 ≈ 75.0%
d)
Page | 140
Final Grades in Introductory Statistics
14
12
10
8
f
6
4
2
0
Grade (%)
4. a)
∑ fX 292 L
X = ∑ f = 30 = 9.73̇ ≈ 9.7
b) ̅ 100 km
X15 + X16 9 + 10 L
Median = X15.5 = = = 9.5 ≈ 9.5
2 2 100 km
L
Mode = 9 ≈ 9.0
100 km
Page | 141
(∑ fX)2 2922
∑ fX2 − 2924− L
c) s = √ ∑ f−1
∑f
=√ 29
30
= 1.680 175 141 ≈ 1.7 100 km
d)
̅ −Median)
3(X 3(9.73̇−9.5)
e) Sk = s
= 1.680 175 141 = 0.42
5.
∑ fX 26,625
a) μ = ∑f
= 13
= 2048.076 923 ≈ $2048.08/month
∑ fX2 ∑ fX 2 59,328,125 26,625 2
b) σ = √ ∑f
− [ ∑f ] = √ 13
− [ 13 ] = 607.521 884 6 ≈ $607.52/month
c) Position = 0.5(∑ f + 1) = 0.5(13 + 1) = 7
Median = X 7 = 1750 ≈ $1750.00/month
d) Position = 0.25(∑ f + 1) = 0.25(13 + 1) = 3.5
Page | 142
X 3 + X 4 1750 + 1750
Q1 = X 3.5 = = = 1750 ≈ $1750.00/month
2 2
Position = 0.75 (∑ f + 1) = 0.75(13 + 1) = 10.5
X10 + X11 2000 + 2500 $2250.00
Q 3 = X10.5 = = = 2250 ≈
2 2 month
IQR = Q 3 − Q1 = 2250 − 1750 = 500 ≈ $500.00/month
6.
∑ 𝑤𝑋 260.5
𝑋̅𝑤 = = = 74.428 571 ≈ 74.4%
∑𝑤 3.5
7.
(∑ fX)2 317.52
∑ fX2 − 4656.25−
b) s√ =√
∑f 25
∑ f−1 24
= 5.099 019 514 ≈ 5.1 days
8. G. M. = [ 𝑛√𝑋1 𝑋2 𝑋3 𝑋4 − 1]100
4
𝐺. 𝑀. = [ √(1 − 0.3)(1 − 0.25)(1 − 0.15)(1 − 0.1) − 1] 100
Page | 143
𝑋−𝜇 2.44−2.60
9. a) 𝑍 = 𝜎
= 0.08
= −2.00
𝑋−𝜇 2.76−2.60
b) 𝑍 = = = 2.00
𝜎 0.08
c) ~95% according to the empirical rule.
Probability
11.
12.
NS
F
Page | 144
b) P(F or NS) = P(F) + P(NS) − P(F and NS) = 0.67 + 0.54 − 0.25 = 0.96
P(F and NS) 0.25 25
c) P(F|NS) = P(NS)
= 0.67 = 67 = 0.373 134 328 ≈ 0.373
13.
13 1
a) P(Spade) = =
52 4
12 3
b) P(Face Card) = 52 = 13
3
c) P(Face Card and Spade) = 52
d) P(Face Card or Spade) = P(Face Card) + P(Spade) − P(Face Card and Spade)
12 13 3 22 11
P(Face Card or Spade) = + − = =
52 52 52 52 26
14.
a)
15.
338
a) P(Social Media) = 530 = 0.637 735 849 ≈ 0.638
140
b) P(Ages 45 − 54) = 530 = 0.264 150 943 ≈ 0.264
89
c) P(Ages 35 − 44 and Social Media) = 530 = 0.167 924 528 ≈ 0.168
Page | 145
Bicycle Make Probability
Bianshe 0.17
Cannonhill 0.22
Fishim 0.03
Giante 0.29
Trec 0.21
HT 0.08
ΣP(X) = 1.00
17. The events in this problem are mutually exclusive. Let X = the number of complaints.
18.
115
a) P(Female) = 197 = 0.583 756 345 ≈ 0.584
Page | 146
21
b) P(1 Ticket) = 197 = 0.106 598 984 ≈ 0.107
14
c) P(Female and 1 Ticket) = 197 = 0.071 428 571 ≈ 0.071
115
d) P(Female or 1 Ticket) = P(Female) + P(1 Ticket) − P(Female and 1 Ticket) = 197 +
21 14 122
197
− 197 = 197 = 0.619 289 34 ≈ 0.619
19.
20.
TYPE OF FAILURE
System Hardware (H) Software (S) Power (P) No Failure Total
Shutdown
Yes 0.73(0.01) = 0.0073 0.12(0.05) = 0.006 0.88(0.02) = 0.0176 0 0.0309
No 0.27(0.01) = 0.0027 0.88(0.05) = 0.044 0.12(0.02) = 0.0024 0.92 0.9691
Total 0.01 0.05 0.02 0.92 1.00
*Note that we are only considering these 3 types of failures. We assume then that the “No”
outcome is the complement of the “Yes” outcome.
Page | 147
21. X = the number of students who are bussed; n = 10; p = 0.2.
23. X = the number of males chosen for the committee; N = 22; n = 5; S = 10.
10C2∙12C3
a) P(X = 2) = = 0.375 939 849 ≈ 0.375 9
22C5
10C0∙12C5
b) P(X ≥ 1) = 1 − P(X = 0) = 1 − 22C5
= 0.969 924 812 ≈ 0.9699
24.
a)
b)
Page | 148
53.75 60.00 66.25 72.50 78.75 85.00 91.25
Annual Sales ($1000)
X − μ 82,000 − 72,500
P(X > $82,000) = P ( > ) = P(Z > 1.52) = 0.5 − 0.4357
σ 6250
= 0.0643
c)
Inferential Statistics
25. See page 206 for the answer to this question.
𝑋−𝜇 70−69
26. a) 𝑃(𝑥 > 70 𝑖𝑛𝑐ℎ𝑒𝑠) = 𝑃 ( > ) = 𝑃(𝑍 > 0.25) = 0.5 − 0.0987 = 0.4013
𝜎 4
b) Since n ≥ 30, the sampling distribution is normal. One could also conclude that the
shape of the sampling distribution is normal because the population distribution itself is
normal.
c) The mean of the sampling distribution is 69 inches.
𝜎 4
d) The standard error of the mean is 𝜎𝑥̅ = = = 0. 6̇ 𝑖𝑛𝑐ℎ. The FCF does not need to
√𝑛 √36
𝑛 36
be applied because 𝑁
= 1200
= 0.03 < 0.05.
Page | 149
𝑋−𝜇 ̅
70−69
e) 𝑃(𝑋̅ > 70 𝑖𝑛𝑐ℎ𝑒𝑠) = 𝑃 ( 𝜎 > 0.6̇ ) = 𝑃(𝑍 > 1.50) = 0.5 − 0.4332 = 0.0668
̅
𝑥
27. a) Although n < 30, the sampling distribution is normal because the population distribution
is normally distributed.
𝑠 8
b) The standard error of the mean is 𝜎𝑥̅ = = = 2 ≈ 2.0 seconds.
√𝑛 √16
28. a) Since n ≥ 30, the sampling distribution is normal. It does not matter what the shape of
the population distribution is because the sample size is large.
𝜎 40 000
b) The standard error of the mean is 𝜎𝑥̅ = = = 5656.854 249 ≈ $5700.
√𝑛 √50
29. Since n ≥ 30, the sampling distribution is normal. The sample mean is 680 hours and the
𝜎 120
standard error of the mean is 𝜎𝑥̅ = = = 10 ≈ 10 ℎ𝑜𝑢𝑟𝑠.
√𝑛 √144
𝑋̅ − 𝜇 680 − 700
𝑃(𝑋̅ < 680 ℎ𝑜𝑢𝑟𝑠) = 𝑃 ( < ) = 𝑃(𝑍 < −2.00) = 0.5 − 0.4772
𝜎𝑥̅ 10
= 0.0228
30. Since n ≥ 30, the sampling distribution is normal. The t distribution should be used because
the population standard deviation is unknown. The sample mean is 60 000 km and the
𝑠 𝑁−𝑛 3300 640−100
standard error of the mean is 𝑠𝑥̅ = √ = √ = 303.361 449 5 𝑘𝑚 ≈
√𝑛 𝑁−1 √100 640−1
300 𝑘𝑚. Note that the finite correction factor must be applied with N = 160(4) = 640 tires.
𝑃[(60 000 − 601.869 115 7) < 𝜇 < (60 000 + 601.869 115 7)] = 0.95
We are 95% confident that true mean mileage that the utility can expect from its tires will
fall between 59,400 and 60,600 km.
𝑍𝜎 2 2.575(6) 2
31. 𝑛=[ ] =[ ] =59.675 625
𝐸 2
Page | 150
𝑋 84
32. The point estimate for this discrete random variable is 𝑝̅ = 𝑛 = 120 = 0.7. Since n𝑝̅ ≥ 0.05,
120(0.7) = 84 and n(1 – ̅𝑝) ≥ 0.05, 120(0.3) = 36, the sampling distribution is normal. The
𝑝̅ (1−𝑝̅ ) 0.7(0.3)
standard error of the proportion is 𝜎𝑝̅ = √ =√ = 0.041 833 001 ≈ 0.042.
𝑛 120
The FCF is not applied since N is unknown.
𝑃[(0.7 − 0.081 992 682) < 𝑝 < (0.7 + 0.081 992 682)] = 0.95
We are 95% confident that the true proportion of shoppers at the Town and Country Mall
making at least one purchase will fall between 0.618 and 0.782.
𝑍 2 1 2 2.33 2
33. a) 𝑛 = 𝑝(1 − 𝑝) [𝐸 ] = 3 (3) [0.03] = 1340.469 136
𝑍 2 1 1 2.33 2
b) 𝑛 = 𝑝(1 − 𝑝) [𝐸 ] = 2 (2) [0.03] = 1508.027 778
34. The population distribution is normally distributed so the sampling distribution is normal too
regardless of the sample size. We will have to use the Student t distribution. The sample
statistics are 𝑋̅ = 15.85 ounces and 𝑠 = 0.242 899 156 ≈ 0.24 ounces. The standard error
𝑠 0.242 899 156
of the mean is 𝑠𝑥̅ = = = 0.099 163 165 ≈ 0.10 ounces.
√𝑛 √6
𝑃[(15.85 − 0.199 813 777) < 𝜇 < (15.85 + 0.199 813 777)] = 0.90
We are 90% confident that the true mean weight of a package of sunflower seeds from the
current shipment will be between 15.65 and 16.05 ounces.
Page | 151
35. a) 𝐻0 : 𝜇 = 35 𝑝𝑝𝑚 and 𝐻1 : 𝜇 < 35 𝑝𝑝𝑚
b) 𝐻0 : 𝜇 = 5 𝑘𝑚 and 𝐻1 : 𝜇 > 5 𝑘𝑚
Reject
-3 -2 -1 0 1 2 3
Z
Page | 152
37. Step 1: Formulate the null and alternative hypothesis.
H0 : μ = 4.9% H1 : μ < 4.9%
Step 2: State the level of significance, α.
α = 0.01
Step 3: Determine the test distribution to use.
The population distribution is normally distributed, so the sampling distribution is normal
too regardless of the sample size. Since the population standard deviation is unknown, we
will have to use the t distribution. The sample statistics are 𝑥̅ = 3.441 6̇ ≈ 3.4% and 𝑠 =
𝑠 0.652 907 251
0.652 907 251 ≈ 0.7%. The standard error of the mean is 𝑠𝑥̅ = = =
√𝑛 √12
0.188 478 088 ≈ 0.2%.
Step 4: Establish a decision rule.
The decision rule is one-tailed (left tail). The critical value is t = -2.718. The decision rule
could be stated as we will reject H0 if t < -2.718, otherwise do not reject H0.
Reject
-4 -3 -2 -1 0 1 2 3 4
t
The sample evidence indicates that the mean carbon monoxide level is significantly less than
4.9% at the 1% level of significance.
Page | 153
38. Step 1: Formulate the null and alternative hypothesis.
H0 : p = 0.005 H1 : p > 0.005
Step 2: State the level of significance, α.
α = 0.02
Step 3: Determine the test distribution to use.
The point estimate for this discrete random variable is p = 0.005. Since np ≥ 0.05,
1000(0.005) = 5 and n(1 – p) ≥ 0.05, 1000(0.995) = 995, the sampling distribution is normal.
𝑝̅ (1−𝑝̅)
The Z distribution may be used. The standard error of the proportion is 𝜎𝑝̅ = √ 𝑛
=
0.005(0.995)
√ = 0.002 230 47 ≈ 0.0022.
1000
Step 4: Establish a decision rule.
The decision rule is one-tailed (right tail). The critical value is Z = 2.05. The decision rule
could be stated as we will reject H0 if Z > 2.05, otherwise do not reject H0.
Reject
-3 -2 -1 0 1 2 3
Z
Page | 154
39. Step 1: Formulate the null and alternative hypothesis.
(D – Day shift; A – Afternoon shift)
H0 : μD = μA or H0 : μD − μA = 0
H1 : μD < μA or H1 : μD − μA < 0
Step 2: State the level of significance, α.
α = 0.05
Step 3: Determine the test distribution to use.
Since nD and nA are both ≥ 30, the sampling distribution is normal. The population standard
deviations are known, so the Z distribution can be used. The mean of the sampling
𝜎 2 𝜎𝐴 2
distribution is 0 sales/week and the standard error of the mean is 𝜎𝑋̅𝐷 −𝑋̅𝐴 = √ 𝑛𝐷 + 𝑛𝐴
=
𝐷
212 282
√ + = 4.607 964 12 ≈ 4.6 𝑢𝑛𝑖𝑡𝑠.
54 60
Step 4: Establish a decision rule.
The decision rule is one-tailed (left tail). The critical values is Z = -1.645. The decision rule
could be stated as we will reject H0 if Z < -1.645, otherwise do not reject H0.
Reject
-3 -2 -1 0 1 2 3
Z
H0. The sample evidence indicates that the number of units produced by afternoon shift is
not significantly greater than the number of units produced by the day shift at the 5% level
of significance.
Page | 155
40. Step 1: Formulate the null and alternative hypothesis.
(1 – Sydney to Melbourne; 2 – Melbourne to Sydney)
𝐻0 : 𝜇1 = 𝜇2 or 𝐻0 : 𝜇1 − 𝜇2 = 0
𝐻1 : 𝜇1 > 𝜇2 or 𝐻1 : 𝜇1 − 𝜇2 > 0
Step 2: State the level of significance, α.
α = 0.05
Step 3: Determine the test distribution to use.
Since the populations distributions are normal, the sampling distribution is normal. Since
the population standard deviations are unknown, the Student t distribution must be used.
The mean of the sampling distribution is 0 minutes and the standard error of the mean is
(𝑛1 −1)𝑠1 2 +(𝑛2 −1)𝑠2 2 1 1 9(225)+5(185) 1 1
𝑠𝑥̅1 −𝑥̅2 = √ [𝑛 + 𝑛 ] = √ [10 + 6] = 7.496 030 696 ≈
𝑛1 +𝑛2 −2 1 2 10+6−2
7.5 𝑚𝑖𝑛. Note that the variances were provided in the table.
Step 4: Establish a decision rule.
The decision rule is one-tailed (right tail). The critical value is t = 1.761 for 14 degrees of
freedom. The decision rule could be stated as we will reject H0 if t > 1.761, otherwise we
will not reject H0.
Reject
-4 -3 -2 -1 0 1 2 3 4
t
The sample evidence indicates that the mean flight time from Sydney to Melbourne is
significantly greater than the mean flight time from Melbourne to Sydney at the 5% level of
significance.
Page | 156
41. Step 1: Formulate the null and alternative hypothesis.
(n – new campaign; c – current campaign)
H0 : pn = pc or H0 : pn − pc = 0
H1 : pn > pc or H1 : pn − pc > 0
Step 2: State the level of significance, α.
α = 0.10
Step 3: Determine the test distribution to use.
Since n𝑝̅ ≥ 5 and n(1 – 𝑝̅ ) ≥ 5 for both samples, the sampling distribution is normal. The Z
𝑥 +𝑥 124+35
distribution can be used. The pooled estimate is 𝑝̅𝑐 = 𝑛𝑛 +𝑛𝑐 = 400+100 = 0.318 where
𝑛 𝑐
𝑋𝑛 = 𝑛𝑝̅ = 400(0.31) = 124 and 𝑋𝑐 = 𝑛𝑝̅ = 100(0.35) = 35. The standard error of the
𝑝̅ (1−𝑝̅ ) 𝑝̅ (1−𝑝̅) 0.318(0.682) 0.318(0.682)
proportion is 𝑠𝑝̅𝑛 −𝑝̅𝑐 = √ + =√ + = 0.052 066 784 ≈
𝑛𝑛 𝑛𝑐 400 100
0.052.
Step 4: Establish a decision rule.
The decision rule is one-tailed (right tail). The critical value is Z = 1.28. The decision rule
could be stated as we will reject H0 if Z > 1.28, otherwise do not reject H0.
Reject
-3 -2 -1 0 1 2 3
Z
The sample evidence indicates that the proportion of viewers who find the new ad
campaign highly attractive is not significantly greater than the proportion of viewers who
find the current campaign highly attractive at the 10% level of significance.
Page | 157
42.
𝑛 ∑ 𝑋𝑌−∑ 𝑋 ∑ 𝑌 12(5560)−28(2360)
b) 𝑏 = = = 6.153 846 154 ≈ 6.2
𝑛 ∑ 𝑋 2 −(∑ 𝑋)2 12(74)−(28)2
∑𝑌 ∑ 𝑋 2360 28
𝑎= −𝑏 = − 6.153 846 154 = 182.307 692 3 ≈ 182.3
𝑛 𝑛 12 12
𝑌𝑝 = 𝑎 + 𝑏𝑋 = 182.3 + 6.2𝑋
c) The selling price of a condominium increases ~$6200 for each increase of 1 bedroom. The
interpretation of the intercept is not relevant. It may be interpreted that the selling price of
a condominium with no bedrooms is ~182,300.
𝑛 ∑ 𝑋𝑌−∑ 𝑋 ∑ 𝑌
d) 𝑟 =
√𝑛 ∑ 𝑋 2 −(∑ 𝑋)2 √𝑛 ∑ 𝑌 2 −(∑ 𝑌)2
12(5560) − 28(2360)
𝑟=
√12(74) − (28)2 √12(466 850) − (2360)2
𝑟 = 0.347 579 78 ≈ 0.35
There is a weak positive correlation between the number of bedrooms and the selling price
of a condominium.
Page | 158