RP ch06
RP ch06
The following problems have been reserved for your use in assignments and testing and do not
appear in student versions of the text.
An independent research was made asking 500 people about their bank deposits. Using the data
in the table, calculate the deposit sample mean and deposit sample standard deviation.
SOLUTION
First of all, we need to multiple each deposit amount by the number of depositors:
Thus,
Therefore, the deposit sample standard deviation is
A consignment of 100 half-inch nuts was released from the factory. The diameters of 10
randomly taken nuts were measured. Calculate the sample mean and sample standard deviation
using the data in the table.
Number 1 2 3 4 5 6 7 8 9 10
of nut
Diameter, 0.4983 0.5030 0.4982 0.4986 0.5010 0.5009 0.4984 0.4993 0.5014 0.5010
inch
SOLUTION
Sample average:
Sample variance:
During the second part of April Chicago Bulls had 6 games with Boston Celtics. The number of
points, scored in each game is in the table. Calculate the sample mean and sample standard
deviation using the data in the table.
No 1 2 3 4 5 6
Number of 90 95 87 95 97 83
points
SOLUTION
Sample average:
Sample variance:
An independent research was made in a dog shelter to measure the heights of dogs of different
breeds. Eight dogs were evaluated for this experiment. The results are shown in the table below.
Calculate the sample mean and sample standard deviation using the data in the table.
Number 1 2 3 4 5 6 7 8
of dog
Dog’s 0.75 0.15 0.35 0.60 0.62 0.58 0.75 0.76
height, m
SOLUTION
Sample average:
Sample variance:
Sample standard deviation:
Suppose that you add 2 to all observations in a sample. How does this change the sample mean?
How does it change the sample standard deviation?
SOLUTION
The United States has an aging infrastructure as witnessed by several recent disasters, including
the I-35 bridge failure in Minnesota. Most states inspect their bridges regularly and report their
condition (on a scale from 1–7) to the public.
5.08 5.44 6.66 5.07 6.04 5.43 4.83 4.00 4.41 3.61
7.00 5.72 3.48 6.43 3.97 4.19 6.26 5.27 5.26 5.48
4.95 5.76 4.93 5.61 4.66 7.00 5.57 3.42 5.91 4.54
Find the sample mean and the sample standard deviation of these condition numbers.
SOLUTION
Sample mean:
Sample variance:
Sample standard deviation:
In an attempt to measure the effects of acid rain, researchers measured the pH (7 is neutral and
values below 7 are acidic) of water collected from rain in Ingham County, Michigan.
5.47 5.37 5.38 4.63 4.34 5.02 3.71 4.96 4.64 5.94
5.54 5.39 4.16 5.62 4.57 5.90 5.48 4.57 4.57 4.51
4.62 4.56 4.61 4.32 3.98 5.56 4.15 3.98 5.65 3.10
5.04 4.62 4.88 4.34 4.16 4.64 5.12 3.71 4.80 5.59
Find the sample mean and sample standard deviation of these measurements.
SOLUTION
Sample mean:
Sample variance:
An independent research was made to calculate the mean weight of elder people in Russia. 50
men and women were interviewed for this research. Construct a stem-and-leaf diagram for the
data. Calculate the median and quartiles of these data. All values are in kg.
74 89 84 85 57
81 78 79 77 83
94 82 57 75 82
77 60 79 76 97
99 70 97 57 69
84 92 82 91 85
76 92 76 63 69
58 64 59 89 91
96 85 80 59 58
94 57 79 89 87
SOLUTION
Leaf
Stem
77778899
5
03499
6
045666778999
7
012223445557999
8
1122446779
9
An independent research was made to calculate the mean weight of elder people in Russia. 50
men and women were interviewed for this research. Calculate the sample median, mode, and
mean of the data. Explain how these three measures of location describe different features of the
data. All values are in kg.
74 89 84 85 57
81 78 79 77 83
94 82 57 75 82
77 60 79 76 97
99 70 97 57 69
84 92 82 91 85
76 92 76 63 69
58 64 59 89 91
94 85 80 59 58
94 57 79 89 87
SOLUTION
The median is halfway between 25th and 26th, so it is the average of the 25th and 26th values and it
equals 79.5.
The table represents the answers of 80 respondents to the survey “How much sports trainings do
you have every year?” carried out among college students. Construct a stem-and-leaf diagram for
the data. Calculate the median and quartiles of these data.
SOLUTION
(a) Construct a stem-and-leaf diagram.
0112268889
12
11345
13
02223345667888999
14
34455677778
15
016678999
16
3378
17
01223347788889999
18
001279
19
0
20
The table represents the answers of 80 respondents to the survey “How much sports trainings do
you have every year?” carried out among college students. Calculate the sample median, mode,
and mean of the data. Explain how these three measures of location describe different features of
the data.
SOLUTION
The median is halfway between 40th and 41th, so it is the average of the 40th and 41th values and it
equals 157.
Modes are 157, 188 and 189 as the most frequent data value.
Sample mean:
The United States has an aging infrastructure as witnessed by several recent disasters, including
the I-35 bridge failure in Minnesota. Most states inspect their bridges regularly and report their
condition (on a scale from 1–7) to the public.
Here are the condition numbers from a sample of 30 bridges in New York State:
5.08 5.44 6.66 5.07 6.80 5.43 4.83 4.00 4.41 4.38
7.00 5.72 4.53 6.43 3.97 4.19 6.26 6.72 5.26 5.48
4.95 6.33 4.93 5.61 4.66 7.00 5.57 3.42 5.18 4.54
N = 30
Do any of the bridges appear to have unusually good or poor ratings? If so, compute the mean
with and without these bridges and comment.
SOLUTION
(a) N = 30
1 34
2 39
6 4 0134
12 4 556899
(7) 5 0012444
11 5 567
8 6 234
5 6 678
2 7 00
In an attempt to measure the effects of acid rain, researchers measured the pH (7 is neutral and
values below 7 are acidic) of water collected from rain in Ingham County, Michigan.
5.47 5.37 5.38 4.63 5.37 3.74 3.71 4.96 4.64 5.11
5.65 5.39 4.16 5.62 4.57 4.64 5.48 4.57 4.57 4.51
4.86 4.56 4.61 4.32 3.98 5.70 4.15 3.98 5.65 3.10
5.04 4.62 4.51 4.34 4.16 4.64 5.12 3.71 4.64 5.59
1 3 1
4 3 777
6 3 99
9 4 111
11 4 33
17 4 555555
(7) 4 6666666
16 4 89
14 5 011
11 5 3333
7 5 445
4 5 6667
N=40. Leaf Unit = 0.010
1 31
4 3 777
6 3 99
9 4 111
11 4 33
B 17 4 555555
(7) 4 6666666
16 4 89
14 5 011
11 5 3333
7 5 445
4 5 6667
N=40. Leaf Unit = 0.10
1 31
4 3 777
6 3 99
9 4 111
11 4 33
C 17 4 555555
(6) 4 666666
16 4 89
14 5 011
11 5 3333
7 5 445
4 5 6667
D N=40. Leaf Unit = 0.10
1 3 1
3 3 77
6 3 999
9 4 111
11 4 33
17 4 555555
(6) 4 666666
16 4 89
14 5 0111
10 5 333
7 5 445
4 5 6667
Many scientists consider rain with pH below 5.3 to be acid rain. What percentage of these
samples could be consider as acid rain?
SOLUTION
The following data represent the yield on 90 consecutive batches of ceramic substrate to which a
metal coating has been applied by a vapor-deposition process.
94.1 86.1 95.3 84.9 88.8 84.6 94.4 84.1
93.2 90.4 94.1 78.3 86.4 83.6 96.1 83.7
90.6 89.1 97.8 89.6 85.1 85.4 98.0 82.9
91.4 87.3 93.1 90.3 84.0 89.7 85.4 87.3
88.2 84.1 86.4 93.1 93.7 87.6 86.6 86.4
86.1 90.1 87.6 94.6 87.7 85.1 91.7 84.5
95.1 95.2 94.1 96.3 90.6 89.6 87.5
90.0 86.1 92.1 94.7 89.4 90.0 84.2
92.4 94.3 96.4 91.1 88.6 90.1 85.1
87.3 93.2 88.2 92.4 84.1 94.3 90.5
86.6 86.7 86.4 90.6 82.6 97.3 95.6
91.2 83.0 85.0 89.1 83.1 96.8 88.3
1 7e|8
7 8t|223333
21 8f|44444444555555
38 8s|66666666667777777
A (11) 8e|88888999999
41 9z|00000000001111
27 9t|22223333
19 9f|444444445555
7 9s|666677
1 9e|8
B N=90. Leaf Unit = 10
1 7e|8
7 8t|223333
21 8f|44444444555555
38 8s|66666666667777777
(13) 8e|88888999999
41 9z|00000000001111
27 9t|22223333
19 9f|444444445555
7 9s|666677
1 9e|8
N=90. Leaf Unit = 1
1 7e|8
6 8t|223333
14 8f|44444444555555
17 8s|66666666667777777
C (11) 8e|88888999999
14 9z|00000000001111
8 9t|22223333
12 9f|444444445555
6 9s|666677
1 9e|8
N=90. Leaf Unit = 10
1 7e|8
7 8t|223333
20 8f|4444444555555
37 8s|6666666667777777
D (11) 8e|88888999999
39 9z|000000000111
27 9t|22223333
18 9f|44444445555
7 9s|666677
1 9e|8
Calculate the median of these data. Calculate the quartiles of these data.
The net energy consumption (in billions of kilowatt-hours) for countries in Asia in 2003 was as
follows
Billions of Kilowatt-Hours
Afghanistan 1.04
Australia 200.66
Bangladesh 16.20
Burma 6.88
China 1671.23
Hong Kong 38.43
India 519.04
Indonesia 101.80
Japan 946.27
Korea, North 17.43
Korea, South 303.33
Laos 3.30
Malaysia 73.63
Mongolia 2.91
Nepal 2.30
New Zealand 37.03
Pakistan 71.54
Philippines 44.48
Singapore 30.89
Sri Lanka 6.80
Taiwan 154.34
Thailand 107.34
Vietnam 36.92
Total 4393.8
(18) 0|000000000000000111
A 5 0|23
3 0|5
2 0|9
1 1|6
N=24, Leaf Unit = 100
(18) 0|000000000000000111
6 0|23
B 4 0|5
3 0|9
2 1|6
1 4|3
N=23, Leaf Unit = 100
(17) 0|00000000000000111
C 5 0|23
3 0|5
2 0|9
1 1|6
D N=24, Leaf Unit = 100
(19) 0|0000000000000000111
5 0|23
3 0|5
2 0|9
1 1|6
Compute the sample mean, sample standard deviation, and sample median:
SOLUTION
Do not use the total as an observation. There are 23 observations. (N=23, Leaf Unit = 100)
Sample mean:
Sample median is at 12th = 38.43, or, rounded to the nearest integer, M = 38 bln kW-h.
The United States Golf Association tests golf balls to ensure that they conform to the rules of
golf. Balls are tested for weight, diameter, roundness, and overall distance. The overall distance
test is conducted by hitting balls with a driver swung by a mechanical device nicknamed “Iron
Byron” after the legendary great Byron Nelson, whose swing the machine is said to emulate.
Following are 100 distances (in yards) achieved by a particular brand of golf ball in the overall
distance test.
1 22|6
5 23|2334
8 23|677
16 24|00112444
20 24|5578
A 33 25|0111122334444
46 25|5555556677899
(15) 26|000011123334444
39 26|56677888
31 27|0000112222233333444
12 27|66788999
4 28|003
1 28|5
N=100, Leaf Unit = 1
1 22|6
5 23|2334
8 23|677
16 24|00112444
20 24|5578
B 33 25|0111122334444
46 25|5555556677899
(14) 26|00011123334444
39 26|56677888
31 27|0000112222233333444
12 27|66788999
4 28|003
1 28|5
C N=100, Leaf Unit = 1
1 22|6
5 23|2334
8 23|677
16 24|00112444
20 24|5578
33 25|0111122334444
46 25|5555556677899
(16) 26|0000111123334444
39 26|56677888
31 27|0000112222233333444
12 27|66788999
4 28|003
1 28|5
N=100, Leaf Unit = 1
1 22|6
5 23|2334
8 23|677
16 24|00112444
20 24|5578
D 34 25|01111122334444
45 25|555556677899
(15) 26|000011123334444
40 26|566778888
30 27|000011222233333444
12 27|66788999
4 28|003
1 28|5
Compute the sample mean, sample standard deviation, and the sample median.
What is the 90th percentile of distances?
SOLUTION
Sample mean:
(yards)
Sample standard deviation:
(yards)
Sample median: M = 260.85 yards
2 63|47
7 64|24899
16 65|223566899
35 66|0000001233455788899
48 67|002245567899
A (17) 68|00001111233333458
55 69|0000112345555677889
36 70|011223444556
24 71|0057889
17 72|000012234447
5 73|59
3 74|68
1 76|3
N=120, Leaf Unit = 1.0
2 63|47
7 64|24899
16 65|223566899
35 66|0000001233455788899
48 67|002245567899
B (19) 68|0000111112333333458
55 69|0000112345555677889
36 70|011223444556
24 71|0057889
17 72|000012234447
5 73|59
3 74|68
1 76|3
C N=120, Leaf Unit = 1.0
2 63|47
7 64|24899
16 65|223566899
34 66|000001233455788899
48 67|002245567899
(17) 68|00001111233333458
55 69|0000112345555677889
36 70|011223444556
24 71|0057889
16 72|00012234447
5 73|59
3 74|68
1 76|3
N=120, Leaf Unit = 10
2 63|47
7 64|24899
16 65|223566899
35 66|0000001233455788899
48 67|002245567899
D (17) 68|00001111233333458
55 69|0000112345555677889
36 70|011223444556
24 71|0057889
17 72|000012234447
5 73|59
3 74|68
1 76|3
Compute the sample mean, the sample standard deviation, and the sample median.
SOLUTION
Sample mean:
(MHz)
Sample standard deviation:
(MHz)
Sample median M = 683 MHz
The following times (in seconds) were captured for athletes running the 100 meter race.
12.1 13.2 11.8 14.7 15.2 16.4 14.9 14.8 14.8 14.7
13.2 15.1 16.2 11.9 11.7 14.0 15.0 15.1 17.0 16.2
14.2 14.6 14.6 14.8 15.1 15.3 13.6 14.2 16.2 12.2
16.0 12.7 12.9 13.7 16.1 14.4 16.7 12.5 13.4 12.9
A
B
D
E
SOLUTION
In the data provided we can observe seven results between 16.0 and 17.0 seconds
, and one result of 17.0 seconds. It allows us to exclude diagrams (b), (c), (d),
and (e).
So, the correct answer is (a). Try to build the histogram yourself to check it out.
A
B
D
E
SOLUTION
Now, one can notice that in histogram (d) mean is greater, than the median, due to lots of values in the box just right to the middle one.
The distribution presented on histogram (e) has two different modes, but it cannot have two means, so this option is not correct as well.
The following table contains frequency and relative frequency distribution for resistance of different specimens. X is the resistance measured in Ohms.
Class
Frequency 5 6 12 22 17 12 6
Relative
frequency 0.0625 0.0750 0.1500 0.2750 0.2125 0.1500 0.0750
Determine the cumulative relative frequency for the specimens with the resistance between 250 and 300 Ohms.
SOLUTION
First, calculate the cumulative relative frequency for the specimens with the resistance between 250 and 300 – it equals the sum of all relative
frequencies for speciments with the resistance less than 300 Ohms.
Therefore, .
Reserve Problems Chapter 6 Section 3 Problem 4
According to the data, obtained from the daily temperature observation, we obtained the
following information, shown in the table.
Temperature,
Frequency
°C
12.1 7
13.2 9
15.1 12
15.6 15
15.7 17
16.0 31
16.3 22
16.9 42
17.1 51
17.7 44
18.2 42
18.6 27
18.8 32
19.0 13
19.1 4
A
B
D
E
SOLUTION
(a)
(b)
(c)
(d)
(e)
First, consider histogram (b). It shows about 60 observations between 15.5 and 16.5 degrees, but
the sum of frequencies from 15.6°C to 16.3°C is 85, so the histogram is incorrect.
Histogram (c) has an outstanding value above 20°C, and histogram (d) has one below 10°C, so
they do not correspond to our data as well.
The sum of frequencies of values between 16.5°C and 17.5°C is 93, but histogram (e) represents
less than 60.
The following data represent the yield on 90 consecutive batches for ceramic substrate to which
a metal coating has been applied by a vapor-deposition process.
B
C
E
SOLUTION
Class
Frequency 1 0 11 19 14
Class
Frequency 18 11 12 4
The net energy consumption (in billions of kilowatt-hours) for countries in Asia in 2003 was as
follows.
Billions of Kilowatt-Hours
Afghanistan 1.04
Australia 200.66
Bangladesh 16.20
Burma 6.88
China 1671.23
Hong Kong 38.43
India 519.04
Indonesia 101.80
Japan 946.27
Korea, North 17.43
Korea, South 303.33
Laos 3.30
Malaysia 73.63
Mongolia 2.91
Nepal 2.30
New Zealand 37.03
Pakistan 71.54
Philippines 44.48
Singapore 30.89
Sri Lanka 6.80
Taiwan 154.34
Thailand 107.34
Vietnam 36.92
Total 4393.8
B
C
D
E
SOLUTION
Class
Frequency 15 4 1 1 0
Class
Frequency 1 0 0 1
The united States Golf Association tests golf balls to ensure that they conform to the rules of
golf. Balls are tested for weight, diameter, roundness and overall distance. The overall distance
test is conducted by hitting balls with a driver swung by a mechanical device nicknamed "Iron
Byron" after the legendary great Byron Nelson, whose swing the machine is said to emulate.
Following are 100 distances (in yards) achieved by a particular brand of golf ball in the overall
distance test.
A
B
C
D
Does it convey the same information as the prsented stem-and-leaf display for this data?
SOLUTION
Class Frequency
1
1
4
6
6
9
14
13
12
11
13
8
2
Stem-and-leaf display. Yard: unit = 1.0
1 22 | 6
5 23 | 2334
8 23 | 677
16 24 | 00112444
20 24 | 5578
33 25 | 0111122334444
46 25 | 5555556677899
(15) 26 | 000011123334444
39 26 | 56677888
31 27 | 0000112222233333444
12 27 | 66788999
4 28 | 003
1 28 | 5
Yes, the histogram shows the same shape as the stem-and-leaf display.
680 669 719 699 670 710 722 663 658 634 720 690
677 669 700 718 690 681 702 696 692 690 694 660
679 675 701 721 683 735 688 763 672 698 659 704
681 679 691 683 705 746 706 649 668 672 690 724
652 720 660 695 701 724 668 698 668 660 680 739
717 727 653 637 660 693 679 682 724 642 704 695
704 652 664 702 661 720 695 670 656 718 660 648
683 723 710 680 684 705 681 748 697 703 660 722
662 644 683 695 678 674 656 667 683 691 680 685
681 715 665 676 665 675 655 659 720 675 697 663
Construct a frequence distribution for these data. Use 14 bins.
B
C
Does it convey the same information as the prsented stem-and-leaf display for this data?
SOLUTION
Class Frequency
1
3
5
17
13
24
11
19
5
16
1
2
2
1
2 63|47
7 64|24899
16 65|223566899
35 66|0000001233455788899
48 67|0022455567899
(17) 68|00001111233333458
55 69|0000112345555677889
36 70|011223444556
24 71|0057889
17 72|000012234447
5 73|59
3 74|68
1 75|
1 76|3
A Yes, the histogram shows the same shape as the stem-and-leaf display.
The United States has an aging infrastructure as witnessed by several resent disasters, including
the I-35 bridge failure in Minnesota. Most states inspect their bridges regularly and report their
condition (on a scale from 1-7) to the public. Here are condition numbers from a sample of 30
bridges in New York State:
5.08 5.44 6.66 5.07 6.80 5.43 4.83 4.00 4.41 4.38
7.00 5.72 4.53 6.43 3.97 4.19 6.26 6.72 5.26 5.48
4.95 6.33 4.93 5.61 4.66 7.00 5.57 3.42 5.18 4.54
A
B
C
D
SOLUTION
Class Frequency
1
5
13
7
4
In an attempt to measure the effects of acid rain, researchers measured the pH (7 is neutral and
values below 7 are acidic) of water collected from rain in Ingham County, Michigan.
5.47 5.37 5.38 4.63 5.37 3.74 3.71 4.96 4.64 5.11
5.65 5.39 4.16 5.62 4.57 4.64 5.48 4.57 4.57 4.51
4.86 4.56 4.61 4.32 3.98 5.70 4.15 3.98 5.65 3.10
5.04 4.62 4.51 4.34 4.16 4.64 5.12 3.71 4.64 5.59
B
C
D
E
SOLUTION
Class Frequency
1
5
5
15
9
5
Reserve Problems Chapter 6 Section 3 Problem 11
Cloud seeding, a process in which chemicals such as silver iodide and frozen carbon dioxide are
introduced by aircraft into clouds to promote rainfall was widely used in the 20-th century.
Recent research has questioned its effectiveness [Journal of Atmospheric Research (2010, Vol.
97 (2), pp. 513-525)]. An experiment was performed by randomly assigning 52 clouds to be
seeded or not. The amount of rain generated was then measured in acre-feet. Here are the data
for the unseeded and seeded clouds:
Unseeded:
81.2 26.1 95.0 41.1 28.6 21.7 11.5 68.5 345.5 321.2
1202.6 1.0 4.9 163.0 372.4 244.3 47.3 87.0 26.3 24.4
830.1 4.9 36.6 147.8 17.3 29.0
Seeded:
Construct a frequence distribution for the combined cloud-seeding rain measurements. Use 6
bins.
B
C
SOLUTION
Class Frequency
35
11
3
2
0
1
In the 2000 Sydney Olympics, a special program initiated by IOC president Juan Antonio
Samaranch allowed developing countries to send athletes to the Olympics without the usual
qualifying procedure. Here are the 71 times for the first round of the 100 meter men's swim (in
seconds).
B
C
SOLUTION
Class Frequency
41
27
2
0
0
0
0
0
1
The driver's reaction time in response to a particular potential traffic hazard is the time required
from the point of initial detection of the hazard in one’s field of view to the time that vehicle
control components are actuated (such as movement of one’s foot to the brake pedal). The
following data represents the measurements of the driver’s reaction time in seconds for the male
and female participants of the experiment:
(a) Calculate the sample mean of the data. Calculate the sample variance of the data. Calculate
the corresponding standard deviation.
SOLUTION
Sample variance:
(b)
(c)
The weather forecast for New York for the period from 1 May 2017 to 15 May 2017 represents
the following set of maximal temperatures in °F: 75, 71, 61, 59, 64, 61, 62, 59, 62, 65, 65, 65, 68,
72, 72.
(a) Calculate the sample mean, sample variance, and standard deviation.
(c) Construct a box plot of the data. Find the interquartile range and determine the number of
outliers.
SOLUTION
Sample variance:
Sample standard deviation:
(b)
(c)
(a) Calculate the sample mean, sample variance, and standard deviation.
(c) Construct a box plot of the data. Find the interquartile range.
SOLUTION
Before analyzing the results, they should be converted to minutes. Therefore, the set of results in
minutes is: 55.23, 55.35, 56.00, 56.20, 56.33, 56.38, 56.60, 56.62, 56.83, 57.13, 58.68, 58.73,
58.80, 58.88, 59.23, 59.50, 59.50, 60.25.
Sample variance:
(b)
(c)
Therefore, there is no outliers and the box plot is
The data below represents the 14 greatest net capacities in GW of currently operated nuclear
power plants with the current net capacities over 1000 MW for three different countries.
(a) Calculate the sample mean, sample variance, and standard deviation.
(c) Construct a box plot of the data. Find the interquartile range and determine the number of
outliers.
SOLUTION
Sample variance:
Sample standard deviation:
(b)
USA:
France:
Japan:
(c)
USA:
The outliers are 3.300 and 4.214. , therefore, 3.300 is an outlier and 4.214 is
an extreme outlier.
France:
There is no outliers.
Japan:
The United States has an aging infrastructure as witnessed by several resent disasters, including
the I-35 bridge failure in Minnesota. Most states inspect their bridges regularly and report their
condition (on a scale from 1-7) to the public. Here are condition numbers from a sample of 30
bridges in New York State:
5.08 5.44 6.66 5.07 6.80 5.43 4.83 4.00 4.41 4.38
7.00 5.72 4.53 6.43 3.97 4.19 6.26 6.72 5.26 5.48
4.95 6.33 4.93 5.61 4.66 7.00 5.57 3.42 5.18 4.54
B
C
SOLUTION
(a)
(b)
(c)
In an attempt to measure the effects of acid rain, researchers measured the pH (7 is neutral and
values below 7 are acidic) of water collected from rain in Ingham County, Michigan.
5.47 5.37 5.38 4.63 5.37 3.74 3.71 4.96 4.64 5.11
5.65 5.39 4.16 5.62 4.57 4.64 5.48 4.57 4.57 4.51
4.86 4.56 4.61 4.32 3.98 5.70 4.15 3.98 5.65 3.10
5.04 4.62 4.51 4.34 4.16 4.64 5.12 3.71 4.64 5.59
B
C
SOLUTION
(a)
Median = 4.635.
(b)
(c)
An article in the Journal of Aircraft (1988) described the computation of drag coefficients for the
NASA 0012 airfoil. Different computational algorithms were used at with the
following results (drag coefficients are in units of drag counts; that is, one count is equivalent to
a drag coefficient of 0.0001): 79, 100, 74, 83, 81, 85, 82, 80, and 84.
(a) Find the median and the upper and lower quartiles of the
B
C
A
B
C
D
(c) Find the median and the upper and lower quartiles.
SOLUTION
(a)
(b)
The net energy consumption (in billions of kilowatt-hours) for countries in Asia in 2003 was as
follows.
Billions of Kilowatt-Hours
Afghanistan 1.04
Australia 200.66
Bangladesh 16.20
Burma 6.88
China 1671.23
Hong Kong 38.43
India 519.04
Indonesia 101.80
Japan 946.27
Korea, North 17.43
Korea, South 303.33
Laos 3.30
Malaysia 73.63
Mongolia 2.91
Nepal 2.30
New Zealand 37.03
Pakistan 71.54
Philippines 44.48
Singapore 30.89
Sri Lanka 6.80
Taiwan 154.34
Thailand 107.34
Vietnam 36.92
Total 4393.8
A
B
C
D
How does the boxplot compare in interpretive value to the original stem-and-leaf diagram?
SOLUTION
The boxplot shows the same basic information as the stem-and-leaf plot but in a different format.
Reserve Problems Chapter 6 Section 4 Problem 9
680 669 719 699 670 710 722 663 658 634 720 690
677 669 700 718 690 681 702 696 692 690 694 660
679 675 701 721 683 735 688 763 672 698 659 704
681 679 691 683 705 746 706 649 668 672 690 724
652 720 660 695 701 724 668 698 668 660 680 739
717 727 653 637 660 693 679 682 724 642 704 695
704 652 664 702 661 720 695 670 656 718 660 648
683 723 710 680 684 705 681 748 697 703 660 722
662 644 683 695 678 674 656 667 683 691 680 685
681 715 665 676 665 675 655 659 720 675 697 663
A
B
C
D
How does the boxplot compare in interpretive value to the original stem-and-leaf diagram?
SOLUTION
The boxplot shows the same basic information as the stem-and-leaf plot but in a different format.
B
C
(b) Comment on the suspicion that rating is related to yards per attempt.
(c) What is the simple correlation coefficient between these two variables?
SOLUTION
(a)
(b)
(c)
B
C
(b) Comment on the widely held belief that price is related to taxes paid.
(c) What is the simple correlation coefficient between these two variables?
SOLUTION
(a)
(b)
(c)
The following data are the numbers of cakes sold by a bakery on different days: 234, 246, 214,
267, 253, 220, 259, 264, 240, 251, 230, 247, 255.
B
C
Does it seem reasonable to assume that the number of cakes sold is normally distributed?
SOLUTION
According to the normal probability plot, it seems reasonable to assume that the number of cakes
is normally distributed.
The following data are the measurements of October rainfall (mm) in Kiev from 2010 to 2016:
44.9, 78.8, 50.6, 14.2, 21.3, 39, 103.1
A
B
C
D
SOLUTION
According to the normal probability plot, it seems reasonable to assume that the rainfall is
normally distributed.
B
C
Does it seem reasonable to assume that the duration of treatment is normally distributed?
SOLUTION
According to the normal probability plot, it seems reasonable to assume that the duration of
treatment is normally distributed. However, the distribution seems to have a positive (or right)
skew.
Reserve Problems Chapter 6 Section 7 Problem 4
The following data are the results of 20 measurements of patient’s systolic blood pressure during
the period of illness: 98, 160, 136, 128, 130, 114, 123, 134, 128, 107, 123, 125, 129, 132, 154,
115, 126, 132, 136, 130.
A
B
C
D
Does it seem reasonable to assume that the systolic blood pressure is normally distributed?
SOLUTION
According to the normal probability plot, it seems reasonable to assume that the systolic blood
pressure is normally distributed. However, the distribution seems to be heavy-tailed.
Reserve Problems Chapter 6 Section 7 Problem 5
The following data are the joint temperatures of the O-rings (°F) for each test firing or actual
launch of the space shuttle rocket motor (from Presidential Commission on the Space Shuttle
Challenger Accident, Vol. 1, pp. 129–131): 84, 49, 61, 40, 83, 67, 45, 66, 70, 69, 80, 58, 68, 60,
67, 72, 73, 70, 57, 63, 70, 78, 52, 67, 53, 67, 75, 61, 70, 81, 76, 79, 75, 76, 58, 31.
Select the correct normal probability plot of the O-ring joint temperature data.
B
C
Does it seem reasonable to assume that O-ring joint temperature is normally distributed?
SOLUTION
Yes, O-ring joint temperature is approximately normally distributed with some departures from
the line at the ends of the distribution.
The National Oceanic and Atmospheric Administration provided the monthly absolute estimates
of global (land and ocean combined) temperature index (degrees C) from 2000. See the table
below. Construct and interpret either a digidot plot or a separate stem-and-leaf and time series
plot of these data.
In each year the temperature has a _____ distribution. In each year, the temperature ______ until
the mid year and then it starts to ______.
SOLUTION
5 12 23444
29 12 555566666666677777888999
42 13 1222233344444
48 13 555566
(11) 14 12222333344
58 14 55566667
50 15 22333334444
39 15 5555566679
29 16 00000011111112222222333334444
An article in Quality Engineering (1992, Vol. 4, pp. 487–495) presents viscosity data from a
batch chemical process. A sample of these data is in the table below.
Reading left to right and up to down, draw a time series plot of all the data and comment on any
features of the data that are revealed by this plot.
Consider the notion that the first 40 observations (the first 4 columns) were generated from a
specific process, whereas the last 40 observations were generated from a different process.
Does the plot indicate that the two processes generate similar results?
Calculate the sample mean and sample variance of the first 40 and the second 40 observations
SOLUTION
No, the mean of the second set of 40 data points appears to be slightly greater
than the mean of the first set of 40.
14.875
The first 40 observations:
0.899
There is a slight difference in the mean levels and the sample variances.
An article in Quality Engineering (1992, Vol. 4, pp. 487–495) presents viscosity data from a
batch chemical process. A sample of these data is in the table below.
Use the data above to calculate the descriptive statistics for two groups of observations: the first
40 observations (the first 4 columns) and the last 40.
The median of the first set of observations is _____ the median of the second set. The ___ half of
the data seems to be concentrated a little more tightly. ____ data point(s) appear(s) as outliers in
the first half of the data, ____ data point(s) appear(s) as outliers in the second half of the data.
SOLUTION
The data shown in the table are monthly champagne sales in France (1962–1969) in thousands of
bottles.
Construct a time series plot of the data and comment on any features of the data that reveals by
this plot.
There appears to be a ____ variation in the data with the highest value of the year _____. The
high values are during the _______.
Speculate on how you would use a graphical procedure to forecast monthly champagne sales for
the year 1970.
SOLUTION
There appears to be a cyclic variation in the data with the high value of the cycle generally
increasing. The high values are during the winter holiday months.
We might draw another cycle, with the peak similar to the last year’s data (1969) at about 12.7
thousand bottles.
112 33 2 02 01
Fill the cells of the stem-and-leaf plot of the data with the number of zeros in the sequences. Leaf
unit is equal to 0.10.
Construct a time series plot of the data. Is there evidence that there was an increase or decrease
in the number of errors in a string?
SOLUTION
Stem-and-leaf of errors N = 20
Leaf Unit = 0.10
3 0 000
9 1 000000
9 2 0000
7 3 000000
140
Sample Average
3.5
2.5
errors
1.5
0.5
0
0 5 10 15 20
Index
The time series plot indicates a slight decrease in the number of errors for strings 16 - 20.
The January 1990 issue of Arizona Trend contains a supplement describing the 12 “best” golf
courses in the state. The yardages (lengths) of these courses are as follows: 6981, 7099, 6930,
6992, 7518, 7100, 6935, 7518, 7013, 6800, 7041, and 6890.
Use the data above to calculate the descriptive statistics for the golf course yardage data.
Construct a box plot of the yardages and fill the blanks in the interpretation of the plot.
The golf course yardage data appear to be _____. Also, there ____ outlying data point(s) ____.
SOLUTION
The golf course yardage data appear to be skewed. Also, there is an outlying data point (double)
above 7500 yards.
An article in Quality Engineering (1992, Vol. 4, pp. 487–495) presents viscosity data from a
batch chemical process. A sample of these data is in the table below.
13.3 14.3 14.9 15.2 15.8 14.2 16.0 14.0
14.5 16.1 13.7 15.2 13.7 16.9 14.9 14.4
15.3 13.1 15.2 15.9 15.1 14.9 13.6 13.7
15.3 15.5 14.5 16.5 13.4 15.2 15.3 13.8
14.3 12.6 15.3 14.8 14.1 14.4 14.3 15.6
14.8 14.6 15.6 15.1 14.8 15.2 15.6 14.5
15.2 14.3 15.8 17.0 14.3 14.6 16.1 12.8
14.5 15.4 13.3 14.9 14.3 16.4 13.9 16.1
14.6 15.2 14.1 14.8 16.4 14.2 15.2 16.6
14.1 16.8 15.4 14.0 16.9 15.7 14.4 15.6
Construct normal probability plots for two groups of the data: the first 40 observations (the first
4 columns) and the last 40 observations. Construct both plots on the same axes. What tentative
conclusions can you draw?
Both sets of data appear ____ normally distributed and with roughly the same ____ value. The
difference in slopes for the two lines indicates that a change in ____ might have occurred. This
could have been the result of a change in processing conditions, the quality of the raw material or
some other factor.
SOLUTION
Both sets of data appear to be normally distributed and with roughly the same mean value. The
difference in slopes for the two lines indicates that a change in variance might have occurred.
This could have been the result of a change in processing conditions, the quality of the raw
material or some other factor.
The following data are the temperatures of effluent at discharge from a sewage treatment facility
on consecutive days:
43 47 51 48 52 50 46 49
45 52 46 51 44 49 46 51
49 45 44 50 48 50 49 50
Construct a normal probability plot of the data. Based on the plot, what tentative conclusions can
you draw?
A normal distribution is _____ for these data. There _____ repeated values in the data so the plot
_____ off the line.
SOLUTION
A normal distribution is reasonable for these data. There are some repeated values in the data
that cause some points to fall off the line.
Although we do not have sufficient data points to really see a pattern, there seem to ____
significant deviations from normality for either sample. The ____ difference in slopes indicates
that the _____ of the populations are very _______.
SOLUTION
Although we do not have sufficient data points to really see a pattern, there seem to be no
significant deviations from normality for either sample. The large difference in slopes indicates
that the variances of the populations are very different.
The United States Golf Association tests golf balls to ensure that they conform to the rules of
golf. Balls are tested for weight, diameter, roundness, and overall distance. The overall distance
test is conducted by hitting balls with a driver swung by a mechanical device nicknamed “Iron
Byron” after the legendary great Byron Nelson, whose swing the machine is said to emulate.
Following are 100 distances (in yards) achieved by a particular brand of golf ball in the overall
distance test.
Use the data above to calculate the descriptive statistics for the yardage distance.
Construct a box plot of the yardage distance and complete the interpretation of the plot. How
does the box plot compare in interpretive value to the original stem-and-leaf diagram?
The plot indicates that most balls (between first and third quartiles) will fall somewhere in the
range from ____ to _____. This type of information _____ be obtained from the stem-and-leaf
graph.
SOLUTION
First quartile
= 251.85
Third quartile
= 271.95
The plot indicates that most balls will fall somewhere in the 250-275 range. The same type of
information could have been obtained from the stem-and-leaf graph.
Chest pain 8
Difficulty breathing 7
Numbness in extremities 3
Broken bones 11
Abrasions 16
Cuts 21
Stab wounds 9
Gunshot wounds 4
Blunt force trauma 10
Fainting, loss of consciousness 4
Other 10
Calculate the numerical summaries of these data and fill the table.
SOLUTION
Sort the categories by the number of instances in each category. Bars are used to indicate the counts and this sorted bar chart is known as a Pareto chart
(discussed in Chapter 15).
The force needed to remove the cap from a medicine bottle is an important feature of the product
because requiring too much force may cause difficulty for elderly patients or patients with
arthritis or similar conditions. The data below presents the results of testing a sample of 68 caps
attached to bottles for the force (in pounds) required for removing the cap.
14 18 27 24 24 28 22 21 16
17 22 16 16 18 30 16 14 15
25 15 16 15 15 19 19 10 22
17 15 17 20 17 20 15 17 20
24 27 17 32 31 27 21 21 26
31 34 32 24 16 37 36 34 20
19 21 14 14 19 15 30 24 15
17 17 21 34 24
Stem-and-leaf of Force N = 68
Leaf Unit = 1.0
1 10
1 1
13 1 444455555555
27 1 66666677777777
33 1 889999
(9) 2 000011111
26 2 222
23 2 4444445
16 2 6777
12 2 8
11 3 0011
7 3 22
5 3 444
2 3 67
What are the average and the standard deviation of the force?
If the upper specification on required force is 30 pounds, what proportion of the caps do not meet
this requirement? Round your answer to three decimal places (eg. 98.765).
What proportion of the caps exceeds the average force plus 2 standard deviations? Round your
answer to three decimal places (eg. 98.765).
Suppose that the first 36 observations in the table come from one machine and the remaining
come from a second machine (read across the rows and the down). Construct box plots of the
data and calculate the descriptive statistics for two groups of observations. Round your answers
to two decimal places (eg. 98.76).
Plot the first 36 observations in the table on a normal probability plot and the remaining
observations on another normal probability plot. Compare the results with the single normal
probability plot constructed for all of the data.
A separate normal distribution for each group of caps fits the data ____ the distribution for all of
the data. This is to be expected when the mean and standard deviations of the groups ____ as
they do here.
SOLUTION
The sample mean is 21.265 and the sample standard deviation is 6.422.
There is a number of repeated values for force that are seen as points stacked vertically on the
plot. This is probably due to rounding of the force value to two digits. There are fewer lower
force values than are expected from a normal distribution.
From the stem-and-leaf display, 9 caps exceed the force limit of 30. This is 9/68 = 0.132.
The mean plus two standard deviations equals 21.265 + 2(6.422) = 34.109. From the stem-and-
leaf display, 2 caps exceed this force limit. This is 2/68 = 0.029.
Third quartile = 31
In the following box plots Force1 denotes the subset of the first 36 observations and Force2
denotes the remaining observation. The mean and variability of force is greater for the second set
of data in Force2.
Yes.
Plot the first 36 observations in the table on a normal probability plot and the remaining
observations on another normal probability plot. Compare the results with the single normal
probability plot constructed for all of the data.
A separate normal distribution for each group of caps fits the data better. This is to be expected
when the mean and standard deviations of the groups differ as they do here.