0% found this document useful (0 votes)
290 views121 pages

RP ch06

The stem-and-leaf diagram that is constructed correctly based on the data provided is B. It has the correct leaf unit of 0.010 to display the data to one decimal place, as the data is measuring bridge conditions on a scale from 1 to 7 with decimals shown to one place. Diagrams A and C have incorrect leaf units that would not accurately represent the measurement scale of the data.

Uploaded by

alallaq
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
290 views121 pages

RP ch06

The stem-and-leaf diagram that is constructed correctly based on the data provided is B. It has the correct leaf unit of 0.010 to display the data to one decimal place, as the data is measuring bridge conditions on a scale from 1 to 7 with decimals shown to one place. Diagrams A and C have incorrect leaf units that would not accurately represent the measurement scale of the data.

Uploaded by

alallaq
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 121

CHAPTER 6 RESERVE PROBLEMS

The following problems have been reserved for your use in assignments and testing and do not
appear in student versions of the text.

Reserve Problems Chapter 6 Section 1 Problem 1

An independent research was made asking 500 people about their bank deposits. Using the data
in the table, calculate the deposit sample mean and deposit sample standard deviation.

Amount of deposit, $ 1000 3000 3500 7000 15000

Number of depositors 96 107 130 89 78

SOLUTION
First of all, we need to multiple each deposit amount by the number of depositors:

Now calculate the deposit mean:

where , is one of the 500 elements of the sample ( ) and is the


deposit of the i category ( ).

And the deposit variance:

Thus,
Therefore, the deposit sample standard deviation is

Reserve Problems Chapter 6 Section 1 Problem 2

A consignment of 100 half-inch nuts was released from the factory. The diameters of 10
randomly taken nuts were measured. Calculate the sample mean and sample standard deviation
using the data in the table.

Number 1 2 3 4 5 6 7 8 9 10
of nut
Diameter, 0.4983 0.5030 0.4982 0.4986 0.5010 0.5009 0.4984 0.4993 0.5014 0.5010
inch

SOLUTION

Sample average:

Sample variance:

Sample standard deviation:

Reserve Problems Chapter 6 Section 1 Problem 3

During the second part of April Chicago Bulls had 6 games with Boston Celtics. The number of
points, scored in each game is in the table. Calculate the sample mean and sample standard
deviation using the data in the table.

No 1 2 3 4 5 6
Number of 90 95 87 95 97 83
points

SOLUTION

Sample average:

Sample variance:

Sample standard deviation:

Reserve Problems Chapter 6 Section 1 Problem 4

An independent research was made in a dog shelter to measure the heights of dogs of different
breeds. Eight dogs were evaluated for this experiment. The results are shown in the table below.
Calculate the sample mean and sample standard deviation using the data in the table.

Number 1 2 3 4 5 6 7 8
of dog
Dog’s 0.75 0.15 0.35 0.60 0.62 0.58 0.75 0.76
height, m

SOLUTION

Sample average:

Sample variance:
Sample standard deviation:

Reserve Problems Chapter 6 Section 1 Problem 5

Suppose that you add 2 to all observations in a sample. How does this change the sample mean?
How does it change the sample standard deviation?

SOLUTION

Let be the original observations, and be the shifted ones, then

Reserve Problems Chapter 6 Section 1 Problem 6

The United States has an aging infrastructure as witnessed by several recent disasters, including
the I-35 bridge failure in Minnesota. Most states inspect their bridges regularly and report their
condition (on a scale from 1–7) to the public.

Here are the condition numbers from a sample of 30 bridges

5.08 5.44 6.66 5.07 6.04 5.43 4.83 4.00 4.41 3.61
7.00 5.72 3.48 6.43 3.97 4.19 6.26 5.27 5.26 5.48
4.95 5.76 4.93 5.61 4.66 7.00 5.57 3.42 5.91 4.54
Find the sample mean and the sample standard deviation of these condition numbers.

SOLUTION

Sample mean:

Sample variance:
Sample standard deviation:

Reserve Problems Chapter 6 Section 1 Problem 7

In an attempt to measure the effects of acid rain, researchers measured the pH (7 is neutral and
values below 7 are acidic) of water collected from rain in Ingham County, Michigan.

5.47 5.37 5.38 4.63 4.34 5.02 3.71 4.96 4.64 5.94
5.54 5.39 4.16 5.62 4.57 5.90 5.48 4.57 4.57 4.51
4.62 4.56 4.61 4.32 3.98 5.56 4.15 3.98 5.65 3.10
5.04 4.62 4.88 4.34 4.16 4.64 5.12 3.71 4.80 5.59

Find the sample mean and sample standard deviation of these measurements.

SOLUTION

Sample mean:

Sample variance:

Sample standard deviation:

Reserve Problems Chapter 6 Section 2 Problem 1

An independent research was made to calculate the mean weight of elder people in Russia. 50
men and women were interviewed for this research. Construct a stem-and-leaf diagram for the
data. Calculate the median and quartiles of these data. All values are in kg.

74 89 84 85 57
81 78 79 77 83
94 82 57 75 82
77 60 79 76 97
99 70 97 57 69
84 92 82 91 85
76 92 76 63 69
58 64 59 89 91
96 85 80 59 58
94 57 79 89 87

SOLUTION

(a) Construct a stem-and-leaf diagram.

Leaf
Stem
77778899
5
03499
6
045666778999
7
012223445557999
8
1122446779
9

Stem: tens digits (kg); Leaf: ones digits (kg).

(b) Find median, q1, q3.


Median: 79.5, q1: 69; q3: 89

Reserve Problems Chapter 6 Section 2 Problem 2

An independent research was made to calculate the mean weight of elder people in Russia. 50
men and women were interviewed for this research. Calculate the sample median, mode, and
mean of the data. Explain how these three measures of location describe different features of the
data. All values are in kg.

74 89 84 85 57
81 78 79 77 83
94 82 57 75 82
77 60 79 76 97
99 70 97 57 69
84 92 82 91 85
76 92 76 63 69
58 64 59 89 91
94 85 80 59 58
94 57 79 89 87

SOLUTION

The median is halfway between 25th and 26th, so it is the average of the 25th and 26th values and it
equals 79.5.

Mode is 57 as the most frequent data value.


Sample mean:

Reserve Problems Chapter 6 Section 2 Problem 3

The table represents the answers of 80 respondents to the survey “How much sports trainings do
you have every year?” carried out among college students. Construct a stem-and-leaf diagram for
the data. Calculate the median and quartiles of these data.

129 157 154 191 192 142 188 126


128 180 190 166 157 147 155 154
200 128 167 143 131 156 153 168
149 144 155 188 149 142 160 149
184 187 169 161 157 134 122 173
188 183 178 148 135 188 187 166
121 177 169 182 158 169 146 173
133 189 183 143 148 121 181 145
189 120 122 189 146 190 128 142
189 131 199 182 197 148 157 140

SOLUTION
(a) Construct a stem-and-leaf diagram.

0112268889
12
11345
13
02223345667888999
14
34455677778
15
016678999
16
3378
17
01223347788889999
18
001279
19
0
20

(b) Find median, q1 and q3.


Median: 157, q1: 143; q3:182.75

Reserve Problems Chapter 6 Section 2 Problem 4

The table represents the answers of 80 respondents to the survey “How much sports trainings do
you have every year?” carried out among college students. Calculate the sample median, mode,
and mean of the data. Explain how these three measures of location describe different features of
the data.

129 157 154 191 192 142 188 126


128 180 190 166 157 147 155 154
200 128 167 143 131 156 153 168
149 144 155 188 149 142 160 149
184 187 169 161 157 134 122 173
188 183 178 148 135 188 187 166
121 177 169 182 158 169 146 173
138 189 183 143 148 121 181 145
189 120 122 189 146 190 128 142
189 131 199 182 197 148 157 140

SOLUTION

The median is halfway between 40th and 41th, so it is the average of the 40th and 41th values and it
equals 157.

Modes are 157, 188 and 189 as the most frequent data value.

Sample mean:

Reserve Problems Chapter 6 Section 2 Problem 5

The United States has an aging infrastructure as witnessed by several recent disasters, including
the I-35 bridge failure in Minnesota. Most states inspect their bridges regularly and report their
condition (on a scale from 1–7) to the public.

Here are the condition numbers from a sample of 30 bridges in New York State:

5.08 5.44 6.66 5.07 6.80 5.43 4.83 4.00 4.41 4.38
7.00 5.72 4.53 6.43 3.97 4.19 6.26 6.72 5.26 5.48
4.95 6.33 4.93 5.61 4.66 7.00 5.57 3.42 5.18 4.54

Choose the correct steam-and-leaf diagram.

N = 30

Leaf Unit = 0.10


1 34
2 39
6 4 0134
A 12 4 556899
(7) 5 0012444
11 5 567
8 6 234
5 6 678
2 7 00
N = 30

Leaf Unit = 0.010


1 34
2 39
6 4 0134
B 12 4 556899
(7) 5 0012444
11 5 567
8 6 234
5 6 678
2 7 00
N = 30

Leaf Unit = 0.10


1 34
2 39
6 4 0134
C 13 4 556899
(5) 5 0012444
11 5 567
8 6 234
5 6 678
2 7 00
N = 30

Leaf Unit = 0.010


1 34
2 39
6 4 0134
D 12 4 556899
(5) 5 0012444
11 5 567
8 6 234
5 6 678
1 7 00

Do any of the bridges appear to have unusually good or poor ratings? If so, compute the mean
with and without these bridges and comment.

SOLUTION
(a) N = 30

Leaf Unit = 0.10

1 34
2 39
6 4 0134
12 4 556899
(7) 5 0012444
11 5 567
8 6 234
5 6 678
2 7 00

(b) One particular bridge has poor rating of 3.4.

The mean before removing:

The mean after removing the bridge mentioned before:

Reserve Problems Chapter 6 Section 2 Problem 6

In an attempt to measure the effects of acid rain, researchers measured the pH (7 is neutral and
values below 7 are acidic) of water collected from rain in Ingham County, Michigan.

5.47 5.37 5.38 4.63 5.37 3.74 3.71 4.96 4.64 5.11
5.65 5.39 4.16 5.62 4.57 4.64 5.48 4.57 4.57 4.51
4.86 4.56 4.61 4.32 3.98 5.70 4.15 3.98 5.65 3.10
5.04 4.62 4.51 4.34 4.16 4.64 5.12 3.71 4.64 5.59

Select the correct stem-and-leaf diagram.

A N=40. Leaf Unit = 0.10

1 3 1
4 3 777
6 3 99
9 4 111
11 4 33
17 4 555555
(7) 4 6666666
16 4 89
14 5 011
11 5 3333
7 5 445
4 5 6667
N=40. Leaf Unit = 0.010

1 31
4 3 777
6 3 99
9 4 111
11 4 33
B 17 4 555555
(7) 4 6666666
16 4 89
14 5 011
11 5 3333
7 5 445
4 5 6667
N=40. Leaf Unit = 0.10

1 31
4 3 777
6 3 99
9 4 111
11 4 33
C 17 4 555555
(6) 4 666666
16 4 89
14 5 011
11 5 3333
7 5 445
4 5 6667
D N=40. Leaf Unit = 0.10

1 3 1
3 3 77
6 3 999
9 4 111
11 4 33
17 4 555555
(6) 4 666666
16 4 89
14 5 0111
10 5 333
7 5 445
4 5 6667

Many scientists consider rain with pH below 5.3 to be acid rain. What percentage of these
samples could be consider as acid rain?

SOLUTION

The number of observations below 5.3 is 17 + 7 + 5 = 29 observations.


Therefore, the percentage of observations which are considered acid rain is 72.5%.

Reserve Problems Chapter 6 Section 2 Problem 7

The following data represent the yield on 90 consecutive batches of ceramic substrate to which a
metal coating has been applied by a vapor-deposition process.
94.1 86.1 95.3 84.9 88.8 84.6 94.4 84.1
93.2 90.4 94.1 78.3 86.4 83.6 96.1 83.7
90.6 89.1 97.8 89.6 85.1 85.4 98.0 82.9
91.4 87.3 93.1 90.3 84.0 89.7 85.4 87.3
88.2 84.1 86.4 93.1 93.7 87.6 86.6 86.4
86.1 90.1 87.6 94.6 87.7 85.1 91.7 84.5
95.1 95.2 94.1 96.3 90.6 89.6 87.5
90.0 86.1 92.1 94.7 89.4 90.0 84.2
92.4 94.3 96.4 91.1 88.6 90.1 85.1
87.3 93.2 88.2 92.4 84.1 94.3 90.5
86.6 86.7 86.4 90.6 82.6 97.3 95.6
91.2 83.0 85.0 89.1 83.1 96.8 88.3

Select the correct stem-and-leaf display for these data.

N=90. Leaf Unit = 1

1 7e|8
7 8t|223333
21 8f|44444444555555
38 8s|66666666667777777
A (11) 8e|88888999999
41 9z|00000000001111
27 9t|22223333
19 9f|444444445555
7 9s|666677
1 9e|8
B N=90. Leaf Unit = 10

1 7e|8
7 8t|223333
21 8f|44444444555555
38 8s|66666666667777777
(13) 8e|88888999999
41 9z|00000000001111
27 9t|22223333
19 9f|444444445555
7 9s|666677
1 9e|8
N=90. Leaf Unit = 1

1 7e|8
6 8t|223333
14 8f|44444444555555
17 8s|66666666667777777
C (11) 8e|88888999999
14 9z|00000000001111
8 9t|22223333
12 9f|444444445555
6 9s|666677
1 9e|8
N=90. Leaf Unit = 10

1 7e|8
7 8t|223333
20 8f|4444444555555
37 8s|6666666667777777
D (11) 8e|88888999999
39 9z|000000000111
27 9t|22223333
18 9f|44444445555
7 9s|666677
1 9e|8

Calculate the median of these data. Calculate the quartiles of these data.

Reserve Problems Chapter 6 Section 2 Problem 8

The net energy consumption (in billions of kilowatt-hours) for countries in Asia in 2003 was as
follows
Billions of Kilowatt-Hours
Afghanistan 1.04
Australia 200.66
Bangladesh 16.20
Burma 6.88
China 1671.23
Hong Kong 38.43
India 519.04
Indonesia 101.80
Japan 946.27
Korea, North 17.43
Korea, South 303.33
Laos 3.30
Malaysia 73.63
Mongolia 2.91
Nepal 2.30
New Zealand 37.03
Pakistan 71.54
Philippines 44.48
Singapore 30.89
Sri Lanka 6.80
Taiwan 154.34
Thailand 107.34
Vietnam 36.92
Total 4393.8

Select the correct stem-and-leaf display of these data.

N=23, Leaf Unit = 100

(18) 0|000000000000000111
A 5 0|23
3 0|5
2 0|9
1 1|6
N=24, Leaf Unit = 100

(18) 0|000000000000000111
6 0|23
B 4 0|5
3 0|9
2 1|6
1 4|3
N=23, Leaf Unit = 100

(17) 0|00000000000000111
C 5 0|23
3 0|5
2 0|9
1 1|6
D N=24, Leaf Unit = 100

(19) 0|0000000000000000111
5 0|23
3 0|5
2 0|9
1 1|6

Compute the sample mean, sample standard deviation, and sample median:

SOLUTION

Do not use the total as an observation. There are 23 observations. (N=23, Leaf Unit = 100)
Sample mean:

Rounded to the nearest integer,


Sample variance:

Sample standard deviation:

Sample median is at 12th = 38.43, or, rounded to the nearest integer, M = 38 bln kW-h.

Reserve Problems Chapter 6 Section 2 Problem 9

The United States Golf Association tests golf balls to ensure that they conform to the rules of
golf. Balls are tested for weight, diameter, roundness, and overall distance. The overall distance
test is conducted by hitting balls with a driver swung by a mechanical device nicknamed “Iron
Byron” after the legendary great Byron Nelson, whose swing the machine is said to emulate.
Following are 100 distances (in yards) achieved by a particular brand of golf ball in the overall
distance test.

261.3 259.4 265.7 270.6 274.2 261.4 254.5 283.7


258.1 270.5 255.1 268.9 267.4 253.6 234.3 263.2
254.2 270.7 233.7 263.5 244.5 251.8 259.5 257.5
257.7 272.6 253.7 262.2 252.0 280.3 274.9 233.7
237.9 274.0 264.5 244.8 264.0 268.3 272.1 260.2
255.8 260.7 245.5 279.6 237.8 278.5 273.3 263.7
241.4 260.6 230.3 272.7 261.0 260.0 279.3 252.1
244.3 272.2 248.3 278.7 236.0 271.2 279.8 245.6
241.2 251.1 267.0 273.4 247.7 254.8 272.8 270.5
254.4 232.1 271.5 242.9 273.6 256.1 251.6
256.8 273.0 240.8 276.6 264.5 264.5 226.8
255.3 266.6 250.2 255.8 285.3 255.4 240.5
255.0 273.2 251.4 276.1 277.8 266.8 268.5
Select the correct stem-and-leaf diagram for these data.

N=100, Leaf Unit = 1

1 22|6
5 23|2334
8 23|677
16 24|00112444
20 24|5578
A 33 25|0111122334444
46 25|5555556677899
(15) 26|000011123334444
39 26|56677888
31 27|0000112222233333444
12 27|66788999
4 28|003
1 28|5
N=100, Leaf Unit = 1

1 22|6
5 23|2334
8 23|677
16 24|00112444
20 24|5578
B 33 25|0111122334444
46 25|5555556677899
(14) 26|00011123334444
39 26|56677888
31 27|0000112222233333444
12 27|66788999
4 28|003
1 28|5
C N=100, Leaf Unit = 1

1 22|6
5 23|2334
8 23|677
16 24|00112444
20 24|5578
33 25|0111122334444
46 25|5555556677899
(16) 26|0000111123334444
39 26|56677888
31 27|0000112222233333444
12 27|66788999
4 28|003
1 28|5
N=100, Leaf Unit = 1

1 22|6
5 23|2334
8 23|677
16 24|00112444
20 24|5578
D 34 25|01111122334444
45 25|555556677899
(15) 26|000011123334444
40 26|566778888
30 27|000011222233333444
12 27|66788999
4 28|003
1 28|5

Compute the sample mean, sample standard deviation, and the sample median.
What is the 90th percentile of distances?

SOLUTION

Sample mean:

(yards)
Sample standard deviation:

(yards)
Sample median: M = 260.85 yards

90th percentile is 277.2 yards.

Reserve Problems Chapter 6 Section 2 Problem 10

A semiconductor manufacturer produces devices used as central processing units in personal


computers. The speed of the devices (in megahertz) is important because it determines the price
that the manufacturer can charge for the devices. The following table contains measurements on
120 devices.
680 669 719 699 670 710 722 663 658 634 720 690
677 669 700 718 690 681 702 696 692 690 694 660
649 675 701 721 683 735 688 763 672 698 659 704
681 679 691 683 705 746 706 649 668 672 690 724
652 720 660 695 701 724 668 698 668 660 680 739
717 727 653 637 660 693 679 682 724 642 704 695
704 652 664 702 661 720 695 670 656 718 660 648
683 723 710 680 684 705 681 748 697 703 660 722
662 644 683 695 678 674 656 667 683 691 680 685
681 715 665 676 665 675 655 659 720 675 697 663

Select the correct stem-and-leaf diagram for these data

N=120, Leaf Unit = 1.0

2 63|47
7 64|24899
16 65|223566899
35 66|0000001233455788899
48 67|002245567899
A (17) 68|00001111233333458
55 69|0000112345555677889
36 70|011223444556
24 71|0057889
17 72|000012234447
5 73|59
3 74|68
1 76|3
N=120, Leaf Unit = 1.0

2 63|47
7 64|24899
16 65|223566899
35 66|0000001233455788899
48 67|002245567899
B (19) 68|0000111112333333458
55 69|0000112345555677889
36 70|011223444556
24 71|0057889
17 72|000012234447
5 73|59
3 74|68
1 76|3
C N=120, Leaf Unit = 1.0
2 63|47
7 64|24899
16 65|223566899
34 66|000001233455788899
48 67|002245567899
(17) 68|00001111233333458
55 69|0000112345555677889
36 70|011223444556
24 71|0057889
16 72|00012234447
5 73|59
3 74|68
1 76|3
N=120, Leaf Unit = 10

2 63|47
7 64|24899
16 65|223566899
35 66|0000001233455788899
48 67|002245567899
D (17) 68|00001111233333458
55 69|0000112345555677889
36 70|011223444556
24 71|0057889
17 72|000012234447
5 73|59
3 74|68
1 76|3

Compute the sample mean, the sample standard deviation, and the sample median.

What percentage of the devices has a speed exceeding 700 MHz?

SOLUTION

Sample mean:

(MHz)
Sample standard deviation:
(MHz)
Sample median M = 683 MHz

exceed 700 MHz.

Reserve Problems Chapter 6 Section 3 Problem 1

The following times (in seconds) were captured for athletes running the 100 meter race.

12.1 13.2 11.8 14.7 15.2 16.4 14.9 14.8 14.8 14.7
13.2 15.1 16.2 11.9 11.7 14.0 15.0 15.1 17.0 16.2
14.2 14.6 14.6 14.8 15.1 15.3 13.6 14.2 16.2 12.2
16.0 12.7 12.9 13.7 16.1 14.4 16.7 12.5 13.4 12.9

Choose the correct histogram for the data.

A
B

D
E

SOLUTION

Let mark the histograms with letters as follows.

In the data provided we can observe seven results between 16.0 and 17.0 seconds
, and one result of 17.0 seconds. It allows us to exclude diagrams (b), (c), (d),
and (e).

So, the correct answer is (a). Try to build the histogram yourself to check it out.

Reserve Problems Chapter 6 Section 3 Problem 2

In which histogram do mode, median, and mean most likely coincide?

A
B

D
E
SOLUTION

Let mark the histograms with letters as follows.


Take a look at the distributions represented by histograms (b) and (c). They are positive and negative skewed, respectively.

Now, one can notice that in histogram (d) mean is greater, than the median, due to lots of values in the box just right to the middle one.

The distribution presented on histogram (e) has two different modes, but it cannot have two means, so this option is not correct as well.

So, the correct answer is (a).

Reserve Problems Chapter 6 Section 3 Problem 3

The following table contains frequency and relative frequency distribution for resistance of different specimens. X is the resistance measured in Ohms.

Class
Frequency 5 6 12 22 17 12 6
Relative
frequency 0.0625 0.0750 0.1500 0.2750 0.2125 0.1500 0.0750

Determine the cumulative relative frequency for the specimens with the resistance between 250 and 300 Ohms.

SOLUTION

First, calculate the cumulative relative frequency for the specimens with the resistance between 250 and 300 – it equals the sum of all relative
frequencies for speciments with the resistance less than 300 Ohms.

Therefore, .
Reserve Problems Chapter 6 Section 3 Problem 4

According to the data, obtained from the daily temperature observation, we obtained the
following information, shown in the table.

Temperature,
Frequency
°C
12.1 7
13.2 9
15.1 12
15.6 15
15.7 17
16.0 31
16.3 22
16.9 42
17.1 51
17.7 44
18.2 42
18.6 27
18.8 32

19.0 13
19.1 4

Choose the correct histogram for the data.

A
B

D
E

SOLUTION

Let mark the histograms with letters as follows.

(a)
(b)

(c)

(d)
(e)

First, consider histogram (b). It shows about 60 observations between 15.5 and 16.5 degrees, but
the sum of frequencies from 15.6°C to 16.3°C is 85, so the histogram is incorrect.

Histogram (c) has an outstanding value above 20°C, and histogram (d) has one below 10°C, so
they do not correspond to our data as well.

The sum of frequencies of values between 16.5°C and 17.5°C is 93, but histogram (e) represents
less than 60.

So, the correct answer is (a).

Reserve Problems Chapter 6 Section 3 Problem 5

The following data represent the yield on 90 consecutive batches for ceramic substrate to which
a metal coating has been applied by a vapor-deposition process.

94.1 86.1 95.3 84.9 88.8 84.6 94.4 84.1


93.2 90.4 94.1 78.3 86.4 83.6 96.1 83.7
90.6 89.1 97.8 89.6 85.1 85.4 98.0 82.9
91.4 87.3 93.1 90.3 84.0 89.7 85.4 87.3
88.2 84.1 86.4 93.1 93.7 87.6 86.6 86.4
86.1 90.1 87.6 94.6 87.7 85.1 91.7 84.5
95.1 95.2 94.1 96.3 90.6 89.6 87.5
90.0 86.1 92.1 94.7 89.4 90.0 84.2
92.4 94.3 96.4 91.1 88.6 90.1 85.1
87.3 93.2 88.2 92.4 84.1 94.3 90.5
86.6 86.7 86.4 90.6 82.6 97.3 95.6
91.2 83.0 85.0 89.1 83.1 96.8 88.3

Construct a frequence distribution for these data. Use 9 bins.


Select the correct histogram.

B
C

E
SOLUTION

Class
Frequency 1 0 11 19 14

Class
Frequency 18 11 12 4

Reserve Problems Chapter 6 Section 3 Problem 6

The net energy consumption (in billions of kilowatt-hours) for countries in Asia in 2003 was as
follows.

Billions of Kilowatt-Hours
Afghanistan 1.04
Australia 200.66
Bangladesh 16.20
Burma 6.88
China 1671.23
Hong Kong 38.43
India 519.04
Indonesia 101.80
Japan 946.27
Korea, North 17.43
Korea, South 303.33
Laos 3.30
Malaysia 73.63
Mongolia 2.91
Nepal 2.30
New Zealand 37.03
Pakistan 71.54
Philippines 44.48
Singapore 30.89
Sri Lanka 6.80
Taiwan 154.34
Thailand 107.34
Vietnam 36.92
Total 4393.8

Construct a frequence distribution for these data. Use 9 bins.

Select the correct histogram.


A

B
C

D
E

SOLUTION

Class
Frequency 15 4 1 1 0

Class
Frequency 1 0 0 1

Reserve Problems Chapter 6 Section 3 Problem 7

The united States Golf Association tests golf balls to ensure that they conform to the rules of
golf. Balls are tested for weight, diameter, roundness and overall distance. The overall distance
test is conducted by hitting balls with a driver swung by a mechanical device nicknamed "Iron
Byron" after the legendary great Byron Nelson, whose swing the machine is said to emulate.
Following are 100 distances (in yards) achieved by a particular brand of golf ball in the overall
distance test.

261.3 259.4 265.7 270.6 274.2 261.4 254.5 283.7


258.1 270.5 255.1 268.9 267.4 253.6 234.3 263.2
254.2 270.7 233.7 263.5 244.5 251.8 259.5 257.5
257.7 272.6 253.7 262.2 252.0 280.3 274.9 233.7
237.9 274.0 264.5 244.8 264.0 268.3 272.1 260.2
255.8 260.7 245.5 279.6 237.8 278.5 273.3 263.7
241.4 260.6 280.3 272.7 261.0 260.0 279.3 252.1
244.3 272.2 248.3 278.7 236.0 271.2 279.8 245.6
241.2 251.1 267.0 273.4 247.7 254.8 272.8 270.5
254.4 232.1 271.5 242.9 273.6 256.1 251.6
256.8 273.0 240.8 276.6 264.5 264.5 226.8
255.3 266.6 250.2 255.8 285.3 255.4 240.5
255.0 273.2 251.4 276.1 277.8 266.8 268.5

Construct a frequence distribution for these data. Use 13 bins.

Select the correct histogram.

A
B

C
D

Does it convey the same information as the prsented stem-and-leaf display for this data?

SOLUTION

Class Frequency
1
1
4
6
6
9
14
13
12
11
13
8
2
Stem-and-leaf display. Yard: unit = 1.0

1 22 | 6

5 23 | 2334

8 23 | 677

16 24 | 00112444

20 24 | 5578

33 25 | 0111122334444

46 25 | 5555556677899

(15) 26 | 000011123334444

39 26 | 56677888

31 27 | 0000112222233333444

12 27 | 66788999

4 28 | 003

1 28 | 5

Yes, the histogram shows the same shape as the stem-and-leaf display.

Reserve Problems Chapter 6 Section 3 Problem 8

A semiconductor manufacturer produces devices used as central processing units in personal


computers. The speed of the devices (in megahertz) is important because it determines the price
that the manufacturer can charge for the devices. The following table contains measurements on
120 devices.

680 669 719 699 670 710 722 663 658 634 720 690
677 669 700 718 690 681 702 696 692 690 694 660
679 675 701 721 683 735 688 763 672 698 659 704
681 679 691 683 705 746 706 649 668 672 690 724
652 720 660 695 701 724 668 698 668 660 680 739
717 727 653 637 660 693 679 682 724 642 704 695
704 652 664 702 661 720 695 670 656 718 660 648
683 723 710 680 684 705 681 748 697 703 660 722
662 644 683 695 678 674 656 667 683 691 680 685
681 715 665 676 665 675 655 659 720 675 697 663
Construct a frequence distribution for these data. Use 14 bins.

Select the correct histogram.

B
C

Does it convey the same information as the prsented stem-and-leaf display for this data?

SOLUTION

Class Frequency
1
3
5
17
13
24
11
19
5
16
1
2
2
1

Stem-and-leaf of speed (in megahertz) N = 120

Leaf Unit = 1.0 63|4 represents 634 megahertz

2 63|47

7 64|24899

16 65|223566899

35 66|0000001233455788899

48 67|0022455567899

(17) 68|00001111233333458

55 69|0000112345555677889

36 70|011223444556

24 71|0057889

17 72|000012234447

5 73|59

3 74|68

1 75|

1 76|3
A Yes, the histogram shows the same shape as the stem-and-leaf display.

Reserve Problems Chapter 6 Section 3 Problem 9

The United States has an aging infrastructure as witnessed by several resent disasters, including
the I-35 bridge failure in Minnesota. Most states inspect their bridges regularly and report their
condition (on a scale from 1-7) to the public. Here are condition numbers from a sample of 30
bridges in New York State:

5.08 5.44 6.66 5.07 6.80 5.43 4.83 4.00 4.41 4.38
7.00 5.72 4.53 6.43 3.97 4.19 6.26 6.72 5.26 5.48
4.95 6.33 4.93 5.61 4.66 7.00 5.57 3.42 5.18 4.54

Construct a frequence distribution for these data. Use 5 bins.

Select the correct histogram.

A
B

C
D

SOLUTION

Class Frequency
1
5
13
7
4

Reserve Problems Chapter 6 Section 3 Problem 10

In an attempt to measure the effects of acid rain, researchers measured the pH (7 is neutral and
values below 7 are acidic) of water collected from rain in Ingham County, Michigan.

5.47 5.37 5.38 4.63 5.37 3.74 3.71 4.96 4.64 5.11
5.65 5.39 4.16 5.62 4.57 4.64 5.48 4.57 4.57 4.51
4.86 4.56 4.61 4.32 3.98 5.70 4.15 3.98 5.65 3.10
5.04 4.62 4.51 4.34 4.16 4.64 5.12 3.71 4.64 5.59

Construct a frequence distribution for these data. Use 6 bins.

Select the correct histogram.


A

B
C

D
E

SOLUTION

Class Frequency
1
5
5
15
9
5
Reserve Problems Chapter 6 Section 3 Problem 11

Cloud seeding, a process in which chemicals such as silver iodide and frozen carbon dioxide are
introduced by aircraft into clouds to promote rainfall was widely used in the 20-th century.
Recent research has questioned its effectiveness [Journal of Atmospheric Research (2010, Vol.
97 (2), pp. 513-525)]. An experiment was performed by randomly assigning 52 clouds to be
seeded or not. The amount of rain generated was then measured in acre-feet. Here are the data
for the unseeded and seeded clouds:

Unseeded:

81.2 26.1 95.0 41.1 28.6 21.7 11.5 68.5 345.5 321.2
1202.6 1.0 4.9 163.0 372.4 244.3 47.3 87.0 26.3 24.4
830.1 4.9 36.6 147.8 17.3 29.0

Seeded:

274.7 302.8 242.5 255.0 17.5 115.3 31.4 703.4 334.1


1697.8 118.3 198.6 129.6 274.7 119.0 1656.0 7.7 430.0
40.6 92.4 200.7 32.7 4.1 978.0 489.1 2745.6

Construct a frequence distribution for the combined cloud-seeding rain measurements. Use 6
bins.

Select the correct histogram.


A

B
C

SOLUTION

Class Frequency
35
11
3
2
0
1

Reserve Problems Chapter 6 Section 3 Problem 12

In the 2000 Sydney Olympics, a special program initiated by IOC president Juan Antonio
Samaranch allowed developing countries to send athletes to the Olympics without the usual
qualifying procedure. Here are the 71 times for the first round of the 100 meter men's swim (in
seconds).

60.39 49.93 53.40 51.32 50.46 51.34 50.28 50.19 52.14


50.56 52.72 50.95 49.74 49.16 52.57 52.53 52.09 52.40
49.75 54.06 53.50 50.63 51.93 51.62 52.58 53.55 51.07
49.76 49.73 50.90 59.26 49.29 52.78 112.72 49.79 49.83
52.43 51.28 52.22 49.76 49.70 52.90 50.19 54.33 62.45
51.93 52.24 52.82 50.96 48.64 51.11 50.87 52.18 54.12
50.49 49.84 52.91 52.52 50.32 51.52 52.0 52.35 52.24
49.45 51.28 49.09 58.79 49.74 49.32 50.62 49.45

Construct a frequence distribution for this data. Use 9 bins.

Select the correct histogram.


A

B
C

SOLUTION

Class Frequency
41
27
2
0
0
0
0
0
1

Reserve Problems Chapter 6 Section 4 Problem 1

The driver's reaction time in response to a particular potential traffic hazard is the time required
from the point of initial detection of the hazard in one’s field of view to the time that vehicle
control components are actuated (such as movement of one’s foot to the brake pedal). The
following data represents the measurements of the driver’s reaction time in seconds for the male
and female participants of the experiment:

0.50 0.49 0.45 0.54 0.44 0.49 0.51 0.48


0.51 0.53 0.51 0.63 0.69 0.76 0.76 0.85

(a) Calculate the sample mean of the data. Calculate the sample variance of the data. Calculate
the corresponding standard deviation.

(b) Find the median and quartiles for the data.


(c) Construct a box plot of the data. Find the interquartile range and determine the number of
outliers.

SOLUTION

(a) Sample mean:

Sample variance:

Sample standard deviation:

(b)

(c)

Therefore, there is no outliers and the box plot is


Reserve Problems Chapter 6 Section 4 Problem 2

The weather forecast for New York for the period from 1 May 2017 to 15 May 2017 represents
the following set of maximal temperatures in °F: 75, 71, 61, 59, 64, 61, 62, 59, 62, 65, 65, 65, 68,
72, 72.

(a) Calculate the sample mean, sample variance, and standard deviation.

(b) Find the median and quartiles for the data.

(c) Construct a box plot of the data. Find the interquartile range and determine the number of
outliers.

SOLUTION

(a) Sample mean:

Sample variance:
Sample standard deviation:

(b)

(c)

Therefore, there is no outliers and the box plot is

Reserve Problems Chapter 6 Section 4 Problem 3


The triathlon is a multiple-stage competition involving the completion of three disciplines, in its
most popular form, swimming, cycling and running. The following data represents the results of
2017 Hamilton CAMTRI Sprint Triathlon American Cup for men: 00:55:14, 00:55:21, 00:56:00,
00:56:12, 00:56:20, 00:56:23, 00:56:36, 00:56:37, 00:56:50, 00:57:08, 00:58:41, 00:58:44,
00:58:48, 00:58:53, 00:59:14, 00:59:30, 00:59:30, 01:00:15.

(a) Calculate the sample mean, sample variance, and standard deviation.

(b) Find the median and quartiles for the data.

(c) Construct a box plot of the data. Find the interquartile range.

SOLUTION

Before analyzing the results, they should be converted to minutes. Therefore, the set of results in
minutes is: 55.23, 55.35, 56.00, 56.20, 56.33, 56.38, 56.60, 56.62, 56.83, 57.13, 58.68, 58.73,
58.80, 58.88, 59.23, 59.50, 59.50, 60.25.

(a) Sample mean:

Sample variance:

Sample standard deviation:

(b)

(c)
Therefore, there is no outliers and the box plot is

Reserve Problems Chapter 6 Section 4 Problem 4

The data below represents the 14 greatest net capacities in GW of currently operated nuclear
power plants with the current net capacities over 1000 MW for three different countries.

USA France Japan


4.214 5.460 7.965
3.300 5.320 4.494
2.560 5.200 4.400
2.538 3.660 3.473
2.429 3.660 3.220
2.367 3.640 2.783
2.332 3.620 2.089
2.330 3.580 1.966
2.302 3.560 1.922
2.300 3.000 1.692
2.288 2.990 1.613
2.278 2.670 1.108
2.264 2.660 1.067
2.258 2.660 1.060
For each country, perform the following steps.

(a) Calculate the sample mean, sample variance, and standard deviation.

(b) Find the median and quartiles for the data.

(c) Construct a box plot of the data. Find the interquartile range and determine the number of
outliers.

SOLUTION

(a) Sample mean:

Sample variance:
Sample standard deviation:

(b)
USA:

France:

Japan:
(c)
USA:

The outliers are 3.300 and 4.214. , therefore, 3.300 is an outlier and 4.214 is
an extreme outlier.

France:

There is no outliers.

Japan:

The outlier is 7.965. , therefore, 7.965 is an outlier.

The box plot is


Reserve Problems Chapter 6 Section 4 Problem 5

The United States has an aging infrastructure as witnessed by several resent disasters, including
the I-35 bridge failure in Minnesota. Most states inspect their bridges regularly and report their
condition (on a scale from 1-7) to the public. Here are condition numbers from a sample of 30
bridges in New York State:

5.08 5.44 6.66 5.07 6.80 5.43 4.83 4.00 4.41 4.38
7.00 5.72 4.53 6.43 3.97 4.19 6.26 6.72 5.26 5.48
4.95 6.33 4.93 5.61 4.66 7.00 5.57 3.42 5.18 4.54

Using the data on bridge conditions,

(a) Find the quartiles and median of the data.

(b) Select the correct boxplot.


A

B
C

(c) The amount of potential outliers.

SOLUTION

(a)
(b)

(c)

The amount of potential outliers .

Reserve Problems Chapter 6 Section 4 Problem 6

In an attempt to measure the effects of acid rain, researchers measured the pH (7 is neutral and
values below 7 are acidic) of water collected from rain in Ingham County, Michigan.

5.47 5.37 5.38 4.63 5.37 3.74 3.71 4.96 4.64 5.11
5.65 5.39 4.16 5.62 4.57 4.64 5.48 4.57 4.57 4.51
4.86 4.56 4.61 4.32 3.98 5.70 4.15 3.98 5.65 3.10
5.04 4.62 4.51 4.34 4.16 4.64 5.12 3.71 4.64 5.59

Using the data on acid rain,

(a) Find the quartiles and median of the data.

(b) Select the correct boxplot.


A

B
C

(c) The amount of potential outliers

SOLUTION

(a)
Median = 4.635.

(b)

(c)

The amount of potential outliers = 0.

Reserve Problems Chapter 6 Section 4 Problem 7

An article in the Journal of Aircraft (1988) described the computation of drag coefficients for the
NASA 0012 airfoil. Different computational algorithms were used at with the
following results (drag coefficients are in units of drag counts; that is, one count is equivalent to
a drag coefficient of 0.0001): 79, 100, 74, 83, 81, 85, 82, 80, and 84.

(a) Find the median and the upper and lower quartiles of the

(b) Select the correct boxplot.


A

B
C

A
B
C
D
(c) Find the median and the upper and lower quartiles.

(d) Select the correct boxplot.

SOLUTION

(a)

(b)

Set aside the highest observation (100) and

Reserve Problems Chapter 6 Section 4 Problem 8

The net energy consumption (in billions of kilowatt-hours) for countries in Asia in 2003 was as
follows.

Billions of Kilowatt-Hours
Afghanistan 1.04
Australia 200.66
Bangladesh 16.20
Burma 6.88
China 1671.23
Hong Kong 38.43
India 519.04
Indonesia 101.80
Japan 946.27
Korea, North 17.43
Korea, South 303.33
Laos 3.30
Malaysia 73.63
Mongolia 2.91
Nepal 2.30
New Zealand 37.03
Pakistan 71.54
Philippines 44.48
Singapore 30.89
Sri Lanka 6.80
Taiwan 154.34
Thailand 107.34
Vietnam 36.92
Total 4393.8

Select the correct boxplot.

A
B

C
D

How does the boxplot compare in interpretive value to the original stem-and-leaf diagram?

SOLUTION

Construct a boxplot of the data.

The boxplot shows the same basic information as the stem-and-leaf plot but in a different format.
Reserve Problems Chapter 6 Section 4 Problem 9

A semiconductor manufacturer produces devices used as central processing units in personal


computers. The speed of the devices (in megahertz) is important because it determines the price
that the manufacturer can charge for the devices. The following table contains measurements on
120 devices.

680 669 719 699 670 710 722 663 658 634 720 690
677 669 700 718 690 681 702 696 692 690 694 660
679 675 701 721 683 735 688 763 672 698 659 704
681 679 691 683 705 746 706 649 668 672 690 724
652 720 660 695 701 724 668 698 668 660 680 739
717 727 653 637 660 693 679 682 724 642 704 695
704 652 664 702 661 720 695 670 656 718 660 648
683 723 710 680 684 705 681 748 697 703 660 722
662 644 683 695 678 674 656 667 683 691 680 685
681 715 665 676 665 675 655 659 720 675 697 663

Select the correct boxplot.

A
B

C
D

How does the boxplot compare in interpretive value to the original stem-and-leaf diagram?

SOLUTION

The boxplot shows the same basic information as the stem-and-leaf plot but in a different format.

Reserve Problems Chapter 6 Section 6 Problem 1


The following table presents data on the ratings of quarterbacks for the 2008 National Football
league season (source: The Sports Network). It is suspected that the rating (y) is related to the
average number of yards gained per pass attempt (x).

Player Team Yards per Attempt Rating Points


Philip Rivers SD 8.39 105.5
Chad Pennington MIA 7.67 97.4
Kurt Warner ARI 7.66 96.9
Drew Brees NO 7.98 96.2
Peyton Manning IND 7.21 95
Aaron Rodgers GB 7.53 93.8
Matt Schaub HOU 8.01 92.7
Tony Romo DAL 7.66 91.4
Jeff Garcia TB 7.21 90.2
Matt Cassel NE 7.16 89.4
Matt Ryan ATL 7.93 87.7
Shaun Hill SF 7.10 87.5
Seneca Wallace SEA 6.33 87
Eli Manning NYG 6.76 86.4
Donovan McNabb PHI 6.86 86.4
Jay Cutler DEN 7.35 86
Trent Edwards BUF 7.22 85.4
Jake Delhomme CAR 7.94 84.7
Jason Campbell WAS 6.41 84.3
David Garrard JAC 6.77 81.7
Brett Favre NYJ 6.65 81
Joe Fiacco BAL 6.94 80.3
Kerry Collins TEN 6.45 80.2
Ben Roethlisberger PIT 7.04 80.1
Kyle Orton CHI 6.39 79.6
JaMarcus Russel OAK 6.58 77.1
Tyler Thigpen KC 6.21 76
Gus Freotte MIN 7.17 73.7
Dan Orlovsky DET 6.34 72.6
Marc Bulger STL 6.18 71.4
Ryan Fitzpatrick CIN 5.12 70
Derek Anderson CLE 5.71 66.5

(a) Select the correct scatterplot.


A

B
C

(b) Comment on the suspicion that rating is related to yards per attempt.

(c) What is the simple correlation coefficient between these two variables?

SOLUTION

(a)
(b)

As the yards per attempt increase, the rating tends to increase.

(c)

Reserve Problems Chapter 6 Section 6 Problem 2

An article in Technometrics by S. C. narula and J. F. Wellington ["Prediction, Linear Regression,


and a Minimum Sum of Relative Errors" (1977, Vol. 19)] presents data on the selling price and
annual taxes for 24 houses. The data are shown in the following table.

Taxes (local, Taxes (local,


Sale Sale
school, school,
Price/1000 Price/1000
country)/1000 country)/1000
25.9 4.9176 30.0 5.0500
29.5 5.0208 36.9 8.2464
27.9 4.5429 41.9 6.6969
25.9 4.5573 40.5 7.7841
29.9 5.0597 43.9 9.0384
29.9 3.8910 37.5 5.9894
30.9 5.8980 37.9 7.5422
28.9 5.6039 44.5 8.7951
35.9 5.8282 37.9 6.0831
31.5 5.3003 38.9 8.3607
31.0 6.2712 36.9 8.1400
30.9 5.9592 45.8 9.1416

(a) Select the correct scatterplot.

B
C

(b) Comment on the widely held belief that price is related to taxes paid.

(c) What is the simple correlation coefficient between these two variables?

SOLUTION

(a)
(b)

As the taxes increase, the price tends to increase.

(c)

Reserve Problems Chapter 6 Section 7 Problem 1

The following data are the numbers of cakes sold by a bakery on different days: 234, 246, 214,
267, 253, 220, 259, 264, 240, 251, 230, 247, 255.

Choose the correct normal probability plot of this data:


A

B
C

Does it seem reasonable to assume that the number of cakes sold is normally distributed?

SOLUTION
According to the normal probability plot, it seems reasonable to assume that the number of cakes
is normally distributed.

Reserve Problems Chapter 6 Section 7 Problem 2

The following data are the measurements of October rainfall (mm) in Kiev from 2010 to 2016:
44.9, 78.8, 50.6, 14.2, 21.3, 39, 103.1

Choose the correct normal probability plot of this data:

A
B

C
D

Does it seem reasonable to assume that the rainfall is normally distributed?

SOLUTION

According to the normal probability plot, it seems reasonable to assume that the rainfall is
normally distributed.

Reserve Problems Chapter 6 Section 7 Problem 3


On Mondays 19 Patients receive hemodialysis treatment in a dialysis unit. Here are the durations
of their treatment in months: 1, 4, 7, 12, 13, 15, 18, 22, 26, 27, 28, 33, 38, 45, 47, 56, 65, 79, 109.

Choose the correct normal probability plot of this data:

B
C

Does it seem reasonable to assume that the duration of treatment is normally distributed?

SOLUTION

According to the normal probability plot, it seems reasonable to assume that the duration of
treatment is normally distributed. However, the distribution seems to have a positive (or right)
skew.
Reserve Problems Chapter 6 Section 7 Problem 4

The following data are the results of 20 measurements of patient’s systolic blood pressure during
the period of illness: 98, 160, 136, 128, 130, 114, 123, 134, 128, 107, 123, 125, 129, 132, 154,
115, 126, 132, 136, 130.

Choose the correct normal probability plot of this data:

A
B

C
D

Does it seem reasonable to assume that the systolic blood pressure is normally distributed?

SOLUTION

According to the normal probability plot, it seems reasonable to assume that the systolic blood
pressure is normally distributed. However, the distribution seems to be heavy-tailed.
Reserve Problems Chapter 6 Section 7 Problem 5

The following data are the joint temperatures of the O-rings (°F) for each test firing or actual
launch of the space shuttle rocket motor (from Presidential Commission on the Space Shuttle
Challenger Accident, Vol. 1, pp. 129–131): 84, 49, 61, 40, 83, 67, 45, 66, 70, 69, 80, 58, 68, 60,
67, 72, 73, 70, 57, 63, 70, 78, 52, 67, 53, 67, 75, 61, 70, 81, 76, 79, 75, 76, 58, 31.

Select the correct normal probability plot of the O-ring joint temperature data.

B
C

Does it seem reasonable to assume that O-ring joint temperature is normally distributed?

SOLUTION
Yes, O-ring joint temperature is approximately normally distributed with some departures from
the line at the ends of the distribution.

Reserve Supplemental Exercises Chapter 6 Problem 1

The National Oceanic and Atmospheric Administration provided the monthly absolute estimates
of global (land and ocean combined) temperature index (degrees C) from 2000. See the table
below. Construct and interpret either a digidot plot or a separate stem-and-leaf and time series
plot of these data.

Table. Global Monthly Temperature


Year 1 2 3 4 5 6 7 8 9 10 11 12
2000 12.3 12.6 13.2 14.3 15.3 15.9 16.2 16.0 15.4 14.3 13.1 12.5
2001 12.4 12.5 13.3 14.2 15.4 16.0 16.3 16.2 15.5 14.5 13.5 12.7
2002 12.7 12.9 13.4 14.2 15.3 16.1 16.4 16.1 15.5 14.5 13.5 12.6
2003 12.6 12.6 13.2 14.2 15.4 16.0 16.3 16.2 15.6 14.7 13.4 12.9
2004 12.6 12.8 13.3 14.3 15.2 16.0 16.3 16.1 15.5 14.6 13.6 12.7
2005 12.6 12.5 13.4 14.4 15.4 16.2 16.4 16.2 15.7 14.6 13.6 12.8
2006 12.4 12.6 13.2 14.2 15.3 16.1 16.4 16.2 15.6 14.6 13.5 12.9
2007 12.8 12.7 13.3 14.4 15.3 16.0 16.3 16.1 15.5 14.5 13.4 12.6
2008 12.2 12.4 13.4 14.1 15.2 16.0 16.3 16.1 15.5 14.6 13.5 12.7
2009 12.5 12.6 13.2 14.3 15.3 16.1 16.4 16.2 15.6

In each year the temperature has a _____ distribution. In each year, the temperature ______ until
the mid year and then it starts to ______.
SOLUTION

Stem-and-leaf of Global Temperature N = 117


Leaf Unit = 0.10

5 12 23444
29 12 555566666666677777888999
42 13 1222233344444
48 13 555566
(11) 14 12222333344
58 14 55566667
50 15 22333334444
39 15 5555566679
29 16 00000011111112222222333334444

Time-series plot by month over 10 years


Based on the time series plots of these data, in each year the temperature has a similar
distribution. In each year, the temperature increases until the mid year and then it starts to
decrease.

Reserve Supplemental Exercises Chapter 6 Problem 2

An article in Quality Engineering (1992, Vol. 4, pp. 487–495) presents viscosity data from a
batch chemical process. A sample of these data is in the table below.

13.3 14.3 14.9 15.2 15.8 14.2 16.0 14.0


14.5 16.1 13.7 15.2 13.7 16.9 14.9 14.4
15.3 13.1 15.2 15.9 15.1 14.9 13.6 13.7
15.3 15.5 14.5 16.5 13.4 15.2 15.3 13.8
14.3 12.6 15.3 14.8 14.1 14.4 14.3 15.6
14.8 14.6 15.6 15.1 14.8 15.2 15.6 14.5
15.2 14.3 15.8 17.0 14.3 14.6 16.1 12.8
14.5 15.4 13.3 14.9 14.3 16.4 13.9 16.1
14.6 15.2 14.1 14.8 16.4 14.2 15.2 16.6
14.1 16.8 15.4 14.0 16.9 15.7 14.4 15.6

Reading left to right and up to down, draw a time series plot of all the data and comment on any
features of the data that are revealed by this plot.

Consider the notion that the first 40 observations (the first 4 columns) were generated from a
specific process, whereas the last 40 observations were generated from a different process.

Does the plot indicate that the two processes generate similar results?

Calculate the sample mean and sample variance of the first 40 and the second 40 observations
SOLUTION

No, the mean of the second set of 40 data points appears to be slightly greater
than the mean of the first set of 40.
14.875
The first 40 observations:
0.899

The second 40 observations: 14.923


1.046

There is a slight difference in the mean levels and the sample variances.

Reserve Supplemental Exercises Chapter 6 Problem 3

An article in Quality Engineering (1992, Vol. 4, pp. 487–495) presents viscosity data from a
batch chemical process. A sample of these data is in the table below.

13.3 14.3 14.9 15.2 15.8 14.2 16.0 14.0


14.5 16.1 13.7 15.2 13.7 16.9 14.9 14.4
15.3 13.1 15.2 15.9 15.1 14.9 13.6 13.7
15.3 15.5 14.5 16.5 13.4 15.2 15.3 13.8
14.3 12.6 15.3 14.8 14.1 14.4 14.3 15.6
14.8 14.6 15.6 15.1 14.8 15.2 15.6 14.5
15.2 14.3 15.8 17.0 14.3 14.6 16.1 12.8
14.5 15.4 13.3 14.9 14.3 16.4 13.9 16.1
14.6 15.2 14.1 14.8 16.4 14.2 15.2 16.6
14.1 16.8 15.4 14.0 16.9 15.7 14.4 15.6

Use the data above to calculate the descriptive statistics for two groups of observations: the first
40 observations (the first 4 columns) and the last 40.

Prepare comparative box plots for two groups of observations:

The median of the first set of observations is _____ the median of the second set. The ___ half of
the data seems to be concentrated a little more tightly. ____ data point(s) appear(s) as outliers in
the first half of the data, ____ data point(s) appear(s) as outliers in the second half of the data.

SOLUTION

The first 40 observations: Sample mean 14.875


Sample variance 0.899
Sample standard deviation 0.948

First quartile 14.3

Median (second quartile) 14.9

Third quartile 15.375


The second 40 observations: Sample mean 14.923
Sample variance 1.046
Sample standard deviation 1.023

First quartile 14.2

Median (second quartile) 14.85

Third quartile 15.675

There is a slight difference in the descriptive statistics for two sets.


The median of the first set of observations is slightly higher than the median of the second set.
The first half of the data seems to be concentrated a little more tightly. Two data points appear as
outliers in the first half of the data, no data points appear as outliers in the second half of the
data.

Reserve Supplemental Exercises Chapter 6 Problem 4

The data shown in the table are monthly champagne sales in France (1962–1969) in thousands of
bottles.

Month 1962 1963 1964 1965 1966 1967 1968 1969

Jan. 2.851 2.541 3.113 5.375 3.633 4.016 2.639 3.934

Feb. 2.672 2.475 3.006 3.088 4.292 3.957 2.899 3.162

Mar. 2.755 3.031 4.047 3.718 4.154 4.510 3.370 4.286

Apr. 2.721 3.266 3.523 4.514 4.121 4.276 3.740 4.676

May 2.946 3.776 3.937 4.520 4.647 4.968 2.927 5.010

June 3.036 3.230 3.986 4.539 4.753 4.677 3.986 4.874

July 2.282 3.028 3.260 3.663 3.965 3.523 4.217 4.633

Aug. 2.212 1.759 1.573 1.643 1.723 1.821 1.738 1.659

Sept. 2.922 3.595 3.528 4.739 5.048 5.222 5.221 5.591


Oct. 4.301 4.474 5.211 5.428 6.922 6.873 6.424 6.981

Nov. 5.764 6.838 7.614 8.314 9.858 10.803 9.842 9.851

Dec. 7.132 8.357 9.254 10.651 11.331 13.916 13.076 12.670

Construct a time series plot of the data and comment on any features of the data that reveals by
this plot.

There appears to be a ____ variation in the data with the highest value of the year _____. The
high values are during the _______.

Speculate on how you would use a graphical procedure to forecast monthly champagne sales for
the year 1970.

SOLUTION

There appears to be a cyclic variation in the data with the high value of the cycle generally
increasing. The high values are during the winter holiday months.

We might draw another cycle, with the peak similar to the last year’s data (1969) at about 12.7
thousand bottles.

Reserve Supplemental Exercises Chapter 6 Problem 5

A communication channel is being monitored by recording the number of errors in a string of


1000 bits. Data for 20 of these strings follow:
130 33 2 41 31

112 33 2 02 01

Fill the cells of the stem-and-leaf plot of the data with the number of zeros in the sequences. Leaf
unit is equal to 0.10.

Find the sample average and standard deviation.

Construct a time series plot of the data. Is there evidence that there was an increase or decrease
in the number of errors in a string?

SOLUTION

Stem Leaf Number of zeros in sequence


0 {0,...} 3
1 {0,...} 6
2 {0,...} 4
3 {0,...} 6
4 {0,...} 1

Computer-generated stem-and-leaf diagram is

Stem-and-leaf of errors N = 20
Leaf Unit = 0.10

3 0 000
9 1 000000
9 2 0000
7 3 000000
140

Sample Average

Sample Standard Deviation

A communication channel is being monitored by recording the number of errors in a string of


1000 bits. Data for 20 of these strings follow:
Read data across and down
1 3 03 3 24 13 1
1 1 23 3 20 20 1
310 13 2 41 31
112 33 2 02 01

3.5

2.5
errors

1.5

0.5

0
0 5 10 15 20
Index
The time series plot indicates a slight decrease in the number of errors for strings 16 - 20.

Reserve Supplemental Exercises Chapter 6 Problem 6

The January 1990 issue of Arizona Trend contains a supplement describing the 12 “best” golf
courses in the state. The yardages (lengths) of these courses are as follows: 6981, 7099, 6930,
6992, 7518, 7100, 6935, 7518, 7013, 6800, 7041, and 6890.

Use the data above to calculate the descriptive statistics for the golf course yardage data.

Construct a box plot of the yardages and fill the blanks in the interpretation of the plot.

The golf course yardage data appear to be _____. Also, there ____ outlying data point(s) ____.

SOLUTION

The golf course yardage data appear to be skewed. Also, there is an outlying data point (double)
above 7500 yards.

Reserve Supplemental Exercises Chapter 6 Problem 7

An article in Quality Engineering (1992, Vol. 4, pp. 487–495) presents viscosity data from a
batch chemical process. A sample of these data is in the table below.
13.3 14.3 14.9 15.2 15.8 14.2 16.0 14.0
14.5 16.1 13.7 15.2 13.7 16.9 14.9 14.4
15.3 13.1 15.2 15.9 15.1 14.9 13.6 13.7
15.3 15.5 14.5 16.5 13.4 15.2 15.3 13.8
14.3 12.6 15.3 14.8 14.1 14.4 14.3 15.6
14.8 14.6 15.6 15.1 14.8 15.2 15.6 14.5
15.2 14.3 15.8 17.0 14.3 14.6 16.1 12.8
14.5 15.4 13.3 14.9 14.3 16.4 13.9 16.1
14.6 15.2 14.1 14.8 16.4 14.2 15.2 16.6
14.1 16.8 15.4 14.0 16.9 15.7 14.4 15.6

Construct normal probability plots for two groups of the data: the first 40 observations (the first
4 columns) and the last 40 observations. Construct both plots on the same axes. What tentative
conclusions can you draw?

Both sets of data appear ____ normally distributed and with roughly the same ____ value. The
difference in slopes for the two lines indicates that a change in ____ might have occurred. This
could have been the result of a change in processing conditions, the quality of the raw material or
some other factor.

SOLUTION

Both sets of data appear to be normally distributed and with roughly the same mean value. The
difference in slopes for the two lines indicates that a change in variance might have occurred.
This could have been the result of a change in processing conditions, the quality of the raw
material or some other factor.

Reserve Supplemental Exercises Chapter 6 Problem 8

The following data are the temperatures of effluent at discharge from a sewage treatment facility
on consecutive days:

43 47 51 48 52 50 46 49
45 52 46 51 44 49 46 51
49 45 44 50 48 50 49 50

Construct a normal probability plot of the data. Based on the plot, what tentative conclusions can
you draw?

A normal distribution is _____ for these data. There _____ repeated values in the data so the plot
_____ off the line.

SOLUTION

A normal distribution is reasonable for these data. There are some repeated values in the data
that cause some points to fall off the line.

Reserve Supplemental Exercises Chapter 6 Problem 9


The “cold start ignition time” of an automobile engine is being investigated by a gasoline
manufacturer. The following times (in seconds) were obtained for a test vehicle: 1.75, 1.92, 2.62,
2.35, 3.09, 3.15, 2.53, 1.91. A second formulation of the gasoline was tested in the same vehicle,
with the following times (in seconds): 1.83, 1.99, 3.13, 3.29, 2.65, 2.87, 3.40, 2.46, 1.89, and
3.35. Construct normal probability plots of the cold start ignition time data. Construct a separate
plot for each gasoline formulation, but arrange the plots on the same axes. What tentative
conclusions can you draw?

Although we do not have sufficient data points to really see a pattern, there seem to ____
significant deviations from normality for either sample. The ____ difference in slopes indicates
that the _____ of the populations are very _______.

SOLUTION

Although we do not have sufficient data points to really see a pattern, there seem to be no
significant deviations from normality for either sample. The large difference in slopes indicates
that the variances of the populations are very different.

Reserve Supplemental Exercises Chapter 6 Problem 10

The United States Golf Association tests golf balls to ensure that they conform to the rules of
golf. Balls are tested for weight, diameter, roundness, and overall distance. The overall distance
test is conducted by hitting balls with a driver swung by a mechanical device nicknamed “Iron
Byron” after the legendary great Byron Nelson, whose swing the machine is said to emulate.
Following are 100 distances (in yards) achieved by a particular brand of golf ball in the overall
distance test.

261.3 259.4 265.7 270.6 274.2 261.4 254.5 283.7


258.1 270.5 255.1 268.9 267.4 253.6 234.3 263.2
254.2 270.7 233.7 263.5 244.5 251.8 259.5 257.5
257.7 272.6 253.7 262.2 252.0 280.3 274.9 233.7
237.9 274.0 264.5 244.8 264.0 268.3 272.1 260.2
255.8 260.7 245.5 279.6 237.8 278.5 273.3 263.7
241.4 260.6 280.3 272.7 261.0 260.0 279.3 252.1
244.3 272.2 248.3 278.7 236.0 271.2 279.8 245.6
241.2 251.1 267.0 273.4 247.7 254.8 272.8 270.5
254.4 232.1 271.5 242.9 273.6 256.1 251.6
256.8 273.0 240.8 276.6 264.5 264.5 226.8
255.3 266.6 250.2 255.8 285.3 255.4 240.5
255.0 273.2 251.4 276.1 277.8 266.8 268.5

Use the data above to calculate the descriptive statistics for the yardage distance.

Construct a box plot of the yardage distance and complete the interpretation of the plot. How
does the box plot compare in interpretive value to the original stem-and-leaf diagram?

The plot indicates that most balls (between first and third quartiles) will fall somewhere in the
range from ____ to _____. This type of information _____ be obtained from the stem-and-leaf
graph.

SOLUTION

Sample mean = 260.3


Sample variance = 179.8
Sample standard deviation s = 13.4

First quartile
= 251.85

Median (second quartile)


= 260.85

Third quartile
= 271.95
The plot indicates that most balls will fall somewhere in the 250-275 range. The same type of
information could have been obtained from the stem-and-leaf graph.

Reserve Supplemental Exercises Chapter 6 Problem 11

Patients arriving at a hospital emergency department present a variety of symptoms and


complaints. The following data were collected during one weekend night shift (11:00 p.m. to
7:00 a.m.):

Chest pain 8
Difficulty breathing 7
Numbness in extremities 3
Broken bones 11
Abrasions 16
Cuts 21
Stab wounds 9
Gunshot wounds 4
Blunt force trauma 10
Fainting, loss of consciousness 4
Other 10

Calculate the numerical summaries of these data and fill the table.
SOLUTION

Sort the categories by the number of instances in each category. Bars are used to indicate the counts and this sorted bar chart is known as a Pareto chart
(discussed in Chapter 15).

Fainting, loss Numbness in


Broken Blunt force Stab Chest Difficulty Gunshot
Cuts Abrasions Other of
bones trauma wounds pain breathing wounds
conciousness extremities
Frequency 21 16 11 10 10 9 8 7 4 4 3
Percent 20.4 15.5 10.7 9.7 9.7 8.7 7.8 6.8 3.9 3.9 2.9
Cumulative 20.4 35.9 46.6 56.3 66 74.8 82.5 89.3 93.2 97.1 100.0
frequency, %
Reserve Supplemental Exercises Chapter 6 Problem 12

The force needed to remove the cap from a medicine bottle is an important feature of the product
because requiring too much force may cause difficulty for elderly patients or patients with
arthritis or similar conditions. The data below presents the results of testing a sample of 68 caps
attached to bottles for the force (in pounds) required for removing the cap.

14 18 27 24 24 28 22 21 16
17 22 16 16 18 30 16 14 15
25 15 16 15 15 19 19 10 22
17 15 17 20 17 20 15 17 20
24 27 17 32 31 27 21 21 26
31 34 32 24 16 37 36 34 20
19 21 14 14 19 15 30 24 15
17 17 21 34 24

Stem-and-leaf of Force N = 68
Leaf Unit = 1.0

1 10
1 1
13 1 444455555555
27 1 66666677777777
33 1 889999
(9) 2 000011111
26 2 222
23 2 4444445
16 2 6777
12 2 8
11 3 0011
7 3 22
5 3 444
2 3 67

What are the average and the standard deviation of the force?

The average is ____ and the sample standard deviation is _____.

Comment on the normal probability plot of the data.


There is a number of _________ values for force that are seen as points stacked vertically on the
plot. This is probably due to rounding of the force value to two digits. There are _______ lower
force values than are expected from a normal distribution.

If the upper specification on required force is 30 pounds, what proportion of the caps do not meet
this requirement? Round your answer to three decimal places (eg. 98.765).

What proportion of the caps exceeds the average force plus 2 standard deviations? Round your
answer to three decimal places (eg. 98.765).

Suppose that the first 36 observations in the table come from one machine and the remaining
come from a second machine (read across the rows and the down). Construct box plots of the
data and calculate the descriptive statistics for two groups of observations. Round your answers
to two decimal places (eg. 98.76).

Does there seem to be a possible difference in the two machines?

Plot the first 36 observations in the table on a normal probability plot and the remaining
observations on another normal probability plot. Compare the results with the single normal
probability plot constructed for all of the data.

A separate normal distribution for each group of caps fits the data ____ the distribution for all of
the data. This is to be expected when the mean and standard deviations of the groups ____ as
they do here.

SOLUTION

The sample mean is 21.265 and the sample standard deviation is 6.422.
There is a number of repeated values for force that are seen as points stacked vertically on the
plot. This is probably due to rounding of the force value to two digits. There are fewer lower
force values than are expected from a normal distribution.

0.132 ? 0.001 % of caps do not meet the requirement.

From the stem-and-leaf display, 9 caps exceed the force limit of 30. This is 9/68 = 0.132.

0.029 ? 0.001 % of caps exceed this force limit.

The mean plus two standard deviations equals 21.265 + 2(6.422) = 34.109. From the stem-and-
leaf display, 2 caps exceed this force limit. This is 2/68 = 0.029.

The first 36 observations: Sample mean = 18.67


Sample variance = 19.31
Sample standard deviation s = 4.39

First quartile = 15.25

Median (second quartile) = 17

Third quartile = 21.75


The last 32 observations: Sample mean = 24.19
Sample variance = 50.67
Sample standard deviation s = 7.12

First quartile = 17.5

Median (second quartile) = 24

Third quartile = 31

In the following box plots Force1 denotes the subset of the first 36 observations and Force2
denotes the remaining observation. The mean and variability of force is greater for the second set
of data in Force2.
Yes.

Plot the first 36 observations in the table on a normal probability plot and the remaining
observations on another normal probability plot. Compare the results with the single normal
probability plot constructed for all of the data.
A separate normal distribution for each group of caps fits the data better. This is to be expected
when the mean and standard deviations of the groups differ as they do here.

You might also like