0% found this document useful (0 votes)
46 views12 pages

Revision

This document provides solutions to several statistics problems involving calculating measures of central tendency, variation, percentiles, and constructing box-and-whisker plots from data sets. For problem 1, the mean, median, mode, midrange, variance, standard deviation, and coefficient of variation are calculated for earthquake magnitude data. Problem 2 compares the coefficient of variation between employee age and salary data. Problems 3-5 involve finding quartiles, percentiles, interquartile ranges, and constructing box-and-whisker plots for additional data sets, as well as checking for outliers.

Uploaded by

loooooooooool
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
46 views12 pages

Revision

This document provides solutions to several statistics problems involving calculating measures of central tendency, variation, percentiles, and constructing box-and-whisker plots from data sets. For problem 1, the mean, median, mode, midrange, variance, standard deviation, and coefficient of variation are calculated for earthquake magnitude data. Problem 2 compares the coefficient of variation between employee age and salary data. Problems 3-5 involve finding quartiles, percentiles, interquartile ranges, and constructing box-and-whisker plots for additional data sets, as well as checking for outliers.

Uploaded by

loooooooooool
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 12

Revision

1. Twelve major earthquakes had Richter magnitudes shown here.

7.0 6.2 7.7 8.0 6.4 6.2


7.2 5.4 6.4 6.5 7.2 5.4
a) Calculate the Mean, Median, Mode, and Midrange for these data.
b) Calculate the variance, standard deviation and coefficient of variation.

Solution:
5.4, 5.4, 6.2, 6.2, 6.4, 6.4, 6.5, 7, 7.2,
7.2, 7.7, 8

̅) = ∑ X = 5.4+5.4+6.2+6.2+6.4+6.4+6.5+7+7.2+ 7.2+7.7+ 8 = 6.633


a) *Mean (X
n 12

*To get median, you must sort the data n = 12 (even)

5.4, 5.4, 6.2, 6.2, 6.4, 6.4, 6.5, 7, 7.2, 7.2,


7.7, 8

6.4+6.5
Median is the average of mid points = = 6.45
2

*Mode is the most repeated value, there are 4 modes


which are
5.4, 6.2, 6.4, 7.2

Max+Min
*Midrange = = 6.45
2
(∑ 𝑋)2
∑ 𝑋2−
𝑛
b) Variance =
𝑛−1

∑ 𝑋 = 7 + 6.2 + 7.7 + 8 + 6.4 + 6.2 + 7.2 + 5.4 + 6.4 + 6.5 +


7.2 + 5.4 = 79.6
∑ 𝑋 2 = 72 + 6.22 + 7.72 + ⋯ + 5.42 = 535.34
(∑ 𝑋)2 (79.6)2
∑ 𝑋2− 535.34−
* Variance (𝑆 2 ) = 𝑛
= 12
= 0.666
𝑛−1 12−1

* Standard deviation (S) = √Variance = √0.666 = 0.816

𝑆 0.816
*Coefficient of variation (C.V) = ̅ × 100% = ×
𝑋 6.633
100% = 12.3%.

2. The average age of the accountants at Three Rivers Corp. is 26 years, with
a standard deviation of 6 years; the average salary of the accountants is $31,000,
with a standard deviation of $4000. Compare the variations of age and income

Solution:
Age 𝑋̅1 = 26, 𝑆1 = 6
Salary 𝑋̅2 = 31000, 𝑆2 = 4000
𝑆 6
C.V (age) = ̅1 × 100% = × 100% = 23.07%
𝑋1 26

𝑆2 4000
C.V (salary) = × 100% = × 100% = 12.9%
𝑋̅2 31000
Coefficient of variation for age is larger than coefficient of variation for
salaries. Therefore, age varies more than salary.
3. A sample of 20 production workers in a company earned the following
net pay amounts after all deductions for a given week, rounded to the
nearest dollar:
240 340 330 240 325 300 240 305 280 240
290 240 265 280 255 265 240 255 240 240

(a)Find the Quartiles and the 10th percentile of the data and 𝑃60 .
(b)Find the percentile rank of 305.
(c)Calculate the inter-quartile.
(d)Construct a box-and-whisker plot for these data.
(e)Check the data for outliers if exist.
Solution:
(a)Arrange the data in ascending order
240, 240, 240, 240, 240, 240, 240,
240, 255, 255, 265, 265, 280, 280,
290, 300, 305, 325, 330, 340

Check if there exist data less than 157.5 and higher than 377.5. So, NO outliers in these
data

To find 𝑸𝟏 = 𝑷𝟐𝟓
𝑛. 𝑃 (20) (25)
𝐶= = =5
100 100
If c is a whole number, use the value halfway between the 𝐶 and 𝐶 + 1 values when
counting up from the lowest value. In this case, the 5th and 6th values
𝟐𝟒𝟎 + 𝟐𝟒𝟎
𝑸𝟏 = 𝑷𝟐𝟓 = = 𝟐𝟒𝟎
𝟐
To find 𝑸𝟐 = 𝑷𝟓𝟎
𝑛. 𝑃 (20) (50)
𝐶= = = 10
100 100
If c is a whole number, use the value halfway between the 𝐶 and 𝐶 + 1 values when
counting up from the lowest value. In this case, the 10th and 11th values
𝟐𝟓𝟓 + 𝟐𝟔𝟓
𝑸𝟐 = 𝑷𝟓𝟎 = = 𝟐𝟔𝟎
𝟐
To find 𝐐𝟑 = 𝐏𝟕𝟓
𝑛. 𝑃 (20) (75)
𝐶= = = 15
100 100
If c is a whole number, use the value halfway between the 𝐶 and 𝐶 + 1 values when
counting up from the lowest value. In this case, the 15th and 16th values
𝟐𝟗𝟎 + 𝟑𝟎𝟎
𝑸𝟑 = 𝑷𝟕𝟓 = = 𝟐𝟗𝟓
𝟐

To find The 10th percentile


𝑛×𝑝 20×10
𝑐= = = 2,
100 100

𝟐𝟒𝟎 + 𝟐𝟒𝟎
𝑷𝟏𝟎 = = 𝟐𝟒𝟎
𝟐
To find 𝐏𝟔𝟎 .
𝑛×𝑝 20×60
𝑐= = = 12
100 100
𝟐𝟔𝟓 + 𝟐𝟖𝟎
𝑷𝟔𝟎 = = 𝟐𝟕𝟐. 𝟓
𝟐
(𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑣𝑎𝑙𝑢𝑒 𝑏𝑒𝑙𝑜𝑤 𝑋=305)+0.5 16+0.5
(b) Percentile = ∗ 100 = ∗ 100 = 82.5
(𝑡𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟𝑠 𝑜𝑓 𝑣𝑎𝑙𝑢𝑒𝑠) 20
𝟑𝟎𝟓 = 𝑷𝟖𝟑

(c)Interquartile range (IQR) = 𝑸𝟑 − 𝑸𝟏 = 𝟐𝟗𝟓 − 𝟐𝟒𝟎 = 𝟓𝟓


(d)Box-and-whisker plot|

e)
Lower inference =𝑄1 − 1.5( 𝑄3 − 𝑄1 ) = 240 − 1.5( 295 − 240) = 157.5
Upper inference= 𝑄3 + 1.5( 𝑄3 − 𝑄1 ) = 295 + 1.5( 295 − 240) = 377.5
Check if there exist data less than 157.5 and higher than 377.5. So, NO outliers in these
data.
4. The following data are the incomes (in thousands of dollars) for a sample of 13
households.
75 69 84 112 74 104 81 90 94 144 79 13 98
(a)Construct a box-and-whisker plot for these data.
(b)Find the percentile rank of 104.
(c)Check the data for outliers if exist.

Solution:
(a)Arrange the data in ascending order
13 69 74 75 79 81 84 90 94 98 104 112 114

Lower inference =𝑄1 − 1.5( 𝑄3 − 𝑄1 ) = 75 − 1.5( 98 − 75) = 40.5


Upper inference= 𝑄3 + 1.5( 𝑄3 − 𝑄1 ) = 98 + 1.5( 98 − 75) = 132.5
To find 𝑸𝟏 = 𝑷𝟐𝟓
𝑛. 𝑃 (13) (25)
𝐶= = = 3.25
100 100
If 𝐶 is not a whole number, round it up to the next whole number; in this case, 𝐶 = 4
𝑸𝟏 = 𝑷𝟐𝟓 = 𝟕𝟓

To find 𝑸𝟐 = 𝑷𝟓𝟎
𝑛. 𝑃 (13) (50)
𝐶= = = 6.5
100 100
If c is not a whole number, round it up to the next whole number; in this case, 𝐶 = 7
𝑸𝟐 = 𝑷𝟓𝟎 = 𝟖𝟒
To find 𝐐𝟑 = 𝐏𝟕𝟓
𝑛. 𝑃 (13) (75)
𝐶= = = 9.75
100 100
If c is not a whole number, round it up to the next whole number; in this case, 𝐶 = 10

𝑸𝟑 = 𝑷𝟕𝟓 = 𝟗𝟖
(𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑣𝑎𝑙𝑢𝑒 𝑏𝑒𝑙𝑜𝑤 𝑋=104)+0.5
(b) Percentile = ∗ 100
(𝑡𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟𝑠 𝑜𝑓 𝑣𝑎𝑙𝑢𝑒𝑠)
10 + 0.5
= ∗ 100 = 80.76
13
𝟏𝟎𝟒 = 𝑷𝟖𝟏

(c)Lower inference =𝑄1 − 1.5( 𝑄3 − 𝑄1 ) = 75 − 1.5( 98 − 75) = 40.5


Upper inference= 𝑄3 + 1.5( 𝑄3 − 𝑄1 ) = 98 + 1.5( 98 − 75) = 132.5

Check if there exist data less than 𝟒𝟎. 𝟓and higher than 132.5. So, 13 is the only
outliers in these data.

5. The following data represent the record weight for each of the 40 people

102 104 140 136 152 132 158 193 128 141
130 133 147 148 141 129 133 137 179 147
152 114 124 138 129 164 135 128 139 154
168 148 152 116 107 136 167 143 139 152

𝑅𝑎𝑛𝑔𝑒 𝑔𝑟𝑒𝑎𝑡𝑒𝑠𝑡 𝑣𝑎𝑙𝑢𝑒−𝑠𝑚𝑎𝑙𝑙𝑒𝑠 𝑣𝑎𝑙𝑢𝑒 193−102


Class width = = = = 8.27 ≈ 9
𝑛𝑜.𝑜𝑓 𝑐𝑙𝑎𝑠𝑠𝑒𝑠 𝑛𝑜.𝑜𝑓 𝑐𝑙𝑎𝑠𝑠𝑒𝑠 11

10 2 4 7
11 4 6
12 4 8 8 9 9
13 0 2 3 3 5 6 6 7 8 9 9
14 0 1 1 3 7 7 8 8
15 2 2 2 2 4 8
16 4 7 8
17 9
19 3
Classes Frequency Relative Frequency
102-111 3 3⁄
40
111-120 2 2⁄
40
120-129 3 3⁄
40
129-138 10 10⁄
40
138-147 7 7⁄
40
147-156 9 9⁄
40
156-165 2 2⁄
40
165-174 2 2⁄
40
174-183 1 1⁄
40
183-192 0 0
192-201 1 1⁄
40
∑ 40 1

I. Draw Histogram, Frequency Polygon, and Histogram by using relative frequency and
an O-give.
Polygon

Classes Frequency Midpoint


102-111 3 106.5
111-120 2 115.5
120-129 3 124.5
129-138 10 133.5
138-147 7 142.5
147-156 9 151.5
156-165 2 160.5
165-174 2 169.5
174-183 1 178.5
183-192 0 187.5
192-201 1 196.5
∑ 40
O-give.
<Upper bound Frequency Cumulative
Frequency
<111 3 3
<120 2 5
<129 3 8
<138 10 18
<147 7 25
<156 9 34
<165 2 36
<174 2 38
<183 1 39
<192 0 39
<201 1 40
40

How many people was the record weight less than 149?
29
6. The data shows the results of women who run the marathon

a. How many women ran of 50 minutes and more


49-37=12
b. How many women ran less than 50
37
7. Given the following Qualitative data, construct the Pie and Bar graphs.

Solution

Classes Frequency Percent 𝑓


Degree= ∗ 360
𝑛
N 10 20% 72
T 16 32% 115.2
I 12 24% 86.4
R 12 24% 86.4
∑ 50 100% 360
Classes Frequency Percent 𝑓
Degree= ∗
𝑛=50
360
N 10 20% 72
T 16 32% 115.2
I 12 24% 86.4
R 12 24% 86.4
∑ 50 100% 360

72

72+115.2=187.2

187.2+86.4=273.6
273.6+86.4=360

You might also like