0% found this document useful (0 votes)
333 views15 pages

Chapter 4 Statistics and Data Measures of Variation

This document provides an overview of measures of variation in statistics and data. It discusses two key measures - range and interquartile range. Range is defined as the difference between the highest and lowest values in a data set. Interquartile range is the difference between the third (Q3) and first (Q1) quartiles, measuring the spread among the middle 50% of data values. Examples are provided to demonstrate calculating each measure and to compare the variation between two data sets. The document serves as an introduction to further lessons on measures of variation for grouped and ungrouped data.

Uploaded by

Ateng Ford
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
333 views15 pages

Chapter 4 Statistics and Data Measures of Variation

This document provides an overview of measures of variation in statistics and data. It discusses two key measures - range and interquartile range. Range is defined as the difference between the highest and lowest values in a data set. Interquartile range is the difference between the third (Q3) and first (Q1) quartiles, measuring the spread among the middle 50% of data values. Examples are provided to demonstrate calculating each measure and to compare the variation between two data sets. The document serves as an introduction to further lessons on measures of variation for grouped and ungrouped data.

Uploaded by

Ateng Ford
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 15

Learning Module in Mathematics in the Modern World A.Y.

2020 - 2021

CHAPTER 4 – STATISTICS AND DATA


CHAPTER INTRODUCTION
Statistics is a branch of mathematics that deals with the scientific collection, organization, presentation,
analysis, and interpretation of numerical data in order to obtain useful and meaningful information. Data is
everywhere. It is observable or measurable. With the advancement of technology every day, data can be accessed
anywhere and anyone. When data is correct, valid analysis and interpretation can be generated to produce valuable
information.
LESSONS AND COVERAGE
 LESSON 4 – Measures of Variation of Ungrouped Data
 LESSON 5 – Measures of Variation of Grouped Data

LEARNING OBJECTIVES

In these lessons, you will learn to:

LESSON OBJECTIVES

 Define the measures of variation


 Calculate the range, the semi- interquartile range, the quartile range, the
4
mean deviation or average deviation, the variance, and standard deviation
of ungrouped data
 Define the measures of variation
 Calculate the range, the semi- interquartile range, the quartile range, the
5
mean deviation or average deviation, the variance, and standard deviation
of grouped data

CONTENT

LESSON 4 – MEASURES OF VARIATION OF UNGROUPED DATA

In the preceding chapter, we discussed the computation, comparison and uses of the several types of
measuring the averages of a set of numerical data – the mean, the median, and the mode. These averages represent
the typical or representative values that tend to be centrally located in a set of data, arranged by magnitude in an
increasing or decreasing order.
The information on the measures of central location of a set of numerical data, however, are not certainly
sufficient to give an adequate description of data. We have to know also how the numerical data tend to spread or
scatter about the average value. Two sets of data having the same mean or median may differ considerably in the
spread of their data from the average. For instance, 2 sets of data, A and B, have the same mean of 50. Data A has
a lowest score of 5 and a highest score of 95, while data B has 20 and 80 for its lowest and highest scores,
respectively. While the 2 sets of data have the same mean, they differ considerably in the spread of their data in
terms of the range. The range of data A is 95-5 or 90, while for data B is 80-20 or 60. This indicates that the values
in B are less scattered about the mean of 50; hence, more homogenous than those values in data. If we will find the
differences of the 2 ranges, 90 for A and 60 for B, we will note a big gap of 30.
Since the measures of central tendency, therefore, are not enough to provide complete and useful data and
information, there is a need to support them with other computational measures of description, commonly known as
the measures of variation or measures of dispersion.
The measures of variation or dispersion indicate the degree of extent to which numerical values are dispersed
or spread out about the average value in a distribution. We will discuss more popularly used measures of variation.
These are the range, the semi – quartile range, the quartile range, the mean deviation or average deviation, the
variance, and standard deviation.

1|Page Prepared by: Sir Ross Christian Corpuz Manuel


Instructor I
Learning Module in Mathematics in the Modern World A.Y. 2020 - 2021

1. THE RANGE
The range which is the simplest to compute, is the difference between the largest and the lowest values in
the set of numerical data. The range of ungrouped data is obtained by obtaining the difference between the largest
value and lowest value.

Thus, we have the following formula for ungrouped data:


𝑅𝑎𝑛𝑔𝑒(𝑅 ) = 𝐻𝑖𝑔ℎ𝑒𝑠𝑡 𝑉𝑎𝑙𝑢𝑒 (𝐻𝑉 ) − 𝐿𝑜𝑤𝑒𝑠𝑡 𝑉𝑎𝑙𝑢𝑒(𝐿𝑉 ) 𝑜𝑟
𝑹 = 𝑯𝑽 − 𝑳𝑽

Example 1
The scores obtained by 12 students in Statistics class are 80, 75, 63, 95, 98, 78, 85, 90, 73, 68, 87, and 81.
Find the range.

Solution:
The highest score is 98, while the lowest score is 63. Hence,
𝑅 = 𝐻𝑉 − 𝐿𝑉
= 98 − 63
𝑹 = 𝟑𝟓
Example 2
The following are the daily wages of 8 factory workers of two garment factories. Factory A and factory B.
Find the range of salaries in peso (Php).
Factory A: 400, 450, 520, 380, 482, 495, 575, 450
Factory B: 450, 400, 450, 480, 450, 450, 400, 672

Workers of both factories have mean wage = 469

Finding the range of wages: Range = Highest wage – Lowest wage

Range A = 575 – 380 = 195


Range B = 672 – 400 = 272

Comparing the two wages, you will note that wages of workers of factory B have a higher range than wages
of workers of factory A. These ranges tell us that the wages of workers of factory B are more scattered than the
wages of workers of factory A.
Look closely at wages of workers of factory B. You will see that except for 672 the highest wage, the wages
of the workers are more consistent than the wages in A. Without the highest wage of 672 the range would be 80
from 480 – 400 = 80. Whereas, if you exclude the highest wage 575 in A, the range would be 140 from 520 – 380 =
140.
Can you now say that the wages of workers of factory B are more scattered than the wages of workers of
factory A?
The range tells us that it is not a stable measure of variability because its value can fluctuate greatly even
with a change in just a single value, either the highest or lowest.

2. THE INTER-QUARTILE RANGE


The range measures the total spread in the data set. Although the range is a simple measure of total variation
in the data, its distinct weakness is that it does not take into account how the data are actually distributed between
the smallest and the largest values.
On the other hand, the interquartile range (IQR) is the difference between the third and first quartile in a set
of data. This measure considers the spread in the middle 50% of the data; therefore, it is not influenced by extreme
values.

IQR

2|Page Prepared by: Sir Ross Christian Corpuz Manuel


Instructor I
Learning Module in Mathematics in the Modern World A.Y. 2020 - 2021
𝑰𝑸𝑹 = 𝑸𝟑 − 𝑸𝟏
Example 1
The data below shows the number of laptops sold by 15 sales agents during the last financial year.
43 48 62 52 46 90 58 37 48 73 84 68 54 34 78
Calculate the interquartile range.
Solution:
Step 1 Order the data from least to greatest.
34 37 43 46 48 48 52 54 58 62 68 73 78 84 90

Step 2 Find the median.


𝑛+1 15+1
𝑄2 = ( ) 𝑡ℎ 𝑡𝑒𝑟𝑚 = ( ) 𝑡ℎ 𝑡𝑒𝑟𝑚 = 𝟖𝒕𝒉 𝒕𝒆𝒓𝒎 = 𝟓𝟒
2 2
34 37 43 46 48 48 52 54 58 62 68 73 78 84 90

Step 3 Calculate the median of both the lower and upper half of the data.

34 37 43 46 48 48 52 54 58 62 68 73 78 84 90
1st 2nd 3rd 4th 5th 6th 7th 8th 9th 10th 11th 12th 13th 14th 15th

1+7 9+15
𝑄1 = ( ) 𝑡ℎ 𝑡𝑒𝑟𝑚 𝑄3 = ( ) 𝑡ℎ 𝑡𝑒𝑟𝑚
2 2
𝑄1 = 4𝑡ℎ 𝑡𝑒𝑟𝑚 𝑄3 = 12𝑡ℎ 𝑡𝑒𝑟𝑚
𝑸𝟏 = 𝟒𝟔 𝑸𝟑 = 𝟕𝟑
Step 4 Calculate the 𝐼𝑄𝑅.
𝐼𝑄𝑅 = 𝑄3 − 𝑄1
𝐼𝑄𝑅 = 73 − 46
𝑰𝑸𝑹 = 𝟐𝟕
Example 2
Find the interquartile range of the following data set:
25, 10, 40, 88, 45, 60, 77, 36, 18, 95, 56, 65, 7, 0, 38 and 83

Step 1 Order the data from least to greatest.


0, 7, 10, 18, 25, 36, 38, 40, 45, 56, 60, 65, 77, 83, 88, 95
Step 2 Find the median.
𝑛+1 16 + 1
𝑄2 = ( ) 𝑡ℎ 𝑡𝑒𝑟𝑚 = ( ) 𝑡ℎ 𝑡𝑒𝑟𝑚 = 𝟖. 𝟓 𝒕𝒉 𝒕𝒆𝒓𝒎
2 2

0, 7, 10, 18, 25, 36, 38, 40, 45, 56, 60, 65, 77, 83, 88, 95

8th 9th
Find the average of the two centermost values.
40+45
𝐴𝑣𝑒𝑟𝑎𝑔𝑒 = = 𝟒𝟐. 𝟓
2
Step 3 Calculate the median of both the lower and upper half of the data.
0, 7, 10, 18, 25, 36, 38, 40 45, 56, 60, 65, 77, 83, 88, 95
1st 2nd 3rd 4th 5th 6th 7th 8th 9th 10th 11th 12th 13th 14th 15th 16th

1+8 9+16
𝑄1 = ( 2
) 𝑡ℎ 𝑡𝑒𝑟𝑚 𝑄3 = ( 2
) 𝑡ℎ 𝑡𝑒𝑟𝑚
𝑄1 = 4.5𝑡ℎ 𝑡𝑒𝑟𝑚 𝑄3 = 12.5𝑡ℎ 𝑡𝑒𝑟𝑚
Calculate the average of 18 and 25. Calculate the average of 65 and 77.
18+25 65+77
𝑄1 = 𝑄3 =
2 2
𝑸𝟏 = 𝟐𝟏. 𝟓 𝑸𝟑 = 𝟕𝟏
Step 4 Calculate the 𝐼𝑄𝑅.
𝐼𝑄𝑅 = 𝑄3 − 𝑄1
3|Page Prepared by: Sir Ross Christian Corpuz Manuel
Instructor I
Learning Module in Mathematics in the Modern World A.Y. 2020 - 2021
𝐼𝑄𝑅 = 71 − 21.5
𝑰𝑸𝑹 = 𝟒𝟗. 𝟓

3. THE SEMI-INTERQUARTILE RANGE OR QUARTILE DEVIATION

The semi-interquartile range (SIQR) or quartile deviation (QD) indicates the variation or dispersion of the
values covering the middle 50% of the distribution of the data. It is found by getting half of the value or distance
between the third or upper quartile and the first quartile or the lower quartile. Thus, we have the equation:

𝑸𝟑 − 𝑸 𝟏
𝑺𝑰𝑸𝑹 𝒐𝒓 𝑸𝑫 =
𝟐
Note that the range, the interquartile range, the semi-interquartile range have the same disadvantage. Each
does not provide idea on the density of observations, and hence, gives only little information on the concentration of
the observations about the central values.

The semi-interquartile range or quartile deviation is an appropriate measure of variation only if the median
is the one that is used as the measure of central tendency, and especially if the distribution is skewed.

Example 1

A manufacturing company produced the following number of units per day for a given period: 21,
25, 20, 28, 30, 23, 22, 31, 32, 27, 19, 33, 24, 29, 26, and 34.
Determine the:
a. Range
b. Interquartile Range (IQR)
c. Semi-interquartile Range (SIQR) or Quartile Deviation (QD)

Solutions:
a. Range = 𝐻𝑉 − 𝐿𝑉
= 34 − 19
= 15

To solve IQR and SIQR or QD, let us first arrange the production units according to
magnitude and calculate 𝑄1 and 𝑄3 , as shown below:

19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34

𝑄1 𝑄2 𝑄3

Note that 𝑄1 is up to the point containing the lower 25% of the data arranged from lowest
to highest, while 𝑄3 is up to the point containing the upper 75% of the data.

Thus, we apply the formulas as follows:

b. 𝐼𝑄𝑅 = 𝑄3 − 𝑄1
= 30.5 − 22.5
𝑰𝑸𝑹 = 𝟖

𝑄3 −𝑄1
c. 𝑆𝐼𝑄𝑅 𝑜𝑟 𝑄𝐷 = 2
30.5−22.5
= 2
𝑺𝑰𝑸𝑹 𝒐𝒓 𝑸𝑫 = 𝟒

4|Page Prepared by: Sir Ross Christian Corpuz Manuel


Instructor I
Learning Module in Mathematics in the Modern World A.Y. 2020 - 2021
4. THE MEAN DEVIATION OR AVERAGE DEVIATION

A more reliable measure of variation than the range, the interquartile range and the semi – interquartile range
or quartile deviation is the mean deviation (MD) or average deviation (AD). The mean deviation or average deviation
takes into account the deviations of the individual values from the mean. While the deviations from the mean of the
individual values could either be negative or positive, we shall calculate the mean deviation or average deviation
using the absolute values of the deviations.
The absolute value of a number is the value of the number regardless of its sign. Hence, the absolute value
of -7 or +7 is simply 7. Since its uses the absolute values of the deviations, it is sometimes referred to as mean
absolute deviation.
The mean deviation or average deviation is, thus, defined as the average of the absolute deviations of the
individual values of a set of numerical data from either the mean, the median, or the mode. Among the 3, however,
the mean is the most preferred and commonly used measure of central tendency for computing the mean deviation
or average deviation. To determine the mean or average deviation, we shall use the following formula:

For Ungrouped Data


∑|𝑥 − 𝑥̅ |
𝑀𝐷 𝑜𝑟 𝐴𝐷 =
𝑛
where:
𝑥 = refers to the individual value for ungrouped data, and the midpoint of each class
interval for grouped data
𝑥̅ = the mean of the data
𝑛 = the total number of frequencies

For ungrouped data, the mean deviation or average deviation is determined by following the
procedures enumerated below.
1. Arrange the values from lowest to highest or vice versa.
2. Compute the values of the mean.
3. Find the individual absolute value of each deviation from the mean.
4. Find the sum of the absolute values in Step 3.
5. Substitute the values in the formula and solve.

The mean deviation or average deviation is an appropriate measure for economic and business concerns if
they are influenced directly from the mean in either direction. For instance, Company A has a mean deviation of
₱3,000.00 for the salaries of its employees, while Company B has a mean deviation of only ₱ 1,000.00 for its
employees. A comparison of the mean deviations indicates that the salaries of the employees of Company A are
thrice as dispersed as the salaries of the employees of Company B.
Hence, companies sometimes use this measure for planning exercises, notably when making economic
forecastings.

Example 1
The number of TV units sold by an appliance store for a 10-day period are: 8, 9, 12, 6, 7,13, 4, 2, 11
and 8. Determine the mean deviation or average deviation.

Solutions:
We shall first arrange the number of TV units sold according to magnitude (lowest to highest) and
then find the individual absolute value of each deviation from the mean, as shown below.

Absolute Values of Deviations from


Age
the Mean
(𝑥)
|𝑥 − 𝑥̅ |
2 6
4 4

5|Page Prepared by: Sir Ross Christian Corpuz Manuel


Instructor I
Learning Module in Mathematics in the Modern World A.Y. 2020 - 2021

6 2
7 1
8 0
8 0
9 1
11 3
12 4
13 5

𝒏 = 𝟏𝟎 ̅| = 𝟐𝟔
∑|𝒙 − 𝒙

Mean (𝑥̅ )
∑ 𝑥 80
𝑥̅ = =
𝑛 10
̅=𝟖
𝒙

Mean Deviation (MD) or Average Deviation (AD)

∑|𝑥 − 𝑥̅ |
𝑀𝐷 𝑜𝑟 𝐴𝐷 =
𝑛
26
𝑀𝐷 𝑜𝑟 𝐴𝐷 =
10
𝑴𝑫 𝒐𝒓 𝑨𝑫 = 𝟐. 𝟔

The MD or AD of 2.6 indicates the amount by which the individual sales are dispersed around
the mean of 8. The higher the value of MD or AD, the larger the dispersion.

5. THE VARIANCE

In the preceding topic, we obtained the mean deviation or average deviation in terms of the absolute values
of the deviations from the mean. The use of the absolute values for the mean deviation is particularly done to avoid
having negative deviations. This, of course, results in an awkward mathematical situation since we obtain a very small
significant finding about the variability of the data. To avoid this disadvantageous situation, let us work with the
squares of the deviations by calculating the 2 most important measures of variability, called the variance and its
corresponding square root.
The variance is defined as the average of the squared deviations from the mean. The square root of this
variance is known as standard deviation. The variance for a sample data is denoted by 𝑆 2 (reads S squared or the
square of S), while the symbol for variance of the population is 𝜎 2 , and is read sigma squared.

The formulas for calculating the variance for ungrouped data are:

For the variance of a sample data:

̅)𝟐
∑(𝒙−𝒙
𝑺𝟐 = where 𝑥̅ is the mean of the sample data
𝒏−𝟏

For the variance of a population:

∑(𝒙−𝝁)𝟐
𝝈𝟐 = where 𝜇 represents the mean of the population
𝑵

6|Page Prepared by: Sir Ross Christian Corpuz Manuel


Instructor I
Learning Module in Mathematics in the Modern World A.Y. 2020 - 2021
We will use 𝑛 − 1 instead of n as divisor in computing the variance of a sample data although the variance
is defined as the average of the squared deviations about the mean. The reason for this is to avoid the likely existence
of biases that are normally associated with the use of the variance computed from different random samples,
especially when the sample sizes 𝑛 are small. The 𝑛 different sizes selected from the same population generally
yield different values for the variance. But, the average of these values computed from several samples of the
population tends to be closer to the actual variance – the population variance.

To determine the variance of an ungrouped data, let us follow the steps below.
1. Arrange the values according to magnitude (lowest to highest or vice versa) vertically.
2. Calculate the mean.
3. Obtain the individual deviations from the mean.
4. Square each deviation and write the results under column (𝑥 − 𝑥̅ )2 .
5. Find the sum of the squared deviations.
6. Divide the sum in Step 5 by 𝑛 − 1 for sample data or by 𝑁 for population data.

Example 1
Determine the variance of the following 8 sample production units of a certain company: 10, 11, 9,
17, 13, 15,13, and 20.

Solutions:

The 8 sample data, showing values under 𝑥 − 𝑥̅ and (𝑥 − 𝑥̅ )2 , are as follows:

𝑥 𝑥 − 𝑥̅ (𝑥 − 𝑥̅ )2
9 −4.5 20.25
10 −3.5 12.25
11 −2.5 6.25
13 −0.5 0.25
13 −0.5 0.25
15 +1.5 2.25
17 +3.5 12.25
20 +6.5 42.25
𝒏=𝟖 ̅)𝟐 = 𝟗𝟔
∑(𝒙 − 𝒙

Thus, the mean is obtained, as follows:


∑𝑥
𝑥̅ =
𝑛
9+10+11+13+13+15+17+20 108
= =
8 8
̅ = 𝟏𝟑. 𝟓
𝒙

and the variance, as shown below.


∑(𝑥 − 𝑥̅ )2
𝑆2 =
𝑛−1
96
𝑆2 =
8−1
𝑺𝟐 = 𝟏𝟑. 𝟕𝟏

As you may have observed, the computation of the variance (𝑆 2 ) for ungrouped data using the
above procedures is laborious and time – consuming. A simpler and easier solution can be done
through the raw data formula.
The raw data formulas for computing the variance of ungrouped data are:

𝒏 ∑ 𝒙𝟐 −(∑ 𝒙)𝟐
𝑺𝟐 = for sample data
𝒏(𝒏−𝟏)
7|Page Prepared by: Sir Ross Christian Corpuz Manuel
Instructor I
Learning Module in Mathematics in the Modern World A.Y. 2020 - 2021
and
𝑵 ∑ 𝒙𝟐 −(∑ 𝒙)𝟐
𝝈𝟐 = for population data
𝑵𝟐

We will use 𝑛 for the number of observations for sample data and 𝑁 for population data.

To solve the variance of ungrouped data by the raw data formula, we will follow the process
enumerated below.
a. Arrange the values in terms of their magnitude.
b. Find the sum of the values.
c. Square each value and write the results under the column 𝑥 2 .
d. Get the sum of the squared values in Step 3.
e. Substitute the results obtained in Step 2 and Step 4 in the raw data formula.

Example 2
Using the same 8 sample production units in the preceding example with
𝑐𝑜𝑚𝑝𝑢𝑡𝑒𝑑 𝑚𝑒𝑎𝑛 (𝑥̅ ) = 13.5, find the variance by the raw data formula.

Solutions:
The 8 sample data with their corresponding values under 𝑥 2
𝑥 𝑥2
9 81
10 100
11 121
13 169
13 169
15 225
17 289
20 400
∑ 𝑥 = 108 ∑ 𝑥 2 = 1,554
𝑛=8

Substituting the computed values obtained above in the raw data formula, we shall have:

𝑛 ∑ 𝑥 2 −(∑ 𝑥)2
𝑆2 = 𝑛(𝑛−1)
8(1,554)−(108)2
= 8(8−1)
12,432−11,664
= 56
768
= 56
𝑺𝟐 = 𝟏𝟑. 𝟕𝟏

6. THE STANDARD DEVIATION

Let us recall that the variance is the average of the squared deviations from the mean. This means that the
variance obtained for a group of data is expressed in terms of squared units, and therefore, it is difficult to use, to
treat, analyze, and interpret data. For instance, if the unit of measure of length is in feet, the value that we will obtain
when working with a problem has to be expressed also in feet. Because of this, it is more convenient to use the
square root of the variance as the standard deviation.
The standard deviation is the most important measure of variation. By knowing the standard deviation, we
will be able to determine the position of the scores in a frequency distribution in relation to the mean. A standard
deviation of a small value means that the values, in a distribution are scattered or spread out near the mean or vice
versa.

8|Page Prepared by: Sir Ross Christian Corpuz Manuel


Instructor I
Learning Module in Mathematics in the Modern World A.Y. 2020 - 2021
The formulas to compute the standard deviation for ungrouped data are:

∑(𝑥−𝑥̅ )2 𝑛 ∑ 𝑥 2 −(∑ 𝑥)2


𝑆=√ or √[ ] for sample data
𝑛−1 𝑛(𝑛−1)

and

∑(𝑥−𝜇)2 𝑁 ∑ 𝑥 2 −(∑ 𝑥)2


𝜎=√ 𝑜𝑟√ for population data
𝑁 𝑁2

Hence, the standard deviation of the ungrouped 8 sample data in Example 1 in which the variance 𝑆 2 =
13.71 is calculated in the following manner.
𝑆 = √𝑆 2
= √13.71
𝑺 = 𝟑. 𝟕𝟎

The variance and the standard deviation are generally accepted measures of dispersion, especially in
discussions and presentation of reports containing basic statistics. The standard deviation is more popularly used
than the variance since its value is expressed in the unit of measurement as the observations and the mean. Hence,
it presents values that can be used directly for analysis and interpretation. For instance, when the unit of measurement
of the data in kilos, the standard deviation, and the mean are in kilos. The variance on the other hand is in kilos
squared.

LESSON 4 – MEASURES OF VARIATION OF GROUPED DATA

To find the range, the interquartile range, the semi-interquartile range or quartile deviation, the mean deviation
or average deviation, the variance, and the standard deviation.

1. THE RANGE

The range is the simplest measure of variability. The range of a frequency distribution is simply the difference
between the upper boundary of the highest class interval and the lower boundary of the lowest class interval.

𝑅𝑎𝑛𝑔𝑒 (𝑅) = 𝑈𝑝𝑝𝑒𝑟 𝐵𝑜𝑢𝑛𝑑𝑎𝑟𝑦 𝑜𝑓 𝑡ℎ𝑒 𝐻𝑖𝑔ℎ𝑒𝑠𝑡 𝐶𝑙𝑎𝑠𝑠 𝐼𝑛𝑡𝑒𝑟𝑣𝑎𝑙 (𝑈𝐵𝐻𝐶𝐼 )


− 𝐿𝑜𝑤𝑒𝑟 𝐵𝑜𝑢𝑛𝑑𝑎𝑟𝑦 𝑜𝑓 𝐿𝑜𝑤𝑒𝑠𝑡 𝐶𝑙𝑎𝑠𝑠 𝐼𝑛𝑡𝑒𝑟𝑣𝑎𝑙 (𝐿𝐵𝐿𝐶𝐼 )

Example 1
Table 1 below shows the average production of 60 employees of a manufacturing company during
a given week. Find the range.

Table 1. Average Production of 60 Employees


Number of
Average Production
Employees Class Boundaries
(classes)
(𝑓)
91 – 95 4 90.5 – 95.5
86 – 90 7 85.5 – 90.5
81 – 85 11 80.5 – 85.5
76 – 80 16 75.5 – 80.5
71 – 75 10 70.5 – 75.5
66 – 70 8 65.5 – 70.5
61 – 65 4 60.5 – 65.5

9|Page Prepared by: Sir Ross Christian Corpuz Manuel


Instructor I
Learning Module in Mathematics in the Modern World A.Y. 2020 - 2021
Solution:
𝑅𝑎𝑛𝑔𝑒 (𝑅) = 𝑈𝐵𝐻𝐶𝐼 − 𝐿𝐵𝐿𝐶𝐼
= 95.5 − 60.5
𝑹 = 𝟑𝟓

2. THE INTERQUARTILE RANGE

The interquartile range (IQR) is found by finding the difference between the values of the third quartile (𝑄3 )
or upper quartile and the first quartile (𝑄1 ) or the lower quartile.
In symbols, we have:
𝑰𝑸𝑹 = 𝑸𝟑 − 𝑸𝟏

Computing the values of the first and third quartiles is similar in computing the value of median, where 𝑛 is
multiplied to k (where 𝑘 = 1,2,3) and divided it by four instead of two. The formula for computing the quartile is, as
follows:
𝒌𝒏
− 𝒄𝒇𝒃
𝑸𝒌 = 𝑿𝒍𝒃 + ( 𝟒 )𝒊
𝒇𝒒𝒌
where:
𝑿𝒍𝒃 = 𝑙𝑜𝑤𝑒𝑟 𝑏𝑜𝑢𝑛𝑑𝑎𝑟𝑦 𝑜𝑓 𝑡ℎ𝑒 𝑘 𝑡ℎ 𝑞𝑢𝑎𝑟𝑡𝑖𝑙𝑒 𝑐𝑙𝑎𝑠𝑠
𝒄𝒇𝒃 = 𝑐𝑢𝑚𝑢𝑙𝑎𝑡𝑖𝑣𝑒 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 𝑏𝑒𝑓𝑜𝑟𝑒 𝑡ℎ𝑒 𝑘 𝑡ℎ 𝑞𝑢𝑎𝑟𝑡𝑖𝑙𝑒 𝑐𝑙𝑎𝑠𝑠
𝒇𝒒𝒌 = 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 𝑜𝑓 𝑡ℎ𝑒 𝑘 𝑡ℎ 𝑞𝑢𝑎𝑟𝑡𝑖𝑙𝑒 𝑐𝑙𝑎𝑠𝑠
𝒊 = 𝑖𝑛𝑡𝑒𝑟𝑣𝑎𝑙 𝑠𝑖𝑧𝑒 𝑜𝑟 𝑐𝑙𝑎𝑠𝑠 𝑠𝑖𝑧𝑒
The formulas of the quartiles when 𝑘 = 1,2, 𝑎𝑛𝑑 3 are as follows:

𝒘𝒉𝒆𝒏 𝒌 = 𝟏
𝒏
− 𝒄𝒇𝒃
𝑸𝟏 = 𝑿𝒍𝒃 + (𝟒 )𝒊
𝒇𝒒𝟏

𝒘𝒉𝒆𝒏 𝒌 = 𝟐
𝟐𝒏
− 𝒄𝒇𝒃
𝑸𝟐 = 𝑿𝒍𝒃 + ( 𝟒 )𝒊
𝒇𝒒𝟐

𝒘𝒉𝒆𝒏 𝒌 = 𝟑
𝟑𝒏
− 𝒄𝒇𝒃
𝑸𝟑 = 𝑿𝒍𝒃 + ( 𝟒 )𝒊
𝒇𝒒𝟑
Example 1
Table 1 below shows the average production of 60 employees of a manufacturing company during
a given week. Find the interquartile range (IQR).
Table 1. Average Production of 60 Employees
Number of
Average Production
Employees < 𝑐𝑓
(classes)
(𝑓)
91 – 95 4 60
86 – 90 7 56
81 – 85 11 49
76 – 80 16 38
71 – 75 10 22
66 – 70 8 12

10 | P a g e Prepared by: Sir Ross Christian Corpuz Manuel


Instructor I
Learning Module in Mathematics in the Modern World A.Y. 2020 - 2021
61 – 65 4 4

Solutions:

First, we have to calculate the first and third quartiles.

The formula to determine the position of the 𝑖 𝑡ℎ quartile is


𝑖(𝑁) 𝑡ℎ
𝑄𝑖 = ( ) , 𝑖 = 1,2,3
4
where N is the total number of observations

First Quartile (𝑸𝟏 )


1(𝑁) 𝑡ℎ
𝑄1 = [
] 𝑣𝑎𝑙𝑢𝑒
4
1(60) 𝑡ℎ
𝑄1 = [ ] 𝑣𝑎𝑙𝑢𝑒
4
𝑄1 = 15𝑡ℎ 𝑣𝑎𝑙𝑢𝑒
The lowest cumulative frequency just greater than or equal to 15 is 22. Therefore, 𝑄1 𝑐𝑙𝑎𝑠𝑠 is 71 – 75.
𝑛
−𝑐𝑓𝑏
𝑄1 = 𝑋𝑙𝑏 + ( 4 𝑓𝑞 ) 𝑖
1
𝑛
−𝑐𝑓𝑏
4
= 𝑋𝑙𝑏 + ( )𝑖
𝑓𝑞1
15−12
= 70.5 + ( )5
10
15−12
= 70.5 + ( )5
10
3
= 70.5 + (10) 5
15
= 70.5 + 10
= 70.5 + 1.5
𝑸𝟏 = 𝟕𝟐

Third Quartile (𝑸𝟑 )


3(𝑁) 𝑡ℎ
𝑄3 = [ ] 𝑣𝑎𝑙𝑢𝑒
4
3(60) 𝑡ℎ
𝑄3 = [ ] 𝑣𝑎𝑙𝑢𝑒
4
𝑄3 = 45𝑡ℎ 𝑣𝑎𝑙𝑢𝑒
The lowest cumulative frequency just greater than or equal to 45 is 49. Therefore, 𝑄3 𝑐𝑙𝑎𝑠𝑠 is 81 – 85.
3𝑛
−𝑐𝑓𝑏
𝑄3 = 𝑋𝑙𝑏 + ( 4 𝑓𝑞 )𝑖
3
3𝑛
−𝑐𝑓𝑏
= 𝑋𝑙𝑏 + ( 4 𝑓𝑞 )𝑖
3
45−38
= 80.5 + ( )5
11
7
= 80.5 + (11) 5
35
= 80.5 +
11
= 80.5 + 3.18
𝑸𝟑 = 𝟖𝟎. 𝟔𝟖

Therefore,
𝐼𝑄𝑅 = 𝑄3 − 𝑄1
= 83.68 − 72
𝑰𝑸𝑹 = 𝟏𝟏. 𝟔𝟖
11 | P a g e Prepared by: Sir Ross Christian Corpuz Manuel
Instructor I
Learning Module in Mathematics in the Modern World A.Y. 2020 - 2021
3. THE SEMI-INTERQUARTILE RANGE OR QUARTILE DEVIATION
The semi-interquartile range is half of the difference between the third quartile and the first quartile. This is
half the interquartile range.

In the previous example, the quartiles were 𝑄3 = 80.68 and 𝑄1 = 72.

The semi – interquartile range is:

𝑄3 −𝑄1
𝑆𝐼𝑄𝑅 = 2
80.68−72
= 2
11.68
= 2
𝑺𝑰𝑸𝑹 = 𝟓. 𝟖𝟒

4. THE MEAN DEVIATION OR AVERAGE DEVIATION


The mean deviation (also called the mean absolute deviation) is the mean of the absolute deviations of a
set of data about the data's mean. It gives equal weight to the deviation of every observation.

For grouped data, we shall compute the mean deviation (MD) or average deviation (AD) by following the
procedures below.
1. Compute the mean (𝑥̅ ) of the distribution.
2. Subtract the mean (𝑥̅ ) from each of the midpoints and write the absolute values of the results under the
column |𝑥 − 𝑥̅ |.
3. Find the products of items under column 𝑓 and items under column |𝑥 − 𝑥̅ |.
4. Add the products in Step 3 to obtain the value of ∑ 𝑓|𝑥 − 𝑥̅ |.
5. Divide the sum obtained in Step 4 by 𝑛.

Example 1

Calculate the mean deviation (MD) or average deviation (AD) of the scores obtained by 89 applicants
for employment, as shown in Table 2 below.

Table 2. Scores of 89 Applicants for Employment


Classes 𝑓 𝑥 𝑓𝑥 |𝑥 − 𝑥̅ | 𝑓|𝑥 − 𝑥̅ |
71 – 75 6 43 172 14.83 88.98
66 – 70 9 48 480 9.83 88.47
61 – 65 17 53 954 4.83 82.11
56 – 60 25 58 1450 0.17 4.25
51 – 55 18 63 1071 5.17 93.06
46 – 50 10 68 612 10.17 101.70
41 – 45 4 73 438 15.17 60.68
𝑛 = 89 ∑ 𝑓𝑥 = 5,177 ∑ 𝑓 |𝑥 − 𝑥̅ | = 519.25

The mean (𝑥̅ ) of the distribution is calculated by using the formula presented in Lesson 3, as follows:
∑ 𝑓𝑥
𝑥̅ = 𝑛
5,177
𝑥̅ = 89
̅
𝒙 = 𝟓𝟖. 𝟏𝟕

Thus, the Mean Deviation (MD) or Average Deviation (AD) shall be computed, as follows:
∑ 𝑓|𝑥−𝑥̅ |
𝑀𝐷 𝑜𝑟 𝐴𝐷 = 𝑛
519.25
= = 𝟓. 𝟖𝟑
89

12 | P a g e Prepared by: Sir Ross Christian Corpuz Manuel


Instructor I
Learning Module in Mathematics in the Modern World A.Y. 2020 - 2021
5. THE VARIANCE
The variance for grouped data may be calculated by 2 methods. The first method involves rather longer
procedures using the mean deviation formula as follows:

Method 1 (Long Method Formula)


̅)𝟐
∑ 𝒇(𝒙−𝒙
𝑺𝟐 = for sample data and
𝒏−𝟏
∑ 𝒇(𝒙−𝝁)𝟐
𝝈𝟐 = for population data
𝑵
We shall use 𝑥̅ for the mean of a sample data and 𝜇 for the mean of a population data.

We will follow the procedures below to solve the variance of a grouped data by using the above formula
(long method).
1. Find the value of the mean.
2. Subtract the mean from each of the midpoints of each class interval.
3. Square each of the deviations in Step 2 and write the results under (𝑥 − 𝑥̅ )2 .
4. Multiply the squared deviations in Step 3 by their corresponding frequencies.
5. Obtain the sum of the results in Step 4.
6. Divide the results in Step 5 by 𝑛 − 1 for sample data and by N for population data.

Example 1
A distribution of the ages of a sample of 87 managerial employees of a manufacturing company is shown in
Table 3 below. Find the variance by the use of the long method formula.

Table 3. Ages of 87 Managerial Employees


Age Classes 𝑓 𝑥 𝑓𝑥 𝑥 − 𝑥̅ (𝑥 − 𝑥̅ )2 𝑓(𝑥 − 𝑥̅ )2
31 – 35 5 33 165 -14.48 209.67 1048.35
36 – 40 10 38 380 -9.48 89.87 898.70
41 – 45 18 43 774 -4.48 20.07 361.26
46 – 50 25 48 1200 0.52 0.27 6.75
51 – 55 17 53 901 5.52 30.47 517.99
56 – 60 9 58 522 10.52 110.67 996.03
61 – 65 3 63 189 15.52 240.87 722.61
𝑛 = 87 ∑ 𝑓𝑥 = 4,131 ∑ 𝑓 (𝑥 − 𝑥̅ )2 = 4,551.69

Solutions:
The Mean (𝑥̅ )
∑ 𝑓𝑥
𝑥̅ =
𝑛
4,131
=
87
̅ = 𝟒𝟕. 𝟒𝟖
𝒙
Hence, we will obtain:
∑ 𝑓(𝑥−𝑥̅ )2
𝑆2 = 𝑛−1
4,551.69
= for sample data
86
𝑺𝟐 = 𝟓𝟐. 𝟗𝟑

Now, let us simplify the computation in solving variance of a grouped data by using a short method known
as coding method formula, as follows:
Method 2 – Short Method
𝑛 ∑(𝑓∙𝑑2 )−(∑ 𝑓𝑑)2
𝑆2 = [ ] 𝑖2 for sample data
𝑛(𝑛−1)
and
𝑁 ∑(𝑓∙𝑑2 )−(∑ 𝑓𝑑)2
𝜎2 = [ ] 𝑖2 for population data
𝑁2
13 | P a g e Prepared by: Sir Ross Christian Corpuz Manuel
Instructor I
Learning Module in Mathematics in the Modern World A.Y. 2020 - 2021
The 𝑑 represents the coded value of a class interval and 𝑖, the interval size of the class interval.
To obtain the variance of a grouped data by the short method or coding formula, we shall follow the
procedures listed below.
1. Write the coded values of the class intervals under the 𝑑 column.
2. Multiply the frequencies by the corresponding coded values.
3. Multiply the squared coded values by the corresponding frequencies.
4. Add the results in Step 2 and in Step 3.
5. Substitute the values in the coding formula.

Example 2
Calculate the variance of the distribution of the ages of a sample of 87 managerial employees contained in
Table 3 in the preceding example, by the short method (coded formula).

Solutions:
By following the above procedures, complete Table 3, as shown hereunder.

Table 3. Ages of 87 Managerial Employees


Age (classes) 𝑓 𝑑 𝑓𝑑 𝑓 ∙ 𝑑2
31 – 35 5 -3 -15 45
36 – 40 10 -2 -20 40
41 – 45 18 -1 -18 18
46 – 50 25 0 0 0
51 – 55 17 1 17 17
56 – 60 9 2 18 36
61 – 65 3 3 9 27
𝑖=5 𝑛 = 87 ∑ 𝑓𝑑 = −9 ∑(𝑓 ∙ 𝑑 2 ) = 183

Substituting the obtained values in the formula, we shall find:

𝑛 ∑(𝑓 ∙ 𝑑 2 ) − (∑ 𝑓𝑑 )2 2
𝑆2 = [ ]𝑖
𝑛(𝑛 − 1)
(87)(183)−(−9)2
=[ ] 52
87(87−1)
15,921−81
= 25 [ ]
7,482
𝑺𝟐 = 𝟓𝟐. 𝟗𝟑

6. THE STANDARD DEVIATION


The formulas to compute the standard deviation for grouped data are:

̅)𝟐
∑ 𝒇(𝒙−𝒙 𝑛 ∑(𝑓∙𝑑2 )−(∑ 𝑓𝑑)2
𝑆=√ or √[ ] 𝑖2 for sample data
𝒏−𝟏 𝑛(𝑛−1)
and
∑ 𝒇(𝒙−𝝁)𝟐 𝑁 ∑(𝑓∙𝑑2 )−(∑ 𝑓𝑑)2
𝜎=√ or √[ ] 𝑖2 for population data
𝑵 𝑁2

Hence, the standard deviation of the grouped data on the ages of 87 managerial employees presented in
Table 3, where the variance 𝑆 2 = 52.53 is found as follows:

𝑆 = √𝑆 2
= √52.93
𝑺 = 𝟕. 𝟐𝟖

14 | P a g e Prepared by: Sir Ross Christian Corpuz Manuel


Instructor I
Learning Module in Mathematics in the Modern World A.Y. 2020 - 2021
ASSESSMENT

1. Consider a sample data of customers having various interest rates (in percent) on their loans, as follows:

8.52 9.55 8.95 13.35


9.05 10.54 12.91 8.39
9.35 7.58 7.93 11.29
10.83 12.31 8.54 14.21

Find the:
a. Range
b. Interquartile Range
c. Semi – interquartile range
d. Mean Deviation or Average Deviation
e. Variance
f. Standard Deviation

2. The monthly losses by a sample of 80 restaurants are tabulated in the following frequency distribution table.

Losses (in Frequency


thousand (number of
pesos) restaurants)
61 - 70 9
51 – 60 11
41 - 50 12
31 - 40 20
21 - 30 15
11 – 20 10
1 – 10 3

Find the:
a. Range
b. Interquartile Range
c. Semi – interquartile range
d. Mean Deviation or Average Deviation
e. Variance
f. Standard Deviation

REFERENCE:

Zorilla, R. S., Esller, B. H., Fe, P. G., Violeta, M. C., & Cabrera, M. K. (2011). Statistics: Basic Concepts and
Application (2011 ed.). Malabon City, Philippines: MUTYA Publishing House.

15 | P a g e Prepared by: Sir Ross Christian Corpuz Manuel


Instructor I

You might also like