Lesson 3.2 Measures of Central Tendency Position and Variation
Lesson 3.2 Measures of Central Tendency Position and Variation
Happy Care
Sad Next
Next
E N M A - MEAN
BACK
BACK
I S O P O I N T - POSITION
BACK
BACK
I O N R A V A I T - VARIATION
BACK
BACK
What do you think is
our topic for today?
Lesson 3.2:
Measures of Central
Tendency, Position,
and Variation
Objectives:
Find the mean, median, and
01 mode of a given data set. 02 Interpret data using the
measure of central tendency.
➢ To calculate: just add up all the numbers, then divide by how many numbers there are.
∑ 𝑥̄
𝑥̄ =
𝑛
Solution:
∑ 𝑥̄
𝑥̄ =
𝑛
Example 1: Solve for the mean of the set 1, 5, 3, 9, 7.
Solution:
∑ 𝑥̄
𝑥̄ =
𝑛
1+ 5+ 3+ 9+7
=
5
25
=
5
Thus, the mean of the given set is 5.
𝑥̄ = 5
Example 2:
Suppose a basketball team has 15 players, and the heights (in cm) are as follows:
180 204 188 184 189 190 190 200
193 186 186 187 182 201 187
Solution:
∑ 𝑥̄
𝑥̄ =
𝑛
Example 2:
Suppose a basketball team has 15 players, and the heights (in cm) are as follows:
180 204 188 184 189 190 190 200
193 186 186 187 182 201 187
Solution:
∑ 𝑥̄
𝑥̄ =
𝑛
2847
=
15
Thus, the average height of basketball
𝑥̄ = 189.8 cm players is 189.8 cm.
Weighted Mean
➢ It is particularly useful when various classes or groups contribute
differently to the total.
Solution:
𝑥̄1 𝑤1 + 𝑥̄2 𝑤2 + 𝑥̄3 𝑤3 + ⋯ + 𝑥̄𝑛 𝑤𝑛
x𝑤 =
𝑤1 + 𝑤2 + 𝑤3 + ⋯ + 𝑤𝑛
Example 3:
Joel’s first quarter grade is shown in the table below. Use the weighted
mean formula to find Joel’s GPA for the first quarter.
Subjects English Math Filipino Science MAPEH Soc Stud
Grade 90 87 88 93 95 96
No. of Units 3 3 3 3 2 1
Solution:
𝑥̄1 𝑤1 + 𝑥̄2 𝑤2 + 𝑥̄3 𝑤3 + ⋯ + 𝑥̄𝑛 𝑤𝑛
x𝑤 =
𝑤1 + 𝑤2 + 𝑤3 + ⋯ + 𝑤𝑛
90 (3) + 87 (3) + 88 (3) + 93 (3) + 95 (2) + 96 (1)
=
3+ 3+ 3+ 3+2+ 1
= 1,360
15
x𝑤 = 90.67 Therefore, Joel’s GPA for the first quarter is 90.67.
B. Median
➢ The median (𝑀𝑑 ) is the middle value in a set of observations arrange
from highest to lowest or vice versa.
Consider again the heights (in cm) of 15 basketball players listed. Find the median
height of the players.
180 204 188 184 189 190 190 200
193 186 186 187 182 201 187
Solution:
First, arrange the heights from the shortest to the tallest and pick the
height of the middle player.
Example 4:
Consider again the heights (in cm) of 15 basketball players listed. Find the median
height of the players.
180 204 188 184 189 190 190 200
193 186 186 187 182 201 187
Solution:
First, arrange the heights from the shortest to the tallest and pick the
height of the middle player.
180 182 184 186 186 187 187 188 189 190 190 193 200 201 204
Since there are 15 players in the team, the eighth observation is the median.
Remarks:
⮚ If 𝑛 is odd, the median of the observation corresponds to the
𝑛+1
𝑡ℎ observation (middle-ranked).
2
There are two modes 20 and 25, since each of these values occurs four
times. The data set is bimodal.
Comparison of the Mean, Median, and Mode
➢ When the distribution of the observations is fairly symmetric or when
there are no extreme observations, the mean is the most meaningful
measure of central tendency.
➢ If extremely low observations are present, these observations pull the mean to the left, and the
relationship between the three measures becomes: x < 𝑀𝑑 < 𝑀𝑜 (negatively skewed distribution).
➢ With the presence of extreme observations, the median is a more meaningful measure in as much as it is
not affected by these extreme values.
2. Measures of Relative Position
➢ It is referred to as the measure of location, and considered as
the extension of the median.
➢ The Q1 is the middle point between the smallest value and the
center value, also called Q2. The Q2 is also called the median,
➢ Q3 is the middle value between the median and the highest value
of the data set.
Example 1:
Find the first, second, and third quartiles of the ages of 9 employees of a certain company. Their ages
are: 53, 45, 59, 48, 54, 46, 51, 58, and 55.
Solution:
Step 1: Arrange the data in ascending order.
45, 46, 48, 51, 53, 54, 55, 58, 59
Step 2: Select the first, second, and third quartiles value using the formula:
𝑘 𝑛+1 where: 𝑄𝑘 - quartile
𝑄𝑘 =
4 n - no. of observations
1 9+1 10 k - quartile location
→ 𝑄1 = = = 2.5
4 4
2 9+1 2 10
→ 𝑄2 = = =5
4 4
3 9+1 3 10
→ 𝑄3 = = = 7.5
4 4
45, 46, 48, 51, 53, 54, 55, 58, 59
Step 3: Identify the first, second, and third quartile values in the data set.
Since the 2.5th falls between 46 and 48; and 7.5th falls between 55 and 58, we can determine the
first and third quartiles of the data set by getting the average of the two values.
46 + 48 94
𝑄1 = = = 47
2 2
55 + 58 113
𝑄3 = = = 56.5
2 2
Therefore, 𝑄1 = 47, 𝑄2 = 53, 𝑎𝑛𝑑 𝑄3 = 56.5.
Interpretations:
1. 𝑄1 = 47 implies that one-fourth or 25% of the ages fall below 47.
3. 𝑄3 = 56.5 implies that three-fourths or 75% of the ages fall below 56.5.
B. Percentiles
➢ This divides the observation into 100 equal parts. It is used to indicate how much of the observation may
be found below.
➢ For instance, if the 30th percentile is the value, this means that 30% of the observation may fall below it.
To find the k-th percentile:
Step 1: Arrange the observations from lowest to highest.
Step 2: Compute the position using the formula:
𝑘
𝐿= ∗𝑛
100
Remarks:
1. If L is whole number, k-th is midway between L and the next value.
2. If L is not a whole number, round it up to the next integer and the value at that position
is the k-th percentile.
Example 2:
Suppose the arrayed scores of 20 students in a Math 11n exam are as follows: 80, 90,
91, 100, 120, 122, 123, 125, 201, 90, 88, 98, 130, 124, 111, 109, 140, 102, 85, 91. Solve for the
for the 40𝑡ℎ percentile, 28𝑡ℎ percentile.
Solution:
Step 1: Arrange the scores in ascending order.
80, 85, 88, 90, 90, 91, 91, 98, 100, 102, 109, 111, 120, 122, 123, 124, 125, 130, 140, 201
Step 2: Compute the position.
40
→𝐿= ∗ 20 = 8 (Remark 1)
100
28
→𝐿= ∗ 20 = 5.6 (Remark 2)
100
Therefore, the 40th percentile is the value between the 8th and 9th observation, and it is 99.
Interpretations:
1. Four-tenths or 40% of the scores fall below 99; or
2. Six-tenths or 60% of the scores are above 99.
Therefore, the 28th percentile is the 6th item in the observation, which is 91.
Interpretations:
➢ The following formula is used to compute the z-score for a data value 𝑥̄ in a
sample.
𝑥̄ − x
𝑧=
𝑠
Example 3:
The monthly expenditures of a large group of households are normally distributed
with a mean of P48,700 and a standard deviation of P10,400. What is the z-value of
monthly expenditures of P59,400 and P38,300?
Solution:
Let x = 48,700 and 𝑠 = 10,400. Then, using the formula of z, determine z-values for
the two x values. We have:
𝑥̄ − x 59,400 − 48,700
For 𝑥̄ = 59,400: 𝑧= = = 1.00
𝑠 10,400
𝑥̄ − x 38,300 − 48,700
For 𝑥̄ = 38,300: 𝑧= = = - 1.00
𝑠 10,400
The 𝑧 of 1.00 indicates that a monthly expenditure of P59,400 for households is one
standard deviation above the mean, and a 𝑧 of −1.00 shows that a P38,300 monthly
expenditure is one standard deviation below the mean. Note that both monthly household
expenditures (P59,400 and P38,300) are the same distance (P10,400) from the mean.
3. Common Measures of Variation
➢ Measures of variation or dispersion tell us how to scatter or spread out a distribution is. One measure of
dispersion is the range.
A. Range
➢ Probably, the simplest and easiest way to determine the measure of dispersion is the range. The range
(𝑅) is the difference between the largest and the smallest in the given set of data.
Example
1: Data Set Values Mean
A 5, 5, 5, 5, 5 5
B 3, 4, 5, 6, 7 5
C 4, 4, 5, 6, 6 5
Given the data sets A, B, and C, we find the following:
𝑅𝐴 = 5 − 5 = 0
𝑅𝐵 = 7 − 3 = 4
𝑅𝐶 = 6 − 4 = 2
Note:
➢ Range of zero simply means that all the values in the data set are the same.
There is no variability in the values, or the variable under consideration is a
constant for this data set.
➢ The larger is the difference between the two extreme values; the larger is the
range.
➢ Comparing the three data sets with respect to variability based on the range,
we can say that while data set A is perfectly homogeneous, data set B is the
most heterogeneous. Data set C ranks second to data set B in terms of
variability.
➢ Although the range is easily computed, it is not considered to be the best
measure of dispersion because it involves only the two extreme values. It
does not tell anything about the remaining values in a set of data.
B. Interquartile Range
➢ Interquartile range is also called the midspread. It is the difference between the 75th and 25th percentile or
between the upper and lower quartile. It is denoted by IQR and computed as 𝐼𝑄𝑅 = 𝑄3 − 𝑄1.
Example 2:
Given: 4, 5, 8, 9, 10, 11, 15 → 𝑛 = 7
Solution:
1 7 + 1 𝑡ℎ
𝑄1 : → 2𝑛𝑑 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛
4
: 2𝑛𝑑
𝑄1 = 5
3 7 + 1 𝑡ℎ
𝑄3 : → 6𝑡ℎ 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛
4
: 6𝑡ℎ
𝑄3 = 11
Therefore, 𝐼𝑄𝑅 = 11 − 5 = 6.
C. Absolute Deviation
➢ The absolute deviation (AD) is the average of the absolute deviation from the central point
or the average of the average distance between each data value and the mean.
➢ This is best used when the median is the appropriate measure of central tendency (in the
presence of extreme values/skewed distributions).
➢ The formula for finding the absolute deviation is:
∑ 𝑥̄ − 𝑚𝑒𝑎𝑛
𝐴𝐷 =
𝑛
Example 3:
Consider the number of blender units sold by a store for one week: 5, 4, 2, 10, 8, 9, 6, 12.
Solution:
Step 1: Arrange the values from lowest to highest.
2, 4, 5, 6, 8, 9, 10, 12
Step 2: Compute the mean and the absolute deviation of each value from the mean.
Unit Sold (𝑥̄) 𝑥̄ − 𝑚𝑒𝑎𝑛
2 5
4 3
5 2 Step 3: Compute the absolute deviation using the
6 1 formula:
8 1
∑ 𝑥̄ − 𝑚𝑒𝑎𝑛
9 2 𝐴𝐷 =
𝑛
10 3
22
12 5 𝐴𝐷 =
8
𝐴𝐷 = 2.75
∑ 𝑥̄ 56
x= = =7 𝑥̄ − 𝑚𝑒𝑎𝑛 = 22
𝑛 8
D. Variance and Standard Deviation
➢ One of the most widely used measures of dispersion is the standard deviation. The more
spread apart the data, the higher the deviation.
➢ The variance of a set of numbers is the mean of the squared deviations of these
numbers from their mean.
➢ To facilitate calculations, we have the following formula to find the variance for
ungrouped data:
∑ 𝑥̄ − x 2
2
𝑠 =
𝑛−1
Example 4:
Consider the following daily rates of a sample of eight employees at GMS Inc.: P550,
P420, P560, P500, P700, P670, P860, P480. Find the variance and standard deviation.
Solution:
Step 1: Compute the mean of the data set.
∑ 𝑥̄
x=
𝑛
4,740
=
8
= 592.5
Step 2: Calculate the mean and the deviations of each item from the mean.
2
Daily Rate (𝑥̄) 𝑥̄ − x 𝑥̄ − x Step 3: Solve for the variance and standard deviation.
550 −42.5 1,806.25
420 −172.5 29,756.25 ∑ 𝑥̄ − x 2
2 142,950
560 −32.5 1,056.25
𝑠 = = = 20,421.43
𝑛−1 8−1
500 −92.5 8,556.25
700 107.5 11,556.25
∑ 𝑥̄ − x 2
670 77.5 6, 006.25 𝑠= = 20,421.43 = 142.90
860 267.5 71,556.25
𝑛−1
480 −112.5 12,656.25
2
∑ 𝑥̄ = 4,740 ∑ 𝑥̄ − x = 0 ∑ 𝑥̄ − x = 142,950
Generalization:
1. What are the measures of central tendency?