0% found this document useful (0 votes)
56 views51 pages

Lesson 3.2 Measures of Central Tendency Position and Variation

Uploaded by

Nics b0rja
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
56 views51 pages

Lesson 3.2 Measures of Central Tendency Position and Variation

Uploaded by

Nics b0rja
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 51

Activity

Happy Care

Sad Next

Next
E N M A - MEAN
BACK

BACK
I S O P O I N T - POSITION
BACK

BACK
I O N R A V A I T - VARIATION

BACK

BACK
What do you think is
our topic for today?
Lesson 3.2:
Measures of Central
Tendency, Position,
and Variation
Objectives:
Find the mean, median, and
01 mode of a given data set. 02 Interpret data using the
measure of central tendency.

03 Find the range of a data


set. 04 Find the interquartile range.
Objectives:
Find the variance and Find the absolute
05 standard deviation. 06 deviation.

Find the first, second, Identify the position of the


07 and third quartiles. 08 data value in a data set using
percentiles.
1. Measures of Central Tendency

➢ It represents the center point or typical value of a set of data.

➢ In layman’s terms, a measure of central tendency is an average.

➢ In statistics, the three most common measures value of central


tendency are the mean, median, and mode.
A. Mean or Arithmetic Mean
➢ The most frequently used measure of central tendency. The mean is the only common
measure in which all values play an equal role, meaning, to determine its value, we need
to consider all the values of any data set.

➢ To calculate: just add up all the numbers, then divide by how many numbers there are.

➢ The mean is denoted by 𝐱̄ and can be computed using the formula:

∑ 𝑥̄
𝑥̄ =
𝑛

where: x = the value of an observation, n = the total no. of observations


Properties of Mean:
1. A set of data has only one mean.
2. Mean can be applied for interval and ratio data.
3. All values in the data set are included in computing the
mean.
4. The mean is very useful in comparing two or more data sets.
5. Mean is affected by the extremely small or large values in
the data set.
6. Mean is most appropriate in symmetrical data.
Example 1: Solve for the mean of the set 1, 5, 3, 9, 7.

Solution:
∑ 𝑥̄
𝑥̄ =
𝑛
Example 1: Solve for the mean of the set 1, 5, 3, 9, 7.

Solution:
∑ 𝑥̄
𝑥̄ =
𝑛
1+ 5+ 3+ 9+7
=
5
25
=
5
Thus, the mean of the given set is 5.
𝑥̄ = 5
Example 2:
Suppose a basketball team has 15 players, and the heights (in cm) are as follows:
180 204 188 184 189 190 190 200
193 186 186 187 182 201 187

Solution:
∑ 𝑥̄
𝑥̄ =
𝑛
Example 2:
Suppose a basketball team has 15 players, and the heights (in cm) are as follows:
180 204 188 184 189 190 190 200
193 186 186 187 182 201 187

Solution:
∑ 𝑥̄
𝑥̄ =
𝑛
2847
=
15
Thus, the average height of basketball
𝑥̄ = 189.8 cm players is 189.8 cm.
Weighted Mean
➢ It is particularly useful when various classes or groups contribute
differently to the total.

➢ To calculate: multiply each value by its corresponding weight and


dividing by the sum of the weights.

➢ The formula for finding the weighted mean is:

𝑥̄1 𝑤1 + 𝑥̄2 𝑤2 + 𝑥̄3 𝑤3 + ⋯ + 𝑥̄𝑛 𝑤𝑛


x𝑤 =
𝑤1 + 𝑤2 + 𝑤3 + ⋯ + 𝑤𝑛
Example 3:
Joel’s first quarter grade is shown in the table below. Use the weighted
mean formula to find Joel’s GPA for the first quarter.
Subjects English Math Filipino Science MAPEH Soc Stud
Grade 90 87 88 93 95 96
No. of Units 3 3 3 3 2 1

Solution:
𝑥̄1 𝑤1 + 𝑥̄2 𝑤2 + 𝑥̄3 𝑤3 + ⋯ + 𝑥̄𝑛 𝑤𝑛
x𝑤 =
𝑤1 + 𝑤2 + 𝑤3 + ⋯ + 𝑤𝑛
Example 3:
Joel’s first quarter grade is shown in the table below. Use the weighted
mean formula to find Joel’s GPA for the first quarter.
Subjects English Math Filipino Science MAPEH Soc Stud
Grade 90 87 88 93 95 96
No. of Units 3 3 3 3 2 1

Solution:
𝑥̄1 𝑤1 + 𝑥̄2 𝑤2 + 𝑥̄3 𝑤3 + ⋯ + 𝑥̄𝑛 𝑤𝑛
x𝑤 =
𝑤1 + 𝑤2 + 𝑤3 + ⋯ + 𝑤𝑛
90 (3) + 87 (3) + 88 (3) + 93 (3) + 95 (2) + 96 (1)
=
3+ 3+ 3+ 3+2+ 1
= 1,360
15
x𝑤 = 90.67 Therefore, Joel’s GPA for the first quarter is 90.67.
B. Median
➢ The median (𝑀𝑑 ) is the middle value in a set of observations arrange
from highest to lowest or vice versa.

➢ To get the value of the median, we arrange the observations from


highest to lowest or from lowest to highest.

➢ The observation in the middle is considered as the median.


Properties of Median:
1. The median is unique; there is only one median for a
given set of data.
2. Median is not affected by the extremely small or large
values.
3. Median can be applied for ordinal, interval, and ratio
data.
4. Median is most appropriate in skewed data.
Example 4:

Consider again the heights (in cm) of 15 basketball players listed. Find the median
height of the players.
180 204 188 184 189 190 190 200
193 186 186 187 182 201 187

Solution:
First, arrange the heights from the shortest to the tallest and pick the
height of the middle player.
Example 4:

Consider again the heights (in cm) of 15 basketball players listed. Find the median
height of the players.
180 204 188 184 189 190 190 200
193 186 186 187 182 201 187

Solution:
First, arrange the heights from the shortest to the tallest and pick the
height of the middle player.

180 182 184 186 186 187 187 188 189 190 190 193 200 201 204

Since there are 15 players in the team, the eighth observation is the median.
Remarks:
⮚ If 𝑛 is odd, the median of the observation corresponds to the
𝑛+1
𝑡ℎ observation (middle-ranked).
2

⮚ If 𝑛 is even, the median is the average of the two


middle-ranked values.
𝑛+1
Median (Rank Value) = 2
Example 5:
The daily rates of the sample of eight employees at GMS Inc. are 𝑃550, 𝑃420, 𝑃560,
𝑃500, 𝑃700, 𝑃670, 𝑃860, 𝑃480. Find the median daily rate of the employees.
Solution:
Step 1: Arrange the data in order.
𝑃420, 𝑃480, 𝑃500, 𝑃550, 𝑃560, 𝑃670, 𝑃700, 𝑃860
Step 2: Select the middle-rank value.
𝑛+1 8+1 9
Median (Rank Value) = = = = 4.5
2 2 2
Step 3: Identify the median in the data set.
𝑃420, 𝑃480, 𝑃500, 𝑃550, 𝑃560, 𝑃670, 𝑃700, 𝑃860

550 + 560 1,110


𝑀𝑒𝑑𝑖𝑎𝑛 = = = 555
2 2
Therefore, the median daily rate is P555.
C. Mode
➢ The mode is the value that occurs the most frequently in a given data set. Like
the median and unlike the mean, extreme values in a data set do not affect the
mode.
Types of Mode
✓ Unimodal- A data set that has only one value that occurs the greatest frequency.
✓ Bimodal- If the data has two values with the same greatest frequency, both
values are considered the mode, and the data set.
✓ Multimodal- If a data set has more than two modes.

✓ No mode- A data set values have the same number frequency.


Properties of Mode:
1. The mode is found by locating the most frequently occurring
value.
2. The mode is the easiest average to compute.
3. There can be more than one mode or even no mode in any given
set.
4. The mode is not affected by extreme small or large values.
5. The mode can be applied for nominal, ordinal, interval, and ratio
data.
Example 6:
The following data represent the total unit sales for Smartphones from a sample
of 10 Communication Centers for the month of August:
15, 17, 10, 12, 13, 10, 14, 10, 8, 𝑎𝑛𝑑 9. Find the mode.
Solution:
The ordered array for these data is:
8, 9, 10, 10, 10, 12, 13, 14, 15, 17.
Because 10 appears 3 times, more times than the other values, therefore, the
mode is 10. The data set is unimodal.
Example 7:
Compute for the following data that represents the number of LED
television manufactured for the past three weeks:
20, 18, 19, 25, 20, 21, 20, 25, 30, 29, 28, 29, 25, 25, 27, 26, 22, 𝑎𝑛𝑑 20.
Find the mode of the given set.
Solution:
The ordered array for these data is:
18, 19, 20, 20, 20, 20, 21, 22, 25, 25, 25, 25, 26, 27, 28, 29, 29, 30

There are two modes 20 and 25, since each of these values occurs four
times. The data set is bimodal.
Comparison of the Mean, Median, and Mode
➢ When the distribution of the observations is fairly symmetric or when
there are no extreme observations, the mean is the most meaningful
measure of central tendency.

➢ In the case of a perfectly symmetric distribution, all three measures are


equal.
➢ When extreme observations are found on the right end of the distribution, these extreme values pull the
value of the mean to the right; therefore, the relationship between the three is illustrated as: 𝑀𝑜 < 𝑀𝑑 < x
(positively skewed distribution).

➢ If extremely low observations are present, these observations pull the mean to the left, and the
relationship between the three measures becomes: x < 𝑀𝑑 < 𝑀𝑜 (negatively skewed distribution).

➢ With the presence of extreme observations, the median is a more meaningful measure in as much as it is
not affected by these extreme values.
2. Measures of Relative Position
➢ It is referred to as the measure of location, and considered as
the extension of the median.

➢ It talks about the position/location of the value relative to the other


values in the data set.

➢ The common measures of position are quartiles, percentiles,


standard scores or z-scores, box-and-whisker plot.
A. Quartiles
➢ This measure divides the observation into four equal parts.

➢ The Q1 is the middle point between the smallest value and the
center value, also called Q2. The Q2 is also called the median,

➢ Q3 is the middle value between the median and the highest value
of the data set.
Example 1:
Find the first, second, and third quartiles of the ages of 9 employees of a certain company. Their ages
are: 53, 45, 59, 48, 54, 46, 51, 58, and 55.
Solution:
Step 1: Arrange the data in ascending order.
45, 46, 48, 51, 53, 54, 55, 58, 59
Step 2: Select the first, second, and third quartiles value using the formula:
𝑘 𝑛+1 where: 𝑄𝑘 - quartile
𝑄𝑘 =
4 n - no. of observations
1 9+1 10 k - quartile location
→ 𝑄1 = = = 2.5
4 4
2 9+1 2 10
→ 𝑄2 = = =5
4 4

3 9+1 3 10
→ 𝑄3 = = = 7.5
4 4
45, 46, 48, 51, 53, 54, 55, 58, 59
Step 3: Identify the first, second, and third quartile values in the data set.
Since the 2.5th falls between 46 and 48; and 7.5th falls between 55 and 58, we can determine the
first and third quartiles of the data set by getting the average of the two values.

46 + 48 94
𝑄1 = = = 47
2 2

55 + 58 113
𝑄3 = = = 56.5
2 2
Therefore, 𝑄1 = 47, 𝑄2 = 53, 𝑎𝑛𝑑 𝑄3 = 56.5.

Interpretations:
1. 𝑄1 = 47 implies that one-fourth or 25% of the ages fall below 47.

2. 𝑄2 = 53 implies that one-half or 50% of the ages fall below 53.

3. 𝑄3 = 56.5 implies that three-fourths or 75% of the ages fall below 56.5.
B. Percentiles
➢ This divides the observation into 100 equal parts. It is used to indicate how much of the observation may
be found below.

➢ For instance, if the 30th percentile is the value, this means that 30% of the observation may fall below it.
To find the k-th percentile:
Step 1: Arrange the observations from lowest to highest.
Step 2: Compute the position using the formula:
𝑘
𝐿= ∗𝑛
100

Remarks:
1. If L is whole number, k-th is midway between L and the next value.
2. If L is not a whole number, round it up to the next integer and the value at that position
is the k-th percentile.
Example 2:
Suppose the arrayed scores of 20 students in a Math 11n exam are as follows: 80, 90,
91, 100, 120, 122, 123, 125, 201, 90, 88, 98, 130, 124, 111, 109, 140, 102, 85, 91. Solve for the
for the 40𝑡ℎ percentile, 28𝑡ℎ percentile.
Solution:
Step 1: Arrange the scores in ascending order.
80, 85, 88, 90, 90, 91, 91, 98, 100, 102, 109, 111, 120, 122, 123, 124, 125, 130, 140, 201
Step 2: Compute the position.
40
→𝐿= ∗ 20 = 8 (Remark 1)
100
28
→𝐿= ∗ 20 = 5.6 (Remark 2)
100

Therefore, the 40th percentile is the value between the 8th and 9th observation, and it is 99.
Interpretations:
1. Four-tenths or 40% of the scores fall below 99; or
2. Six-tenths or 60% of the scores are above 99.

Therefore, the 28th percentile is the 6th item in the observation, which is 91.
Interpretations:

1. 28% of the scores fall below 91; or


2. 72% of the scores are above 91.
C. Z-Scores
➢ A z-score measures the distance between an observation and the mean,
measured in units of standard deviation.
➢ Z-score is used to know the position of one observation relative to others
in a set of data.

➢ The following formula is used to compute the z-score for a data value 𝑥̄ in a
sample.
𝑥̄ − x
𝑧=
𝑠
Example 3:
The monthly expenditures of a large group of households are normally distributed
with a mean of P48,700 and a standard deviation of P10,400. What is the z-value of
monthly expenditures of P59,400 and P38,300?
Solution:
Let x = 48,700 and 𝑠 = 10,400. Then, using the formula of z, determine z-values for
the two x values. We have:
𝑥̄ − x 59,400 − 48,700
For 𝑥̄ = 59,400: 𝑧= = = 1.00
𝑠 10,400

𝑥̄ − x 38,300 − 48,700
For 𝑥̄ = 38,300: 𝑧= = = - 1.00
𝑠 10,400

The 𝑧 of 1.00 indicates that a monthly expenditure of P59,400 for households is one
standard deviation above the mean, and a 𝑧 of −1.00 shows that a P38,300 monthly
expenditure is one standard deviation below the mean. Note that both monthly household
expenditures (P59,400 and P38,300) are the same distance (P10,400) from the mean.
3. Common Measures of Variation
➢ Measures of variation or dispersion tell us how to scatter or spread out a distribution is. One measure of
dispersion is the range.
A. Range
➢ Probably, the simplest and easiest way to determine the measure of dispersion is the range. The range
(𝑅) is the difference between the largest and the smallest in the given set of data.
Example
1: Data Set Values Mean
A 5, 5, 5, 5, 5 5
B 3, 4, 5, 6, 7 5
C 4, 4, 5, 6, 6 5
Given the data sets A, B, and C, we find the following:
𝑅𝐴 = 5 − 5 = 0
𝑅𝐵 = 7 − 3 = 4

𝑅𝐶 = 6 − 4 = 2
Note:
➢ Range of zero simply means that all the values in the data set are the same.
There is no variability in the values, or the variable under consideration is a
constant for this data set.
➢ The larger is the difference between the two extreme values; the larger is the
range.
➢ Comparing the three data sets with respect to variability based on the range,
we can say that while data set A is perfectly homogeneous, data set B is the
most heterogeneous. Data set C ranks second to data set B in terms of
variability.
➢ Although the range is easily computed, it is not considered to be the best
measure of dispersion because it involves only the two extreme values. It
does not tell anything about the remaining values in a set of data.
B. Interquartile Range
➢ Interquartile range is also called the midspread. It is the difference between the 75th and 25th percentile or
between the upper and lower quartile. It is denoted by IQR and computed as 𝐼𝑄𝑅 = 𝑄3 − 𝑄1.
Example 2:
Given: 4, 5, 8, 9, 10, 11, 15 → 𝑛 = 7
Solution:
1 7 + 1 𝑡ℎ
𝑄1 : → 2𝑛𝑑 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛
4
: 2𝑛𝑑
𝑄1 = 5

3 7 + 1 𝑡ℎ
𝑄3 : → 6𝑡ℎ 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛
4

: 6𝑡ℎ
𝑄3 = 11
Therefore, 𝐼𝑄𝑅 = 11 − 5 = 6.
C. Absolute Deviation
➢ The absolute deviation (AD) is the average of the absolute deviation from the central point
or the average of the average distance between each data value and the mean.

➢ This is best used when the median is the appropriate measure of central tendency (in the
presence of extreme values/skewed distributions).
➢ The formula for finding the absolute deviation is:
∑ 𝑥̄ − 𝑚𝑒𝑎𝑛
𝐴𝐷 =
𝑛
Example 3:
Consider the number of blender units sold by a store for one week: 5, 4, 2, 10, 8, 9, 6, 12.
Solution:
Step 1: Arrange the values from lowest to highest.
2, 4, 5, 6, 8, 9, 10, 12
Step 2: Compute the mean and the absolute deviation of each value from the mean.
Unit Sold (𝑥̄) 𝑥̄ − 𝑚𝑒𝑎𝑛

2 5
4 3
5 2 Step 3: Compute the absolute deviation using the
6 1 formula:
8 1
∑ 𝑥̄ − 𝑚𝑒𝑎𝑛
9 2 𝐴𝐷 =
𝑛
10 3
22
12 5 𝐴𝐷 =
8
𝐴𝐷 = 2.75
∑ 𝑥̄ 56
x= = =7 ෍ 𝑥̄ − 𝑚𝑒𝑎𝑛 = 22
𝑛 8
D. Variance and Standard Deviation
➢ One of the most widely used measures of dispersion is the standard deviation. The more
spread apart the data, the higher the deviation.

➢ The variance of a set of numbers is the mean of the squared deviations of these
numbers from their mean.

➢ Standard deviation is the square root of variance.

➢ To facilitate calculations, we have the following formula to find the variance for
ungrouped data:
∑ 𝑥̄ − x 2
2
𝑠 =
𝑛−1
Example 4:
Consider the following daily rates of a sample of eight employees at GMS Inc.: P550,
P420, P560, P500, P700, P670, P860, P480. Find the variance and standard deviation.
Solution:
Step 1: Compute the mean of the data set.
∑ 𝑥̄
x=
𝑛

550 + 420 + 560 + 500 + 700 + 670 + 860 + 480


=
8

4,740
=
8

= 592.5
Step 2: Calculate the mean and the deviations of each item from the mean.

2
Daily Rate (𝑥̄) 𝑥̄ − x 𝑥̄ − x Step 3: Solve for the variance and standard deviation.
550 −42.5 1,806.25
420 −172.5 29,756.25 ∑ 𝑥̄ − x 2
2 142,950
560 −32.5 1,056.25
𝑠 = = = 20,421.43
𝑛−1 8−1
500 −92.5 8,556.25
700 107.5 11,556.25
∑ 𝑥̄ − x 2
670 77.5 6, 006.25 𝑠= = 20,421.43 = 142.90
860 267.5 71,556.25
𝑛−1
480 −112.5 12,656.25
2
∑ 𝑥̄ = 4,740 ∑ 𝑥̄ − x = 0 ∑ 𝑥̄ − x = 142,950
Generalization:
1. What are the measures of central tendency?

2. How about the measures of relative position?

3. What are the common measures of variation?


THANK YOU!
Slidesgo

CREDITS: This presentation template was created by Slidesgo,


including icons by Flaticon and infographics & images by Freepik
Flaticon Freepik

You might also like