0% found this document useful (0 votes)
145 views

Module 4 PDF

This document discusses measures of central tendency and location, including the mean, median, and mode. It provides formulas and examples for calculating each measure from both grouped and ungrouped data. The mean is the average value and is affected most by outliers. The median divides the data in half and is less affected by outliers. The mode is the most frequent value. Measures of central tendency provide a representative central value for a data set.

Uploaded by

yoonginism
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
145 views

Module 4 PDF

This document discusses measures of central tendency and location, including the mean, median, and mode. It provides formulas and examples for calculating each measure from both grouped and ungrouped data. The mean is the average value and is affected most by outliers. The median divides the data in half and is less affected by outliers. The mode is the most frequent value. Measures of central tendency provide a representative central value for a data set.

Uploaded by

yoonginism
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 15

Lesson

Measures of Central Tendency


4 and Other Measures of Location

4.1 Measures of the Central Tendency


The measure of central tendency refers to measures, which describes the middle
or the center of the data. It is also a value used to represent a set of scores or
frequencies.

Mean - the most popular and widely used measure of central tendency,
commonly called the average. It is also used to identify the “center” of the data.
Uses of the Mean
1. provides a summary of the data
2. provides a common denominator in comparing two groups of
data

Formula for Ungrouped Data:


∑𝒙
Mean (X)=
𝒏

Where x is the sum of quantities divided by the number of


quantities under consideration.
Examples:
1. Find the average of Beatrice’s grades in Math 12.
First quarter – 94
Second quarter – 98
Third quarter – 97
Fourth quarter – 99

Solution :
94+98+97+99
X=
4
= 97

2. The change in fixed assets of 10 companies are as follows in percent.


+1.2, -1.5, +3.4, +2.1, -2.7, +4.1, -3.3, +3.8, +1.9, -3.6

The mean percent change in fixed assests of the companies is

(1.2 − 1.5 + 3.4 + 2.1 − 2.7 + 4.1 − 3.3 + 3.8 + 1.9 − 3.6)
= 0.54
10

Computation of the Mean from Grouped Data


For group data or from a frequency distribution table, the mean is computed by
the formula,

∑𝑓𝑖 𝑥𝑖
x̅ =
𝒏
where
𝑋𝑖 – stands for the class mark or midpoint
𝑓𝑖 - is the corresponding frequency

Example. Consider the frequency distribution of 50 scores

CI f CM (x) 𝑓𝑖 𝑥𝑖
15-21 2 18 36
22-28 2 25 50
29-35 4 32 128
36-42 8 39 312
43-49 11 46 506
50-56 9 53 477
57-63 4 60 240
64-70 6 67 402
71-77 2 74 148
78-84 2 81 162
C=7 n = 50 ∑ 𝑓𝑖 𝑥𝑖 = 2461

To apply the formula, the following steps are involved:


Step 1. Get the class mark of each class interval.
Step 2. Multiply the class mark by its corresponding frequency to obtain the 𝑓𝑖 𝑥𝑖
The product of column 2 and column 3
Step 3. Sum up the 𝑓𝑖 𝑥𝑖 column to get ∑𝑓𝑖 𝑥𝑖
Step 4. Substitute the values of ∑𝑓𝑖 𝑥𝑖 and n in the formula.
Thus,

∑𝑓𝑖 𝑥𝑖 2461
x̅ = 𝒏
= 50
= 49.22

Weighted Average:
Weighted average is a computation wherein each data in the set is
multiplied by a predetermined weight before the ultimate calculation is done. It is
more precise than a simple average wherein all numbers in a data set are
apportioned an equal weight.

Example: Given the grades of the two employees. Who among them is qualified
for:
a) Full scholarship – with GWA of 1.5 and above
b) Partial Scholarship – with GWA of 1.6 and below
Median - divides the ordered observations into two equal parts.
1. If the number of observations n is odd, the median is the middle position in the
ordered observations.
2. If the number of observations n is even, the median is the average of the two
middle observations in the ordered observations.
Examples:
1. The following are the total receipts of 7 mining companies (in million pesos)
1.2, 4.5, 6.5, 7.2, 10.4, 12.5, 50.6 The median is 7.2.
2. The following are the number of years of operation of
8 mining companies:
8, 10, 10, 11, 16, 17, 17, 18 The median is (11+16) / 2 = 13.5

Computation of the Median for Grouped Data


For grouped data presented in a frequency distribution, the formula used for the
median is
Steps in Calculating the Median
Step1. Compute the cumulative frequency (cf)
𝑛
Step2. Solve for , which is 25. The 25th item is the median class. It is class
2
where the median lies. Our median class 43-49 because it is the class interval
which contains the 25th item is 27. Across 27 is the median class, 43-49.
Step3. Get the <cf of the class immediately below the median class which is 16.
This is the sum of all the frequencies below the median class.
Step4. Determine f, or the frequency of the median class which is 11.
Step5. Determine the class size which is 7.
Step6. Substitute all the needed values in the formula.

The median is computed as follows:


𝑛
−<𝑐𝑓𝑏
md = 𝑙𝑚 + ( 2 )𝑐
𝑓𝑚
25−16
md= 42.5 + ( )7
11
9
md= 42.5 + ( )7
11
md= 42.5 + ( 0.818 )7
md= 42.5 + 5.727
md= 48.227
Some Characteristics of the Median
1. The score or class in a distribution below the median is 50% of the
scores fall the above where the other 50% lies.
2. The median is a positional measure.
3. The median is affected by the position of each item in the series but
not by the value of each item. This means that extreme values
affect the median less than the arithmetic mean.
4. Used when the data are ordinal.

The Mode - is the most frequent observation in a data set, the mode is determined
by counting the frequency of each value and finding the value with the highest
frequency of occurrence
Examples: Find the mode of the following data:
1. The following are waistlines of 12 males:
25, 26, 29, 30, 30, 29, 30, 30, 30, 31, 34, 36
Answer: The mode is 30.

2. Given the number of children of 20 male


respondents:2,5,5,2,2,5,1,3,5,4,2,5,5,2,2,5,5,2,2,1
Answer: 2 and 5
3. Given the following data:
1,2,3,3,2,1,2,3,1,4,4,5,5,1,2,3,4,5,4,5
Answer: no mode

Computation of the Mode for Grouped Data


For grouped data, the mode is the midpoint or class mark of the class
interval with the highest frequency. This class interval is called the modal class.
The mode obtained in this manner is called a crude mode or empirical mode it
is just a rough estimate of the actual mode. To compute accurately, we used the
formula
𝑓𝑚− 𝑓1
mo = (𝑙𝑚𝑜 + (2𝑓 )c
𝑚𝑜 −𝑓1 −𝑓2

where
𝑙𝑚𝑜 = lower class boundary of the modal class. The class interval with the highest
frequency.
𝑓𝑚𝑜 =frequency of the modal class
𝑓1= frequency preceding the modal class
𝑓2 = frequency following the modal class
c = class size of the modal class

Example.
CI f
15-21 2
22-28 2
29-35 4
36-42 8𝑓1
Modal class 43-49 11 𝑓𝑚𝑜
50-56 9𝑓2
57-63 4
64-70 6
71-77 2
78-84 2
c=7 n = 50

Calculating the mode


𝑓𝑚− 𝑓1
mo = (𝑙𝑚𝑜 + (2𝑓 )c
𝑚𝑜 −𝑓1 −𝑓2

Substituting, we get
11−8
mo = (42.5 + (2( 11)−8−9)7

= 42.5 + 4.2
= 46.7
Some Characteristics of the Mode
1. For ungrouped data, it requires no calculation only counting.
2. It can be determined for qualitative as well as quantitative data.
3. Some distributions have one or more modes, but some have none.
4. Used quick approximation if central tendency is desired.
5. Used when the data are nominal.

Ex. 1
CI f
62 – 65 5
66 – 69 10
70 – 73 8
74 – above 2
Ex. 2
CI f
58 – 61 5
62 – 65 10
66 – 69 10
70 – 73 8
74 – above 2

Skewness in Relationship to Central Tendency

The measure of central tendency helps to describe the characteristics of


a given distribution. If the value of the mean, median and mode are all equal, then
they are all represented by a simple point in a distribution as shown in Figure 4.1.
The distribution in such case in normal is symmetrical, the figure shows that the
sides of its curve have the same or identical shape when folded vertically at the
center. If the values of the mean, median, and mode, however, are not the same,
the curve or distribution is asymmetrical or skewed as shown in Figure 4.2.
Figure 4.2. shows to types of skewed distribution. Figure 4.2.(A) is
positively skewed or skewed to the right, thus the curve has a heavy right tail. This
distribution implies that there are more high values, so the scores accumulate at
the right. The mean, therefore, is pulled into the tail of distribution and its value is
higher than the median. Remember that the mean is easily affected by extreme
cases which is positively skewed distribution are found to the right. Furthermore,
the mean is also found to the right of the mode since skewness in this case is
approximated b the distance of the mean from the mode.

Figure 4.2. (B) illustrates a negatively skewed distribution, thus the curve has a
heavy left nail. This type of distribution implies that there are more low scores, so
the values accumulate at the left. The mean is, therefore, pulled into the tail of
the curve which is found at the left. If the value of mean is lower than the median
extreme cases are found at the left end of the distribution. Whether positively or
negatively skewed, the area under the curve to left of the median is equal to the
right.
Other Measure of Location (Quantiles)

The idea of the median can be extended to the distribution of quantiles.


Quantiles are values that divide the distribution into a given number of equal parts.
The median divides the distribution into to equal parts. Other types of quantiles
include that following: quartiles, deciles, and percentiles or centiles.
The quartiles divide the distribution into four equal arts. The quartiles are
𝑄1 (first quartile), 𝑄2 (second quartile), 𝑄3 (third quartile), and 𝑄4 (fourth quartile),
The deciles divide the distributions into 10 equal parts. The deciles are 𝐷1
(first decile), 𝐷2 (second decile), 𝐷3 (third decile), 𝐷4 (fourth decile) and so on up
to 𝐷10 ( tenth decile).
The percentile or centiles divide the distribution into one hundred equal
parts. They are 𝑃1 (first percentile), 𝑃2 (second percentile), and so on up to 𝑃100 .
Notice that 𝑃50=𝑄2 = Median; 𝑃20 = 𝐷2 : 𝐶20 =𝐷2 .
Quartiles are computed if the values in the distribution are already
arranged according to magnitude for ungrouped data. For grouped data, the
computation of the quantiles is similar to that of the median.
To understand the concept of quantiles, let us take the percentile or
centiles. A percentile is a point in a distribution below which a given percent of
cases lie. For example, the 60th percentile or is the point or score in a distribution
below which 60% of the cases lie. If the 60th percentile is equal to 75, that is in a
distribution of scores given to freshmen entrance in a certain college, a student
who got 75 in the admission exam surpassed 60% of the cases with only 40%
examinees higher than his grade.
Computation of the Quartiles for Ungrouped Data

To solve any quantile, changes it first to percentile and follow the steps below.

Step1. Arrange first the scores according to magnitude or size.

Step2. Compute the position of the given percentile in the distribution using the
formula P(n+1) 100 where P is the given percent and n is the number of cases.

Step3. Starting from the lowest score, locate the score corresponding to the
obtained position in the distribution.

Step4. Interpolate to get the score if the obtained position from step 2 is not
exact.

Example: Find the 20th percentile or of the following scores;

25 22 20 16 17 12 8 6 5

Solution: Need to rank the given scores

Steps

1. Locate the position of the score corresponding to the 20 th percentile


Solution: = 2.0

2. Locate the second score from the lowest. The answer is 6.

3. Thus, the 20th percentile or = 6

This mean that 20% of the cases scored below 6.

Example: Using the following scores:

25 22 20 16 17 12 8
6 5

a. Below what score is the lower 38%.

b. Above what score is the upper 22%

c. Between what scores is the middle 50%

Solution

Ranking the score:

6 8 12 16 17 20 22
25

38(𝑛+1) 38(9+1)
A. 𝑃38= = = 3.8 this rank is between the third and fourth:
100 100

Interpolating the approximated value:

1. Value of the fourth – Value of the third = 12 – 8 = 4

2. Multiply the difference by the .8 = 4(8) =3.2

3. Add the product to the value of the third = 3.2 + 8 = 11.2

Hence, the lower 38% is below 11.2

B. The upper 22% can be derived by


1. 100 – 22 = 78

78(𝑛+1) 78(9+1)
2. Solve for 𝑃38= = = 7.8
100 100

3. 7.8 is between the 7th and 8th

4. Interpolating: Value of the 8th – Value of the 7th = 22-20 2

5. 2 ( .8) = 1.6

6. 1.6 + value 7th = 1.6 + 20 = 21.6

Hence, the upper 22% is above 21.6.

C. The middle 50% can be derived by 100 – 50 = 50/2 = 25 𝑃25 --->

and the second score 25 + 50 = 75 𝑃75 --->

25(𝑛+1) 25(9+1) 25(𝑛+1) 25(9+1)


D. 𝑃25= = = 2.5 and 𝑃75= = = = 7.5
100 100 100 100

2. For case where the decimal part is 0.5 the average can be obtained for the two
value. For instance, 2.5 rank is between second and third, thus the value can
obtained by

value (third) + value (second) 8+6


𝑃25 = = =7
2 2

value (eight) + value (seventh) 22+20


𝑃75 = = = 21
2 2

Hence, the middle 50% is between 7 and 21.


Computation of the Quantiles for grouped data:
𝒊𝒏
𝟏𝟎𝟎
− <𝒄𝒇𝒃𝒊
𝑷𝒊 = 𝒍𝒊 +( )𝒄
𝒇𝒊
𝒊𝒏
− <𝒄𝒇𝒃𝒊
𝟏𝟎
𝑫𝒊 = 𝒍𝒊 +( )𝒄
𝒇𝒊
𝒊𝒏
𝟒
− <𝒄𝒇𝒃𝒊
𝑸𝒊 = 𝒍 𝒊 +( )𝒄
𝒇𝒊

CI f <cf

61-65 5 5

66-70 8 13

71-75 12 25

76-80 6 31

81-85 4 35

C= 5 35

1. Find the 25th percentile


2. Find the 75th percentile
3. Solve for Q3
4. Solve for D4
5. Above what score is the upper 15%

𝟐𝟓𝒏 𝟕𝟓𝒏
− <𝒄𝒇𝒃𝟐𝟓 − <𝒄𝒇𝒃𝟕𝟓
𝟏𝟎𝟎 𝟏𝟎𝟎
𝑷𝟐𝟓 = 𝒍𝟐𝟓 +( )𝒄 𝑷𝟕𝟓 = 𝒍𝟕𝟓 +( )𝒄
𝒇𝟐𝟓 𝒇𝟕𝟓
25(35) 75(35)
= 8.75 = 26.25
100 100

8.75−5 26.25−25
P 25 = 65.5 + ( )5 P 75 = 75.5 + ( )5
8 6

= 65.5 + 2.34 = 75.5 + 1.04


= 67.84 = 76.54

You might also like