0% found this document useful (0 votes)
18 views9 pages

EPSC 224 Topic 9

The document discusses measures of central tendency, specifically the mean, median, and mode, which summarize distributions of numeric values. It outlines the properties, merits, and demerits of each measure, providing examples and formulas for calculation. The choice of which measure to use depends on the context and purpose of the analysis.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
18 views9 pages

EPSC 224 Topic 9

The document discusses measures of central tendency, specifically the mean, median, and mode, which summarize distributions of numeric values. It outlines the properties, merits, and demerits of each measure, providing examples and formulas for calculation. The choice of which measure to use depends on the context and purpose of the analysis.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 9

EGERTON UNIVERSITY

COLLEGE OF OPEN AND DISTANCE LEARNING


E-CAMPUS

EPSC 224: Introduction to Statistics and Research


Methods in Education

Topic 9 Handout

Copyright
Copyright© Egerton University
Published 2015
All rights reserved. No part of this publication may be reproduced, stored in a retrieval
system or transmitted in any form or by any means, electronic, mechanical, photocopying,
recording, or otherwise, without the prior written permission of the copyright owner.

"Transforming Lives through Quality Education"


Egerton University is ISO 9001:2008 Certified
Page 1 of 9
Topic 9: Measures of Central Tendency

Introduction
Imagine you have a distribution of numeric values such as the age of First Year students
that include 17, 18, 18, 19, 20, 20, 21, 17, 22, 20, 19, 18, 18, 20, 20, 40 and 40. If you are
asked to state one value that will best capture and communicate the distribution, which
value will you choose? This is the focus of discussion in this topic, the measures of
central tendency. Enjoy the discussion.

Learning Outcomes
By the end of this topic you should be able to:

 Discuss the merits and demerits of the measures of central tendency.


 Compute the different measures of central tendency for given data sets.

9.1 Measures of Central Tendency


Measures of central tendency or centrality measures are values or characteristics that
describe the average of a distribution. They are values around which other items of a
distribution congregate. They condense a huge set of numerical data into single numerical
values which are representative of the entire distribution.

9.2 The Mean


This is the average which is obtained by dividing the sum total of the scores in a
distribution by the number of scores i.e. sample size or population size. Means are
derived for interval and ratio data and they help to give the general characteristic or trend
of a sample or population.

Properties of the Mean


i. The mean is appropriate for interval data.
ii. The mean may be interpreted as a point of balance where equal weight is placed at
each measurement point.
iii. The mean is a property of all observations in a distribution.
iv. If all observations of a distribution are added, subtracted, multiplied or divided by
a constant, then the mean is also added, subtracted, multiplied or divided by the
same constant.
v. The aggregate sum of all the deviations of a given set of observations from the
mean is zero.
"Transforming Lives through Quality Education"
Egerton University is ISO 9001:2008 Certified
Page 2 of 9
Merits of the Mean

i. It is rigidly defined. Its definition is clear, unambiguous and only one.


ii. It is easy to calculate and comprehend.
iii. It is based on all observations.
iv. It is affected least by fluctuations of sampling i.e. it is a stable average.

Demerits of the Mean


i. It is very much affected by the extreme observations [outliers].
ii. It cannot be determined by a single inspection, nor can it be located graphically.
iii. It cannot be used when dealing with qualitative characteristics which cannot be
quantified. e.g. beauty, honesty, etc.
X
The mean is calculated as follows: -X = ∑
N

X = ∑X/N X - mean

∑X – summation of x

N – Sample of population size

Example
The mean of the scores 7, 7, 6, 5, 4, 4, 3 is calculated as:-

X = 7+7+6+5+4+4+3

= 40/8

=5

For grouped data the mean is calculated as: -


Σfx
Mean=
Σf

̅ = ∑ fX
= ∑fxX
fX
Or = ∑ ∑ = ∑fx f - frequency
N f

"Transforming Lives through Quality Education"


Egerton University is ISO 9001:2008 Certified
Page 3 of 9
N – Same as ∑f

x – midpoints of classes

E.g. determine the mean of the data represented in the table below.

CLASS f x f[x] c.f


65-69 1 67 67 50

60-64 3 62 186 49

55-59 4 57 228 46

50-54 7 52 364 42

45-49 9 47 423 35

40-44 11 42 462 26

35-39 8 37 296 15

30-34 4 32 128 7

25-29 2 27 54 3

20-24 1 22 22 1

∑f = 50 ∑fx =2230

Σfx 2230
Mean= = = 44.60 =
Σf 50

9.2.1 Weighted Mean


This is obtained when the relative importance of all items in a distribution is not the
same. Weighted proportion to relative importance is given to each value.
∑ Wi Xi
Weighted mean= ∑ Wi

"Transforming Lives through Quality Education"


Egerton University is ISO 9001:2008 Certified
Page 4 of 9
The overall sample means for two sets of data are based on the same sizes n1 and n2, and
n1 X1+n2 X2
means X1 and X2 X =
n1+n2

Each separate sample mean receives weight proportional to the respective sample size.

Example

The mean yearly income for 200 civil servants in Nakuru is 21,600/=, whereas the mean
yearly income for 400 municipal workers is 19,500/=. What is the overall average salary
for these public servants?

X = 200*21600+19500*400

600

= 20,200/=

9.3 The Mode


This is the most frequently occurring score in a distribution. It is the score with the
highest frequency in a distribution, also known as the typical value. In grouped data, the
mode is the midpoint of the class with the highest frequency. This class is known as the
modal class. E.g. in the above example, the modal class is 40-44, while the mode is 42.

Properties of the Mode


i. A distribution may have more than one mode. E.g. scores 3, 4, 4, 4, 5, 6, 7, 7, 7, 8,
10, 11 have two modes 4 and 7. Such a distribution is said to be ‘bimodal’.
ii. It is possible for a set of scores or measurements not to have a mode when all
scores in a group have the same frequency. In this case, the mode is not helpful in
the distribution.

Merits of the Mode


i. It is easily understood and calculated.
ii. It is not affected at all by extreme observations.
iii. It can be conveniently obtained in case of open ended classes.

Demerits of the Mode


i. It is affected greatly by the fluctuations of sampling.
ii. It is not based on all observations in a distribution.

"Transforming Lives through Quality Education"


Egerton University is ISO 9001:2008 Certified
Page 5 of 9
9.4 The Median

The median is the 50th percentile in a group of scores. It is the point below and above
which 50% of the scores fall. It divides ranked scores into two equal parts such that it
exceeds and is exceeded by the same number of observations. The determination of the
median is dependent on the type of data. For ungrouped data, the median is obtained as
follows: -

 Arrange the data in order of magnitude.


 Determine the value at the exact centre in the order.

Example
The data below represents scores in a CAT

9, 9, 6, 4, 7, 6, 8, 2, 9, 10, 3

On order of magnitude

2, 3, 4, 6, 6, 7, 8, 9, 9, 9, 10.

The median is 7.

N.B. If the data set is even, the median id half way between each number of the mid pair.
In this case, determine the mid pair and average them

e.g. 6+7 = 6.5

If similar scores surround the median e.g. 7, 7, 7. Determine the exact lower and exact
upper limits of 7 i.e. 6.5 and 7.5. Divide the interval by 3, add values of 2 portions to the
lower limit e.g. 6.5 + 0.66 = 7.16 this then becomes the median.

For grouped data, determine the median class, then obtain the median as follows: -
N+1
{ −F}
2
Median = L + i
f

Where, L – Exact lower limit of the median class.

F – Total frequencies below the median class


"Transforming Lives through Quality Education"
Egerton University is ISO 9001:2008 Certified
Page 6 of 9
f – Frequency of the median class

i – Class interval

N – Total frequencies/sample size/population size

Example

CLASS f x f[x] c.f


65-69 1 67 67 50

60-64 3 62 186 49

55-59 4 57 228 46

50-54 7 52 364 42

45-49 9 47 423 35

40-44 11 42 462 26

35-39 8 37 296 15

30-34 4 32 128 7

25-29 2 27 54 3

20-24 1 22 22 1

∑f = 50 ∑fx =2230

In the distribution repeated above, the scores are 50, hence the median is between the 25th
and 26th score. This falls in the interval class 40-44. Hence;

L = 39.5, F = 15, f = 11, i= 5, N= 50

Median = 39.5 + {50+1/2 -15} *5

11

= 39.5 + {25.5 – 15}5

11

"Transforming Lives through Quality Education"


Egerton University is ISO 9001:2008 Certified
Page 7 of 9
= 39.5 + {10.5/11}5

= 39.5 + 4.772727272727273

= 44.272727272727

= 44.27

Merits of the Median


(i) It is rigidly defined.

(ii) It is easy to understand and comprehend.

(iii) It is not affected at all by the extreme observations since it’s a positional average.

(iv) It can be calculated when dealing with a distribution having open ended classes.

(v) It can sometimes be located by a single inspection and can be computed


graphically.

Demerits of the Median


(i) It cannot be obtained exactly for an even number of observations.

(ii) It is not a function of all observations in a distribution.

(iii) It is affected by fluctuations of sampling and thus is a less stable average than the
mean.

Mathematical relationship between the 3 averages.


A distribution in which the values of the mean, median and mode coincide and are
identical is known as symmetrical (normal) distribution. Conversely, when the values of
the mean, median and mode are not equal, the distribution is known as asymmetrical or
skewed. In moderately skewed or asymmetrical distributions, a very important
relationship exists among the mean, median and mode. In such distributions, the distance
between the mean and the median is one-third of the distance between the mean and the
mode and this is summarized by the formula.
mod e  3median  2arithmeticmean(Mo= 3Md – 2M)

The mean, median and mode being measures of central tendency, which one should one
choose to use? The choice depends on why such a measure is required. The mode is the
most useful whenever a research requires a quick and relatively informal indication of a
typical data value. The median is the most useful when one requires the midpoint of the

"Transforming Lives through Quality Education"


Egerton University is ISO 9001:2008 Certified
Page 8 of 9
data distribution. The mean is the most useful if the knowledge of exact values of the
observation is important.

Topic Summary
The mean, median and mode being measures of central tendency, which one should one
choose to use? The choice depends on why such a measure is required. The mode is the
most useful whenever a research requires a quick and relatively informal indication of a
typical data value. The median is the most useful when one requires the midpoint of the
data distribution. The mean is the most useful if the knowledge of exact values of the
observation is important.

Further Reading
Book on Statistics.

"Transforming Lives through Quality Education"


Egerton University is ISO 9001:2008 Certified
Page 9 of 9

You might also like