Measure of Central Tendency - Questions
Measure of Central Tendency - Questions
UNIT-1
FREQUENCY DISTRIBUTION
Structure:
1.0 Introduction
1.1 Objectives
1.2 Measures of Central Tendency
1.2.1 Arithmetic mean
1.2.2 Median
1.2.3 Mode
1.2.4 Empirical relation among mode, median and mode
1.2.5 Geometric mean
1.2.6 Harmonic mean
1.3 Partition values
1.3.1 Quartiles
1.3.2 Deciles
1.3.3 Percentiles
1.4 Measures of dispersion
1.4.1 Range
1.4.2 Semi-interquartile range
1.4.3 Mean deviation
1.4.4 Standard deviation
1.2.5 Geometric mean
1.5 Absolute and relative measure of dispersion
1.6 Moments
1.7 Karl Pearson’s β and γ coefficients
1.8 Skewness
1.9 Kurtosis
1.10 Let us sum up
1.11 Check your progress : The key.
2
1.0 INTRODUCTION
According to Simpson and Kafka a measure of central tendency is typical
value around which other figures aggregate‘.
According to Croxton and Cowden ‗An average is a single value within the
range of the data that is used to represent all the values in the series. Since an
average is somewhere within the range of data, it is sometimes called a measure of
central value‘.
1.1 OBJECTIVES
The main aim of this unit is to study the frequency distribution. After going through this unit you
should be able to :
know about measures of dispersion like range, semi-inter-quartile range, mean deviation,
standard deviation;
To find the arithmetic mean, add the values of all terms and them divide sum by the
number of terms, the quotient is the arithmetic mean. There are three methods to find
the mean :
(i) Direct method: In individual series of observations x1, x2,… xn the arithmetic mean is
obtained by following formula.
x x x x .............xn1 xn
A.M . 1 2 3 4
n
(ii) Short-cut method: This method is used to make the calculations simpler.
Let A be any assumed mean (or any assumed number), d the deviation of the
arithmetic mean, then we have
M. A
fd ( d=(x-A))
N
(iii)Step deviation method: If in a frequency table the class intervals have equal width,
say i than it is convenient to use the following formula.
M A
fu i
n
where u=(x-A)/ i ,and i is length of the interval, A is the assumed mean.
Example 1. Compute the arithmetic mean of the following by direct and short -cut methods
both:
Freqyebcy 8 26 30 20 16
Solution.
Example 2 Compute the mean of the following frequency distribution using step deviation
method. :
Frequency 9 17 28 26 15 8
Solution.
Property 1 The algebraic sum of the deviations of all the variates from their arithmetic
mean is zero.
Proof . Let X1, X2,… Xn be the values of the variates and their corresponding frequencies be
f1, f2, …, fn respectively.
Let xi be the deviation of the variate Xi from the mean M, where i = 1,2, …, n. Then
Xi = Xi –M, i = 1,2,…, n.
n n
fixi f ( X M )
i 1 i 1
i i
n n
=M
i 1
fi M f i
i 1
5
=0
Exercise 1(a)
52 75 40 70 43 65 40 35 48
Variate : 6 7 8 9 10 11 12
Frequency: 20 43 57 61 72 45 39
Frequency: 31 44 39 58 12
1.2.2 MEDIAN
The median is defined as the measure of the central term, when the given terms (i.e.,
values of the variate) are arranged in the ascending or descending order of magnitudes. In
other words the median is value of the variate for which total of the frequencies above this
value is equal to the total of the frequencies below this value.
Due to Corner, ―The median is the value of the variable which divides the group into two
equal parts one part comprising all values greater, and the other all values less then the
median‖.
For example. The marks obtained, by seven students in a paper of Statistics are 15, 20, 23,
32, 34, 39, 48 the maximum marks being 50, then the median is 32 since it is the value of the
4th term, which is situated such that the marks of 1st, 2nd and 3rd students are less than this
value and those of 5th, 6th and 7th students are greater then this value.
COMPUTATION OF MEDIAN
Let n be the number of values of a variate (i.e. total of all frequencies). First of all
we write the values of the variate (i.e., the terms) in ascending or descending order of
magnitudes
n 1
th
th
Case2. If n is even then there are two central terms i.e., n/2 and The mean of
2
these two values gives the median.
(b) Median in continuous series (or grouped series). In this case, the median (Md) is
computed by the following formula
n
cf
Md l 2 i
f
Where Md = median
Example 1 – According to the census of 1991, following are the population figure, in
thousands, of 10 cities :
1400, 1250, 1670, 1800, 700, 650, 570, 488, 2100, 1700.
Here n=10, therefore the median is the mean of the measure of the 5th and 6th terms.
= 1325 Thousands
No. of workers 22 38 46 35 20
Here N = 161. Therefore median is the measure of (N + 1)/2th term i.e 81st term. Clearly 81st
term is situated in the class 20-30. Thus 20-30 is the median class. Consequently.
n
cf
Median M d l 2 i
f
= 20 + (½ 161 – 60) / 46 10
125 1
th
= 63rd term.
n
cf
Median M d l 2 i
f
= 30 + 25/24
= 30+1.04 = 31.04
1.2.3 MODE
The word ‗mode is formed from the French word ‗La mode‘ which means ‗in
fashion‘. According to Dr. A. L. Bowle ‗the value of the graded quantity in a statistical
group at which the numbers registered are most numerous, is called the mode or the
position of greatest density or the predominant value.‘
Mode
According to other statisticians, ‗The value of the variable which occurs most
frequently in the distribution is called the mode.‘
―The mode of a distribution is the value around the items tends to be most heavily
concentrated. It may be regarded at the most typical value of the series‖.
Definition. The mode is that value (or size) of the variate for which the frequency is
maximum or the point of maximum frequency or the point of maximum density. In other
words, the mode is the maximum ordinate of the ideal curve which gives the closest fit to
the actual distribution.
9
Size of shoes 1 2 3 4 5 6 7 8 9
Frequency 1 1 1 1 2 3 2 1 1
Here maximum frequency is 3 whose term value is 6. Hence the mode is modal size number
6.
(b) In continuous frequency distribution the computation of mode is done by the following
formula
f1 f 0
Mode M 0 l i … (i)
2 f1 f 0 f 2
i =class interval
f1 f 0
Mode M 0 l i
2 f1 f 0 f 2
72 36
= 21 10
(2 72 36 51)
= 21 + 357 / 87
= 21 + 4.103
= 25.103.
10
(c) Method of determining mode by the method of grouping frequencies. This method is
usually applied in the cases when there are two maximum frequencies against two different
size of items. This method is also applied in the cases when it is possible that the effect of
neighboring frequencies on the size of item (of maximum frequency) may be greater. The
method is as follows :
Firstly the items are arranged in ascending or descending order and corresponding
frequencies are written against them.The frequencies are then grouped in two and then in
threes and then is fours (if necessary). In the first stage of grouping, they are grouped (i.e.,
frequencies are added) by taking, first and second, third and fourth, …, . After it, the
frequencies are added in threes. The frequencies are added in the following two ways :
1. (i) First and second, third and fourth, fifth and sixth, seventh and eighth, …
(ii) Second and third, fourth and fifth, …
2. (i) First, second and third; fourth, fifth and sixth, …
(ii) Second, third and fourth; fifth, sixth and seventh, …
(iii) Third, fourth and fifth; sixth seventh and eighth, …
Now the items with maximum frequencies are selected and the item which
contains the maximum is called the mode. For illustration see following example 1.
Size of I II III IV V VI
Items
4 2
7
5 5
13
6 8
17 15
7 9
21 22
8 12
26 35 29
9 14
11
28 40
10 14
29 40 43
11 15
26 39
12 11
24
13 13
We have used brackets against the frequencies which have been grouped. Now we
shall find the size of the item containing maximum frequency :
Column Size of item having maximum frequency
I 11
II 10,11
III 9,10
IV 10,11,12
V 8,9,10
VI 9,10,11
Here size 8 occurs 1 time, 9 occurs 3 times, 10 occurs 5 times, 11 occurs 4 times, 12
occurs 1 time.
Since 10 occurs maximum number of times (5 times).
Hence the required mode is size 10.
For moderately asymmetrical distribution (or for asymmetrical curve), the relation
Mean – Mode = 3 (Mean - Median),
approximately holds. In such a case, first evaluate mean and median and then mode
is determined by
Mode = 3 Median – 2 Mean.
If in the asymmetrical curve the area on the left of mode is greater than area on the
right then
Mean < median < mode, i. e., (M < Md < M0)
12
Mode
Median Mode
Median
Mean
Mean
If in the asymmetrical curve, the area on the left of mode is less than the area on the
right then in this case
Exercise 1(c)
Q.1) Find the Mode of the following model size number of shoes.
Model size no. of shoes : 3,4,2,1,7,6,6,7,5,6,8,9,5.
If x1,x2, … ,xn. are n values of the variate x, none of which is zero . Then their
geometric mean G is defined by
G = (x1, x2, … xn)1/n (1)
If f1, f2, … , fn are the frequencies of x1,x2,…, xn respectively, then geometric mean G
is given by
The Harmonic mean of a series of values is the reciprocal of the arithmetic means of
their reciprocals. Thus if x1,x2,…, xn (none of them being zero) is a series and H is its
harmonic mean then
1 1 1 1 1
[ .... ]
H N x1 x 2 xn
If f1, f2, …, fn be the frequencies of x1,x2, … , xn (none of them being zero) then harmonic
mean H is given by
H .M .
f
1
fx
Example 1. Find the harmonic mean of the marks obtained in a class test, given below
Marks : 11 12 13 14 15
No. of students: 3 7 8 5 2
Solution.
Marks Frequency 1/x f 1/x
X f
11 3 0.0909 0.2727
12 7 0.0833 0.5831
13 8 0.0769 0.6152
14 5 0.0714 0.3570
15 2 0.0667 0.1334
14
N = ∑f = 25 ∑f/x = 1.9614
H .M .
f
1
fx
= 25 / 1.9614
= 25/1.9614
= 250000/19614
= 12.746 marks.
Property . For two observations x1 and x2, we have
AH = G2
Where A = arithmetic mean, H = harmonic mean and G = geometric mean.
1.3.1 QUARTILES :
Definition. The values of the variate which divide the total frequency into four equal
parts, are called quartiles. That value of the variate which divides the total frequency into
two equal parts is called median. The lower quartile or first quartile denoted by Q1 divides
the frequency between the lowest value and the median into two equal parts and similarly
the upper quartile (or third quartile) denoted by Q3 divides the frequency between the
median and the greatest value into two equal parts. The formulas for computation of
quartiles are given by
n 3n
cf cf
Q1 l 4 i , Q3 l 4 i
f f
1.3.2 DECILES :