Measures OF VARIATION-final
Measures OF VARIATION-final
MEASURES OF DISPERSION
The previous lesson provides the measure of central tendency that sum up or
describe the data into a single representative value. The measures of
central tendency may not be adequate to describe data unless we know the
manner in which the individual items scatter around it. In other words, a further
description of a series on the scatter or variability known as dispersion is
necessary, if we are to gauge how representative the average is.
Let us take the following three sets.
Students Group X Group Y Group Z
1 50 45 05
2 50 50 45
3 50 55 100
mean X 50 50 50
Thus, the three groups have same mean i.e. 50. In fact the median of group X and
Y are also equal. Now if one would say that the students from the three groups are
of equal capabilities, it is totally a wrong conclusion. Close examination
reveals that in group X students have equal marks as the mean, students from
group Y are very close to the mean but in the third group Z, the marks are widely
scattered. It is thus clear that the measures of the central tendency is alone not
sufficient to describe the data. The measure of dispersion helps us to know
the degree of variability in the data and provide a better understanding of the
data.
OBJECTIVES
After completing this lesson, you will be able to:
know the meaning and need of measures of dispersion;
Measures of Dispersion
For example if the data is expressed in kilograms then the absolute variation will
also be expressed in kilograms but if the same data is expressed in grams then the
variation will appear 1000 times more. So for comparison point of view it is
necessary to calculate the relative measures of dispersion which are expressed as
percentage form (i.e. unitless number). These types of expressions are called
coefficients of dispersion. Each absolute measure of dispersion has a relative
measure of dispersion.
Range and Quartile Deviation measure the dispersion by calculating the spread
within which the values lie. Mean Deviation and Standard Deviation calculate the
extent to which the values differ from the average.
9.2.1 Range
Range (R) is the difference between the largest (L) and the smallest value (S) in
a distribution. Thus
Range (R) = L – S
Coefficient of Range: It is a relative measure of the range. It is used in the
comparative study of the dispersion
L−S
co-efficient of Range =
L+S
In case of continuous series Range is just the difference between the upper limit
of the highest class and the lower limit of the lowest class.
Range: Evaluation
Range is very simple to understand and easy to calculate. However, it is not based
on all the observations of the distribution and is unduly affected by the extreme
values. Any change in the data not related to minimum and maximum values will
not affect range. It cannot be calculated for open-ended frequency distribution.
Example 1: The amount spent (in `) by the group of 10 students in the school
canteen is as follows:
110, 117, 129, 197, 190, 100, 100, 178, 255, 790.
Find the range and the co-efficient of the range.
Solution: R = L - S = 790 - 100 = ` 690
Example 2: Find the range and it’s co-efficient from the following data.
Size 10-20 20-30 30-40 40-50 50-100
Frequency 2 3 5 4 2
Solution: R = L – S = 100 – 10 = 90
Measures of Dispersion
L −S 100 −10 90
Co-efficient of range = = = = 0.82
L+S 100 +10 110
Q3 − Q1
Thus Quartile Deviation (Q.D) =
2
Q3 − Q1
Q − Q1
Coefficient of Quartile Deviation = 2 = 3
Q3 + Q1 Q3 + Q1
2
It is pure number free of any units of measurement. It can be used for comparing
the dispersion in two or more than two sets of data.
Solution:
Arranging the data in an ascending order we get
70, 100, 120, 150, 170, 190, 250
Here n = 7,
(N +1)
Q1 = Size of th item
4
(7 +1)
= Size of th item = 2nd item = 100 rupees
4
3(N +1)
Q3 = Size of th item
4
3(7 +1)
= Size of th item = 6th item = 190 rupees
4
Q3 − Q1 190 −100
Q.D. = = = 45 rupees
2 2
Example 4: The wheat production (in Kg) of 20 acres is given as: 1120, 1240,
1320, 1040, 1080, 1200, 1440, 1360, 1680, 1730, 1785, 1342, 1960, 1880, 1755,
1720, 1600, 1470, 1750, and 1885. Find the quartile deviation and coefficient of
quartile deviation.
Solution:
After arranging the observations in ascending order, we get
1040, 1080, 1120, 1200, 1240, 1320, 1342, 1360, 1440, 1470, 1600, 1680, 1720,
1730, 1750, 1755, 1785, 1880, 1885, 1960.
( N +1
Q1 = value of th item
4 )
( 20 +1
= Value of th item
4 )
Measures of Dispersion
3( N + 1)
Q3 = Value of th item
4
3(20 +1)
= Value of th item
4
Q3 − Q1 = 1753.75 −1260
Q.D. =
2 2
492.75
= = 246.875kg.
2
Q3 − Q1 1753.75 −1260
= = = 0.164
Q3 + Q1 1753.75 +1260
Solution:
Table 9.1: Calculation of Quartile deviation
Amount of Taxes No. of Firms (f) Cummulative
(in ’000 `) Frequency (C.F.)
10 3 3
11 12 15
12 18 33
13 12 45
14 3 48
f = 48
Here N = 48,
(N + 1) th item
Q1 = Size of
4
(48 + 1) th item
= Size of
4
3(N + 1)
Q3 = Size of th item
4
3(48 + 1)
= Size of th item
4
Q3 − Q1 13 −11
Q.D. = = = 1 (in '000 rupees)
2 2
Measures of Dispersion
Q3 − Q1 13 −11
Coeff of Q.D. = Q + Q = 13 +11 = 0.083
3 1
Solution:
Table 9.2: Calculation of Quartile deviation and coefficient
of quartile deviation
Weekly Wages No. of Workers (f) Cummulative
(in ‘000 `) Frequency (C.F.)
4.5-7.5 14 14
7.5-10.5 24 38
10.5-13.5 38 76
13.5-16.5 20 96
16.5-19.5 4 100
f = 100
l2 - l1 ( N l
Q1 = l1 + ¨©4 - cf ¸¹
f
N
= 25. Q1 lies in the class of 7.5-10.5
4
l2 − l1 ( N 25 −14
Q1 = l1 + 4 − cf ) = 7.5 + 3 = 8.875 (in `000)
f 24
l2 − l1 ( 3N 75 − 38
Q3 = l1 + 4 − cf ) = 10.5 + 3 = 13.42 (in `000)
f 38
Q3 − Q1 13.42 − 8.875
Q.D. = = = 2.273 (in `000)
2 2
Measures of Dispersion
Q3 − Q1 13.42 − 8.875
Coeff. of Q.D. = = = 0.21
Q3 + Q1 13.42 + 8.875
3. Mean Deviation
Mean deviation (MD) of a series is the arithmetic average of the deviation of
various items from a measure of central tendency (mean, median and mode)
First we compute deviations of all the items from either mean or median ignoring
plus (+) and (–) signs. They are called absolute values of deviations where the two
parallel bars (ii) indicate that the absolute value is taken. This is also called modulas
value. Then the aggregate of these deviations are divided by the number of
observations this is called mean deviation.
(iii) Take deviations of items from median/mean ignoring ± signs and denote the
column as |D|
(iv) Calculate the sum of these deviation in case of discrete and continuous series
|D| is multiplied by respective frequency of the item to get f |D|
(v) Divide the total obtained by number of items to get mean deviation
f D
M.D. =
N
M.D
Coefficient of M.D. = Median / Mean / Mode
Example 6: Calculate mean deviaiton and coefficient of mean deviation from both
mean and median for the following data on the monthly income (in `) of
households
( N +1
th
and Median = Size of
2 )
item
X
Mean =
N
( 5 +1
th
= Size of
2 )
item
38650
= = 7730
5 = Size of 3rd item
D = ` 7920
M.D. =
N D
Mean Deviation =
3220 N
= = Rs 644
5 3030
= = Rs 606
M.D. 5
Coefficient of M.D. = Mean
M.D.
Coefficient of M.D. =
644 Median
= = 0.083
7730
606
= = 0.076
7920
Solution.
Table 9.4: Calculation of mean deviaiton from Median
X f cf |D| f|D|
6 4 4 18 72
12 7 11 12 84
18 9 20 6 54
24 18 38 0 0
30 15 53 6 90
36 10 63 12 120
42 5 68 18 90
f = 68 |D| = 72 f|D| = 510
( N +1
th
Median = Size of item
2 )
( 68 +1
th
= Size of item
2 )
= 34.5th item
f D 510
M.D. = = = 7.5
N 68
M.D. 7.5
Coefficient of M.D. = = = 0.312
Median 24
fd
X = A+ N c
10
= 25 + 10 = 27 Marks
50
f D 472
M.D. = = = 9.44 Marks
N 50
2. Obtain the deviation of each ith observation from X i.e. (Xi – X ). (Note that
(Xi – X ) =0)
5. Obtain the square root of the mean of these squared deviations as follows:
( X − X)2
Standard deviation (x) =
N
N = Total No. of observation
3. Direct Method
The relevance of this method is particularly useful when the items are very small.
To obtain standard deviations, we apply the following formula:
X2
(X)
2
Standard Deviation (x) =
−N
(Note: The direct method basically implies taking deviations from zero)
d2 (
Standard Deviation (x) = − 2 c
N d N )
X−A
where d = ( i.e. deviation taken from the assumed mean and divide by
c )
class interval (c)
Days 1 2 3 4 5 6
Commodity Price(` Per Quintal) 5 15 25 35 45 55
Solution:
Table 9.6: Calculation of standard deviation
4 35 5 25 –5 25 1225 –1 1
5 45 15 225 5 25 2025 1 1
X=
X 180
= = 30 (in Rupees)
N 6
( X − X)2 = 1750
Standard deviation (x) = = 17.078 (in rupees)
N 6
−60
X =A+
d
Here = 40 + = 30
N 6
2
d2 ( d
Standard Deviation (x) = −
N N )
d 2 = 2350, d = −60, N = 6
2350 ( −60 2
(X ) = − = Rs.17.078
6 6 )
Measures of Dispersion
X2 −
(X)
2
Standard Deviation (x) =
N
2
d'2 ( d '
Standard Deviation (x) = − c
N N )
94 ( −12 2
(X ) = − 5 = Rs.17.078
6 6 )
Note: The sum of deviations taken from mean is Zero. But the sum of deviations
from a value other than actual mean is not equal to zero
fx2
Standard deviation (x) =
f
2
fd2 ( fd
Standard deviation (x) = −
f f )
m−A
Divide the deviations by a common factor ‘C’ denoted by d = ( .
c )
Multiply d values with corresponding f values to obtain fd values.
Multiply fd values with d values to get fd2 values.
Obtain fd and fd2 values.
Apply the following formula.
2
fd2 ( fd
Standard Deviation (x) = − c
f f )
Measures of Dispersion
( m−A
where d = i.e. deviation taken from the assumed mean and divide by
c )
class interval (c) (or the common factor in case the class intervals are unequal), m
is the mid value of the interval.
Solution:
Table 9.7: Calculation of standard deviation
Profit No. of m fm d= d = fd fd2 fd fd2
m – 45
(in ` Companies (m – 40)
10
crores)
fx2
Standard deviation (x) =
f
36000
(X ) = = 14.142(in rupees crores)
180
2
fd2 ( fd
Standard deviation (x) = −
f f )
40500 − ( −900 2
(X ) = = 14.142(in rupees crores)
180 180 )
2
fd'2 ( fd '
Standard deviation (x) = − c
f f )
m − 45
d' = , fd'2 = 540, fd ' = −180, f = 180, c = 10
10
540 − ( −180 2
(X ) = 10 = 14.142(in rupees crores)
180 180 )
Example 11: The following table shows the daily wages of a random sample of
construction workers. Calculate its mean deviation and standard deviation.
Measures of Dispersion
Solution:
Table 9.8: Computation of mean deviation
Daily Wages Number of Class Mark fm fi |m –X|
(`) X Workers (f) (m) = fi |m – 823.5|
200 – 399 5 299.5 1497.50 2,620
400 – 599 15 499.5 7492.50 4,860
600 – 799 25 699.5 17487.50 3,100
800 – 999 30 899.5 26985.00 2,280
1000 – 1199 18 1,099.5 19791.00 4,968
1200 – 1399 7 1,299.5 9096.50 3,332
Total 100 82350.00 21,160
fi m − X 21,160
Mean deviation = = = 211.60 (`)
fi 100
Table 9.9: Computation of Standard deviation
Daily Wages Number of Class Mark fi (m – X)2
(`) Workers (M.V.)
6462400
Standard deviation = = 254.21 (Rupees)
100
Measures of Dispersion
X
C.V. = 100
X
Example 12: The following table shows the summary statistics for the daily wages
of two types of workers.
20
Variation I > II CV = 100 = 20% CV
I II
100
24
= 100 = 16%
150
Measures of Dispersion
distribution of the dependent variable is equal, the plot will show as a straight, 45°
line. Unequal distributions will yield a curve. The gap between this curve and the
45° line is the inequality gap. The farther the curve from this 45o line, the
greater is the variability present in the distribution. Lorenz curve is used to see
the degree of concentration of income or health. For example, it may show
top 25% of population accounts for 70% of income or bottom 25% of population
has only 5% of income (see figure 9.1).
Last 25%
has 70% of
Cumulative % of Income
national
income
Third 25%
has 15% of
national
income
Second 25%
has 10% of
national
income
First 25%
has 5% of
national
income
Cumulative % of population
Fig. 9.1
Q3 − Q1
The Coefficient of quartile deviation is given by Coeffof Q.D. =
Q3 + Q1
( X − X)2
Standard deviation (x) =
N
2
d2 ( d
Standard deviation (x) = −
N N )
X2 −
(X)
2
Standard deviation (x) =
N
2
d '2 ( d '
Standard deviation (x) = − c
N N )
(X−A
where d = i.e. deviation taken from the assumed mean and
c )
divide by class interval (c)
Standard Deviation in case of Continuous Series is given by
fx2
Standard deviation (x) =
f
2
fd2 ( fd
Standard deviation (x) = −
f f )
2
fd2 ( fd
Standard deviation (x) = − c
f f )
( m−A
where d = i.e. deviation taken from the assumed mean and
c )
divide by class interval (c) (or the common factor in case the class
intervals are unequal, m is the mid value of the interval.
Measures of Dispersion
X
C.V. = 100
X
X = S.D. of var iable X, X = mean of var iable X
Lorenz curve is the graphical method of estimating dispersion.
TERMINAL EXERCISES
Range
1. The following are the prices of shares of A B Co. Ltd. from Monday to
Saturday:
Days Price (in `) Days Price (in `)
Monday 200 Thursday 160
Tuesday 210 Friday 220
Wednesday 208 Saturday 250
Quartile Deviation
7. Calculate the QD for a group of data,
241, 521, 421, 250, 300, 365, 840, 958
8. From the following figures find the quartile deviation and its coefficient:
Height (cms.): 150 151 152 153 154 155 156 157 158
No. of Students: 15 20 32 35 33 22 20 12 10
9. Using quartile deviation, state which of the two variables –Aand B is more variable:
A B
Mid-Point Frequency Mid-Point Frequency
15 15 100 340
20 33 150 492
25 56 200 890
30 103 250 1420
35 40 300 620
40 32 350 360
45 10 400 187
450 140
10. Find the quartile deviation from the following table:
Size: 4-8 8-12 12-16 16-20 20-24 24-28 28-32 32-36 36-40
Frequency: 6 10 18 30 15 12 10 6 2
11. Calculate the coefficient of quartile deviation from the following data:
Class Interval Frequency
10–15 4
15–20 12
20–25 16
25–30 22
30–40 10
40–50 8
50–60 6
60–70 4
8
Measures of Dispersion
Standard deviation
12. Determine the standard deviation of the following student test results
percentages.
92% 66% 99% 75% 69% 51% 89% 75% 54% 45% 69%
13. Calculate the coefficient of variation for the following data set.
The price (in `), of a stock over five trading days was 52, 58, 55, 57, 59.
14. The frequency table of the monthly salaries of 20 people is shown below.
3500 5
4000 8
4200 5
4300 2
15. The following table shows the grouped data, in classes, for the heights of 50
people.
120 130 2
130 140 5
140 150 25
150 160 10
160 170 8
16. The following is the frequency distribution for the speeds of a sample of
automobiles traveling on an interstate highway.
Speed Miles per Hour Frequency
50 – 54 2
55 – 59 4
60 – 64 5
65 – 69 10
70 – 74 9
75 – 79 5
35
A 3083
B 1466
C -461
D 1113
E -11
F 395
G 3290
H 437
Answers:
ANSWERS TO INTEXT QUESTIONS
9.1
1. (c)
2. (c)
9.2
1. 140
9.3
1. (c)
2. (d)
3. (c)
4. (b)
Measures of Dispersion
9.4
1. Median = 5, M.D. = 1.68
2. M.D. 0.915, Coefficient of M.D. = 0.336
9.5
1. 2.1 marks
9.6
1. (b)
2. (b)
3 (d)
4. (b)
5. Standard deviation (b) C.V