Measures of Dispersion or Variation: Vijay - Gahlawat@yahoo - Co.in
Measures of Dispersion or Variation: Vijay - Gahlawat@yahoo - Co.in
Measures of Dispersion or Variation: Vijay - Gahlawat@yahoo - Co.in
1
Dispersion– The degree to which data tends to spread
about an average value is called variation or
dispersion of the data.
2
Why we need Measures of Variations?
1. The Range
2. The Interquartile Range
3. The Average Deviation
4. The Standard Deviation
5. The Lorenz Curve
5
Absolute measures of variation
6
Relative measures of variations
When the two data sets are expressed in different
units, for example – quintals of sugar versus tonnes
of sugarcane, or if the average value is very much
different, such as manager’s salary versus worker’s
salary, then relative measures of variations are
used.
A measure of relative variation is the ratio of a
measure of absolute variation to an average. It is
sometimes called a coefficient of variation ( a pure
number independent of the unit of measurement) 7
1.The Range
Range = L-S
L= Largest Value and
S= Smallest Value
8
Merits/Advantage of Range
9
Demerits/Disadvantages of Range
• Quality Control
• Fluctuations in the share market
• Weather Forecasts
11
2. Interquartile Range
The range which include middle 50% 0bservations
Inter-quartile range = Q3 Q1
Semi-interquartile range or quartile deviation
(Q.D.) = Q3 Q1
2
The Relative measure corresponding to this
measure
Coefficient of Q.D. = Q3 Q1
Q3 Q1 12
Merits of Q.D.
• It is superior to range.
• We can calculate range in case of open end
distributions.
• It is not effected by the presence of extreme
observations.
13
Demerits of Q.D.
• It ignores 50% observations.
• It is not capable for further mathematical
manipulations.
• Its value is very much effected by sampling
fluctuations.
• It is not a measure of variation as it really does not
show the scatter around an average but rather a
distance on the scale, i.e. Q.D. is not itself
measured from an avg., but it is a positional avg.
14
3.The Mean Absolute Deviation / Average
Deviation
A.D. is obtained by calculating the absolute
deviations of each observation from median or
mean, and than averaging these deviations by
taking their arithmetic mean.
Computation of Mean Absolute Deviation –
Ungrouped Data
MAD (Mean) = XX
N
Or
MAD (Median) = X Med . 15
N
Computation of Mean Absolute Deviation – Grouped Data
17
Demerits of A.D.
• The greatest drawback is that algebraic signs are
ignored
• This method may not give us very accurate results.
(because A.D. give us best results when deviations
are taken from median)
• It is not capable for further algebraic treatments.
• It is rarely used in sociological and business studies.
18
4.The Standard Deviation
19
Standard Deviation is also known as Root Mean
Square Deviation for the reason that it is the square
root of the means of square deviations from the
arithmetic mean. Standard deviation is denoted by
small Greek letter (read as sigma) and is defined
as
2
_
=
x x
=
x x
N
Or
x x
2 2
= N
N
21
Deviations taken from Assumed mean:-
d d
2 2
= N
N
Where
d x A
and A is Assumed Mean or Arbitrary point
22
Calculation of SD – Grouped Data
Deviations taken from Actual Mean:-
2
_
= f x x
N
fx fx
2
2
=
N N
N f
And
23
Deviations taken from Assumed Mean:-
fd fd
2 2
=
N
N
h
Where
d
x A
h
and A is Assumed Mean or Arbitrary point.
h is class interval
N f
24
Coefficient of Variation
The corresponding relative measure of S.D. is known as
coefficient of variation. It is most commonly used
measure of relative measure.
It is used in such problems where we want to compare
the variability of two or more than two series.
Coefficient of Variation denoted by C.V. is obtained as
follows:
100
C.V. = _
x
25
Mathematical Properties of S.D.
28
Demerits of S.D.
29
5. Lorenz Curve
• It is a graphic method of studying variation. It was
devised by Max O. Lorenz (Economic Statistician).
This curve was used by him for the first time to
measure the distribution of wealth and income.
• Now the curve is also used to study the distribution of
profits, wages, turnovers etc.
• The most common use of this curve is in the study
of the degree of inequality in the distribution of
income and wealth between countries or between
different periods of time. 30
It is a cumulative percentage curve in which
the % of items is combined with the % of
other things as wealth, profits, turnovers
etc.
31