Measures of Dispersion (Part 1)

The document discusses measures of dispersion, emphasizing their importance in understanding the variation within data sets beyond averages. It explains how dispersion indicates the reliability of averages and outlines various measures such as range, interquartile range, and quartile deviation, along with their advantages and disadvantages. Additionally, it highlights the significance of measuring dispersion for statistical analysis, quality control, and comparison of data sets.

Uploaded by

singh133167

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views28 pages

Measures of Dispersion (Part 1)

Uploaded by

singh133167

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Unit 3: Measures of

Dispersion
Dr. Richa Verma, DEI
Need and Meaning: Measures of Dispersion

• In the preceding lectures we have already discussed why it is necessary to tabulate and classify
statistical series and to condense them into a single figure called average.
• The average as we have already seen has its own limitations and even an ideal average can
represent a series only" as best as a single figure can".
• No doubt averages have a very great utility in statistical analysis but they fail to reveal the entire
story of a phenomenon.
• There may be a dozen series whose averages may be identical but which may differ from each
other in a hundred ways. Obviously in such cases further statistical analysis of the data is necessary
so that these differences between various series may also be studied and accounted for.
• If this is done statistical analysis would be more accurate and we shall be more confident of
conclusions.
Need and Meaning: Measures of Dispersion

• Just as central tendency can be measured by a number in the form of an average, the
amount of variation (dispersion, spread, or scatter) among the values in the data set can also
be measured.
• The measures of central tendency describe that the major part of values in the data set
appears to concentrate (cluster) around a central value called average with the remaining
values scattered (spread or distributed)on either sides of that value.
• But these measures do not reveal how these values are dispersed (spread or scattered) on
each side of the central value.
• The dispersion of values is indicated by the extent to which these values tend to spread over
an interval rather than cluster closely around an average.
Need and Meaning: Measures of Dispersion

• A small dispersion among values in the data set indicates that data are clustered closely around the mean. The mean is therefore considered
representative of the data, i.e. mean is a reliable average.
• Conversely, a large dispersion among values in the data set indicates that the mean is not reliable, i.e. it is not representative of the data.
• Illustration
• Suppose over the six-year period the net profits (in percentage) of two firms are as follows:
Firm 1 : 5.2, 4.5, 3.9, 4.7, 5.1, 5.4
Firm 2 : 7.8, 7.1, 5.3, 14.3, 11.0, 16.1
• Since average amount of profit is 4.8 per cent for both firms, therefore operating results of both the firms are equally good and that a
choice between them for investment purposes must depend on other considerations.
• However, the difference among the values is greater in Firm, 2, that is, profit is varying from 5.3 to 16.1 per cent, while the net profit
values of Firm 1 were varying from 3.9 to 5.4 per cent.
• This shows that the values in data set 2 are spread more than those in data set 1.
• This implies that Firm 1 has a consistent performance while Firm 2 has a highly inconsistent performance.
• Thus for investment purposes, a comparison of the average (mean) profit values alone should not be sufficient.
When Dispersion is high

• Spread of Data- High

• Variability - High
• Heterogeneity- High
• Reliability - Low
• Consistency - Low
• Dependability - Low
• Uniformity – Low
• Homogeneity - Low
SIGNIFICANCE OF MEASURING DISPERSION

• Test the reliability of an average: Measures of variation are used to test to what extent an average
represents the characteristic of a data set. If the variation is small, that is, extent of dispersion or scatter is
less on each side of an average, then it indicates high unformity of values in the distribution and the
average represents an individual value in the data set. On the other hand, if the variation is large, then it
indicates a lower degree of uniformity in values in the data set, and the average may be unreliable. No
variation indicates perfect uniformity and, therefore, values in the data set are identical.
• ii) Control the variability: Measuring variation helps to identify the nature and causes of variation. Such
information is useful in controlling the variations. According to Spurr and Bonini, ‘In matters of health,
variations in, body temperature, pulse beat and blood pressure are the basic guides to diagnosis.
Prescribed treatment is designed to control their variation. In industrial production, efficient operation
requires control of quality variation, the causes of which are sought through inspection and quality control
programmes.’ In social science, the measurement of ‘inequality’ of distribution of income and wealth
requires the measurement of variability.
SIGNIFICANCE OF MEASURING DISPERSION

(iii) Compare two or more sets of data with respect to their variability: Measures of variation
help in the comparison of the spread in two or more sets of data with respect to their
uniformity or consistency. For example, (i) the measurement of variation in share prices and
their comparison with respect to different companies over a period of time requires the
measurement of variation, (ii) the measurement of variation in the length of stay of patients in
a hospital every month may be used to set staffing levels, number of beds, number of doctors,
and other trained staff, patient admission rates, and so on.
iv) Facilitate the use of other statistical techniques: Measures of variation facilitate the use of
other statistical techniques such as correlation and regression analysis, hypothesis testing,
forecasting, quality control, and so on.
Essential Requisites for a Measure of Variation

The essential requisites for a good measure of variation are listed below. These requisites
help in identifying the merits and demerits of individual measures of variation.
(i) It should be rigidly defined.
(ii) It should be based on all the values (elements) in the data set.
(iii) It should be calculated easily, quickly, and accurately.
(iv) It should not be unduly affected by the fluctuations of sampling and by extreme
observations.
(v) It should be amenable to further mathematical or algebraic manipulations.
CLASSIFICATION OF MEASURES OF DISPERSION

The various measures of dispersion (variation) can be classified into two categories:
• (i) Absolute measures, and
• (ii) Relative measures

• Absolute measures are described by a number or value to represent the amount of variation or differences among values in a data set.
Such a number or value is expressed in the same unit of measurement as the set of values in the data such as rupees, inches, feet,
kilograms, or tonnes. Such measures help in comparing two or more sets of data in terms of absolute magnitude of variation, provided
the variable values are expressed in the same unit of measurement and have almost the same average value.
• The relative measures are described as the ratio of a measure of absolute variation to an average and is termed as coefficient of
variation. The word ‘coefficient’ means a number that is independent of any unit of measurement. While computing the relative
variation, the average value used as base should be the same from which the absolute deviations were calculated.
Measures of dispersion

The following measures of dispersion are in common use-

• Range
• Inter-Quartile-Range
• Semi-Inter-Quartile-Range or Quartile Deviation
• Average Deviation or Mean Deviation
• Standard Deviation or Root-Mean-Square Deviation taken from the mean.
Range
Range

• The range is the most simple measure of dispersion and is based on the location of the largest and the
• smallest values in the data.
• Thus, the range is defined to be the difference between the largest and lowest observed values in a data set.
In other words, it is the length of an interval which covers the highest and lowest observed values in a data
set and thus measures the dispersion or spread within the interval in the most direct possible way.
Range (R) = Highest value of an observation – Lowest value of an observation
Range (R) = H – L
• For example, if the smallest value of an observation in the data set is 160 and largest value is 250, then the
range is 250 – 160 = 90.
• For grouped frequency distributions of values in the data set, the range is the difference between the upper
class limit of the last class and the lower class limit of first class. In this case, the range obtained may be
higher than as compared to ungrouped data because of the fact that the class limits are extended slightly
beyond the extreme values in the data set.
Coefficient of Range

• The relative measure of range, called the coefficient of range is obtained by applying the
following
• Formula:
• Example : The following are the sales figures of a firm for the last 12
months.
Months : 1 2 3 4 5 6 7 8 9 10 11 12
Sales: (Rs. ’000) : 80 82 82 84 84 86 86 88 88 90 90 92
Calculate the range and coefficient of range for sales.

Range: • Solution: Given that H = 92 and L = 80. Therefore

Example Range = H – L = 92 – 80 = 12
Advantages of Range

• It is independent of the measure of central tendency and easy to calculate and

understand.
• It is quite useful in cases where the purpose is only to find out the extent of extreme
variation, such as industrial quality control, temperature, rainfall, and so on.
Disadvantages of Range
• The calculation of range is based on only two values—largest and smallest in the data set and fail to
take account of any other observations.
• It is largely influenced by two extreme values and completely independent of the other values. For
example, range of two data sets {1, 2, 3, 7, 12} and {1, 1, 1, 12, 12} is 11, but the two data sets differ
in terms of overall dispersion of values
• Its value is sensitive to changes in sampling, that is, different samples of the same size from the same
population may have widely different ranges.
• It cannot be computed in case of open-ended frequency distributions because no highest or lowest
value exists in open-ended class.
• It does not describe the variation among values in the data between two extremes. For example, each
of the following set of data has a range of 21 – 9 = 12, but the variation of values is quite different in
each case between the highest and lowest values.
Set 1 : 9 21 21 21 21 21 21 21
Set 2 : 9 9 9 9 21 21 21 21
Set 3 : 9 10 12 14 15 19 20 21
Applications of Range

• Fluctuation in share prices: The range is useful in the study of small variations among values in a data
set, such as variation in share prices and other commodities that are very sensitive to price changes
from one period to another.
• Quality control: It is widely used in industrial quality control. Quality control is exercised by
preparing suitable control charts. These charts are based on setting an upper control limit (range)
and a lower control limit (range) within which produced items shall be accepted. The variation in the
quality beyond these ranges requires necessary correction in the production process or system.
• Weather forecasts: The concept of range is used to determine the difference between maximum
and minimum temperature or rainfall by meteorological departments to announce for the knowledge
of the general public.
Interquartile Range
Interquartile Range or Deviation

• The limitations or disadvantages of the range can partially be overcome by using another measure of
variation which measures the spread over the middle half of the values in the data set so as to
minimise the influence of outliers (extreme values) in the calculation of range.
• Since a large number of values in the data set lie in the central part of the frequency distribution,
therefore it is necessary to study the Interquartile Range (also called midspread).
• To compute this value, the entire data set is divided into four parts each of which contains 25 per
cent of the observed values.
• The quartiles are the highest values in each of these four parts. The interquartile range is a measure
of dispersion or spread of values in the data set between the third quartile, Q3 and the first quartile,
Q1. In other words, the interquartile range or deviation (IQR) is the range for the middle 50 per
cent of the data.
Interquartile range (IQR) = Q3 – Q1
Interquartile Range
• The median is not necessarily midway
between Q1 and Q3, although this will be so
for a symmetrical distribution. The median
and quartiles divide the data into equal
numbers of values but do not necessarily
divide the data into equally wide intervals.
• In a non-symmetrical distribution, the two
quartiles Q1 and Q3 are at equal distance
from the median, that is, Median – Q1 = Q3
– Median. Thus, Median ± Quartile
Deviation covers exactly 50 per cent of the
observed values in the data set.
• A smaller value of quartile deviation
indicates high uniformity or less variation
among the middle 50 per cent observed
values around the median value. On the
other hand, a high value of quartile deviation
indicates large variation among the middle
50 per cent observed values.
SEMI-INTER-QUARTILE
RANGE/ QUARTILE
DEVIATION
SEMI-INTER QUARTILE RANGE/ QUARTILE DEVIATION

• Semi-inter-quartile range as the name suggests is the midpoint of the

inter-quartile-range. In other words, it is one half of the difference
between the third quartile and the first quartile.
• Semi-inter-quartile range or Quartile deviation

• Where Q3 and Q1 stand for the upper and lower quartiles respectively.
SEMI-INTER QUARTILE RANGE/ QUARTILE DEVIATION

• In a symmetrical series median lies half way on the scale from and Q1
to Q3. If, therefore, the value of the quartile deviation is added to the
lower quartile or subtracted from the upper quartile, in a symmetrical
series, the resulting figure would be the value of the median.
• But generally series are not symmetrical and in a moderately
asymmetrical series Q1 - quartile deviation or Q3 - quartile deviation,
would not give true value of the median.
• There would be a difference between the two figures and the greater
the difference, the greater would be the extent of departure from
normality.
Coefficient of Quartile Deviation
• Since quartile deviation is an absolute measure of variation, therefore its value gets affected by the size
• and number of observed values in the data set. Thus, the Q.D. of two or more than two sets of data may
• differ. Due to this reason, to compare the degree of variation in different sets of data, we compute the
• relative measure corresponding to Q.D., called the coefficient of Q.D., and it is calculated as follows:

• Example : Calculate the Quartile deviation of the following data:

Advantages of Quartile Deviation

(i) It is not difficult to calculate but can only be used to evaluate variation
among observed values within the middle of the data set. Its value is not
affected by the extreme (highest and lowest) values in the data set.

(ii) It is an appropriate measure of variation for a data set summarized in

open-ended class intervals.

(iii) Since it is a positional measure of variation, therefore it is useful in

case of erratic or highly skewed distributions, where other measures of
variation get affected by extreme values in the data set.
Disadvantages of Quartile Deviation

(i) The value of Q.D. is based on the middle 50 per cent observed values in the data
set, therefore it cannot be considered as a good measure of variation as it is not
based on all the observations.

(ii) The value of Q.D. is very much affected by sampling fluctuations.

(iii) The Q.D. has no relationship with any particular value or an average in the data
set for measuring the variation. Its value is not affected by the distribution of the
individual values within the interval of the middle 50 per cent observed values.

Solid Works Training BASIC
100% (4)
Solid Works Training BASIC
176 pages
Imaging Brain Function With EEG
100% (4)
Imaging Brain Function With EEG
266 pages
Vision Based Systems For UAV Applications: Aleksander Nawrat Zygmunt Kus
100% (1)
Vision Based Systems For UAV Applications: Aleksander Nawrat Zygmunt Kus
348 pages
Zimbabwe School Examinations Council Pure Mathematics 6042/2
100% (4)
Zimbabwe School Examinations Council Pure Mathematics 6042/2
8 pages
Survey Sampling Formula Sheet
100% (2)
Survey Sampling Formula Sheet
13 pages
Chapter 5
No ratings yet
Chapter 5
47 pages
Dispersion: (Measures of Variability)
No ratings yet
Dispersion: (Measures of Variability)
93 pages
Lecture 5
No ratings yet
Lecture 5
97 pages
Midterm Project Gec 3
No ratings yet
Midterm Project Gec 3
29 pages
Bast 503 Lect 5
No ratings yet
Bast 503 Lect 5
53 pages
of B.com Sem 1 Prabhjot Kaur 30006 On Measures of Dispersion-2
No ratings yet
of B.com Sem 1 Prabhjot Kaur 30006 On Measures of Dispersion-2
26 pages
MS Excel 280 Short Keys Guide Book
No ratings yet
MS Excel 280 Short Keys Guide Book
36 pages
Measure of Dispersion
No ratings yet
Measure of Dispersion
66 pages
Q3 DLP Math 1 Week 9
100% (1)
Q3 DLP Math 1 Week 9
7 pages
CO Distribution Cycle
No ratings yet
CO Distribution Cycle
10 pages
CHAPTER 4 Measure of Dispersion
No ratings yet
CHAPTER 4 Measure of Dispersion
76 pages
Measures of Dispersion: By: Joan Listana Justine Sanosa Martin Reverente
No ratings yet
Measures of Dispersion: By: Joan Listana Justine Sanosa Martin Reverente
11 pages
CHAPTER 4 Measure of Dispersion
No ratings yet
CHAPTER 4 Measure of Dispersion
76 pages
Dispersion 26-11-2023
No ratings yet
Dispersion 26-11-2023
41 pages
03 Dispersion and Skewness B
No ratings yet
03 Dispersion and Skewness B
42 pages
Measures of Central Tendency/ Dispersion: Anastat Lesson3 Amdelosreyes
No ratings yet
Measures of Central Tendency/ Dispersion: Anastat Lesson3 Amdelosreyes
12 pages
Dispersion
No ratings yet
Dispersion
18 pages
Measure of Variation
No ratings yet
Measure of Variation
16 pages
Dispersion
No ratings yet
Dispersion
3 pages
Unit 2 Measures of Dispersion: Structure
No ratings yet
Unit 2 Measures of Dispersion: Structure
16 pages
Breakthrough Trading Formulas
100% (1)
Breakthrough Trading Formulas
7 pages
Sonia Khondakar - Data Analytics - BBA 504 (A)
No ratings yet
Sonia Khondakar - Data Analytics - BBA 504 (A)
13 pages
DBB2102 Unit-04
No ratings yet
DBB2102 Unit-04
21 pages
Dispersion (Measures of Variability)
100% (3)
Dispersion (Measures of Variability)
42 pages
OptiTekServices Mining
No ratings yet
OptiTekServices Mining
3 pages
Statistics 1-17
No ratings yet
Statistics 1-17
18 pages
3.dispersion and Skewness-Students Notes-MAR
No ratings yet
3.dispersion and Skewness-Students Notes-MAR
29 pages
Dispersion Theory
No ratings yet
Dispersion Theory
7 pages
Measures of Disperson
No ratings yet
Measures of Disperson
17 pages
Fracture Analysis of Pressure Vessel Under Dynamic Loading and Thermal Effect PDF
No ratings yet
Fracture Analysis of Pressure Vessel Under Dynamic Loading and Thermal Effect PDF
108 pages
Disperson SkwenessOriginal
No ratings yet
Disperson SkwenessOriginal
10 pages
Measures of Variability
No ratings yet
Measures of Variability
21 pages
Newtons Laws of Motion PDF
No ratings yet
Newtons Laws of Motion PDF
43 pages
3 Graphing Quadratic Functions Worksheet
No ratings yet
3 Graphing Quadratic Functions Worksheet
9 pages
Measure of Dispersion and Skwness
No ratings yet
Measure of Dispersion and Skwness
41 pages
Mba Unit-2
No ratings yet
Mba Unit-2
2 pages
NCR STAT ANALYSIS LESSON 2 Midterm
No ratings yet
NCR STAT ANALYSIS LESSON 2 Midterm
42 pages
Module 8 Measures of Variation
No ratings yet
Module 8 Measures of Variation
6 pages
The Multiple Classical Linear Regression Model (CLRM) : Specification and Assumptions
No ratings yet
The Multiple Classical Linear Regression Model (CLRM) : Specification and Assumptions
19 pages
Manozo, Pamela L. BSA3-A 8 Task Performance 1: Spss Solution
No ratings yet
Manozo, Pamela L. BSA3-A 8 Task Performance 1: Spss Solution
3 pages
003 Measures of Dispersion
No ratings yet
003 Measures of Dispersion
6 pages
Solutions To Problems: = = ⎡ ⎣⎢ ⎤ ⎦⎥ ρ π kgm m 7 860 4 3 0 015 0 - = 0 111 - kg
No ratings yet
Solutions To Problems: = = ⎡ ⎣⎢ ⎤ ⎦⎥ ρ π kgm m 7 860 4 3 0 015 0 - = 0 111 - kg
15 pages
Measure of Dispersion Statistics
No ratings yet
Measure of Dispersion Statistics
24 pages
Unit 4
No ratings yet
Unit 4
16 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
7 pages
Discrete Structures
No ratings yet
Discrete Structures
2 pages
Energies 11 02626 PDF
No ratings yet
Energies 11 02626 PDF
16 pages
Thermodynamic TTT Metal Science 1982
No ratings yet
Thermodynamic TTT Metal Science 1982
7 pages
Study Design Comparison
No ratings yet
Study Design Comparison
6 pages
LP Arithmetic Sequence'19-'20
No ratings yet
LP Arithmetic Sequence'19-'20
4 pages
Liar by Isaac Asimov 2
No ratings yet
Liar by Isaac Asimov 2
16 pages
Dispersion
No ratings yet
Dispersion
74 pages
Vectors Plane
No ratings yet
Vectors Plane
28 pages
Chapter Four
No ratings yet
Chapter Four
27 pages
Business Statistics - KMBN104
No ratings yet
Business Statistics - KMBN104
25 pages
Unit I & Ii Qa
No ratings yet
Unit I & Ii Qa
42 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
15 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
6 pages
Sheet 2 Measures of Dispersion
No ratings yet
Sheet 2 Measures of Dispersion
10 pages
Measure of Dispersion
No ratings yet
Measure of Dispersion
17 pages
Chapter 8 - STATISTICAL Method by S. P. Gupta
No ratings yet
Chapter 8 - STATISTICAL Method by S. P. Gupta
49 pages
COM 201 LessonNote
No ratings yet
COM 201 LessonNote
8 pages
Lec 3
No ratings yet
Lec 3
18 pages
Lecture 3 - Introduction To Computer Data Processing Using Python
No ratings yet
Lecture 3 - Introduction To Computer Data Processing Using Python
22 pages
Digital Communications Over Fading Channels M.K. Simon and M.S. Alouini 2005 Book Review
No ratings yet
Digital Communications Over Fading Channels M.K. Simon and M.S. Alouini 2005 Book Review
2 pages
Chapter 04
No ratings yet
Chapter 04
18 pages
CS 1101 Unit 4
No ratings yet
CS 1101 Unit 4
3 pages
Statistical Practical in Geography
No ratings yet
Statistical Practical in Geography
3 pages
ABM 401 Lesson 9
No ratings yet
ABM 401 Lesson 9
10 pages
Measures of Variability
No ratings yet
Measures of Variability
49 pages
Extensin Education Part 1 PDF
No ratings yet
Extensin Education Part 1 PDF
71 pages
BS Unit 3
No ratings yet
BS Unit 3
22 pages
Lecture Notes 2.3
No ratings yet
Lecture Notes 2.3
7 pages
GAN-based Synthetic Medical Image Augmentation
No ratings yet
GAN-based Synthetic Medical Image Augmentation
10 pages
Textile Test Series 2 PDF
No ratings yet
Textile Test Series 2 PDF
11 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
25 pages
Chapter 4 (1) Macroeconomy
No ratings yet
Chapter 4 (1) Macroeconomy
9 pages
Chi Square Test
No ratings yet
Chi Square Test
27 pages
FRM Test Series 1
No ratings yet
FRM Test Series 1
44 pages
Project-4 MS
No ratings yet
Project-4 MS
49 pages
Measures of Disperssion and Skewness
No ratings yet
Measures of Disperssion and Skewness
18 pages
ML Unit Wise Important Questions
No ratings yet
ML Unit Wise Important Questions
2 pages
Measures of Disperson
No ratings yet
Measures of Disperson
6 pages
Heat Conduction Using Green S Functions 2nd Edition Beck Instant Download
No ratings yet
Heat Conduction Using Green S Functions 2nd Edition Beck Instant Download
47 pages
Statistics Assignment Labib
No ratings yet
Statistics Assignment Labib
16 pages
Dispersion N Range
No ratings yet
Dispersion N Range
4 pages
Measure of Dispersion-1
No ratings yet
Measure of Dispersion-1
13 pages
Descriptive Statistics: Six Sigma Thinking, #3
From Everand
Descriptive Statistics: Six Sigma Thinking, #3
Sumeet Savant
No ratings yet
Introduction To Non Parametric Methods Through R Software
From Everand
Introduction To Non Parametric Methods Through R Software
Editor IJSMI
No ratings yet