0% found this document useful (0 votes)

96 views9 pages

Measures of Spread

1) Measures of spread describe the variability or dispersion in a data set and are usually used along with measures of central tendency like the mean or median. 2) The range, quartiles, variance, absolute deviation, and standard deviation are common measures of spread that will be discussed. 3) Quartiles break a data set into quarters to show the spread. The interquartile range describes the difference between the third and first quartiles and indicates the spread of the middle half of scores.

Uploaded by

Rizza Monge

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

96 views9 pages

Measures of Spread

Uploaded by

Rizza Monge

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

Measures of Spread

Introduction
A measure of spread, sometimes also called a measure of dispersion, is used
to describe the variability in a sample or population. It is usually used in
conjunction with a measure of central tendency, such as the mean or median,
to provide an overall description of a set of data.

Why is it important to measure the spread of data?

There are many reasons why the measure of the spread of data values is
important, but one of the main reasons regards its relationship with measures
of central tendency. A measure of spread gives us an idea of how well the
mean, for example, represents the data If the spread of values in the data set
is large, the mean is not as representative of the data as if the spread of data
is small. This is because a large spread indicates that there are probably large
differences between individual scores. Additionally, in research, it is often
seen as positive if there is little variation in each data group as it indicates that
the similar.

We will be looking at the range, quartiles, variance, absolute

deviation and standard deviation.

Range
The range is the difference between the highest and lowest scores in a data
set and is the simplest measure of spread. So we calculate range as:

Range = maximum value - minimum value

For example, let us consider the following data set:

23 56 45 65 59 55 62 54 85 25

The maximum value is 85 and the minimum value is 23. This results in a
range of 62, which is 85 minus 23. Whilst using the range as a measure of
spread is limited, it does set the boundaries of the scores. This can be useful
if you are measuring a variable that has either a critical low or high threshold
(or both) that should not be crossed. The range will instantly inform you
whether at least one value broke these critical thresholds. In addition, the
range can be used to detect any errors when entering data. For example, if
you have recorded the age of school children in your study and your range is
7 to 123 years old you know you have made a mistake!

Quartiles and Interquartile Range

Quartiles tell us about the spread of a data set by breaking the data set into
quarters, just like the median breaks it in half. For example, consider the
marks of the 100 students below, which have been ordered from the lowest to
the highest scores, and the quartiles highlighted in red.

OrderScoreOrderScoreOrderScoreOrderScoreOrderScore
1st 35 21st 42 41st 53 61st 64 81st 74
2nd 37 22nd 42 42nd 53 62nd 64 82nd 74
3rd 37 23rd 44 43rd 54 63rd 65 83rd 74
4th 38 24th 44 44th 55 64th 66 84th 75
5th 39 25th 45 45th 55 65th 67 85th 75
6th 39 26th 45 46th 56 66th 67 86th 76
7th 39 27th 45 47th 57 67th 67 87th 77
8th 39 28th 45 48th 57 68th 67 88th 77
9th 39 29th 47 49th 58 69th 68 89th 79
10th 40 30th 48 50th 58 70th 69 90th 80
11th 40 31st 49 51st 59 71st 69 91st 81
12th 40 32nd 49 52nd 60 72nd 69 92nd 81
13th 40 33rd 49 53rd 61 73rd 70 93rd 81
14th 40 34th 49 54th 62 74th 70 94th 81
15th 40 35th 51 55th 62 75th 71 95th 81
16th 41 36th 51 56th 62 76th 71 96th 81
17th 41 37th 51 57th 63 77th 71 97th 83
18th 42 38th 51 58th 63 78th 72 98th 84
19th 42 39th 52 59th 64 79th 74 99th 84
20th 42 40th 52 60th 64 80th 74 100th 85
The first quartile (Q1) lies between the 25th and 26th student's marks,
the second quartile (Q2) between the 50th and 51st student's marks, and
the third quartile (Q3) between the 75th and 76th student's marks. Hence:

First quartile (Q1) = (45 + 45) ÷ 2 = 45

Second quartile (Q2) = (58 + 59) ÷ 2 = 58.5
Third quartile (Q3) = (71 + 71) ÷ 2 = 71

In the above example, we have an even number of scores (100 students,

rather than an odd number, such as 99 students). This means that when we
calculate the quartiles, we take the sum of the two scores around each
quartile and then half them (hence Q1= (45 + 45) ÷ 2 = 45). However, if we
had an odd number of scores (say, 99 students), we would only need to take
one score for each quartile (that is, the 25th, 50th and 75th scores). You
should recognize that the second quartile is also the median.

Quartiles are a useful measure of spread because they are much less
affected by outliers or a skewed data set than the equivalent measures of
mean and standard deviation. For this reason, quartiles are often reported
along with the median as the best choice of measure of spread and central
tendency, respectively, when dealing with skewed and/or data with outliers. A
common way of expressing quartiles is as an interquartile range. The
interquartile range describes the difference between the third quartile (Q3)
and the first quartile (Q1), telling us about the range of the middle half of the
scores in the distribution. Hence, for our 100 students:

Interquartile range = Q3 - Q1
= 71 - 45
= 26

However, it should be noted that in journals and other publications you will
usually see the interquartile range reported as 45 to 71, rather than the
calculated range.

A slight variation on this is the semi-interquartile range, which is half the

interquartile range = ½ (Q3 - Q1). Hence, for our 100 students, this would be
26 ÷ 2 = 13.
Absolute Deviation & Variance
Variation
Quartiles are useful, but they are also somewhat limited because they do not
take into account every score in our group of data. To get a more
representative idea of spread we need to take into account the actual values
of each score in a data set. The absolute deviation, variance and standard
deviation are such measures.

The absolute and mean absolute deviation show the amount of deviation
(variation) that occurs around the mean score. To find the total variability in
our group of data, we simply add up the deviation of each score from the
mean. The average deviation of a score can then be calculated by dividing
this total by the number of scores. How we calculate the deviation of a score
from the mean depends on our choice of statistic, whether we use absolute
deviation, variance or standard deviation.

Absolute Deviation and Mean Absolute Deviation

Perhaps the simplest way of calculating the deviation of a score from the
mean is to take each score and minus the mean score. For example, the
mean score for the group of 100 students we used earlier was 58.75 out of
100. Therefore, if we took a student that scored 60 out of 100, the deviation of
a score from the mean is 60 - 58.75 = 1.25. It is important to note that scores
above the mean have positive deviations (as demonstrated above), whilst
scores below the mean will have negative deviations.

To find out the total variability in our data set, we would perform this
calculation for all of the 100 students' scores. However, the problem is that
because we have both positive and minus signs, when we add up all of these
deviations, they cancel each other out, giving us a total deviation of zero.
Since we are only interested in the deviations of the scores and not whether
they are above or below the mean score, we can ignore the minus sign and
take only the absolute value, giving us the absolute deviation. Adding up all
of these absolute deviations and dividing them by the total number of scores
then gives us the mean absolute deviation (see below). Therefore, for our 100
students the mean absolute deviation is 12.81, as shown below:

Variance
Another method for calculating the deviation of a group of scores from the
mean, such as the 100 students we used earlier, is to use the variance. Unlike
the absolute deviation, which uses the absolute value of the deviation in order
to "rid itself" of the negative values, the variance achieves positive values by
squaring each of the deviations instead. Adding up these squared deviations
gives us the sum of squares, which we can then divide by the total number of
scores in our group of data (in other words, 100 because there are 100
students) to find the variance (see below). Therefore, for our 100 students, the
variance is 211.89, as shown below:
As a measure of variability, the variance is useful. If the scores in our group of data are spread
out, the variance will be a large number. Conversely, if the scores are spread closely around the
mean, the variance will be a smaller number. However, there are two potential problems with
the variance. First, because the deviations of scores from the mean are 'squared', this gives
more weight to extreme scores. If our data contains outliers (in other words, one or a small
number of scores that are particularly far away from the mean and perhaps do not represent
well our data as a whole), this can give undo weight to these scores. Secondly, the variance is
not in the same units as the scores in our data set: variance is measured in the units squared.
This means we cannot place it on our frequency distribution and cannot directly relate its value
to the values in our data set. Therefore, the figure of 211.89, our variance, appears somewhat
arbitrary. Calculating the standard deviation rather than the variance rectifies this problem.
Nonetheless, analyzing variance is extremely important in some statistical analyses, discussed
in other statistical guides.

Standard Deviation
Introduction
The standard deviation is a measure of the spread of scores within a set of
data. Usually, we are interested in the standard deviation of a population.
However, as we are often presented with data from a sample only, we can
estimate the population standard deviation from a sample standard deviation.
These two standard deviations - sample and population standard deviations -
are calculated differently. In statistics, we are usually presented with having to
calculate sample standard deviations, and so this is what this article will focus
on, although the formula for a population standard deviation will also be
shown.

When to use the sample or population standard deviation

We are normally interested in knowing the population standard deviation
because our population contains all the values we are interested in.
Therefore, you would normally calculate the population standard deviation if:
(1) you have the entire population or (2) you have a sample of a larger
population, but you are only interested in this sample and do not wish to
generalize your findings to the population. However, in statistics, we are
usually presented with a sample from which we wish to estimate (generalize
to) a population, and the standard deviation is no exception to this. Therefore,
if all you have is a sample, but you wish to make a statement about the
population standard deviation from which the sample is drawn, you need to
use the sample standard deviation. Confusion can often arise as to which
standard deviation to use due to the name "sample" standard deviation
incorrectly being interpreted as meaning the standard deviation of the sample
itself and not the estimate of the population standard deviation based on the
sample.

What type of data should you use when you calculate a standard
deviation?
The standard deviation is used in conjunction with the mean to
summarize continuous data, not categorical data. In addition, the standard
deviation, like the mean, is normally only appropriate when the continuous
data is not significantly skewed or has outliers.

Examples of when to use the sample or population standard deviation

Q. A teacher sets an exam for their pupils. The teacher wants to summarize
the results the pupils attained as a mean and standard deviation. Which
standard deviation should be used?

A. Population standard deviation. Why? Because the teacher is only

interested in this class of pupils' scores and nobody else.

Q. A researcher has recruited males aged 45 to 65 years old for an exercise

training study to investigate risk markers for heart disease (e.g., cholesterol).
Which standard deviation would most likely be used?

A. Sample standard deviation. Although not explicitly stated, a researcher

investigating health related issues will not simply be concerned with just the
participants of their study; they will want to show how their sample results can
be generalized to the whole population (in this case, males aged 45 to 65
years old). Hence, the use of the sample standard deviation.
Q. One of the questions on a national consensus survey asks for respondents'
age. Which standard deviation would be used to describe the variation in all
ages received from the consensus?

A. Population standard deviation. A national consensus is used to find out

information about the nation's citizens. By definition, it includes the whole
population. Therefore, a population standard deviation would be used.

What are the formulas for the standard deviation?

The sample standard deviation formula is:

Where,

s = sample standard deviation

= sum of...
= sample mean
n = number of scores in sample.
The population standard deviation formula is:

Where,

= population standard deviation

= sum of...
= population mean
n = number of scores in sample.

Measures of Partition and Dispersion
No ratings yet
Measures of Partition and Dispersion
51 pages
Assessment of Learning Ii: Dr. Louie D. Asuncion Instructor
No ratings yet
Assessment of Learning Ii: Dr. Louie D. Asuncion Instructor
73 pages
Report Group D
No ratings yet
Report Group D
100 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
41 pages
Variability
No ratings yet
Variability
26 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
44 pages
Dispersion Measures
No ratings yet
Dispersion Measures
23 pages
Dispersion
50% (2)
Dispersion
58 pages
STA201 Lec 04
No ratings yet
STA201 Lec 04
22 pages
Charii Attachment Report
No ratings yet
Charii Attachment Report
23 pages
Assessment of A New Born PPT LP
100% (1)
Assessment of A New Born PPT LP
35 pages
Psy 311 Lesson 8
No ratings yet
Psy 311 Lesson 8
19 pages
Measures of Dispersion Updated
No ratings yet
Measures of Dispersion Updated
38 pages
Biostatistics (Descriptive Statistics)
No ratings yet
Biostatistics (Descriptive Statistics)
30 pages
Lecture 4 Copy 1
No ratings yet
Lecture 4 Copy 1
13 pages
Have A Glowing and Soft Skin With Homeopathy Remedies - by Dr. Ranjana Gupta - Lybrate
No ratings yet
Have A Glowing and Soft Skin With Homeopathy Remedies - by Dr. Ranjana Gupta - Lybrate
6 pages
GE Math - Learning Module 9 - AY 2021 2022
No ratings yet
GE Math - Learning Module 9 - AY 2021 2022
9 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
25 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
25 pages
Chapter 5 - Measures of Variability
No ratings yet
Chapter 5 - Measures of Variability
35 pages
Measures of Variability-April19-1
No ratings yet
Measures of Variability-April19-1
88 pages
Introduction To Descriptive Statistics 2014
67% (3)
Introduction To Descriptive Statistics 2014
72 pages
Hepatobiliary Trans
100% (1)
Hepatobiliary Trans
14 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
26 pages
Group-1 Module-1 PPT
No ratings yet
Group-1 Module-1 PPT
100 pages
QTT201 Ca-2
No ratings yet
QTT201 Ca-2
14 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
23 pages
Final Measures of Dispersion DR Lotfi
No ratings yet
Final Measures of Dispersion DR Lotfi
54 pages
Of Mice and Men Essay Friendship
100% (2)
Of Mice and Men Essay Friendship
3 pages
Dispersion
No ratings yet
Dispersion
26 pages
SR NO Time Specific Objective Content Teacher Activities Student Activities A.V Aids Evaluation
No ratings yet
SR NO Time Specific Objective Content Teacher Activities Student Activities A.V Aids Evaluation
16 pages
Meas T
No ratings yet
Meas T
8 pages
Lecture 5 Notes
No ratings yet
Lecture 5 Notes
23 pages
Diagnostic Atlas of Veterinary Ophthalmology (VetBooks - Ir)
No ratings yet
Diagnostic Atlas of Veterinary Ophthalmology (VetBooks - Ir)
218 pages
ProPerMan Hand Out 4 (Two Standard Deviations)
No ratings yet
ProPerMan Hand Out 4 (Two Standard Deviations)
8 pages
Ed216 Chapter 7
No ratings yet
Ed216 Chapter 7
31 pages
Statistics 8 q2
No ratings yet
Statistics 8 q2
6 pages
BComp3 Module 5 Measures of Variability
No ratings yet
BComp3 Module 5 Measures of Variability
17 pages
R3.Descriptive Statistics
No ratings yet
R3.Descriptive Statistics
5 pages
MBA Quantitative Techniques and Analytics 03
No ratings yet
MBA Quantitative Techniques and Analytics 03
14 pages
KP COVID Report
No ratings yet
KP COVID Report
2 pages
Kest 106
No ratings yet
Kest 106
17 pages
Measures of Dispersion 1
No ratings yet
Measures of Dispersion 1
5 pages
2023 Edal 221 Lesson 4a Utilization of Assessment Data
No ratings yet
2023 Edal 221 Lesson 4a Utilization of Assessment Data
10 pages
Measures of Dispersion OR Measures of Variations
No ratings yet
Measures of Dispersion OR Measures of Variations
7 pages
Dispersion
No ratings yet
Dispersion
31 pages
Chapter 1 Episode 3 - Measure of Dispersion
No ratings yet
Chapter 1 Episode 3 - Measure of Dispersion
10 pages
3.dispersion and Skewness-Students Notes-MAR
No ratings yet
3.dispersion and Skewness-Students Notes-MAR
29 pages
Stat-L CHAPTER 4
No ratings yet
Stat-L CHAPTER 4
18 pages
Variability Final
No ratings yet
Variability Final
53 pages
Form 9 Ver 1.0-IsBAR Form Correction
No ratings yet
Form 9 Ver 1.0-IsBAR Form Correction
4 pages
Measures of Dispersion
No ratings yet
Measures of Dispersion
46 pages
Kupit 2 PDF
No ratings yet
Kupit 2 PDF
26 pages
Measures of Variability
No ratings yet
Measures of Variability
8 pages
Chapter 6 (Philoid-In)
No ratings yet
Chapter 6 (Philoid-In)
17 pages
Module 1 Assessment 2
No ratings yet
Module 1 Assessment 2
5 pages
Descriptive Statistic - Session 5
No ratings yet
Descriptive Statistic - Session 5
6 pages
Math-7 FLDP Quarter-4 Week-7
No ratings yet
Math-7 FLDP Quarter-4 Week-7
7 pages
Lesson II: Measures of Variability: Example 1
No ratings yet
Lesson II: Measures of Variability: Example 1
21 pages
Dispersion PDF
No ratings yet
Dispersion PDF
17 pages
Tech Seminar Sem 2
No ratings yet
Tech Seminar Sem 2
35 pages
Measures of Variability
No ratings yet
Measures of Variability
3 pages
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
From Everand
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
S. Deviant
4.5/5 (6)
Esophageal Stricture
No ratings yet
Esophageal Stricture
6 pages
Role of Media in The Management of Public Health Issues
No ratings yet
Role of Media in The Management of Public Health Issues
51 pages
Measures of Variability
No ratings yet
Measures of Variability
7 pages
VOLUNTEERISM
No ratings yet
VOLUNTEERISM
14 pages
Faecal Occult Blood Tests
No ratings yet
Faecal Occult Blood Tests
8 pages
Sampling in Statistics
From Everand
Sampling in Statistics
Stephanie Glen
No ratings yet
Radiography: S. Lewis, F. Mulla
No ratings yet
Radiography: S. Lewis, F. Mulla
6 pages
Unit 4 Descriptive Statistics
No ratings yet
Unit 4 Descriptive Statistics
8 pages
Central Tendency Variation Skewness Individual Performance Relationships
No ratings yet
Central Tendency Variation Skewness Individual Performance Relationships
9 pages
Author(s) Prerequisites Learning Objectives: Measures of Variability
No ratings yet
Author(s) Prerequisites Learning Objectives: Measures of Variability
17 pages
Influenza
No ratings yet
Influenza
21 pages
High Flow Nasal Cannula
No ratings yet
High Flow Nasal Cannula
6 pages
Cues Nursing Diagnosis Inference Objective Nursing Intervention Rationale Evaluation
No ratings yet
Cues Nursing Diagnosis Inference Objective Nursing Intervention Rationale Evaluation
3 pages
Chapter 3 Dispersion
No ratings yet
Chapter 3 Dispersion
12 pages
Biostats Research Work
No ratings yet
Biostats Research Work
4 pages
Bank Soal Osce Hippocrates PDF
0% (1)
Bank Soal Osce Hippocrates PDF
2 pages
Presentation 1
No ratings yet
Presentation 1
2 pages
Case Analysis Rizza L. Monge
No ratings yet
Case Analysis Rizza L. Monge
1 page
Case Analysis Rizza L. Monge
No ratings yet
Case Analysis Rizza L. Monge
1 page
SOCIETY & CULTURE Final Examination 1. What
No ratings yet
SOCIETY & CULTURE Final Examination 1. What
20 pages
ARC Intake Form 020618
No ratings yet
ARC Intake Form 020618
15 pages
Is Violence A Disease? Situating Violence Prevention in Public Health Policy and Practice
No ratings yet
Is Violence A Disease? Situating Violence Prevention in Public Health Policy and Practice
8 pages
De Potter Et Al 2022 Ultra Low Temperature Cryoablation For Atrial Fibrillation Primary Outcomes For Efficacy and Safety
No ratings yet
De Potter Et Al 2022 Ultra Low Temperature Cryoablation For Atrial Fibrillation Primary Outcomes For Efficacy and Safety
6 pages
Paradoxical Vocal Cord Motion Disorder: Past, Present and Future
No ratings yet
Paradoxical Vocal Cord Motion Disorder: Past, Present and Future
9 pages
What Are Knock Knees?
No ratings yet
What Are Knock Knees?
2 pages
HP Sukhwinder Singh India
No ratings yet
HP Sukhwinder Singh India
1 page
Which of These Diseases Cannot Be Controlled by Killing The Vector - Myschool
No ratings yet
Which of These Diseases Cannot Be Controlled by Killing The Vector - Myschool
10 pages
Dermoscopic Feature of Chondroid Syringoma: About 2 Cases
No ratings yet
Dermoscopic Feature of Chondroid Syringoma: About 2 Cases
5 pages
Life Cycle of Taenia - 2
No ratings yet
Life Cycle of Taenia - 2
3 pages
Ice Candy For Sale
100% (1)
Ice Candy For Sale
1 page
International Pig Topics Nutritional Management NovDec 2021
No ratings yet
International Pig Topics Nutritional Management NovDec 2021
1 page

Measures of Spread

Uploaded by

Measures of Spread

Uploaded by

Measures of Spread

Why is it important to measure the spread of data?

We will be looking at the range, quartiles, variance, absolute

Range = maximum value - minimum value

For example, let us consider the following data set:

Quartiles and Interquartile Range

First quartile (Q1) = (45 + 45) ÷ 2 = 45

In the above example, we have an even number of scores (100 students,

A slight variation on this is the semi-interquartile range, which is half the

Absolute Deviation and Mean Absolute Deviation

When to use the sample or population standard deviation

Examples of when to use the sample or population standard deviation

A. Population standard deviation. Why? Because the teacher is only

Q. A researcher has recruited males aged 45 to 65 years old for an exercise

A. Sample standard deviation. Although not explicitly stated, a researcher

A. Population standard deviation. A national consensus is used to find out

What are the formulas for the standard deviation?

s = sample standard deviation

= population standard deviation

You might also like