Measurement 1
Measurement 1
Measurement 1
NUMERICAL MEASURES
PART 1
◦ Measures of Central Tendency
◦ Measures of Variability
Central Tendency
In general terms, central tendency is a statistical measure that determines a
single value that accurately describes the center of the distribution and
represents the entire distribution of scores.
The goal of central tendency is to identify the single value that is the best
representative for the entire set of data.
By identifying the "average score," central tendency allows researchers to summarize or
condense a large set of data into a single value.
In addition, it is possible to compare two (or more) sets of data by simply comparing the
average score (central tendency) for one set versus the average score for another set.
Types
◦ Mean
◦ Median
◦ Mode
◦ Percentiles
◦ Quartiles
Mean
The mean is the arithmetic average of the scores.
Sample mean
Population mean
The Median
If the scores in a distribution are listed in order from smallest to
largest, the median is defined as the midpoint of the list.
The median divides the scores so that 50% of the scores in the
distribution have values that are equal to or less than the median.
7
Mode
The mode of a data set is the value that occurs with greatest frequency.
If the data have exactly two modes, the data are bimodal.
If the data have more than two modes, the data are multimodal.
The Mode
The mode is defined as the most frequently occurring category or score in the
distribution.
The mode can be determined for data measured on any scale of measurement:
nominal, ordinal, interval, or ratio.
11
Mean > Median > Mode
The pth percentile of the data set is a measurement such that after the data are
ordered from smallest to largest, at most, p% of the data are at or below this
value and at most, (100 - p)% at or above it.
16
Percentiles and Quartiles
◦ For a set of measurements arranged in increasing order, the 𝑝𝑝th percentile is a
value such that p percent of the measurements fall at or below the value and
(100−𝑝𝑝) percent of the measurements fall at or above the value
Q1 Q2 Q3
■ The first quartile, Q1, is the value for which 25% of the
observations are smaller and 75% are larger
■ Q2 is the same as the median (50% of the observations
are smaller and 50% are larger)
■ Only 25% of the observations are greater than the third
quartile
The Interquartile Range (IQR)
The IQR is Q3 – Q1 and measures the spread in the middle 50% of the data
Q1, Q3, and IQR measures are called resistant measures because they are not
influenced by the outliers in the dataset
Think of Q1 as a median in the lower half of the data and think of Q3 as a
median for the upper half of data.
Q1 = 5 and Q3 = 18
Example
we will consider the set of data: First quartile = 3.5
2, 3, 3, 4, 5, 6, 6, 7, 8, 8, 8, 9 Median = 6
Third quartile = 8
Find the Q1, Q3 and the IQR. Thus we see that the interquartile range is 8
– 3.5 = 4.5
The Boxplot or Box and
Whisker Diagram
◦ The Boxplot: A Graphical display of the data.
Example:
Example:
X Median X
minimum Q1 (Q2) Q3 maximum
12 30 45 57 70
Interquartile range
IQR= 57 – 30= 27