Numerical Summary Measures
Numerical Summary Measures
Mekdes W.(MPH)
Numerical summary
measures
A single number which quantify the characteristics of a
distribution of values.
m ifi
i=1
x = k
i=1
fi
• where,
a) ungrouped data
observation.
nF
~
x = Lm 2 c W
fm
• where,
• Lm = lower true class boundary of the interval containing the median
• n = total number of
observations
Example. Compute the median age of 169 subjects from the
grouped data.
• Fc = 70
are less than Q2. [50th percentile] The second quartile is the
median.
c) The third quartile (Q3): 75% of all the ranked observations are
less than Q3. [75th percentile] 104
Percentiles
– P25: 25% of the sample values are less than or equal to this value.
P25 means 1st Quartile or 25th percentile and given by:-
0.25(n+1)th observation
– P50: 50% of the sample are less than or equal to this value. 2nd
Quartile or 50th percentile and given by:-
0.5(n+1)th observation
– P75: 75% of the sample values are less than or equal to this
value. 3rd Quartile or 75th percentile and given by:-
0.75(n+1)th observation
– P100: The maximum
Class exercise
1. The following data set is birth in grams. Find
the 10th and 90th percentile.
2069, 2581, 2759, 2834, 2838, 2841, 3031,
3101, 3200, 3245, 3248,3260, 3265, 3314, 3323,
3484, 3541, 3609, 3649, 4146
Solution
10th percentile = 0.1(20+1) = 2.1th value
the average of the 2nd and 3rd values =
(2581+2759)/2 = 2670 g
90th percentile = 0.9(20+1) = 18.9th value
• the average of the18th and 19th values =
(3609+3649)/2 = 3629 g
Mode
• The mode of grouped data usually refers to the modal class with
the highest frequency.
Often its value is not unique (more than one mode is possible)
• Example –
– Range = 42-5 = 37
Properties of range
IQR = Q3 - Q1
Example: Suppose the first and third quartile for weights of girls
12 months of age are 8.8 Kg and 10.2 Kg, respectively.
i.e., 50% of the infant girls weigh between 8.8 and 10.2 Kg.
Example 2
• Given the following data set (age of patients):-
• Solution: 18 21 23 24 24 32 42 59
• Hence, IQR = 37 - 22 = 15
Properties of IQR:
S
CV x
100
SD Mean CV (%)
• When the data are skewed, it is preferable to use the median and IQR as
summary statistics.
• Remark:
• The mean and median of symmetric distribution coincide.
• When skewed to the right, its mean is larger than its median.
• When skewed to the left, its mean is smaller than its median.
Median Mode Mean
Fig. 2(a). Symmetric Distribution Mode Median Mean
Fig. 2(b). Distribution skewed to the right
144