Module 3.
Organizing and Summarizing Quantitative Data
Learning Outcomes
At the end of the week, the students should be able to:
1. summarize data using measures of central tendency
2. decide the appropriate measure of central tendency to be used given a data set
3. interpret numerical output to describe the distribution of data set in terms of its
variability
A. The Summation Notation
EXERCISES:
B. NUMERICAL METHODS FOR SUMMARIZING QUANTITATIVE DATA
MEASURES OF CENTRAL TENDENCY OF UNGROUPED DATA
The Measure of Central Tendency for ungrouped data are the mean, the median, the mode
and the midrange
1. The Mean of Ungrouped Data
The Mean, or the arithmetic mean or average of a set of numbers is the sum of all the
values in the data set divided by the number of observation.
2. The Median
3. The Mode
4. The mid-range
MEASURES OF CENTRAL TENDENCY OF GROUPED DATA
Recall:
Grouped Data are the data or scores that are arranged in a frequency distribution.
Frequency distribution is the arrangement of scores according to category of classes
including the frequency.
Frequency is the number of observations falling in a category.
Class mark (midpoint) is the middle value of a category or class interval.
1. Mean for Grouped Data (The Midpoint Formula)
̅ = mean value
Where: 𝒙
𝒙𝒊 = midpoint of each class or category
𝒇𝒊 = frequency of each class or category
Example 7. Consider the following data. Find the mean of the distribution.
X 25 35 45 55 65 75 85 95
f 1 2 2 3 12 14 12 4
Solution: Using the formula
X f xf
25 1 25
35 2 70
45 2 90
55 3 165
65 12 780
75 14 1050 3580
85 12 1020 = 50
95 4 380
∑ 𝒇 = 50 ∑ 𝒇𝒙 = 3580
𝑥̅ = 71.6
PROPERTIES OF MEAN
1. It measures stability. Mean is the most stable among other measures of central tendency
because every score contributes to the value of the mean.
2. The sum of each score’s distance from the mean is zero.
3. It may easily affected by extreme scores
4. It can be applied to interval level of measurement.
5. It may not be an actual score in the distribution.
6. It is very easy to compute
Example 8. The following table gives the frequency distribution of the number of orders received
each day during the past 65 days at the office of a mail-order company. Calculate the mean.
Number of Order f
10 – 12 4
13 – 15 12
16 – 18 20
19 – 21 14
22 – 24 5
25 – 27 10
n = 65
Solution:
Number of Order f x fx Solving for the mean
10 – 12 4 11 44
13 – 15 12 14 168
16 – 18 20 17 340
19 – 21 14 20 280
22 – 24
1207
5 23 115 =
25 – 27 10 26 260 65
n = 65 ∑ fx = 1,207 = 18.57
Analysis: On the average, the number of orders received during the past 65 days is 18.57
or approximately 19.
2. Median and Interquartile Range of Grouped Data
STEPS IN DETERMINING THE MEDIAN
1. Construct the cumulative frequency distribution.
2. Decide the class that contain the median. Class Median is the first class with the value of
cumulative frequency equal at least n/2.
3. Solve for the median by using the formula.
Example 9: Based on the grouped data below, find the median:
Therefore,
Analysis:
There are 25 persons who take less than 24 minutes to travel to work and another 25 persons
take more than 24 minutes to travel to work.
PROPERTIES OF MEDIAN
1. It may not be the actual observation in the set of data.
2. It can be applied in ordinal level
3. It is not affected by the extreme values because median is a positional measure.
4. The exact midpoint of the score distribution is desired.
5. There are extreme scores in the distribution.
Quartiles
Quartiles is a type of quantile which divides the number of data points into four parts, or
quarters, of more-or-less equal size. Using the same method of calculation as in the Median, we
can get Q1 and Q3 equation as follows:
Interquartile Range
Interquartile range (IQR) is the difference between the third quartile and the first quartile.
IQR = Q3 – Q1
Example 10. Based on the grouped data below, find the Interquartile Range
3. Mode
Mode is the value that has the highest frequency in a data set. For grouped data, class
mode (or, modal class) is the class with the highest frequency.
To find mode for grouped data, use the following formula:
Example 11: Based on the grouped data below, find the mode
ACTIVITY
The scores of 40 students in statistics class consist of 60 items and they are tabulated below.
Compute for the mean, median, mode and the interquartile range of the distribution