ENDATA130 Data Summarization-Computation of Measures of Variation
ENDATA130 Data Summarization-Computation of Measures of Variation
Illustration: Two data file/set with the same center (mean) but different variation away from the center.
Sample 1: No. of calls received per day for 5 days by two companies
Sample 2: Cans of paint sold per month for Brand A and Brand B
From Bluman
CONDITION:
The value of the different measures of variation is directly proportional to the extent or degree of
scattering.
Like: Range is greater, then scattering of data points is wider while when the Range is lesser, scattering
of data points is shorter.
Reqd: Range
Soln:
Mean absolute deviation or mean deviation = the sum of the absolute values of the deviation of each
values in the data file divided by the number of values in the data file.
- the average of the absolute deviations of the scores around the mean.
Let: X = specific value or score in the Data File/set = specific values of the variable
x̅ =mean
d = deviation
|𝑑|= |𝑥 − 𝑥̅ |
b.) Compute the mean absolute deviation as
∑|𝑑| ∑|𝑥− 𝑥̅ |
𝑀𝐴𝐷 = = ∑𝑓
𝑛
Illustration: Computation of MAD for Raw Data
Sample 1:
Data Set A Data Set B
1, 3, 6, 15, 45 10, 10,12, 18, 20
x̅ =14.00 x̅ =14.00
Reqd: MAD
Soln: For Data Set A
NOTE: The sum of the deviations (d) of all the data points/scores is ZERO.
For Data Set B
Σl fd l Σ l fd l
MAD = N or n or: MAD = Σ f
Classes f M fM d=M-x
̅ fd lfdl
(Days of Shipment)
1 -3 6
4–6 8
7-9 10
10 - 12 7
13 - 15 0
16 - 18 5
Variability can also be defined in terms of how close the scores in the distribution are to the middle
or center (mean) of the distribution. Using the mean as the measure of the middle or center of the
distribution, the variance is defined as the average squared difference of the scores from the mean (refer
to “squared deviation = d2). The variance is the measure of dispersion that eliminates negative signs by
squaring all deviations.
∑ 𝑑2 ∑(𝑥− 𝜇)2
𝜎2 = =
𝑁 𝑁
∑ 𝑑2 ∑(𝑥− 𝑥̅ )2
𝑠2 = =
𝑛−1 𝑛−1
lllustration: Computation of Variance for raw data
Sample 1:
Data Set A Data Set B
1, 3, 6, 15,45 10, 10,12, 18, 20
x̅ =14.00 x̅ =14.00
Reqd: Variance = s2
Soln:
For Data Set A
s2 = Σfd2 / n- 1
Illustration: Variance for Grouped Data (Frequency Distribution)
Example:
From Bluman
Given:
Classes f M d=M-x
̅ d2 fd2
(Days of Shipment)
1 -3 6 2
4–6 8 5
7-9 10 8
10 - 12 7 11
13 - 15 0 14
16 - 18 5 17
Determination of the Standard Deviation:
(x/y)n - F
Quantile Point = LB + f’ C
NOTE: The quantile class is the class whose “less than” cumulative frequency
contains the value (x/y)n.
◼ Example7:
Age of students enrolled in an adult basic mathematics subject:
◼ Example 8:
Data for the record high temperatures for each of the 50 states
Determine:
a.) D3
b.) P55
c.) Lowest value of the
25% hottest temperature
d.) Temperature
range/interval of the
middle 50% of the
recorded temperature
Exercises
1.) The nicotine contents, in milligrams, for 40 cigarettes of a certain brand were recorded as follows:
1.09 1.92 2.31 1.79 2.28 1.74 1.47 1.97 0.85 1.24
1.58 2.03 1.70 2.17 2.55 2.11 1.86 1.90 1.68 1.51
1.64 0.72 1.69 1.85 1.82 1.79 2.46 1.88 2.08 1.67
1.37 1.93 1.40 1.64 2.09 1.75 1.63 2.37 1.75 1.69
Find the
(a) mean
(b) median.
(c) MAD
(d) standard deviation
2.) Thirty automobiles were tested for fuel efficiency (in miles per gallon). This frequency distribution was
obtained. Calculate the mean and standard deviation of the fuel frequency in miles per gallons.
Class boundaries Frequency
7.5–12.5 3
12.5–17.5 5
17.5–22.5 15
22.5–27.5 5
27.5–32.5 2