2A Data Description (A)
2A Data Description (A)
Chapter 2A – Page 1
what is ab= axb=a.b=a(b)
Example
The following frequency distribution shows the lengths of metal pipes
produced on a particular day. Calculate the mean length.
Solution:
Length (cm) Class mark, x No. of metal pipes, f fx
10 – 20 15 3 45
20 – 30 20+30/2 35
7 25x7=175
30 – 40 35 10 35x10=350
40 – 50 45 16 720
50 – 60 55 34 1870
60 – 70 65 13 845
70 – 80 75
7 525
80 – 90 85 6 510
90 – 100 95 4 380
f = 100 fx = 5420
Mean = fx =
5420
= cm
f 100 5420
Interpretation:
54.20
The average length of a metal pipe is _______ cm.
mean
Chapter 2A – Page 2
Example
The following frequency distribution shows the daily production level.
Calculate the arithmetic mean.
Solution:
Production (units) f Class mark, x fx
13 – 17 2 15 30
18 – 22 22 20 440
23 – 27 10 25 250
28 – 32 14 30 420
33 – 37 3 35 105
38 – 42 4 40 160
43 – 47 6 45 270
48 – 52 1 50 50
Total 1725
Mean = fx =
1725
= 27.8226 units
f 62
Interpretation:
The average daily production is _______
27.8226
units.
Note:
The mean computed from population data is denoted by the symbol
(pronounced as miu); the mean computed from sample data is denoted by x
(pronounced as x bar).
Chapter 2A – Page 3
Median
• Median is the middle value of the distribution. It is important that the data
must be arranged in ascending or descending order of magnitude before
the median is to be located.
• The median being the middle value implies that 50% (half) of the
observations will be more than or equal to the median and 50% (another
half) of the observations will be less than or equal to the median.
Note: If the data set consists of even number of figures, then median will
be the average of the two middle values.
Example
The data below shows the number of components completed for a particular
product by 9 employees during a weekday:
6, 6, 7, 8, 4, 10, 6, 5, 5 n=9
Chapter 2A – Page 4
Example 4.5
Find the median for the distribution: 32, 35, 36, 43, 45, 45, 60, 74
Solution: n = 8 (already arranged in ascending order)
n +1 8 +1
Median = value of the th item = value of the th item
2 2
43 + 45
= value of the 4.5th item = = 44
2
(b) Grouped data
1. Calculate the cumulative frequencies to position the items in
ascending order.
n
2. Locate the median class where: median = value of the th item
2
3. Find the median using (1) ogive; (2) linear interpolation formula:
cm n
Median = Lm + − f m−1
LB UB fm 2
20-30
where 𝐿𝑚 = lower class boundary of median class
𝑐𝑚 = class size of median class
𝑓𝑚 = frequency of median class
∑ 𝑓𝑚−1 = cum. freq. of the preceding median class
before
Example
The following table shows the lengths of metal pipes produced on a
particular day. Calculate the median length.
Chapter 2A – Page 5
Solution:class boundary
Length (cm) f Cumulative f
10 – 20 3 3
20 – 30 7 10
30 – 40 10 10+7+3=30
40 – 50 16 16+10+7+3=36
50 – 60 34 70 50%
median class
60 – 70 13 83
70 – 80 7 90
80 – 90 6 96
90 – 100 4 100
Total 100
n 100
Median = value of the th item = value of the th item
2 2
= value of the 50th item
Median class boundaries: 50 – 60 class size=60-50=10
c n 10
Median = Lm + m − f m −1 = 50 + (50 − 36) = ______
54.1176 cm
fm 2 34
Interpretation:
50% of the metal pipes are having lengths less than or equal to _____
54.1176 cm
and the other 50% are having lengths more than or equal to _____
54.1176 cm.
Example
The following frequency distribution shows the daily production level.
Production (units) No. of days
13 – 17 2
18 – 22 22
23 – 27 10
28 – 32 14
33 – 37 3
38 – 42 4
43 – 47 6
48 – 52 1
Find the median using
(a) the linear interpolation method;
(b) an ogive.
Chapter 2A – Page 6
Solution:
Production(units) Class boundaries f Cumulative frequency
13 – 17 12.5 – 17.5 2 2
18 – 22 17.5 – 22.5 22 24
median 23 – 27 22.5-22.5 10 10+22+2=34
28 – 32 27.5-32.6 14 14+10+22+2=48
33 – 37 32.5 -37.5 3 51
38 – 42 37.5 – 42.5 4 55
43 – 47 42.5 – 47.5 6 61
48 – 52 47.5 – 52.5 1 62
-
Total
(b)
'<' Ogive shows the daily production for 62 days
70
60
50
Cumulative Frequency
n/2=62/2th=31^st
40
30
20
10
0
12.5 17.5 22.5 27.5 32.5 37.5 42.5 47.5 52.5
Production (unit)
Example
The following data show the number of children in 8 families:
2, 2, 3, 4, 0, 6, 2, 1
Find the mode.
Solution:
Arranging data in ascending order (optional): 0, 1, 2, 2, 2, 3, 4, 6
Mode = 2 children (occurs 3 times)
Example
Find the mode for the following data.
(1) 1, 2, 3, 4, 7, 9
(2) 1, 2, 2, 3, 4, 5, 5, 8
1. Locate the modal class (the class with the highest frequency)
2. Find the mode using (1) Histogram; (2) Formula.
Adjusted frequency
2 usages
. mode
. histogram Chapter 2A – Page 8
others-use original frequency
f m − fb
Mode = Lm + cm
2 f m − ( fb + f a )
Note:
If the formula is used to compute the mode for frequency distribution of
unequal class size, then the frequency will be replaced by the adjusted
frequency of the corresponding class.
Example
The following frequency distribution shows the lengths of metal pipes
produced on a particular day.
Chapter 2A – Page 9
Solution:
(a)
Interpretation:
The most common (or most frequent) length of metal pipe is 54.62cm.
or
Most of the metal pipes produced are having lengths 54.62 cm.
Chapter 2A – Page 10
Advantages and disadvantages of using mean, median or mode as a
measurement of central tendency
Mean
Advantages:
• It is easily understood.
• The value of each item is included in the computation; thus it has the kind
of exactness that the median and the mode lack of.
• It is well suited for further analysis.
Disadvantages:
• Its value may not correspond to the actual value, e.g. the average number
of children in each family is 2.4 children.
• It may be distorted by a few extremely high or low values, e.g. a distribution
of 5 persons’ income (in RM):
500, 200, 250, 300, 6000
The mean income per week = RM 1,450
The mean is now distorted or pulled up by one person’s income of RM6,000
(being the extreme value). The majority is only earning between RM200 –
RM500. Certainly, the mean of RM1,450 is misleading.
Median
Advantages:
• It is of use where the middle value is of significance.
• It is not affected by extreme values. Thus, it is usually preferred than mean
when there are extreme values in the distribution.
Disadvantages:
• It does not reflect the full range of values.
• It is not suitable for further analysis.
Chapter 2A – Page 11
Mode
Advantages:
• It is the more appropriate average in situations where it is useful to know
the most common value, e.g. the purchasing department in a shoe store will
order the modal shoe size.
• It is easy to find and is not influenced by extreme values.
Disadvantages:
• It ignores the dispersion around the modal value.
• It is not suitable for further analysis.
• It is not unique.
_____________________________________________________________
Question 1:
A psychologist developed a new test for adult intelligence. The test was
administered to 10 adults, and the following scores were obtained.
115 102 130 124 117 118 122 106 108 118
Chapter 2A – Page 12
Question 2:
The following table shows the number of cars sold by a second-hand car dealer
for the last 50 days.
Number of cars Number of days
1–2 8
3–4 14
5–6 18
7 – 10 6
11 and above 4
Chapter 2A – Page 13
AAMS1773 QUANTITATIVE STUDIES
TUTORIAL 2A (Measures of Central Tendency)
1. The following array shows the amounts spent (in RM) by a random sample
of 15 students at a primary school canteen:
0.50, 0.50, 0.75, 0.75, 0.75, 0.85, 0.90, 1.50, 1.90, 1.90, 2.35, 2.45,
2.71, 3.00, 3.10.
Determine the mean, median, and mode for the amounts spent. Then,
interpret your answers.
Chapter 2A – Page 14
4. A large fast-food chain store has 73 branches located at different parts of
the country. The profits of the companies at the end of the financial year are
as follows:
Profit ($ million) Number of companies
Under 2 30
2 and under 4 20
4 and under 6 10
6 and under 10 8
10 and under 15 5
(a) Draw a histogram to represent the information.
(b) Estimate the mode from the histogram.
(c) Calculate the mean, median and mode. Interpret your results.
Chapter 2A – Page 15
6. A company has produced the following cumulative frequency distribution of
ordinary shares held by a random sample of 300 shareholders.
No. of ordinary shares held (’00) No. of shareholders
Less than 10 49
Less than 20 139
Less than 30 217
Less than 40 259
Less than 50 285
Less than 60 300
(a) Draw an ogive for the above data and estimate the median. Interpret the
answer of the median.
(b) Find the sample mean.
Answers:
1. RM1.594; RM1.50; RM0.75
2. 12.1 minutes; 11.5 minutes; 5 minutes
3. (a) 13.86 min.,13.47 min.,13.11 min. (d) mean
4. (b) 1.5 ($ million) (c) 3.65 ($ million); 2.65($ million); 1.5($ million)
5. (a) 57.6 kg (b)(i) 58.25 kg (ii) 22.5%
6. (a) 21.5(’00) shares (b) 23.367(’00) shares
Chapter 2A – Page 16