0% found this document useful (0 votes)
81 views16 pages

2A Data Description (A)

The document discusses measures of central tendency, including the mean, median, and mode. [1] The mean, or average, is calculated by summing all values and dividing by the total number of items. [2] The median is the middle value when data is arranged in order. [3] Half of the observations are above the median and half are below.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
81 views16 pages

2A Data Description (A)

The document discusses measures of central tendency, including the mean, median, and mode. [1] The mean, or average, is calculated by summing all values and dividing by the total number of items. [2] The median is the middle value when data is arranged in order. [3] Half of the observations are above the median and half are below.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 16

AAMS1773 QUANTITATIVE STUDIES

CHAPTER 2A: DATA DESCRIPTION (A)

MEASURES OF CENTRAL TENDENCY (LOCATION)

The measures of central tendency is a single value that can be taken as a


summary value for that data set. Three types of often used measures of
central tendency are mean, median and mode.

Mean (arithmetic mean) – the “average”

• Mean is the most used measure of central tendency. It is the average of a


group of data. Mean is calculated by summing up all the observations in the
data set and then dividing it by the number of data.

• Computation of the mean:


total
(a) Raw data
∑𝒙 -value
𝑴𝒆𝒂𝒏 = 154+160+180/3 # mean
𝒏
where 𝑥 = individual value
𝑛 = the total number of items
 means total (summation) of
x1-x8 n=8

Example: Find the mean of 2, 2, 3, 4 0, 6, 2 and 1.


2+2+3+4+0+6+2+1 20
Solution: Mean = = =
8 8

total ,freq,class mark


(b) Grouped data
∑𝒇𝒙
𝑴𝒆𝒂𝒏 =
∑𝒇
where 𝑥 = class mark total, frequency
ub=lb/2

Chapter 2A – Page 1
what is ab= axb=a.b=a(b)

Example
The following frequency distribution shows the lengths of metal pipes
produced on a particular day. Calculate the mean length.

Length (cm) No. of metal pipes


x-class mark
10 – under 20 3
20 – under 30 7
30 – under 40 10
40 – under 50 16
50 – under 60 34
60 – under 70 13
70 – under 80 7
80 – under 90 6
90 – under 100 4
Total 100

Solution:
Length (cm) Class mark, x No. of metal pipes, f fx
10 – 20 15 3 45
20 – 30 20+30/2 35
7 25x7=175

30 – 40 35 10 35x10=350

40 – 50 45 16 720
50 – 60 55 34 1870
60 – 70 65 13 845
70 – 80 75
7 525

80 – 90 85 6 510
90 – 100 95 4 380
f = 100 fx = 5420

Mean =  fx =
5420
= cm
f 100 5420

Interpretation:
54.20
The average length of a metal pipe is _______ cm.
mean

Chapter 2A – Page 2
Example
The following frequency distribution shows the daily production level.
Calculate the arithmetic mean.

Production (units) No. of days


13 – 17 2
18 – 22 22
23 – 27 10
28 – 32 14
33 – 37 3
38 – 42 4
43 – 47 6
48 – 52 1

Solution:
Production (units) f Class mark, x fx
13 – 17 2 15 30
18 – 22 22 20 440

23 – 27 10 25 250

28 – 32 14 30 420
33 – 37 3 35 105
38 – 42 4 40 160
43 – 47 6 45 270
48 – 52 1 50 50
Total 1725

Mean =  fx =
1725
= 27.8226 units
f 62

Interpretation:
The average daily production is _______
27.8226
units.

Note:
The mean computed from population data is denoted by the symbol 
(pronounced as miu); the mean computed from sample data is denoted by x
(pronounced as x bar).

Chapter 2A – Page 3
Median

• Median is the middle value of the distribution. It is important that the data
must be arranged in ascending or descending order of magnitude before
the median is to be located.

• The median being the middle value implies that 50% (half) of the
observations will be more than or equal to the median and 50% (another
half) of the observations will be less than or equal to the median.

• Computation of the median:

(a) Raw data

1. Arrange the data into an array in ascending order of magnitude.


2. Locate the median item as:
n+1
Median = value of the th item
2
where n = no. of items in a data set.

Note: If the data set consists of even number of figures, then median will
be the average of the two middle values.

Example
The data below shows the number of components completed for a particular
product by 9 employees during a weekday:
6, 6, 7, 8, 4, 10, 6, 5, 5 n=9

Find the median.

Solution: Arrange the data in ascending order of magnitude.


4, 5, 5, 6, 6, 6, 7, 8, 10 n = 9
n+1 9+ 1
Median = value of the th item = value of the th item
2 2
= value of the 5th item = 6 components.

Chapter 2A – Page 4
Example 4.5
Find the median for the distribution: 32, 35, 36, 43, 45, 45, 60, 74
Solution: n = 8 (already arranged in ascending order)
n +1 8 +1
Median = value of the th item = value of the th item
2 2
43 + 45
= value of the 4.5th item = = 44
2
(b) Grouped data
1. Calculate the cumulative frequencies to position the items in
ascending order.
n
2. Locate the median class where: median = value of the th item
2
3. Find the median using (1) ogive; (2) linear interpolation formula:
cm  n 
Median = Lm +  −  f m−1 
LB UB fm  2 
20-30
where 𝐿𝑚 = lower class boundary of median class
𝑐𝑚 = class size of median class
𝑓𝑚 = frequency of median class
∑ 𝑓𝑚−1 = cum. freq. of the preceding median class
before

Example
The following table shows the lengths of metal pipes produced on a
particular day. Calculate the median length.

Length (cm) No. of metal pipes


10 – under 20 3
20 – under 30 7
30 – under 40 10
40 – under 50 16
50 – under 60 34
60 – under 70 13
70 – under 80 7
80 – under 90 6
90 – under 100 4
Total 100

Chapter 2A – Page 5
Solution:class boundary
Length (cm) f Cumulative f
10 – 20 3 3
20 – 30 7 10
30 – 40 10 10+7+3=30
40 – 50 16 16+10+7+3=36
50 – 60 34 70 50%
median class
60 – 70 13 83
70 – 80 7 90
80 – 90 6 96
90 – 100 4 100
Total 100
n 100
Median = value of the th item = value of the th item
2 2
= value of the 50th item
Median class boundaries: 50 – 60 class size=60-50=10
c n  10
Median = Lm + m  −  f m −1  = 50 + (50 − 36) = ______
54.1176 cm
fm  2  34

Interpretation:
50% of the metal pipes are having lengths less than or equal to _____
54.1176 cm
and the other 50% are having lengths more than or equal to _____
54.1176 cm.

Example
The following frequency distribution shows the daily production level.
Production (units) No. of days
13 – 17 2
18 – 22 22
23 – 27 10
28 – 32 14
33 – 37 3
38 – 42 4
43 – 47 6
48 – 52 1
Find the median using
(a) the linear interpolation method;
(b) an ogive.

Chapter 2A – Page 6
Solution:
Production(units) Class boundaries f Cumulative frequency
13 – 17 12.5 – 17.5 2 2
18 – 22 17.5 – 22.5 22 24
median 23 – 27 22.5-22.5 10 10+22+2=34
28 – 32 27.5-32.6 14 14+10+22+2=48
33 – 37 32.5 -37.5 3 51
38 – 42 37.5 – 42.5 4 55
43 – 47 42.5 – 47.5 6 61
48 – 52 47.5 – 52.5 1 62
-
Total

Median = value of the _____


62/2 item = value of the _____
62/2 item = value of the _____
31 item

(a) Median class boundaries: _____________


22.5-27.5
5
Median = 22.5 + (31 − 24) = 26 units
10

(b)
'<' Ogive shows the daily production for 62 days
70

60

50
Cumulative Frequency

n/2=62/2th=31^st

40

30

20

10

0
12.5 17.5 22.5 27.5 32.5 37.5 42.5 47.5 52.5
Production (unit)

From the ogive, median = 26 units.


Chapter 2A – Page 7
Mode

• Mode is the most frequently occurred value in a set of measures.


• Mode may not exist and even if it does, it may not be unique.
only one
• Computation of the mode:
(a) Raw data
Arrange the data in ascending order of magnitude. Locate the value with
the highest occurrence.

Example
The following data show the number of children in 8 families:
2, 2, 3, 4, 0, 6, 2, 1
Find the mode.

Solution:
Arranging data in ascending order (optional): 0, 1, 2, 2, 2, 3, 4, 6
Mode = 2 children (occurs 3 times)

Example
Find the mode for the following data.
(1) 1, 2, 3, 4, 7, 9
(2) 1, 2, 2, 3, 4, 5, 5, 8

Solution: 1.class boundary


(1) No mode. 2.class size
3.locate mode
(2) Two modes – 2 and 5. 4.calculate

(b) Grouped data

1. Locate the modal class (the class with the highest frequency)
2. Find the mode using (1) Histogram; (2) Formula.

Adjusted frequency
2 usages
. mode
. histogram Chapter 2A – Page 8
others-use original frequency
f m − fb
Mode = Lm +  cm
2 f m − ( fb + f a )

where 𝐿𝑚 = lower class boundary of the modal class


𝑓𝑚 = frequency of the modal class
𝑓𝑏 = frequency of the class before the modal class
𝑓𝑎 = frequency of the class after the modal class.
𝑐𝑚 = class size of the modal class

Note:
If the formula is used to compute the mode for frequency distribution of
unequal class size, then the frequency will be replaced by the adjusted
frequency of the corresponding class.

Example
The following frequency distribution shows the lengths of metal pipes
produced on a particular day.

Length (cm) No. of metal pipes


10 – under 20 3
20 – under 30 7
30 – under 40 10
40 – under 50 16
50 – under 60 34
60 – under 70 13
70 – under 80 7
80 – under 90 6
90 – under 100 4
Total 100

Find the mode for the frequency distribution using


(a) Histogram;
(b) Formula.

Chapter 2A – Page 9
Solution:
(a)

From the histogram, mode = 54.5 cm.

(b) Modal class boundaries: 50 – 60


f m − fb
Mode = Lm +  cm
2 f m − ( fb + f a )
(34−16)
= 50 + × 10
2×34−(16+13)
= 54.62 cm

Interpretation:
The most common (or most frequent) length of metal pipe is 54.62cm.
or
Most of the metal pipes produced are having lengths 54.62 cm.

Chapter 2A – Page 10
Advantages and disadvantages of using mean, median or mode as a
measurement of central tendency

Mean
Advantages:
• It is easily understood.
• The value of each item is included in the computation; thus it has the kind
of exactness that the median and the mode lack of.
• It is well suited for further analysis.
Disadvantages:
• Its value may not correspond to the actual value, e.g. the average number
of children in each family is 2.4 children.
• It may be distorted by a few extremely high or low values, e.g. a distribution
of 5 persons’ income (in RM):
500, 200, 250, 300, 6000
The mean income per week = RM 1,450
The mean is now distorted or pulled up by one person’s income of RM6,000
(being the extreme value). The majority is only earning between RM200 –
RM500. Certainly, the mean of RM1,450 is misleading.

Median
Advantages:
• It is of use where the middle value is of significance.
• It is not affected by extreme values. Thus, it is usually preferred than mean
when there are extreme values in the distribution.
Disadvantages:
• It does not reflect the full range of values.
• It is not suitable for further analysis.

Chapter 2A – Page 11
Mode
Advantages:
• It is the more appropriate average in situations where it is useful to know
the most common value, e.g. the purchasing department in a shoe store will
order the modal shoe size.
• It is easy to find and is not influenced by extreme values.
Disadvantages:
• It ignores the dispersion around the modal value.
• It is not suitable for further analysis.
• It is not unique.
_____________________________________________________________

Question 1:
A psychologist developed a new test for adult intelligence. The test was
administered to 10 adults, and the following scores were obtained.

115 102 130 124 117 118 122 106 108 118

Calculate the mean, median and mode.


(Answers: 116, 117.5, 118)

Chapter 2A – Page 12
Question 2:
The following table shows the number of cars sold by a second-hand car dealer
for the last 50 days.
Number of cars Number of days
1–2 8
3–4 14
5–6 18
7 – 10 6
11 and above 4

Calculate the mean, median and mode. Interpret your results.


(Answers: 5.22, 4.83, 4.92)

Chapter 2A – Page 13
AAMS1773 QUANTITATIVE STUDIES
TUTORIAL 2A (Measures of Central Tendency)
1. The following array shows the amounts spent (in RM) by a random sample
of 15 students at a primary school canteen:
0.50, 0.50, 0.75, 0.75, 0.75, 0.85, 0.90, 1.50, 1.90, 1.90, 2.35, 2.45,
2.71, 3.00, 3.10.
Determine the mean, median, and mode for the amounts spent. Then,
interpret your answers.

2. A manager observes the amount of time taken by his secretary to prepare


a sample of 10 business letters in the office and the results are arranged in
ascending order to the nearest minute:
5, 5, 5, 7, 9, 14, 15, 15, 16, 30.
Determine (a) mean; (b) median; and (c) mode.
Which measure of central tendency calculated above can best represent
the data in this question? Explain your answer.

3. A record on outgoing telephone calls lasting for 10 minutes or longer


produced the following table:
Length of call (nearest minute) No. of calls
10 and under 12 40
12 and under 14 109
14 and under 16 54
16 and under 18 22
18 and under 20 15
(a) Calculate the mean, median and modal values of the length of calls.
(b) Draw the histogram for the data and estimate the mode.
(c) Draw the ‘ less than’ ogive and estimate the median.
(d) If the charges for telephone call is $0.30 per minute, which of the 3
measures in part (a) above is most suitable to estimate the total
telephone charges? Explain.

Chapter 2A – Page 14
4. A large fast-food chain store has 73 branches located at different parts of
the country. The profits of the companies at the end of the financial year are
as follows:
Profit ($ million) Number of companies
Under 2 30
2 and under 4 20
4 and under 6 10
6 and under 10 8
10 and under 15 5
(a) Draw a histogram to represent the information.
(b) Estimate the mode from the histogram.
(c) Calculate the mean, median and mode. Interpret your results.

5. A survey is conducted by the administration staff in a college to study the


weight distribution of a sample of 120 students in a course.
Weight (kg) Number of students
40 – < 45 4
45 – < 50 13
50 – < 55 17
55 – < 60 40
60 – < 70 39
70 – < 80 7
(a) Construct a histogram to represent the data and estimate the mode
from it. Justify the answer of the mode by calculation.
(b) Construct a ‘<’ cumulative frequency polygon,
(i) estimate the median from it. Justify the answer of the median
by calculation.
(ii) if a student having weight 65 kg or more is classified as
overweight, estimate the percentage of overweight students.

Chapter 2A – Page 15
6. A company has produced the following cumulative frequency distribution of
ordinary shares held by a random sample of 300 shareholders.
No. of ordinary shares held (’00) No. of shareholders
Less than 10 49
Less than 20 139
Less than 30 217
Less than 40 259
Less than 50 285
Less than 60 300
(a) Draw an ogive for the above data and estimate the median. Interpret the
answer of the median.
(b) Find the sample mean.

Answers:
1. RM1.594; RM1.50; RM0.75
2. 12.1 minutes; 11.5 minutes; 5 minutes
3. (a) 13.86 min.,13.47 min.,13.11 min. (d) mean
4. (b) 1.5 ($ million) (c) 3.65 ($ million); 2.65($ million); 1.5($ million)
5. (a) 57.6 kg (b)(i) 58.25 kg (ii) 22.5%
6. (a) 21.5(’00) shares (b) 23.367(’00) shares

Chapter 2A – Page 16

You might also like