Lec 6
Lec 6
Numerical Summaries
Measures of Central Tendency
Lecture 6
1 / 38
Outline
2 / 38
Lecture Objectives
3 / 38
Measures of Central Tendency
4 / 38
The Mean
5 / 38
The Mean
Example 1
The data represent the number of days off per year for a sample of
individuals selected from nine different countries.
6 / 38
The Mean
Solution
Pn
i=1 xi 20+26+40+36+23+42+35+24+30 276
x̄ = n
= 9
= 9
=30.7
7 / 38
The Mean
Example 2
The manager of the BiLo Supermarket, gathered the following
information on the number of times a customer visits the store during a
month. The responses of 20 customers.
Using the frequency distribution below, find the mean?
class limits frequency
6 − 10 1
11− 15 2
16 −20 3
21 − 25 5
26 − 30 4
31 − 35 3
36 − 40 2
Total=20
8 / 38
The Mean
9 / 38
The Mean
Solution:
class limits frequency Mid pints f ∗ xm
5.5 − 10.5 1 8 8
10.5− 15.5 2 13 26
15.5 –20.5 3 18 54
20.5 – 25.5 5 23 115
25.5 – 30.5 4 28 112
30.5 – 35.5 3 33 99
35.5 − 40.5 2 38 76
P
Total=20 f ∗ xm = 490
P
f ∗ xm 490
x̄ = = = 24.5
n 20
10 / 38
The Median
The Median
The median is the midpoint of the data array. The symbol for the
median is MD
When the data set is ordered, it is called a data array.
Example
The number of rooms in the seven hotels is
713, 300, 618,595, 311, 401, and 292. Find the median
11 / 38
The Median
Solution:
First Arrange the data in order.
292, 300, 311, 401, 595, 618, 713
Second Find the median for the data
if the total data values is odd the median order is n+1
2
if the total data values is even the median is the average
of the two midpoint values. m1 = n2 m2 = n2 + 1
x +x
MD = m1 2 m2
the total data in this example is odd so the median order
is 7+1 th
2 = 4, the median is the four element which is 401.
12 / 38
The Median
Examble
The number of children with asthma during a specific year in eight
cities is:
13 / 38
The Median
Examble
Using the frequency table below find the median
class limits frequency
6 − 10 1
11− 15 2
16 −20 3
21 − 25 5
26 − 30 4
31 − 35 3
36 − 40 2
Total=20
14 / 38
The Median
solution
First Find the cumulative frequency.
Second Determine the median class,
which is the class with the cumulative frequency
exactly greater then the median order.
Third find the median value using the rule
median(MD) = L + fwm ∗ (0.5 ∗ n − cfb )
where
L=lower class limit of the class that contains the median
n= total frequency
cfb =the sum of frequencies (cumulative frequency)
for all classes before the median class
fm =frequency of the class interval containing the median
w= interval width
15 / 38
The Median
solution
class frequency CF
less than 10.5 1 1
less than 15.5 2 3
less than 20.5 3 6
less than 25.5 5 11
less than 30.5 4 15
less than 35.5 3 18
less than 40.5 2 20
Total=20
The median class is the class with the cumulative frequency greater
than 10 which is 11 the class is 21 − 25, applying the median rule we
get
5
MD = 20.5 + ∗ (0.5 ∗ 20 − 6) = 24.5
5
16 / 38
The Mode
The mode is the least used of the measures of central tendency and
can only be used when dealing with nominal data.
The Mode
The mode is the value that occurs most often in a data set.
17 / 38
The Mode
A data set that has only one value that occurs with the greatest
frequency is said to be unimodal.
If a data set has two values that occur with the same greatest frequency,
both values are considered to be the mode and the data set is said to be
bimodal.
If a data set has more than two values that occur with the same greatest
frequency, each value is used as the mode, and the data set is said to
be multimodal.
When no data value occurs more than once, the data set is said to have
no mode.
A data set can have more than one mode or no mode at all.
18 / 38
The Mode
Example
Find the mode for the following data sets
1 8, 9, 9, 14, 8, 8, 10, 7, 6, 9, 7, 8, 10, 14, 11, 8, 14, 11
2 10, 31, 30, 84, 20, 18, 62, 77, 33, 52
3 15, 18, 18, 18, 20, 22, 24, 24, 24, 26, 26
19 / 38
The Mode
solution
1 The mode is 8.
2 There is no mode.
3 The modes are 18 and 24.
20 / 38
The Mode
Example
Using the previous frequency table from the last example find the
mode?
class limits frequency
6 − 10 1
11− 15 2
16 −20 3
21 − 25 5
26 − 30 4
31 − 35 3
36 − 40 2
Total=20
21 / 38
The Mode
Solution
First Find the modal class.
Which the class with the highest frequency
Second find the mode value using the rule
f −fi−1
mode = L + (f −f i )+(f ∗w
i i−1 i −fi+1 )
where
L=lower class limit of the interval that contains
the mode(modal class).
fi = the frequency of the modal class.
fi−1 = the frequency of the class previous to the modal class.
fi+1 = the frequency of the class after the modal class.
w= the width of the modal class.
22 / 38
The Mode
23 / 38
The Midrange
The Midrange
The midrange is defined as the sum of the lowest and highest values in
the data set, divided by 2.
Example
For the following dataset find the midrange
2, 3, 6, 8, 4, 1
Solution
1+8
MR = 2 = 4.5
25 / 38
The Midrange
Example
Find the midrange of data for the yearly income for a sample of
employees in a company
solution
10+34.5
MR = 2 = 22.5
Notice that the midrange doesn’t represent most of the data, and this
is due to the highest value is extremely large.
26 / 38
The Weighted Mean
The weighted mean is a special case of the arithmetic mean
(average). Instead of each data point contributing equally to the
final mean, some data points contribute more “weight” than others.
If all the weights are equal, then the weighted mean equals the
arithmetic mean.
The Weighted Mean
The weighted mean of a variable X is found by multiplying each value
by its corresponding weight and dividing the sum of the products by
the sum of the weights.
Pn
w1 X1 +w2 X2 +w3 X3 +...+wn Xn wi Xi
X̄ = w1 +w2 +w3 +...+wn = Pi=1
n
i=1 wi
where
w1 , w2 , w3 , . . . , wn are the weights and X1 , X2 , X3 , . . . , Xn are the
values, and it’s used when you need to find the mean of a data set in
which not all values are equally represented.
27 / 38
The Weighted Mean
Example
A student received an A in STAT1 (72 ), a C in Comp (53 ), a B in
Mang(65 ), and a D in Math (47 ). Assuming STAT1 and Comp are 3
hours credit each, Mang and Math are 2 hours credit.what is the
student’s average
solution
applying the weighted mean rule we get
72 ∗ 3 + 53 ∗ 3 + 65 ∗ 2 + 47 ∗ 2 599
X̄ = = = 59.9
3+3+2+2 10
the student’s grade average is 59.9.
28 / 38
The Geometric Mean
P
log x
log GM =
n
29 / 38
The Geometric Mean
Example
Find the GM of 1,3 and 9
solution
p
3
GM = (1) (3) (9) = 3
30 / 38
The Geometric Mean
31 / 38
The Geometric Mean
32 / 38
Harmonic mean
Harmonic mean
The harmonic mean (HM) is defined as the number of values divided
by the sum of the reciprocals of each value.
34 / 38
Properties and Uses of Measures of Central
Tendency(The Median)
1 The median is used to find the center or middle value of a data set.
2 The median is used when it is necessary to find out whether the
data values fall into the upper half or lower half of the distribution.
3 The median is used for an open-ended distribution.
4 The median is affected less than the mean by extremely high or
extremely low values.
35 / 38
Properties and Uses of Measures of Central
Tendency(The Mode)
3 The mode can be used when the data are nominal or categorical,
such as gender.
4 The mode is not always unique. A data set can have more than
one mode, or the mode may not exist for a data set.
36 / 38
Properties and Uses of Measures of Central
Tendency(The Midrange)
37 / 38
Properties and Uses of Measures of Central
Tendency(The Geometric Mean)
38 / 38