Statistics 1 - Chapter 4
Statistics 1 - Chapter 4
Chapter 4
MEASURES OF CENTRAL TENDENCY
4.1 Introduction
Descriptive measures that are used to indicate where the center, the middle property, or the most
typical value of a set of data lies are called measures of central tendency, often referred to as
averages. We shall consider in this chapter the three most important measures of central tendency:
the mean, the median, and the mode.
The most commonly used measure of central tendency is the arithmetic mean. It is also called
the mean or the computed average. It is defined to be the sum of the values of a group of items
divided by the number of such items. The mean of a sample scores on a variable x is symbolized by
x́ (x-bar) and the mean of a population is called the µ (mu). Most of the time, researchers are forced
to estimate µ from x, since they cannot measure every item in the population.
It is appropriate to use when sample data are being used to make inferences about
populations.
It is a point which balances all the values on either side.
It is sensitive or greatly affected by the presence of extremely high or low values.
It is inappropriate to use when the distribution is highly skewed.
It cannot be computed when the distribution contains open-ended intervals.
The mean is the most commonly used, easily understood, easily calculated, and generally
recognized average.
It is the best measure to use when the distribution is normal or symmetrical.
It is a useful measure for inferential statistics.
It is also used to obtain values after each item is weighted.
For ungrouped data, the mean is computed by simply adding all the values and dividing the
sum by the total number of items.
2
x̄=
∑x
n
μ=
∑x
N
Example 4.2-1
A random sample of six cashiers in a department store shows the following balances at the
end of the day:
Ᵽ 16,640.39+26,915.50+ ⋯+20,244.12
x́=
6
Ᵽ 234,230.10
x́=
6
x́=Ᵽ 39,038.35
x́=
∑ fx
∑f
Example 4.2-2
The final grades of a student at the end of the school year are as follows:
981.6
x́=
11.5
x́=85.36
Example 4.2-3
210 210 850 360 310 310 210 210 960 210
210 210 210 210 210 310 310 360 850 960
Ᵽ 3,840
x́=
10
Data which are arranged in a frequency distribution are called grouped data. Observations
belonging to each class interval are represented by the class mark of the interval. There are
two methods we can use to compute the mean from grouped data. These are the long method
and the coded method.
A. Long Method
Example 4.2-4
X̄ =
∑ fX
n
3 , 547
X̄ =
50
B. Coded Formula
This formula requires coding and is called the coded formula for the mean. The
procedure is as follows:
1. Take the class mark of a class interval as an assumed mean. Denote this by x 0. This x 0
is set to zero (origin).
5
2. The class marks of the classes following the class containing the origin are coded
+1, +2, …The class marks prior to the class containing the origin are coded
-1, -2, … or the class marks may be expressed by the codes
X i−X o
U i= where c is the size of the class interval.
c
where
X̄ =X 0 +c ( )
∑ f i Ui
i=1
X 0 = assumed mean
n
f = frequency
n = total frequency
c = class size
Ui = 0, ± 1, ± 2, ± 3, …
Example 4.2-5
X̄ =X 0 +c ( )
∑ f i Ui
i=1
n
X̄ =71+3 (50−1 )
X̄ =71−0.06
X̄ =70.94
6
7
Exercise 4.1
21, 19, 16, 18, 19, 22, 25, 26, 35, 35, 40, 42, 41, 26, 27
2. A sample of 20 families gave the following data on the number of children per family. Find the
mean.
0, 1, 3, 4, 4, 4, 2, 2, 3, 5, 6, 7, 5, 6, 5, 3, 3, 4, 5, 0
3. A sample of 15 supermarkets reported the following prices of sugar per kilo. Find the average
price of sugar per kilo.
Ᵽ24, 24.50, 25, 24.50, 23.50, 24.25, 24.50, 24, 22.75, 23, 23.50, 23.50, 23, 24.25, 24
4. Ely Santos is working part time at a restaurant in Makati. If she works for 30 hours in one week,
how many hours per day does she work?
5. The mean height of Ana, Lucy, Pearl, and Tess is 66 inches. What is the height of Cora if the
mean height of the five girls is 70?
6. The average of four numbers is 40. The third is 12 more than the second. The second is five times
the first. The fourth is 8 less than the first. Find the numbers.
7. The six departments of a company, consisting of 22, 32, 18, 16, 10, and 12 employees have an
equal monthly salary of Ᵽ7,200; Ᵽ7,600; Ᵽ6,900; Ᵽ8,200; Ᵽ7,800; and Ᵽ7,200, respectively.
What is the mean monthly salary of all the employees of the company?
8. The distribution of ages (years) of married women in a municipality are given below:
Ages Frequency
16-22 13
23-29 16
30-36 24
37-43 19
44-50 8
The median (Md) of a set of data is a measure of central tendency that occupies the middle
position in an array of values. It is the number that divides the data into two such that 50% of the
data fall below this value and 50% fall above this value.
n+1
Md= X m where :m=
2
b. If n is even, the median is the average of the two middle values. That is,
X m+ X (m +1) n
Md= where :m=
2 2
Example 4.3-1
The amount of money a balut vendor earned on five randomly selected days are:
n+1 5+1
2. m= = =3 (since n=5 is odd)
2 2
Example 4.3-2
n 10
2. m= = =5 (since n=10 is even)
2 2
X 5+ X 6 77 +81
Md= = =79
2 2
The median of a grouped frequency distribution is essentially the x-coordinate of the point
of intersection of the “less than” and “greater than” ogives of the distribution.
n
Md=L B +c
2
( )
−F
f Md
n
Median class isthe class interval where is found
2
F=cumulative frequency for the class interval preceding themedian class
c=class ¿ ¿
f Md =frequency of the median class
Example 4.3-2
Height
(inches) f ¿ cf
61-63 2 2
64-66 5 7
67-69 12 19
70-72 15 34
73-75 8 42
76-78 5 47
79-81 3 50
Total 50
10
Solution:
n
=25 → Md class :70−72
2
n
Md=L B +c
2
( )
−F
f Md
¿ 69.5+3 ( 25−19
15 )
¿ 69.5+1.2
Md=70.7inches
The mode (Mo) is the value that appears most often in a series. A series may have more
than one mode or none at all. For grouped data, the class with the greatest frequency is called the
modal class. A distribution with only one mode is said to be unimodal. A distribution wherein there
are two classes with the greatest frequency is referred to as bimodal.
For ungrouped data, the mode is the most frequent score in the series.
Example 4.4-1
Mo=7
b) 6, 6, 6, 9, 9. 9, 9, 12, 12, 12, 12, 12, 12, 15, 15, 15, 15, 15, 15, 21, 21, 35, 35
11
Mo=12 ,15
For grouped data, the class with the greatest frequency is called the modal class. The mode
can be obtained using the formula:
d1
Mo=LB + c ( d1 +d 2 )
where LB =lower class boundary of themodal class
d 1=difference between the frequency of the modal class∧¿
the frequency of the class intervallower thanthe modal class
d 2=difference between the frequency of the modal class∧¿
the frequency of the class interval higher than the modal class
c=class ¿ ¿
Example 4.3-2
Height
(inches) f
61-63 2
64-66 5
67-69 12
70-72 15
73-75 8
76-78 5
79-81 3
Total 50
Solution:
d1
Mo=LB + c ( d1 +d 2 )
¿ 69.5+3 ( 3+73 )
¿ 69.5+0.9
Mo=70.4 inches
12
a) Normal Distribution