0% found this document useful (0 votes)
41 views47 pages

CH 3

The document discusses various measures of central tendency including the mean, median, and mode. It provides definitions and formulas for calculating the arithmetic mean, weighted mean, geometric mean, and harmonic mean. Examples are given to demonstrate calculating the means for grouped data, weighted data, and data with transformations applied. Properties of the different means are outlined including how they are affected by adding or multiplying constants to the original data values.

Uploaded by

Abera Molla
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
41 views47 pages

CH 3

The document discusses various measures of central tendency including the mean, median, and mode. It provides definitions and formulas for calculating the arithmetic mean, weighted mean, geometric mean, and harmonic mean. Examples are given to demonstrate calculating the means for grouped data, weighted data, and data with transformations applied. Properties of the different means are outlined including how they are affected by adding or multiplying constants to the original data values.

Uploaded by

Abera Molla
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 47

.

CHAPTER 3
Measure of Central Tendency
[MCT]

November 21,2022

1
Central Tendency
 The goal of central tendency is to identify
the single value that is the best
representative for the entire set of data.
 By identifying the "average score (Mean),"
central tendency allows researchers to
summarize a large set of data into a single
value.

2
Summation Notation
• The symbol ∑, that means to sum (add).
• If we denote a set of data by
X = (x1, x2, ..., xn),You will see notations such
as ∑ X. This is read as the “sum of X” &
means to find the sum of the X scores.

3
Continual of summation notation …

• Suppose scores made on quiz result for


five students in the class: 5, 7, 7, 6, and 8.
where N=5, then find sum of scores.
Solution

4
PROPERTIES OF SUMMATION

1. where k is any constant.


2. where k is any constant.

3. where a and b are any constant.

4. ++…..+

5
Properties of measures of central tendency (a typical
average should possess the following)

 It should be rigidly defined.


 It should be based on all observation under investigation.
 It should be as little as affected by extreme observations.
 It should be capable of further algebraic treatment.
 It should be as little as affected by fluctuations of
sampling.
 It should be ease to calculate and simple to understand.

6
Types of Measures of Central Ten-
dency
Summarizing Data
Give your single value or
The Mean
measure that represents The Median
an entire group of scores
The Mode

7
Continual of Types of MCT…

To describe the central tendency of a dataset,


we typically talk in terms of:
• Mean (or average)
• Median (center of the dataset)
• Mode – most frequent (repeated) value!

8
The Mean

• Arithmetic Mean
• Weighted Mean
• Geometric Mean
• Harmonic mean

9
Arithmetic Mean

Sample Population

x
x 
X N
n

10
Arithmetic Mean of Grouped Data

• if x1 , x 2 , x 3 ,.........., x k are the class marks


and f1 , f 2 , f 3 ,........, f k are the corresponding
frequencies, where the subscript ‘k’ stands for the
number of classes, then the mean is

A.M 
 fx i i

f i
11
Example: Find A.M and the rest frequencies?
If n= 75 & 20% of the students have marks between 55 and 59.

Marks(100) Number of students

40-44 7

45-49 10

50-54 22

55-59 f4

60-64 f5

65-69 6

70-74 3
solution
• Firstly, we should find the value of f4 and f5. since
20% of the students equals to f4. implies f4= 20%
*75=0.2*75=15
• And f5=75-(f1+ f2+ f3+ f4+ f6+ f7)
=75-(7+10+22+15+6+3)
=12
Then f4=15 and f5 =12.

13
Solutions:
1.First find the class marks
2.Find the product of frequency and class marks
3.Find mean using the formula
Class fi Xi (class mark) Xifi
40-44 7 42 294
45-49 10 47 470
50-54 22 52 1144
55-59 15 57 855
60-64 12 62 744
65-69 6 67 402
70-74 3 72 216
Total 75 4125
using the Then the arithmetic mean A.M=4125/75
formula for (A.M)will be A.M=55
grouped data

A.M 
fx i i

f i 14
Special properties of Arithmetic mean

1. The sum of the deviations of a set of items from their mean is always zero. i.e.
n

 ( X  X )  0.
i 1
i

2. The sum of the squared deviations of a set of items from their mean is the minimum. i.e.
n n

 ( Xi  X )   ( X  A) , A  X
i 1
2

i 1
i
2

3. If X 1 is the mean of n1 observations


If X 2 is the mean of n2 observations
.
.
If X k is the mean of nk observations
Then the mean of all the observation in all groups often called the combined mean is given by:
k

X1n 1  X2 n 2  ....  Xk n k 
Xi n i
Xc   i 1k
n 1  n 2  ... n k
n
i 1
i

15
Cont..

Example for property three:


In a class there are 30 females and 70 males. If females averaged 60 in an
examination and boys averaged 72, find the mean for the entire class.
Solutions:

Females Males
X 1  60 X 2  72
n1  30 n2  70

X 1 n1  X 2 n 2  X i ni
Xc   i 1 2
n1  n 2  ni i 1

30(60)  70(72) 6840


 Xc    68.40
30  70 100
16
4. If a wrong figure has been used when calculating the mean the correct
mean can be obtained without repeating the whole process using:

Example: An average weight of 10 students was calculated to be 65.


Latter it was discovered that one weight was misread as 40 instead of 80
k.g. Calculate the correct average weight.

Solutions:
(CorrectValue  WrongValue)
CorrectMean  WrongMean 
n
(80  40)
CorrectMean  65   65  4  69k.g.
10

17
Cont..

5. The effect of transforming original series on the mean.


a) If a constant k is added/ subtracted to/from every observation
then the new mean will be the old mean± k respectively.
b) If every observations are multiplied by a constant k then the new
mean will be k*old mean
Exercise for property five
1. The mean of n Tetracycline Capsules X1, X2, …,Xn are known to be
12 gm. New set of capsules of another drug are obtained by the
linear transformation Yi = 2Xi – 0.5 ( i = 1, 2, …, n ) then what will
be the mean of the new set of capsules.
2. Let the mean of a set of numbers is 500.
a) If 10 is added to each of the numbers in the set, then what will be the
mean of the new set?
b) If each of the numbers in the set are multiplied by -5, then what will
be the mean of the new set?
18
Weighted Mean

The Weighted mean of the positive real numbers x1,x2,


..., xn with their weight w1,w2, ..., wn is defined to be

w x
i 1
i i
x  n

w
i 1
i

19
Example: Weighted mean
 When determining the GPA of student for first
semester!
• Let a student obtained grade A, B, C for three
courses with respective credit hours 3,4,2.
• (Where A=4 Point, B= 3 point, c =2 point). Find the
student weighted grade point average (GPA).
Solutions:
= = 3.11

20
Geometric Mean
Geometric mean is defined as the positive root of the product
of observations.
• In general, the sample geometric mean is calculated by

Example1:The man gets three annual raises in his salary. At the end of first
year, he gets an increase of 4%, at the end of the second year, he gets an
increase of 6% and at the end of the third year, he gets an increase of 9% of
his salary. What is the average percentage increase in the three periods?
Solution: = 1.0631=>

21
Harmonic mean
 is suitable measure of central tendency when the data
pertains to speed, rates and price.
Let be n variant values in a set of observations, then
simple harmonic mean is given by:

 Note: SHM is used for equal distances, equal costs and


equal rates.
Weighted harmonic mean (WHM)
• WHM is used for different distance, different cost and
different rate.

22
Example
A motorist travels for three days at a rate (speed) of 480km
per day. On the first day he travels 10 hours at a rate of 48
km/h, on the second day 12hours at a rate of 40km/h, on the
third day 15hours at a rate of 32km/h. What is the average
speed?
Solution: Since the distance covered by the motorist is
equal (), so we use SHM.
• so the required average speed .

23
The mode
I) Mode for Ungrouped (Discrete) FD
 Mode is a value which occurs most frequently in a set of
values.
 The mode may not exist and even if it does exist, it may not
be unique.
 In case of discrete distribution the value having the maximum
frequency is the modal value.
Examples:
 Find the mode of 5, 3, 5, 8, 9

Mode =5
 Find the mode of 8, 9, 9, 7, 8, 2, and 5.

It is a bimodal Data: 8 and 9


 Find the mode of 4, 12, 3, 6, and 7.

No mode for this data. 24


Example: Find Mean and Mode of
raw Data
The quiz result for 6 students out of 5 was found to
be:

3,4, 1,4,2,5

Mean Mode
3.8 4;
Unimodal 25
Example: Find Mode
Age Frequency
Our example: 19 159
 20 & 21 are the most
20 219
frequently occurring age
in our sample 21 219
 Therefore the mode of 22 146
this distribution are 20 &
23 123
21
 This is a bimodal 25 83
distribution 27 48
29 16
32 20
40 14
26
CONT..

II) Mode for Grouped data


 If data are given in the shape of continuous frequency
distribution, the mode is defined as
ˆ  L  1 
X mo  w
  

 1 2 
Xˆ  the mod e of the distributi on
Where: Lmo  lower class boundaries of the modal class
w  the size of the mod al class
1  f mo  f1
 2  f mo  f 2
f mo  frequency of the mod al class
f1  frequency of the class preceeding the mod al class
f 2  frequency of the class following the mod al class

Note: The modal class is a class with the highest frequency. 27


Example :the size of farms in hectare.

find mode
Size of farms Number of farms
5-15 8
15-25 12
25-35 17
35-45 29
45-55 31
55-65 5
65-75 3

28
Solution
45
.  55 is the modal class, since it is a class with the highest frequency .

Lmo  45
w  10
 1  f mo  f1  2
 2  f mo  f 2  26
f mo  31
f1  29
f2  5

ˆ  45  10 2 
X
 2  26 
 45. 71

29
The Median
 median is the value of the variable which divides it in
to two equal halves.
 In an ordered series of data median is an observation
lying exactly in the middle of the series.
 If X1, X2, …Xn be the observations, then the numbers
arranged in ascending order will be X[1], X[2], …X[n],
where X[i] is ith smallest value. X[1]< X[2]< …<X[n]
I) Median for ungrouped data

30
Cont…

II) Median for grouped data


 If data are given in the shape of continuous frequency
distribution, the median is defined as
~ w n
X  Lmed  (  c)
f med 2
Where :
Lmed  lower class boundary of the median class.
as: w  the size of the median class
n  total number of observatio ns.
c  the cumulative frequency (less than type ) preceeding the median class.
f med  thefrequen cy of the median class.

Remark:
The median class is the class with the smallest
cumulative frequency (less than type) greater than or 31

equal to .
.
Example : Find the median of the following
distribution

32
Solution
Class Frequency Cumu.Freq(less
than type)
40-44 7 7
45-49 10 17
50-54 22 39
55-59 15 54
60-64 12 66
65-69 6 72
70-74 3 75

n 75
  37. 5
2 2
39 is the first cumulative frequency to be greater than or equal to 37. 5
 50  54 is the median class.

L  49. 5, w 5
med
n  75, c  17, f  22
med


~
X L  w ( n  c)
med f 2
med
 49. 5  5 (37. 5  17)
22
 54. 16 33
Quantiles
Quantiles are measures that divide the frequency
distribution in to different equal parts.
I) Quartiles:
Quartiles are measures that divide the frequency
distribution in to four equal parts.
Q1 is a value which has 25% items which are less
than or equal to it. Similarly Q2 has 50%items with
value less than or equal to it and Q3 has 75% items
whose values are less than or equal to it.
• Calculating quartiles for raw data

34
Quantile formula for grouped data

Remark: The quartile class (class containing Qi) is the class with the smallest cumulative
in
frequency (less than type) greater than or equal to .
4 35
Cont..

II) Deciles:
Deciles are measures that divide the frequency
distribution in to ten equal parts.
The values of the variables corresponding to these
divisions are denoted D1, D2,.. D9 often called the first,
the second,…, the ninth deciles respectively.
 Calculating Deciles for raw data

36
Calculating for deciles for grouped data

Remark:
• The decile class (class containing Di) is the class with
the smallest cumulative frequency (less than type)
greater than or equal to
37
Cont…

III) Percentiles:
 Percentiles are measures that divide the frequency

distribution in to hundred equal parts.


 The values of the variables corresponding to these divisions

are denoted P1, P2,.. P99 often called the first, the second,…,
the ninety-ninth percentile respectively.
• Calculating percentiles for raw data

38
Percentiles for grouped data

Remark: The percentile class (class containing Pi) is the class with the smallest cumulative
in
frequency (less than type) greater than or equal to .
100
39
Example for finding quartiles for raw data

 the following data shows the age of 30 sampled patients in JUSH


49,9,11,14,17,16,18,21,22,22,27,22,23,25,25,26,22,28,28,32,39,34,3
4,36,33,39,41,45,46,6
 find the lower, middle and upper quartiles for the above data.
Solution:
1st order the data (if it hasn’t been ordered)
6,9,11,14,16,17,18,21,22,22,22,22,23,25,25,26,27,28,28,32,33,34,34,36,39,
39,41,45,46,49 . Then , n =30,
Q1= ¼ (30+1) th value =7.75th value =7th value +0.75(8th value -7th value)
18+0.75(21-18) =
18+2.25 =20.25
•This implies one fourth of the patients (25%) have age below 20.25 years.40
Example for calculating grouped data
• Considering the distribution for cholesterol levels of
493 people were measured in mg/dl.
Find: A) All quartiles. B) The 7th decile C)The 90th
percentile.

41
Solution
 First find the less than cumulative frequency.
 second use the formula to calculate the required
quantiles.
Values Frequency Cum.Freq(le
ss than type)
140- 150 17 17
150- 160 29 46
160- 170 42 88
170- 180 72 160
180- 190 84 244
190- 200 107 351
200- 210 49 400
210- 220 34 434
220- 230 31 465
230- 240 16 481
240- 250 12 493

42
Cont..

a. Quartiles:
i. Q1
• determine the class containing the first quartile

43
Cont..
i. Q2
- determine the class containing the second quartile.
2*n
 246.5
4
 190  200 is the class containing the sec ond quartile .

LQ2  190 , w 10


n  493 , c  244 , f Q2 107

w 2*n
 Q2  LQ2  (  c)
f Q2 4
10
 170  ( 246.5  244)
72
 190.23

44
Cont..

a. D7
- determine the class containing the 7th decile.
7*n
 345.1
10
190  200 is the class containing the seventh decile.
LD7  190 , w 10
n  493 , c  244 , f D7 107

w 7*n
 D7  LD7  (  c)
f D7 10
10
 190  (345.1  244)
107
 199.45

45
Cont..
c. P90, determine the class containing the 90th percentile.

46
It is enough just for to day …

Question ???

47

You might also like