0% found this document useful (0 votes)

32 views8 pages

Statistical Measures

This document discusses various statistical measures used to summarize data including measures of central tendency like the mean, median, and mode. It provides examples and formulas for calculating these measures and discusses their advantages and disadvantages. Measures of dispersion are also introduced.

Uploaded by

francis Magoba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views8 pages

Statistical Measures

Uploaded by

francis Magoba

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

LESSON TWO: CONTINUATION OF DESCRIPTIVE STATISTICS

STATISTICAL MEASURES

In the preceding part of the course, we studied tables and graphs as methods of organizing, visually
summarizing and displaying data. Among the highlights we sort to depict are central tendency,
dispersion, skewness (lack of symmetry), Kurtosis (degree of flatness or peakedness at the top of
a distribution). Although these techniques are extremely useful, they do not allow us to make
concise, quantitative statements that characterize a distribution as a whole. In order to do this we
rely on numerical summary measures.

Measures of Central Tendency

There are normally three commonly investigated measures of central tendency; namely mean,
mode and median.

Measures of Central Tendency

Measures of central tendency

Mean Median Mode

Arithmetic Mean Geometric Mean Harmonic Meam

Mean
The most frequently investigated characteristic of a set of data is its center. The commonly used
measure of central tendency is the arithmetic mean, or average. It is calculated by summing all the
observation in a set of data and dividing by the total number of measurements. In the table below,
we have n=13 observations. If x is used to represent FEV1, then x1 = 2.30 denotes the first in the
series of observations; x2 = 2.15, the second, and so on up through x13 =3.8. In general, xi refers to
a single measurement where i can take on any value from 1 to n. The mean of the observations in

1 n
x   xi
n i 1
the sample represented by x
Table: Forced expiratory volume in 1 second for 13 adolescents suffering from asthma

Subject Fev1{liters}
1 2.30
2 2.15
3 3.50
4 2.60
5 2.75
6 2.82
7 4.05
8 2.25
9 2.68
10 3.00
11 4.02
12 2.85
13 3.38

For the FEV 1 data, therefore, x = 2.95 liters

For discrete frequency distributions, the mean is given by;

Example2.

The following is a frequency distribution of fasting serum insulin (μU/ml)(expressed as whole

numbers) of males in some rural area.
Freq 1 9 20 32 22 23 19 20 13 10 8
μU/ml 7 9 11 13 15 17 19 21 23 25 27

The mean = 2775/160=17.34 μU/ml

For grouped frequency distributions, the mean is where xi’s are the midpoints of
the classes.
Example 3.

Cholesterol Number of men Midpoints fixi

level (fi) xi
[mg/ 100 ml ]
80-119 13 99.5 1293.50
120-159 150 135.5 20925.00
160-199 442 175.5 79339.00
200-239 299 215.5 65630.50
240-279 115 255.5 29842.50
280-319 34 295.5 10186.40
320-359 9 335.5 3055.50
360-399 5 395.5 1897.50

Total 1,069 212169

Now the mean = 198.8 mg/ 100 ml

Advantages
(i) Readily Understood
(ii) Can be treated algebraically and easy to compute
(iii)It is stable as regards sampling fluctuations

Disadvantage
(i) Affected by extreme values

Median
The median is defined as the 50th percentile of a set of measurements. It can be used as a summary
measure for ordinal data as well as discrete and continuous data. If a set of data contains a total of
n (odd) observations, the median is the middle value, or the [(n+1)/2] th largest measurement; if n
is even, the median is usually taken to be the average of the two middlemost values, the [(n/2)]th
and the [(n+1)/2]th observation. In the example considered above, the ranked 13 FEV1
measurements would be
2.15, 2.25, 2.30, 2.60, 2.68, 2.75, 2.82, 2.85, 3.00, 3.38, 3.50, 4.02, 4.05
Since n = 13, is odd, the median is the [(13+1)/2] = 7th observation or 2.82 liters. In the situation
where the FEV1 of subject 11 was recorded as 40.2 rather than 4.02, the ranking of the
measurement would change only slightly but the median would still be 2.82 liters. The median is
said to be robust; that is, it is much less sensitive to unusual data points than is the mean.

For frequency distribution, we generate cumulative frequency and the median will be the value
corresponding to a cumulative frequency of N/2.
In the example of fasting serum insulin we proceed as follows;
Example 4.
The following is a frequency distribution of fasting serum insulin (μU/ml)(expressed as whole
numbers) of males in some rural area.
μU/ml 7 9 11 13 15 17 19 21 23 25 27
Freq 1 9 20 32 22 23 19 20 13 10 8
Cf 1 10 30 62 84 107 126 146 159 169 177

The median is the value in 177/2th position = 17 μU/ml

Given a continuous (or grouped discrete) frequency distribution the median is obtained as follows
(i) Prepare a cumulative frequency table
(ii) Determine the median class by identifying the class corresponding to N/2th c.f

Estimated median =

Example 5.

Cholesterol Number of men Cumulative

level (fi) frequency
[mg/ 100 ml ]
80-119 13 13
120-159 150 163
160-199 442 605
200-239 299 904
240-279 115 1019
280-319 34 1053
320-359 9 1062
360-399 5 1067

In this case N=1067/2 = 533.5. Thus median class is 160-199.

l=lcb of median class = 159.5
fl= cumulative frequency proceeding the median class= 163
f=median class frequency = 442
c=width of median class = 40

Therefore median, m= 167.9 mg/ 100 ml

Mode
The mode of a set of data is the observation that occurs with the highest frequency and thus is not
unique. It can be used as a summary measure for all types of data.
The best measure of central tendency for a given set of data often depends on the way in which
the values are distributed. If they are symmetric and unimodal, then the mean, median and mode
will coincide. If the distribution of the values is symmetric but bimodal, then the mean and median
should be approximately the same. A bimodal distribution often indicates that the population from
which the data is taken consists of two distinct subgroups that differ in the characteristic being
measured; in this situation it might be better to report two modes rather than the mean or the
median.
When the data are not symmetric, the median is often the best measure of central tendency.
Because the mean is sensitive to extreme observations it is pulled in the direction of the outlying
data values and as a result it might end up either excessively inflated or deflated. Note that when
the data are skewed to the right the mean lies to the right of the median; when they are skewed to
the left the mean lies to the left of the median.
Regardless of the measure of central tendency used in a particular situation it can be misleading to
assume that this value is representatives of all observations.

Example 6.

Cholesterol Number of men Cumulative

level (fi) frequency
[mg/ 100 ml ]
80-119 13 13
120-159 150 163
160-199 442 605
200-239 299 904
240-279 115 1019
280-319 34 1053
320-359 9 1062
360-399 5 1067

The modal class is 160 – 199 mg/ 100 ml since it has the highest frequency, and hence the modal
value should lie in this class. An estimate of the median is obtained as follows

In this example l= lcb of the modal class =159.5

f1= frequency proceeding the modal class = 150
fm= frequency of the modal class = 442
f2=frequency immediately after the modal class = 299
c = width of the modal class =40

Therefore the mode m= 166.2 mg/100ml

Measures of Spread/dispersion/variation

Measures of central tendency give us some idea of the size of central values, while measures of
spread give us some idea of how the values of a distribution cluster around the average. If the
dispersion is small many values cluster around the mean, whereas if the dispersion is large a
considerable proportion of values are markedly different from the average. Incidentally the
importance of the measures of central tendency such as the mean and the measures of spread such
as the standard deviation can be appreciated when it is realized that for all normally distributed
variables, be they continuous or discrete, approximately 68 per cent of the values lie within one
standard deviation (S.D) of the mean, approximately 95 per cent within two S.Ds and practically
100 per cent within three S.Ds

Measures of Spread

Measures of Dispersion

Absolute Measures Relative Measures

Range Quartile Deviation Coeff. of Variat. Coeff of

Quartile Dev

Mean Deviation Standard Dev. Coff. of

mean Dev.

Absolute Measures: These are measures of spread that carry the unit of measurement.

Range
One number that can be used to describe the variability in a set of data values is known as the
range. The range of a group of measurements is defined as the difference between the largest and
the smallest observation. Its usefulness is limited since it considers only the extreme values of a
data set rather than the majority of the observations. Therefore it is highly sensitive to
exceptionally large or exceptionally small values.

Inter-quartile Range
This is a measure of variability that is not easily influenced by extreme values. It is calculated by
subtracting the 25th percentile of the data from the 75th percentile. It encompasses the middle 50
percent of the observations.
IR = Q3 – Q1
Note: when data is a grouped frequency, we follow the same procedure as that for computing the
median except that we identify the class with Q1 and Q3 then estimate Q1 and Q3 in the same
manner as we estimated the median.

Mean deviation
It is the mean of a series of deviations from the mean or the median. If we let, d, denote the
deviations from the mean, i.e di= xi - x (absolute deviations). The mean deviation, M.D is given
by:

m.d 
d1  d 2  ....  d n

 di
n n
Where n is the number of observations recorded. A small value of M.D means a low dispersion,
while a large M.D means a high dispersion. Note: When data is a grouped frequency distribution,
the xis shall be the mid values of the respective classes, di = fi xi - x  and n = ∑fi

Variance and Standard Deviation

Another commonly used measure of spread is the variance and standard deviation. The variance
quantifies the amount of variability or spread about the mean of a sample. Assuming the
observations are given by xi, i = 1,2, …, n, then the mean x =  xi/n
The variance is given by;
n
1
2 
n 1
 (x
i 1
i  x) 2

The variance is calculated by subtracting the mean of a set of data values from each of the
observations, squaring these deviations adding them up and dividing by one less than the number
of observations in the data set representing the variance by 2.

Since the standard deviation has units of measurement, it is meaning less to compare standard
deviation for two unrelated quantities.

Note: For grouped frequencies

n
1
2 
N 1
f
i 1
i
( xi  x) 2

Where xis are the mid values of the respective classes and fi the corresponding frequencies
Relative Measures: These are measures of spread that are unit free. They are useful for purposes
of comparisons where the variables of interest have different units of measurements.

Coefficient of Variation

It is possible to make comparisons among data sets representing different quantities using a
numerical summary measure known as the coefficient of variation. It relates the standard deviation
of a sample to its mean; it is the ratio of  tox multiplied by 100 and is therefore, a measure of
relative variability. Because the standard deviation and the mean share the same units of
measurement, the units cancel out and leave the coefficient of variation as a dimensionless number.
Since it is independent of measurement unit, it can be used to compare the relative variation
between any two sets of values.
C.V = S.D/Mean × 100%

Advantages and disadvantages of various measures of dispersion

The range
It is a reasonably good indication of dispersion, but will be badly affected by just one extreme
value. Care is therefore necessary when it is used.

The mean deviation (from the mean or median)

Generally this is a good measure of dispersion since all the values are used in its computation;
however, it has the disadvantage of not being soundly based mathematically.

The standard deviation

The standard deviation has the same advantages as the mean deviation, but in addition is
mathematically sound.

Skewness

Skewness refers to lack of symmetry. If a distribution is normal, the distribution is said to be

symmetrical otherwise it is asymmetric (Skewed). For a symmetric distribution, the
mean=mode=median
If the mean>Median> Mode, the distribution is skewed to the right (Positively skewed)
If the Mode>median>Mean, then the distribution is skewed to the left (Negatively skewed)

Kurtosis

Kurtosis is a measure which indicates the degree to which a curve of a frequency distribution is
peaked or flat topped. If the distribution is more peaked than the normal distribution, it is called
“Leptokurtic”. If it is more flat than the normal, it is called “Platykurtic”. The normal distribution
is “mesokurtic”.

Statistics
No ratings yet
Statistics
49 pages
Lesson 7 Measures of Central Tendency - Ungrouped Data High School
No ratings yet
Lesson 7 Measures of Central Tendency - Ungrouped Data High School
22 pages
Biostatistics: Khadeeja PK
0% (1)
Biostatistics: Khadeeja PK
27 pages
Measures of Central Tendency
100% (1)
Measures of Central Tendency
48 pages
4 - Measures of Central Tendency
No ratings yet
4 - Measures of Central Tendency
68 pages
Techniques in Geog 1 Complete
No ratings yet
Techniques in Geog 1 Complete
153 pages
Business Statistics: Measures of Central Tendency
No ratings yet
Business Statistics: Measures of Central Tendency
44 pages
Data Presentation
No ratings yet
Data Presentation
104 pages
Slides For IT SKill
No ratings yet
Slides For IT SKill
63 pages
6.descriptve PPHD
No ratings yet
6.descriptve PPHD
70 pages
Median and Mode Calculation
No ratings yet
Median and Mode Calculation
34 pages
Mean, Median, Mode
No ratings yet
Mean, Median, Mode
49 pages
Statistics
No ratings yet
Statistics
47 pages
Quantitative Analysis
No ratings yet
Quantitative Analysis
27 pages
GEE138
No ratings yet
GEE138
45 pages
Important Measures of Central Tendency Are Mean, Median and Mode
No ratings yet
Important Measures of Central Tendency Are Mean, Median and Mode
31 pages
2.3 Descriptive Numerical Summary Measures
No ratings yet
2.3 Descriptive Numerical Summary Measures
67 pages
Measures of Central Tendency Position and Dispersion 1.Pptx 20241015 145631 0000
No ratings yet
Measures of Central Tendency Position and Dispersion 1.Pptx 20241015 145631 0000
44 pages
LabModule - Exploratory Data Analysis - 2023ic
No ratings yet
LabModule - Exploratory Data Analysis - 2023ic
24 pages
Module 4
No ratings yet
Module 4
18 pages
Data Management
No ratings yet
Data Management
81 pages
المحاضرة رقم 3
No ratings yet
المحاضرة رقم 3
44 pages
2.descriptive Statistics
No ratings yet
2.descriptive Statistics
53 pages
Session 2 Week 1
No ratings yet
Session 2 Week 1
30 pages
Lecture 9descriptivestatistics 171204035552
No ratings yet
Lecture 9descriptivestatistics 171204035552
26 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
11 pages
ICS Week 2 - Handouts
No ratings yet
ICS Week 2 - Handouts
20 pages
3jane - Data Description Finala4
No ratings yet
3jane - Data Description Finala4
14 pages
BigDataAnalytics - Unit2
No ratings yet
BigDataAnalytics - Unit2
15 pages
23 Biostatistics
No ratings yet
23 Biostatistics
18 pages
Week 13 Central Tendency For Ungrouped Data
No ratings yet
Week 13 Central Tendency For Ungrouped Data
27 pages
Quantitative Data Analysis
No ratings yet
Quantitative Data Analysis
31 pages
Measures of Central Tendency and Dispersion
No ratings yet
Measures of Central Tendency and Dispersion
9 pages
Strategic Management Notes Strategy and
No ratings yet
Strategic Management Notes Strategy and
48 pages
Measure of Central Tendency Dispersion A
No ratings yet
Measure of Central Tendency Dispersion A
8 pages
Statistics 3: DR Taher
No ratings yet
Statistics 3: DR Taher
38 pages
Stat Handout
No ratings yet
Stat Handout
7 pages
Week 3 - Review Topic - Measures of Central Tendency and Dispersion - NEUVLE
No ratings yet
Week 3 - Review Topic - Measures of Central Tendency and Dispersion - NEUVLE
13 pages
MMW Data Management
No ratings yet
MMW Data Management
35 pages
Measures of Location.....
No ratings yet
Measures of Location.....
8 pages
Obesity
No ratings yet
Obesity
14 pages
What Is Central Tendency
No ratings yet
What Is Central Tendency
10 pages
Finals. Fmch. Measure of Central Tendency Shape of The Distribution of Dispe
No ratings yet
Finals. Fmch. Measure of Central Tendency Shape of The Distribution of Dispe
5 pages
New Microsoft Office Word Document
No ratings yet
New Microsoft Office Word Document
10 pages
1 Descriptive Statistics - Unlocked
No ratings yet
1 Descriptive Statistics - Unlocked
18 pages
Bio Statistics 3
No ratings yet
Bio Statistics 3
13 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
53 pages
Engineering Statistics: Measures of Central Tendency
No ratings yet
Engineering Statistics: Measures of Central Tendency
10 pages
7.1 Measures of Central Tendency
No ratings yet
7.1 Measures of Central Tendency
6 pages
Basic Statistics Concepts: 1 Frequency Distribution
No ratings yet
Basic Statistics Concepts: 1 Frequency Distribution
7 pages
02 - Descriptive Statistics
No ratings yet
02 - Descriptive Statistics
45 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
6 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
8 pages
Topic 5 EMPLOYEE TRAINING AND DEVELOPMENT
No ratings yet
Topic 5 EMPLOYEE TRAINING AND DEVELOPMENT
26 pages
SSC CGL Tier 2 Statistics - Last Minute Study Notes: Measures of Central Tendency
No ratings yet
SSC CGL Tier 2 Statistics - Last Minute Study Notes: Measures of Central Tendency
10 pages
Mean Median Mode
No ratings yet
Mean Median Mode
10 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
10 pages
Capital Structure
No ratings yet
Capital Structure
23 pages
AFRA 2010 - Apr 2025 Past Papers
No ratings yet
AFRA 2010 - Apr 2025 Past Papers
184 pages
Management Practice Lesson 2
No ratings yet
Management Practice Lesson 2
18 pages
TAXATION
No ratings yet
TAXATION
54 pages
Buying Behavior NOTES UPDATED NEW
No ratings yet
Buying Behavior NOTES UPDATED NEW
16 pages
04-Capacity Planning
No ratings yet
04-Capacity Planning
37 pages
Mba 807 7
No ratings yet
Mba 807 7
6 pages
Lesson Two
No ratings yet
Lesson Two
14 pages
Marketing Management
No ratings yet
Marketing Management
39 pages
Topic 3 HUMAN RESOURCE PLANNING
No ratings yet
Topic 3 HUMAN RESOURCE PLANNING
9 pages
Cost of Capital
No ratings yet
Cost of Capital
15 pages
Capital Budgeting Techniques
No ratings yet
Capital Budgeting Techniques
14 pages
LESSON ONE Mba808
No ratings yet
LESSON ONE Mba808
17 pages
FUNCTIONS
No ratings yet
FUNCTIONS
12 pages
AMA AUG 2024
No ratings yet
AMA AUG 2024
14 pages
Staffing function
No ratings yet
Staffing function
22 pages
Planning function
No ratings yet
Planning function
15 pages
Leading function
No ratings yet
Leading function
14 pages
Organizing function
No ratings yet
Organizing function
11 pages
Marketing Notes
No ratings yet
Marketing Notes
9 pages
Mba 807 4
No ratings yet
Mba 807 4
5 pages
Accounting standards
No ratings yet
Accounting standards
9 pages
Mba 812 (May 2016)
No ratings yet
Mba 812 (May 2016)
3 pages
Sampling Theory
No ratings yet
Sampling Theory
7 pages
Nonparametric Testing
No ratings yet
Nonparametric Testing
4 pages
Qa Test Answers
No ratings yet
Qa Test Answers
4 pages
Strategic Management Notes
No ratings yet
Strategic Management Notes
2 pages