0% found this document useful (0 votes)

5 views53 pages

Biostat Lecture Four

Chapter Four discusses Descriptive Statistics, focusing on Measures of Central Tendency (MCT) which include the Arithmetic Mean, Median, and Mode. It outlines the characteristics of a good MCT, the properties of each measure, and the appropriate contexts for their use based on data distribution. The chapter also introduces Measures of Dispersion, emphasizing the importance of understanding data variability alongside central tendency.

Uploaded by

birukfirdut

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views53 pages

Biostat Lecture Four

Uploaded by

birukfirdut

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 53

Chapter Four

Descriptive Statistics:

[email protected] 1
Measures of Central Tendency (MCT)
• A frequency distribution is a general picture of the
distribution of a variable .
• But, can’t indicate the average value and the
spread of the values .
• The tendency of the statistical data to get
concentrated at a certain value is called “central
tendency”
• The various methods of determining the point
about which the observations tend to concentrate
are called MCT.

[email protected] 2
Measures of Central Tendency (MCT)

•The objective of calculating MCT is to determine

a single figure which may be used to represent
the whole data set.

•In that sense it is an even more compact

description of the statistical data than the
frequency distribution.

•Since a MCT represents the entire data, it

facilitates comparison within one group or
between groups of data.
[email protected] 3
Characteristics of a good MCT
A MCT is good or satisfactory if it possesses the following characteristics.
1. It should be based on all the observations.
2. It should not be affected by the extreme values.
3. It should be as close to the maximum number of
values as possible .
4. It should have a definite value.
5. It should be capable of further algebraic treatment .
6. It should be stable with regard to sampling.

[email protected] 4
• The most common measures of central tendency include:
 Arithmetic Mean
 Median
 Mode
Others

[email protected] 5
1. Arithmetic Mean
A. Ungrouped Data
• The arithmetic mean is the "average" of the data
set and by far the most widely used measure of
central location and it is usually denoted by
• Is the sum of all the observations divided by the
total number of observations.

[email protected] 6
b)G ro
u pe d d
ata
I
n c alculatingthem e
anfr
o mgr
o up
eddata
,weass
u m
eth
ata
llvalu e
sfallingin
toa
par ticularc la
ssinte
rva
larelo
cate
d a
tth
em id
-po
into
fth
ein
ter
va l.I
tisc alc
ula
teda
s
f
o llo w:
k


mf
i=
1
i i
x= k


f
i=
1
i

w
he
re,
k =thenum be
rofclassinterv a
ls
th
m i=them id
-po
intofthei c la
ssinte
rva
l
fi=thefr
eq u
encyoftheithc lassin
ter
val

[email protected] 7
Example. Compute the mean age of 169 subjects from the
grouped data.

Mean = 5810.5/169 = 34.48 years

Class interval Mid-point (mi) Frequency (fi) mifi

10-19 14.5 4 58.0
20-29 24.5 66 1617.0
30-39 34.5 47 1621.5
40-49 44.5 36 1602.0
50-59 54.5 12 654.0
60-69 64.5 4 258.0
Total __ 169 5810.5

[email protected] 8
When the data are skewed, the mean is “dragged” in
the direction of the skewness .

• It is possible in extreme cases for all but one of the sample points
to be on one side of the arithmetic mean & in this case, the mean is
a poor measure of central location or does not reflect the center of
the sample.
[email protected] 9
Properties of the Arithmetic Mean.
• For a given set of data there is one and only one arithmetic
mean (uniqueness).
• Easy to calculate and understand (simple).
• Influenced by each and every value in a data set
• Greatly affected by the extreme values.
• In case of grouped data if any class interval is open,
arithmetic mean can not be calculated .

[email protected] 10
2. Median
a) Ungrouped data
• The median is the value which divides the data set into two equal
parts.
• If the number of values is odd, the median will be the
middle value when all values are arranged in order of
magnitude.
• When the number of observations is even, there is no
single middle value but two middle observations.
• In this case the median is the mean of these two middle
observations, when all observations have been arranged in
the order of their magnitude.

[email protected] 11
[email protected] 12
[email protected] 13
[email protected] 14
• The median is a better description (than the mean) of the
majority when the distribution is skewed .
• Example
– Data: 14, 89, 93, 95, 96
– Skewness is reflected in the outlying low value of 14
– The sample mean is 77.4
– The median is 93

[email protected] 15
b) Grouped data
• In calculating the median from grouped data, we
assume that the values within a class-interval are
evenly distributed through the interval.
• The first step is to locate the class interval in which
the median is located, using the following procedure.
• Find n/2 and see a class interval with a minimum cumulative
frequency which contains n/2.
• Then, use the following formula.

[email protected] 16
 n 
  Fc 
~
x = Lm   2 W
 fm 
 
 
where,
Lm = lower true class boundary of the interval containing the median
Fc = cumulative frequency of the interval just above the median
class
interval
fm = frequency of the interval containing the median
W= class interval width
n = total number of observations
[email protected] 17
Example. Compute the median age of 169
subjects from the grouped data.

n/2 = 169/2 = 84.5

Class interval Mid-point (mi) Frequency (fi) Cum. freq

10-19 14.5 4 4
20-29 24.5 66 70
30-39 34.5 47 117
40-49 44.5 36 153
50-59 54.5 12 165
60-69 64.5 4 169
Total 169

[email protected] 18
• n/2 = 84.5 = in the 3rd class interval
• Lower limit = 29.5, Upper limit = 39.5
• Frequency of the class = 47
• (n/2 – fc) = 84.5-70 = 14.5

• Median = 29.5 + (14.5/47)10 = 32.58 ≈ 33

[email protected] 19
Properties of the median
• There is only one median for a given set of data
(uniqueness)
• The median is easy to calculate
• Median is a positional average and hence it is
insensitive to very large or very small values .
• Median can be calculated even in the case of
open end intervals
• It is determined mainly by the middle points and
less sensitive to the remaining data points
(weakness).

[email protected] 20
3. Mode

• The mode is the most frequently occurring value among

all the observations in a set of data.
• It is not influenced by extreme values.
• It is possible to have more than one mode or no mode.
• It is not a good summary of the majority of the data.

[email protected] 21
3. Mode
Mode

[email protected] 22
a) Ungrouped data
• It is a value which occurs most frequently in a set of
values.
• If all the values are different there is no mode, on the
other hand, a set of values may have more than one
mode.

[email protected] 23
• Example
• Data are: 1, 2, 3, 4, 4, 4, 4, 5, 5, 6
• Mode is 4
• Example
• Data are: 1, 2, 2, 2, 3, 4, 5, 5, 5, 6, 6, 8
• There are two modes – 2 & 5
• This distribution is said to be “bi-modal”
• Example
• Data are: 2.62, 2.75, 2.76, 2.86, 3.05, 3.12
• No mode, since all the values are different

[email protected] 24
b) Grouped data
• To find the mode of grouped data, we usually refer to
the modal class, where the modal class is the class
interval with the highest frequency.
• If a single value for the mode of grouped data must
be specified, it is taken as the mid-point of the modal
class interval.

[email protected] 25
 
x̂ = L m 
 w f 2 
 0  
f f 2 
 
where
L - Lower boundary of the Modal class
f0 – The frequency of the class next below the modal
class in value
f2 – the frequency of the class next above the modal class
in value
w – length of the interval of the modal class

[email protected] 26
[email protected] 27
Properties of mode
 It is not affected by extreme values
 It can be calculated for distributions with open end
classes
 Often its value is not unique
 The main drawback of mode is that often it does not
exist

[email protected] 28
Which measure of central tendency is best with a
given set of data?

• Two factors are important in making this decisions:

– The scale of measurement (type of data)
– The shape of the distribution of the
observations

[email protected] 29
• The mean can be used for discrete and continuous data .
• The median is appropriate for discrete and continuous
data as well, but can also be used for ordinal data.
• The mode can be used for all types of data, but may be
especially useful for nominal and ordinal measurements .
• For discrete or continuous data, the “modal class” can be
used .

[email protected] 30
(a) Symmetric and unimodal distribution — Mean, median,
and mode should all be approximately the same .

Mean, Median & Mode

[email protected] 31
(b) Bimodal — Mean and median should be about the
same, but may take a value that is unlikely to occur; two
modes might be best

[email protected] 32
(c) Skewed to the right (positively skewed) —Mean is
sensitive to extreme values, so median might be more
appropriate

Mode

Median

Mean

[email protected] 33
(d) Skewed to the left (negatively skewed) — Same as (c)

Mode

Median

Mean

[email protected] 34
Measures of Dispersion

Consider the following two sets of data:

A: 177 193 195 209 226 Mean =

200

B: 192 197 200 202 209 Mean =

200
Two or more sets may have the same mean and/or median but they
may be quite different.

[email protected] 35
These two distributions have the same mean,
median, and mode

[email protected] 36
Measures of Dispersion
• MCT are not enough to give a clear
understanding about the distribution of the data.

• Measures that quantify the variation or

dispersion of a set of data from its central
location

• Dispersion refers to the variety exhibited by the

values of the data.

• The amount may be small when the values are

close together.

[email protected] 37
Measures of Dispersion
Other synonymous term:
– “Measure of Variation”
– “Measure of Spread”
– “Measures of Scatter”

[email protected] 38
• Measures of dispersion include:
– Range
– Variance
– Standard deviation
– Coefficient of variation
– Standard error
– Others

[email protected] 39
1. Range (R)
• The difference between the largest and smallest
observations in a sample.

• Range = Maximum value – Minimum value

• Example –
– Data values: 5, 9, 12, 16, 23, 34, 37, 42
– Range = 42-5 = 37
• Data set with higher range exhibit more variability

[email protected] 40
Properties of range
 It is the simplest crude measure and can be easily
understood
 It takes into account only two values which causes it to be
a poor measure of dispersion
 Very sensitive to extreme observations
 The larger the sample size, the larger the
range

[email protected] 41
2. Variance (2, s2)
• Variance is used to measure the dispersion of values
relative to the mean.
• The variance is the average of the squares of the
deviations taken from the mean.
• When values are close to their mean (narrow range) the
dispersion is less than when there is scattering over a
wide range.
– Population variance = σ2
– Sample variance = S2

[email protected] 42
Ungrouped data

[email protected] 43
Degrees of freedom
• In computing the variance there are (n-1) degrees of
freedom because only (n-1) of the deviations are
independent from each other .
• The last one can always be calculated from the others
automatically.

[email protected] 44
b) Grouped data
k

 (m i  x) 2 f i
S2  i =1
k

i =1
fi - 1

where
mi = the mid-point of the ith class interval
fi = the frequency of the ith class interval
k = the number of class intervals
x = the sample mean

[email protected] 45
Properties of Variance:
 The main disadvantage of variance is that its unit
is the square of the unite of the original
measurement values .
 The variance gives more weight to the extreme
values as compared to those which are near to
mean value, because the difference is squared in
variance.
• The drawbacks of variance are overcome by the
standard deviation.

[email protected] 46
4. Standard deviation (, s)
• It is the square root of the variance.
• This produces a measure having the same scale as
that of the individual values.

   and S = S 2 2

[email protected] 47
[email protected] 48
Example. Compute the variance and SD of the age of 169
subjects from the grouped data.
Mean = 5810.5/169 = 34.48 years
S2 = 20199.22/169-1 = 120.23
SD = √S2 = √120.23 = 10.96
Class
interval (mi) (fi) (mi-Mean) (mi-Mean)2 (mi-Mean)2 fi
10-19 14.5 4 -19.98 399.20 1596.80
20-29 24.5 66 -9-98 99.60 6573.60
30-39 34.5 47 0.02 0.0004 0.0188
40-49 44.5 36 10.02 100.40 3614.40
50-59 54.5 12 20.02 400.80 4809.60
60-69 64.5 4 30.02 901.20 3604.80
Total 169 1901.20 20199.22

[email protected] 49
Properties of SD
• The SD has the advantage of being expressed in
the same units of measurement as the mean

• SD is considered to be the best measure of

dispersion and is used widely because of the
properties of the theoretical normal curve.

• However, if the units of measurements of variables

of two data sets is not the same, then there
variability can’t be compared by comparing the
values of SD.
[email protected] 50
SD vs Standard Error (SE)
• SD describes the variability among individual
values in a given data set .
• SE is used to describe the variability among
separate sample means obtained from one
sample to another .

[email protected] 51
5. Coefficient of variation (CV)
• When two data sets have different units of
measurements, or their means differ sufficiently in
size, the CV should be used as a measure of
dispersion.
• It is the best measure to compare the variability of
two series of sets of observations.
• Data with less coefficient of variation is considered
more consistent.

• (CV) = (Standard Deviation/Mean) × 100.

[email protected] 52
[email protected] 53

【Haug-1984】Computer Aided Analysis and Optimization of Mechanical System Dynamics PDF
No ratings yet
【Haug-1984】Computer Aided Analysis and Optimization of Mechanical System Dynamics PDF
718 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
29 pages
Chapter Three Bio
No ratings yet
Chapter Three Bio
38 pages
Mean, Median, Mode
No ratings yet
Mean, Median, Mode
49 pages
Stat I PPT, Chapter 3k
No ratings yet
Stat I PPT, Chapter 3k
47 pages
Biostatistics Chapter Three
No ratings yet
Biostatistics Chapter Three
146 pages
2descriptive Numerical Summary Measures Central
No ratings yet
2descriptive Numerical Summary Measures Central
52 pages
Chapter 2 Measure of Central Tendency Dhiraj (Becon 2025)
No ratings yet
Chapter 2 Measure of Central Tendency Dhiraj (Becon 2025)
80 pages
Share MBBS - Lecture 4 (1) - 1
No ratings yet
Share MBBS - Lecture 4 (1) - 1
68 pages
CVS Pharmacology 04
No ratings yet
CVS Pharmacology 04
279 pages
Lec - 4 (Summary Data)
No ratings yet
Lec - 4 (Summary Data)
89 pages
Lecture 2-Descriptive Statistics
No ratings yet
Lecture 2-Descriptive Statistics
74 pages
Summarizing Data
No ratings yet
Summarizing Data
49 pages
Summary Measures-1
No ratings yet
Summary Measures-1
101 pages
Lecture - 2 Measures of Central Tendency and Variation
No ratings yet
Lecture - 2 Measures of Central Tendency and Variation
40 pages
3descriptive Numerical Summary Measures
No ratings yet
3descriptive Numerical Summary Measures
111 pages
SASA211: Finding The Center
No ratings yet
SASA211: Finding The Center
138 pages
Title
No ratings yet
Title
137 pages
Measures of Central Tendency and Dispersion: Chapter Three
No ratings yet
Measures of Central Tendency and Dispersion: Chapter Three
47 pages
3rd Week
No ratings yet
3rd Week
87 pages
Charpter 5 - Descriptive Analysis
No ratings yet
Charpter 5 - Descriptive Analysis
88 pages
Ischemia
No ratings yet
Ischemia
53 pages
Microbial Diseases of Cardiovascular and Lymphatic System
No ratings yet
Microbial Diseases of Cardiovascular and Lymphatic System
52 pages
2016 General Principles of Antimicrobial Therapy PC I Students
No ratings yet
2016 General Principles of Antimicrobial Therapy PC I Students
68 pages
Chapter 5 Descriptive Analysis Using Measures of Central Tendency and Measures of Despersion
No ratings yet
Chapter 5 Descriptive Analysis Using Measures of Central Tendency and Measures of Despersion
88 pages
Lecture 2
No ratings yet
Lecture 2
73 pages
Biostat Ch-4
No ratings yet
Biostat Ch-4
36 pages
Notes For SIBD
No ratings yet
Notes For SIBD
19 pages
2.3 Descriptive Numerical Summary Measures
No ratings yet
2.3 Descriptive Numerical Summary Measures
67 pages
Biostat Lecture Five
No ratings yet
Biostat Lecture Five
72 pages
Lecture 4 Matrix Factorization Transpose Permutation
No ratings yet
Lecture 4 Matrix Factorization Transpose Permutation
19 pages
CH 3
No ratings yet
CH 3
59 pages
GENMATHPPT2NDQUARTER
No ratings yet
GENMATHPPT2NDQUARTER
45 pages
Drugs Affecting Bone Metabolism
No ratings yet
Drugs Affecting Bone Metabolism
30 pages
Chapter 4 - 3 Measures of Central Tendency
No ratings yet
Chapter 4 - 3 Measures of Central Tendency
17 pages
Chapter 3
No ratings yet
Chapter 3
59 pages
B 0
No ratings yet
B 0
24 pages
Application of Biostatistics
No ratings yet
Application of Biostatistics
11 pages
Micro Part From Hematology Module For PC 1 Stu (Autosaved)
No ratings yet
Micro Part From Hematology Module For PC 1 Stu (Autosaved)
55 pages
MCS Lecture 3
No ratings yet
MCS Lecture 3
57 pages
Calculates Measures of Central Tendency of Grouped and Ungrouped Data
No ratings yet
Calculates Measures of Central Tendency of Grouped and Ungrouped Data
23 pages
ICEMMA 2025 Brochure
No ratings yet
ICEMMA 2025 Brochure
22 pages
Agents Used in Dyslipidemia (1) - Compressed
No ratings yet
Agents Used in Dyslipidemia (1) - Compressed
67 pages
Obstruction
No ratings yet
Obstruction
53 pages
3.describing Data
No ratings yet
3.describing Data
35 pages
Biostat Lecture 1
No ratings yet
Biostat Lecture 1
24 pages
UCCM2233 - Chp3 Num Descriptive Measures-Wble
No ratings yet
UCCM2233 - Chp3 Num Descriptive Measures-Wble
103 pages
1 Introduction MH
No ratings yet
1 Introduction MH
8 pages
MTE 3113 - Stat - 2
No ratings yet
MTE 3113 - Stat - 2
51 pages
Mean Median Mode
No ratings yet
Mean Median Mode
56 pages
Mathematics P1 Nov 2016 Memo Afr & Eng
No ratings yet
Mathematics P1 Nov 2016 Memo Afr & Eng
20 pages
Ame Final
No ratings yet
Ame Final
20 pages
Week 3 - Review Topic - Measures of Central Tendency and Dispersion - NEUVLE
No ratings yet
Week 3 - Review Topic - Measures of Central Tendency and Dispersion - NEUVLE
13 pages
Lecture 3 - MEASURE OF CENTRAL TENDENCY
No ratings yet
Lecture 3 - MEASURE OF CENTRAL TENDENCY
25 pages
Calculus Essay Writing
No ratings yet
Calculus Essay Writing
4 pages
AS Mathematics - Practice Paper - Binomial Expansion MS
No ratings yet
AS Mathematics - Practice Paper - Binomial Expansion MS
5 pages
GkFinalCentralTendency Slides
No ratings yet
GkFinalCentralTendency Slides
46 pages
Matlab Code To Print Inverse Receptance 12
No ratings yet
Matlab Code To Print Inverse Receptance 12
1 page
E707 - Modern Control-2
No ratings yet
E707 - Modern Control-2
73 pages
Founder of Circle'S First Theorem Thales Definition of Circle
No ratings yet
Founder of Circle'S First Theorem Thales Definition of Circle
15 pages
Number System
No ratings yet
Number System
6 pages
D'alembert Priciple
No ratings yet
D'alembert Priciple
9 pages
Coordinates
100% (2)
Coordinates
4 pages
Somers D
No ratings yet
Somers D
27 pages
HMTCR4101T
No ratings yet
HMTCR4101T
2 pages
Measure of Central Tendency
No ratings yet
Measure of Central Tendency
16 pages
Measures of Central Tendency or Averages
No ratings yet
Measures of Central Tendency or Averages
9 pages
Central Tendancy in R
No ratings yet
Central Tendancy in R
10 pages
Chapter 4 Measures of Central Tendency
No ratings yet
Chapter 4 Measures of Central Tendency
8 pages
An Effective Document Image Deblurring Algorithm
No ratings yet
An Effective Document Image Deblurring Algorithm
8 pages
Chapter 1 Simple Linear Regression Model
No ratings yet
Chapter 1 Simple Linear Regression Model
2 pages
Chapter 3 Descriptive Statistics
No ratings yet
Chapter 3 Descriptive Statistics
78 pages
Root Locus Diagram - GATE Study Material in PDF
100% (1)
Root Locus Diagram - GATE Study Material in PDF
7 pages
1431364846L02.EE3121.Review of Measures of Central Tendency
No ratings yet
1431364846L02.EE3121.Review of Measures of Central Tendency
7 pages
Statistics I Essentials
From Everand
Statistics I Essentials
Emil G. Milewski
No ratings yet
Quantum Field Theory 1
No ratings yet
Quantum Field Theory 1
11 pages
Module 4 PDF
No ratings yet
Module 4 PDF
15 pages
Central Tendency - Fall 20
No ratings yet
Central Tendency - Fall 20
38 pages
Notes On Quantum Mechanics PDF
100% (1)
Notes On Quantum Mechanics PDF
317 pages
Measure of Central Tendency
No ratings yet
Measure of Central Tendency
25 pages
Lesson 3 Numerical and Descriptive Measures
No ratings yet
Lesson 3 Numerical and Descriptive Measures
16 pages
Central Tendency
No ratings yet
Central Tendency
105 pages
Hand Outs3
No ratings yet
Hand Outs3
6 pages
Measure of Central Tendency in Statistics
No ratings yet
Measure of Central Tendency in Statistics
16 pages
An Introduction To Physics (Classical Mechanics)
From Everand
An Introduction To Physics (Classical Mechanics)
Jason King
No ratings yet
Modules Week 1 8 2nd Quarter
No ratings yet
Modules Week 1 8 2nd Quarter
11 pages
Measure of Central Tendency
No ratings yet
Measure of Central Tendency
34 pages
Mathematical Geodesy Maa-6.3230: Martin Vermeer 4th February 2013
No ratings yet
Mathematical Geodesy Maa-6.3230: Martin Vermeer 4th February 2013
127 pages
Polynomials: Fundamental Theorem of Algebra: P (X) Has N Roots
No ratings yet
Polynomials: Fundamental Theorem of Algebra: P (X) Has N Roots
1 page
Measures of Central Tendency - Use This PDF
No ratings yet
Measures of Central Tendency - Use This PDF
30 pages
4 2 Measure of Central Tendency
No ratings yet
4 2 Measure of Central Tendency
11 pages
Measures of Location
No ratings yet
Measures of Location
33 pages
Gateway Tips-SP-16 PDF
No ratings yet
Gateway Tips-SP-16 PDF
1 page
Measure of Central Tendency Grouped Data
No ratings yet
Measure of Central Tendency Grouped Data
22 pages
Mscds 2024
No ratings yet
Mscds 2024
11 pages
1: Arithmetic Mean: Advantages: Disadvantages: Uses and Properties
No ratings yet
1: Arithmetic Mean: Advantages: Disadvantages: Uses and Properties
1 page
The Principle of Least Work PDF
100% (1)
The Principle of Least Work PDF
17 pages
Euler Lagrange EQ Made Simple Reany p3
100% (1)
Euler Lagrange EQ Made Simple Reany p3
3 pages
Measure of Central Tendency: Measure of Location: Goals
No ratings yet
Measure of Central Tendency: Measure of Location: Goals
7 pages
Diagnostic Exam: Algebra 1
100% (1)
Diagnostic Exam: Algebra 1
12 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
7 pages
Revit Basic Formula
No ratings yet
Revit Basic Formula
3 pages

Biostat Lecture Four

Uploaded by

Biostat Lecture Four

Uploaded by

Chapter Four

•The objective of calculating MCT is to determine

•In that sense it is an even more compact

•Since a MCT represents the entire data, it

Mean = 5810.5/169 = 34.48 years

Class interval Mid-point (mi) Frequency (fi) mifi

n/2 = 169/2 = 84.5

Class interval Mid-point (mi) Frequency (fi) Cum. freq

• Median = 29.5 + (14.5/47)10 = 32.58 ≈ 33

• The mode is the most frequently occurring value among

• Two factors are important in making this decisions:

Mean, Median & Mode

Consider the following two sets of data:

A: 177 193 195 209 226 Mean =

B: 192 197 200 202 209 Mean =

• Measures that quantify the variation or

• Dispersion refers to the variety exhibited by the

• The amount may be small when the values are

• Range = Maximum value – Minimum value

• SD is considered to be the best measure of

• However, if the units of measurements of variables

• (CV) = (Standard Deviation/Mean) × 100.

You might also like