0% found this document useful (0 votes)

7 views46 pages

Week7 - Measures of Central Tendency

Uploaded by

bonolobadire447

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views46 pages

Week7 - Measures of Central Tendency

Uploaded by

bonolobadire447

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 46

Week 7

Measures of Central
Tendency/Location
Review

Distinguish between population and sample,

parameter and statistic, good sampling
methods: simple random sample, stratified
sample, etc.

Frequency distributions, summarizing data

using graphs, describing the center, variation,
distribution, outliers, and changing
characteristics over time in a data set
Objectives
By the end of this lesson, you must be able to:
1. Distinguish between the measures of central
tendency; Arithmetic mean, Median, mode and
midrange.
2. Compute mean, median and mode for grouped
and ungrouped data.
3. Interpret the mean, median and mode.
4. Describe the effect of outliers on the measures of
central tendency
5. Describe quartiles and percentiles
6. Construct and Describe box plots for normal and
skewed distributions
Summary Definitions

• Measures of central tendency/location: extent to

which all the data values group around a typical or
central value (concentration)
• Central location is the middle value of
concentration of the data
• Non-central location measures identify values “off
the centre”
Round-off Rule for Measures of
Center/Location

Carry one more decimal place than is

present in the original set of values
Critical Thinking

• Think about whether the results are

reasonable.
• Think about the method used to collect
the sample data.
easures of Central Tendency/Location

• The value at the center or middle of a data set

• These measures tend to lie near the center of a
distribution when the data are arranged according
to magnitude.
• If the frequency distribution is bilaterally
symmetrical, unimodal distribution, then all three
measures of central tendency will be equal
Measures of Central
Tendency/Location
Most common measures of central
tendency are:
1. Mean/arithmetic mean/average
2. Median/second quartile/middle
quartile/50th percentile
3. Mode (most frequent value)
All three can be used to analyse numerical
data; Mode is the only one that can
be used for categorical data
(nominal and ordinal data)
Symmetric distribution
Arithmetic Mean

• The measure of center obtained by

adding the values and dividing the total
by the number of values
• What most people call an average.
Notation

 denotes the sum of a set of values.

x is the variable usually used to
represent the individual data values.

represents the number of data

n values in a sample. (Sample size)

represents the number of data

N values in a population.
Notation

x is pronounced ‘x-bar’ and denotes the mean of a

set of sample values
x
x
n
 is pronounced ‘mu’ and denotes the mean of all
values in a population
x

N
Mean For Ungrouped Data

Find the mean: Age of 5 statistics students;

22, 22, 26, 24, 23

x 22  22  26  24  23 117
x  
n 5 5
= 23.4

Interpretation?
Mean for Grouped Data
Consider grouped data below, how do we
calculate the mean?
Age group Frequency (f)

20-24 5

25-29 8

30-34 2

35-39 3

40-44 2
Calculating Mean from a
Frequency Distribution- Grouped
•
data
Calculate the mid point of the group and assume
that all sample values in each class are equal to the
class midpoint. Use variable x for class midpoint.
• Multiply the mid point by the frequency in each
group
• The mean will therefore be;
Mean= Sum of midpoints x frequencies
Sum of frequencies
( f x)
x
f
Mean Grouped Data….
Age Frequency Mid point Frequency X mid
group (f) of age point
group ( f.x)
(x)
20-24 5 22 110
25-29 8 27 216
30-34 2 32 64
35-39 3 37 111
40-44 2 42 84
Total(su 20 585
m)
Mean Grouped Data…
Therefore,
( f x)
x
f
= 585
20
= 29.3
Mean
Advantages
• Sample means drawn from the same population
tend to vary less than other measures of center
• Takes every data value into account
Disadvantage
• Is sensitive to every data value, one extreme
value can affect it dramatically; is not a resistant
measure of center (affected by OUTLIERS). The
mean tends to follow the OUTLIER, making the
distribution to be non uniform….Skewed.
• Not Used for Categorical Data
Skewness
Exercises

• Given the following data of the age of patients

in years.
2, 5, 17, 8, 25, 20, 35, 70, 15, 45, 52, 68, 70,
55, 66,
82, 37, 59, 22, 19.
a) Construct a frequency distribution (Classes=8)
b) Use your frequency distribution to calculate
the mean.
Median

• The middle value when the original data

values are arranged in order of
increasing (or decreasing) magnitude

• Denoted byx (pronounced ‘x-tilde’)

• Is not affected by an extreme value - is a

resistant measure of the center
• Cannot be calculated for categorical data
Median for Ungrouped Data

First sort the values (arrange them in

order). Then –
1. If the number of data values is odd, the
median is the number located in the
exact middle of the list.
2. If the number of data values is even,
the median is found by computing the
mean of the two middle numbers.
Median for Ungrouped
Data
• The location of the median when the values are in
numerical order (smallest to largest):
n 1
Median position  position in the ordered data
2
• If the number of values is odd, the median is the
middle number

• If the number of values is even, the median is the

average of the two middle numbers
n 1
Note that 2 is not the value of the median,
only the position of the median in the ranked data
Example

• The duration of hospital stay in a hospital x

are;
6, 6, 6 ,1, 1, 2, 4, 4, 4, 2, 10, 38, 80, 3, 3, 4,
5, 6,7, 8, 10.
• Calculate the median;
Arrange the numbers in descending or
ascending order.
The middle number/score is the median.
i.e. 1, 1, 2, 2, 3, 3, 4, 4, 4, 4, 5, 6, 6, 6, 6, 7, 8,
10, 10, 38, 80
Therefore the median duration of stay= 5
Example

• For the even numbers;

1, 1, 2, 2, 3, 3, 4, 4, 4, 5, 6, 6, 6, 6, 7,
8, 10, 10, 38, 80.
• Median is the average of the middle
numbers.
i.e. 5 + 6 = 5.5
2 Interpretati
on?
Median…

• The median is less often used than the

mean.
• However, median is more stable if the
data is asymmetrical.
Median is not affected by
Outliers
Median for Grouped Data
Age group Frequency (f)

20-24 5

25-29 8

30-34 2

35-39 3

40-44 2

• Determined by finding the age group at which we

have 50% of the sample above and 50% below
• This can be done using frequency counts or
cumulative frequency
Midrange

The value midway between the

maximum and minimum value

maximum value + minimum

Midrange = value
2
Midrange

• Sensitive to extremes (outliers)because

it uses only the maximum and minimum
values.

• It is rarely used
Mode
• The value that occurs with the greatest frequency

• Data set can have one, more than one, or no mode

Bimodal two data values occur with the same greatest

frequency
Multimodal more than two data values occur with the
same greatest frequency
No Mode no data value is repeated

• Mode is the only measure of central

tendency that can be used with nominal
data.
• If data has outliers, use median or mode
Mode.

e.g. if the duration of stay in ward x for

patients is like shown below;
3,4,5,7,2,3,5,9,1,5,3,5.

• By rearranging, the numbers; 1,

2,3,3,3,4,5,5,5,5,7,9,
The mode= 5.
Example

a. 5.40 1.10 0.42 0.73 0.48 1.10 Mode is 1.10

b. 27 27 27 55 55 55 88 88 99
Bimodal - 27

& 55
c. 1 2 3 6 7 8 9 10
No Mode
Unimodal distribution
Multimodal distribution
Critical Thinking

• When the mean and median are not close

to each other in terms of their value, it’s a
good idea to report both and let the reader
interpret the results from there.
• Also, as a general rule, be sure to ask for
the median if you are only given the mean.
Non-Central Location Measures

1. Quartiles
2. Percentiles
Quartiles

• Quartiles split the ordered data into 4

segments with an equal number of
values per
25% segment
25% 25% 25%

Q1 Q2 Q3

• The first quartile, Q1, is the value for which

25% of the observations are smaller and 75%
are larger
• Q2 is the same as the median (50% of the
observations are smaller and 50% are larger)
• Only 25% of the observations are greater
than the third quartile, Q3
Locating
Quartiles
To find a quartile: rank/order data and
determine the value in the appropriate
position in the ranked data, where,
First quartile position: Q1 = (n+1)/4
value

Second quartile position: Q2 = (n+1)/2

value

Third quartile position: Q3 =

3(n+1)/4 value
Calculation Rules

• When calculating the ranked

position use the following rules:
– If the result is a whole number then it is
the ranked position to use

– If the result is a fractional half (e.g. 2.5,

7.5, 8.5, etc.) then average the two
corresponding data values.

– If the result is not a whole number or a

fractional half then round the result to
the nearest integer to find the ranked
position.
Locating Quartiles- Example

11 12 13 16 16 17 18 21 22
(n = 9)
Q1 is in the (n+1)/4 position; (9+1)/4 = 2.5 position
of the ranked data. So use the value half way
between the 2nd and 3rd values,

Use the formula

So Qfor to 12.5
= find Q2
1
and Q4

Q1 and Q3 are measures of non-central

location
Q2 = median, is a measure of central
Locating Quartiles- Example

11 12 13 16 16 17 18 21 22
(n = 9)
Q1 is in the (n+1)/4 position; (9+1)/4 = 2.5 position
of the ranked data. So use the value half way
between the 2nd and 3rd values,

So Q1 (9+1)/2
Q2 is in the (n+1)/2 position; = 12.5 = 5th position
So Q2 = median = 16

Q3 is in the 3(n+1)/4; 3(9+1)/4 = 7.5 position

So Q3 = (18+21)/2 = 19.5
Percentiles

11 12 13 16 16 17 18 21 22

A percentile is a data point below

which a given percentage of data points
in the distribution fall.
Critical Thinking

Compare these two data sets;

1. what’s the mean, median and mode?
2. What is the midrange?

• 199, 200, 201

• 0, 200, 400
Critical Thinking

• What if two sets of data have about the same

average and the same median? Does that mean
that the data are all the same?
• For example, the data sets 199, 200, 201, and 0,
200, 400 both have the same average, which is
200, and the same median, which is also 200. Yet
they have very different amounts of variability
• The first data set has a very small amount of
variability compared to the second.
• Therefore, in addition to center we also measure
variability.
END WEEK 7

Stats Form 4
100% (2)
Stats Form 4
35 pages
Dtatistical Measures
No ratings yet
Dtatistical Measures
54 pages
Chapter 3
No ratings yet
Chapter 3
33 pages
CH03 - Descriptive Statistics 2
No ratings yet
CH03 - Descriptive Statistics 2
67 pages
Properties - Describing Quantitative Data
No ratings yet
Properties - Describing Quantitative Data
36 pages
4 2 Measure of Central Tendency
No ratings yet
4 2 Measure of Central Tendency
11 pages
Notation
No ratings yet
Notation
9 pages
Session 5 - Measurement Central Tendency
No ratings yet
Session 5 - Measurement Central Tendency
13 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
38 pages
Lesson 3 Numerical and Descriptive Measures
No ratings yet
Lesson 3 Numerical and Descriptive Measures
16 pages
3 Central Tendency Mean Median, Mode
No ratings yet
3 Central Tendency Mean Median, Mode
33 pages
Lec1 Statistics
No ratings yet
Lec1 Statistics
30 pages
Applied Statistical Methods (ASM) : "The True Logic of This World Is in The Calculus of Probabilities"
No ratings yet
Applied Statistical Methods (ASM) : "The True Logic of This World Is in The Calculus of Probabilities"
90 pages
Central Tendency
No ratings yet
Central Tendency
105 pages
Business Statistics CH
No ratings yet
Business Statistics CH
37 pages
2.3 Descriptive Numerical Summary Measures
No ratings yet
2.3 Descriptive Numerical Summary Measures
67 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
38 pages
Ling Part 3
No ratings yet
Ling Part 3
5 pages
Presentation 3
100% (1)
Presentation 3
37 pages
QT Session 2 Measures of Central Tendency
No ratings yet
QT Session 2 Measures of Central Tendency
32 pages
3 Summarizing Data
No ratings yet
3 Summarizing Data
64 pages
Portion 9
No ratings yet
Portion 9
44 pages
Lecture-3&4 - Measure of Centeral T
No ratings yet
Lecture-3&4 - Measure of Centeral T
171 pages
Chapter 4 Numerical Descriptive Measures of Data
No ratings yet
Chapter 4 Numerical Descriptive Measures of Data
35 pages
L-03 PBH 611 Exploratory Data Analysis
No ratings yet
L-03 PBH 611 Exploratory Data Analysis
78 pages
Lesson 3.2 Measures of Central Tendency Position and Variation
No ratings yet
Lesson 3.2 Measures of Central Tendency Position and Variation
51 pages
Biostatistics3 2
No ratings yet
Biostatistics3 2
36 pages
Data Analytics TB
No ratings yet
Data Analytics TB
1,944 pages
Module 3. - Measures of Central Tendency
No ratings yet
Module 3. - Measures of Central Tendency
21 pages
Lesson 4: Statistics/Data Management Unit 1 - Measures of Central Tendency
No ratings yet
Lesson 4: Statistics/Data Management Unit 1 - Measures of Central Tendency
26 pages
Module 5 Data Management Measures of Central Tendency Dispersion and Position
No ratings yet
Module 5 Data Management Measures of Central Tendency Dispersion and Position
18 pages
Measures of Central Tendancy
No ratings yet
Measures of Central Tendancy
18 pages
(Measures of Location) - Lec#1 - Chapter 1 - Part1
No ratings yet
(Measures of Location) - Lec#1 - Chapter 1 - Part1
33 pages
NE 2207 Part 4
No ratings yet
NE 2207 Part 4
5 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
16 pages
3jane - Data Description Finala4
No ratings yet
3jane - Data Description Finala4
14 pages
Measures of Central Tendency
No ratings yet
Measures of Central Tendency
49 pages
Grade 7: Mathematics Quarter 4 - Module 4 MELC 6 and 7 Measures of Central Tendency
No ratings yet
Grade 7: Mathematics Quarter 4 - Module 4 MELC 6 and 7 Measures of Central Tendency
12 pages
4 Measures of Central Tendency, Position, Variability PDF
100% (1)
4 Measures of Central Tendency, Position, Variability PDF
24 pages
Measures of Central Tendency (Summarizing Data With A Single Number) Grouped Data Intro To Dispersion: Quartiles
No ratings yet
Measures of Central Tendency (Summarizing Data With A Single Number) Grouped Data Intro To Dispersion: Quartiles
11 pages
Sec 2.4 - Measures of Central Location
No ratings yet
Sec 2.4 - Measures of Central Location
19 pages
Lec - 4 (Summary Data)
No ratings yet
Lec - 4 (Summary Data)
89 pages
Chap1 Lesson 2
No ratings yet
Chap1 Lesson 2
10 pages
5.measures of Central Tendency
No ratings yet
5.measures of Central Tendency
15 pages
Lecture3A Slides
No ratings yet
Lecture3A Slides
12 pages
Measures of Location and VARIATION For 1 Variable
No ratings yet
Measures of Location and VARIATION For 1 Variable
44 pages
Stat I Chapter 3
No ratings yet
Stat I Chapter 3
48 pages
Chapter 3 A
No ratings yet
Chapter 3 A
62 pages
Central Tendency and Dispersion: A.Ramesh
No ratings yet
Central Tendency and Dispersion: A.Ramesh
58 pages
المحاضرة رقم 3
No ratings yet
المحاضرة رقم 3
44 pages
Module 5 Measures of Central Tendency
No ratings yet
Module 5 Measures of Central Tendency
22 pages
4 Measures of Central-Tendency & Box Plot
No ratings yet
4 Measures of Central-Tendency & Box Plot
30 pages
Lesson 4 Measure of Central Tendency
100% (1)
Lesson 4 Measure of Central Tendency
20 pages
CH 3
No ratings yet
CH 3
59 pages
4 Measures of Centrality: Mean, Median, Mode, Grouped Data
No ratings yet
4 Measures of Centrality: Mean, Median, Mode, Grouped Data
18 pages
Lecture 04
No ratings yet
Lecture 04
88 pages
Sta 3.2
No ratings yet
Sta 3.2
15 pages
MODULE 2 Measures of Central Tendency
No ratings yet
MODULE 2 Measures of Central Tendency
8 pages
Mean, Median, Mode
No ratings yet
Mean, Median, Mode
49 pages
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
From Everand
De-Mystifying Math and Stats for Machine Learning: Mastering the Fundamentals of Mathematics and Statistics for Machine Learning
Seaport AI Madhavan
No ratings yet
Ma 1
No ratings yet
Ma 1
31 pages
100 SQL Questions
No ratings yet
100 SQL Questions
32 pages
Muhammad A (202) 583-2500: Senior Aem Developer - Lateetud - Jersey City, NJ January 2019 To Present
No ratings yet
Muhammad A (202) 583-2500: Senior Aem Developer - Lateetud - Jersey City, NJ January 2019 To Present
5 pages
Web Programming With Python and Javascript
No ratings yet
Web Programming With Python and Javascript
40 pages
SplunkFundamentals1 Module4
100% (1)
SplunkFundamentals1 Module4
8 pages
Administration of Veritas Backup Exec™ 21 Sample Exam
No ratings yet
Administration of Veritas Backup Exec™ 21 Sample Exam
6 pages
Unit 1: Course Introduction: Week 1: SAP HANA Query Processing
No ratings yet
Unit 1: Course Introduction: Week 1: SAP HANA Query Processing
100 pages
Guide To Design Database For Inventory Management System in MySQL
No ratings yet
Guide To Design Database For Inventory Management System in MySQL
11 pages
Data Science Skills
No ratings yet
Data Science Skills
31 pages
openECA Test Harness Training
No ratings yet
openECA Test Harness Training
55 pages
Unit-1 ML Notes
No ratings yet
Unit-1 ML Notes
20 pages
Power BI - Syllubus...
No ratings yet
Power BI - Syllubus...
5 pages
Notes Big Data
No ratings yet
Notes Big Data
106 pages
Learning PostgreSQL 10 A Beginner S Guide To Building High Performance PostgreSQL Database Solutions Juba PDF Download
100% (1)
Learning PostgreSQL 10 A Beginner S Guide To Building High Performance PostgreSQL Database Solutions Juba PDF Download
59 pages
Introduction To Data Science 1-2-2025
No ratings yet
Introduction To Data Science 1-2-2025
14 pages
Creating A Content Part: Orchardcms Orcharddoc
No ratings yet
Creating A Content Part: Orchardcms Orcharddoc
4 pages
Complete SQL
No ratings yet
Complete SQL
91 pages
BTech-CSE-2021 Syllabus
No ratings yet
BTech-CSE-2021 Syllabus
5 pages
Final TY Project Documentation
No ratings yet
Final TY Project Documentation
54 pages
Sinhgad Institute of Management MCA-I, Div A&B Dbms (SQL) Assignment No-2
No ratings yet
Sinhgad Institute of Management MCA-I, Div A&B Dbms (SQL) Assignment No-2
8 pages
Microprocessor Question Paper 2010
No ratings yet
Microprocessor Question Paper 2010
5 pages
Vector and Raster Data Models: Faculty of Applied Engineering and Urban Planning Civil Engineering Department
No ratings yet
Vector and Raster Data Models: Faculty of Applied Engineering and Urban Planning Civil Engineering Department
41 pages
Department of Computing: CS-220: Database Systems Class: BSCS-4C
100% (1)
Department of Computing: CS-220: Database Systems Class: BSCS-4C
14 pages
Certification: Aws, Db2: All Earlier Versions DB2/400, DB2, Oracle /9i/10g
No ratings yet
Certification: Aws, Db2: All Earlier Versions DB2/400, DB2, Oracle /9i/10g
4 pages
50 Kash Sharma Maths
No ratings yet
50 Kash Sharma Maths
13 pages
DW Practical No 1 & 2
No ratings yet
DW Practical No 1 & 2
6 pages
Blockchain Unit 1
No ratings yet
Blockchain Unit 1
13 pages
Dmdw-Lab Manual
No ratings yet
Dmdw-Lab Manual
61 pages
Linux Recommended File Systems For SAP System
No ratings yet
Linux Recommended File Systems For SAP System
3 pages
MINI Project - Report (DR)
No ratings yet
MINI Project - Report (DR)
30 pages

Week7 - Measures of Central Tendency

Uploaded by

Week7 - Measures of Central Tendency

Uploaded by

Week 7

Distinguish between population and sample,

Frequency distributions, summarizing data

• Measures of central tendency/location: extent to

Carry one more decimal place than is

• Think about whether the results are

• The value at the center or middle of a data set

• The measure of center obtained by

 denotes the sum of a set of values.

represents the number of data

represents the number of data

x is pronounced ‘x-bar’ and denotes the mean of a

Find the mean: Age of 5 statistics students;

• Given the following data of the age of patients

• The middle value when the original data

• Denoted byx (pronounced ‘x-tilde’)

• Is not affected by an extreme value - is a

First sort the values (arrange them in

• If the number of values is even, the median is the

• The duration of hospital stay in a hospital x

• For the even numbers;

• The median is less often used than the

• Determined by finding the age group at which we

The value midway between the

maximum value + minimum

• Sensitive to extremes (outliers)because

• Data set can have one, more than one, or no mode

Bimodal two data values occur with the same greatest

• Mode is the only measure of central

e.g. if the duration of stay in ward x for

• By rearranging, the numbers; 1,

a. 5.40 1.10 0.42 0.73 0.48 1.10 Mode is 1.10

• When the mean and median are not close

• Quartiles split the ordered data into 4

• The first quartile, Q1, is the value for which

Second quartile position: Q2 = (n+1)/2

Third quartile position: Q3 =

• When calculating the ranked

– If the result is a fractional half (e.g. 2.5,

– If the result is not a whole number or a

Use the formula

Q1 and Q3 are measures of non-central

Q3 is in the 3(n+1)/4; 3(9+1)/4 = 7.5 position

A percentile is a data point below

Compare these two data sets;

• 199, 200, 201

• What if two sets of data have about the same

You might also like