0% found this document useful (0 votes)
2 views25 pages

Interpolation For Median and Quartiles - Lesson

The document provides a comprehensive guide on using interpolation to estimate median, quartiles, and percentiles in grouped data. It includes teacher notes, key points, and various pedagogical strategies for teaching these concepts, along with examples and practice questions. The resource is designed for both teachers and students, facilitating independent practice and understanding of statistical measures.

Uploaded by

Medha Joji
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views25 pages

Interpolation For Median and Quartiles - Lesson

The document provides a comprehensive guide on using interpolation to estimate median, quartiles, and percentiles in grouped data. It includes teacher notes, key points, and various pedagogical strategies for teaching these concepts, along with examples and practice questions. The resource is designed for both teachers and students, facilitating independent practice and understanding of statistical measures.

Uploaded by

Medha Joji
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 25

Interpolation to Estimate

Median, Quartiles &


Percentiles
Jamie Frost
www.drfrost.org
@DrFrostMaths

Contact the resource team:


[email protected]
@DrFrostResource

Dr Frost Learning is a registered


Last modified:1st August 2024 charity in England and Wales (no
1194954)
Teacher Notes
Prerequisite
Knowledge
• Median from listed data.
• Cumulative frequency graphs.

Throughout the slides, this symbol refers to a web link.


Unless
Key: otherwise specified, this will be to some functionality
within DF.
Key Points Solution step – All slides include
click to reveal pedagogical detail in the
! To be written ‘Notes’ section for each
in books Question/Discussion slide.
Dr Frost Learning is a registered
Prompt charity in England and Wales (no
Using the Dr Frost online platform
TEACHERS STUDENTS
Generate a Start an
random independent
worksheet practice involving
involving skills in skills in this
this PowerPoint PowerPoint.
(for printing or
online task
setting).
drfrost.org/w/73 drfrost.org/p/73
3 3
Clicking this box takes you to a single question practice for a
drfrost.org/s/123a
subskill to allow you further Test Your Understanding opportunities. (e.g.
drfrost.org/s/123a)
Skills in this Lesson
544 Interpolation to determine median, quartiles and percentiles
544a State the class interval within which a given quartile lies.
544b State the class interval within which a given decile or percentile lies.
544c Estimate frequency for part of a group in a grouped frequency table.
544d Estimate the median of grouped data using interpolation.
544e Estimate the median of grouped data where intervals have gaps.
544f Determine quartiles or percentiles of grouped data using interpolation.
544g Estimate the interquartile range from a grouped frequency table using interpolation.
544h Estimate the inter-decile or inter-percentile range from a grouped frequency table
Dr Frost Learning is a registered
using interpolation. charity in England and Wales (no
How to use these slides
Though many slides in this resource will have titles specific to the topic, the slide titles in the
table below are used consistently within DFL resources for specific pedagogical purposes.
Any atypical use of a slide type, including any change of animation* or intended use, will be
outlined in the Teacher Notes for the slide.
Slide Title Explanation Default Animations*
To be used as a prior knowledge check or to review
Recap prerequisite knowledge. Can be used as a starter or as part of Green click-to-reveal boxes.
the main lesson.
To be used to highlight key concepts or theorems. This could
Usually in sequence with
The Big include the ‘why’ of the topic - including “real-life” contextual
some green click-to-reveal
Idea scenarios, or putting into context of other mathematical
boxes.
concepts (past and future).
Solution animates in
Example To be modelled by the teacher.
sequence.
Green click-to-reveal boxes.
Test Your
To be completed by students and used for Assessment for For multi-step answers,
Understandi
Learning, primarily using mini-whiteboards. reveal in parts or click final
ng
answer to reveal full solution.
To be used as ‘Example’ &‘Test Your Understanding’ above, Example animates in
To be used as ‘Example’ &‘Test Your Understanding’ above,
Example within the same slide to provide scaffold via visible modelled sequence,
Examplefollowed
animates byinTYU
Example within the same slide to provide scaffold via visible modelled
Problem solution. question with
sequence. Clickgreen click-to-
the header to
Problem solution.
Pair TYU column is blank initially, to focus attention on example. reveal boxes for solution
reveal TYU question, then
Pair TYU column is blank initially, to focus attention on example.
Reveal question by clicking ‘Test Your Understanding’ steps.
green click-to-reveal boxes.
banner.
To be used as fluency practice. Multiple questions in rapid Green click-to-reveal boxes.
Quickfire succession,
To be used for calculations
as fluency that can
practice. be completed
Multiple questions mentally.
in rapid For multi-step
Green answers,
click-to-reveal boxes.
Questions
Quickfire Often used forfor
succession, shorter questions/
calculations that formulae or to isolate
can be completed a small
mentally. reveal in parts oranswers,
For multi-step click final
Questions Often used for shorterpart of the method.
questions/ formulae or to isolate a small line toin
reveal reveal
partsfull solution.
or click final
part of the method. line to reveal full solution.
To be used as a diagnostic question. Multiple choice questions,
Multi-choice with
To be plausible
used distractors,
as a diagnostic to allow
question. teachers
Multiple
Dr Frost to diagnose
choice
Learning is questions, Arrowinpoints
a registered charity Englandto answer,
and Wales on (no
Estimating Frequencies
We are assuming the cats are
The table shows the heights of equally distributed between
various cats. and cm, so that there are
between cm and between cm.
Height (cm) Frequenc
y halfway

cm cm cm
Estimate the number of cats with cats cats cats
a height:
halfway
a Below cm
8

Cats up to
this height
b Above cm
c Below cm
4

a
Heigh
b cm cm
t
c 8 +15=𝟐𝟑 This is known as linear interpolation,
because if we were plotting this as a
cumulative frequency graph (i.e. the running
total of cats up to a specific height), the
graph forms a straight line.
Test Your Understanding drfrost.org/ 544c
s/

1 The grouped frequency table


shows the times taken for seconds is of the way
students to complete a puzzle. across the interval.

Time (secs) Frequenc Therefore, include:


y • of students
?
• All of the students

Estimate the number of students


with a time above seconds.
Mean vs Measures of Position

We understand mean as an
‘average’ that considers all
values, but is skewed by
extreme values.

of data of data of data

Salary
($)
Mea 𝑷 𝟗𝟗
Media n 𝑸𝟑 We can also have percentiles. The
n 99th percentile (written ) is the value
along the data. When ‘the ’is used in
the media, it’s referring to people in
the ‘top percentile’ for salary, i.e.
Median is a measure of above .
position because it allows us Another measure of position is
to get the value a certain quartiles, which gives us the
position, in this case along the value , , and along the data.
data.
along data (lower quartile)
along data (upper quartile)
Names for Quantiles
Quantiles represent general positions across the data when
ordered.

Quartiles
𝑄1 𝑄2 𝑄3
of data 25 % 25 % 25 %

Deciles
𝐷1 𝐷 2 𝐷 3 𝐷 4 𝐷5 𝐷 6 𝐷7 𝐷 8 𝐷 9
10 %10 %10 %10 %10 %10 %10 %10 %10 %10 %

Percentil
es
𝑃1 𝑃2 𝑃3 𝑃 98 𝑃 99

1 1 1 1 1 1
%%%%%%
… … 1 1
%%
What Item To Use For Listed Data?

Items Position of Media


median? n ?
3rd
? ?
2nd/3rd
? ?
4th
? ?
5 /6th
th

Can you think of a rule to find the position


of the median given the number of
values, ?

! To find the position of the median for listed data,


calculate :
- If a decimal, round up.
- If whole, use the midpoint between this
item and the one after.
Which Item To Use For Grouped Data?

IQ of L6Ms2 () Frequency ()
If the data is grouped, what
item do we use for the
median?
th
item

! To estimate the median of


grouped data, calculate , then
use linear interpolation.

Important point: Unlike with listed values, for grouped data, do not round in
any way. For example, if we were reading off a value from a cumulative
frequency graph and there were values, for the median we’d read across the th
item mark, not halfway between the 50th and 51st.
Quickfire Questions
What position do we use for the
1 median? 4 7
Lengths: cm, cm, cm, … Score Fre Weights: kg, kg, …
q

? : 18th
Position
?
Position:
6th 8
?
Position: Volume (ml) Freq
2 5th
Lengths: m, m, m, … 5
Ages: , …

Position:?
12th/13th
3 Position:?30th/31st ? 6.5th
Position:
Age Freq
6 9
Score Freq
Weights: kg, kg, …

?
Position: Position:? 9th/10th
8.5th
Position?: 10.5th
Example Test Your
Understanding
In what interval does the 2 In what interval does the
upper quartile lie? lower quartile lie?

IQ of L6Ms2 () Frequency () Weights () in kg Freq ()

The upper
th
item quartile, , is
th
item
along the
data.
The 12.75th item doesn’t The lower quartile lies
lie within the first items. within ?
But is within the next .

The upper quartile lies


within
drfrost.org/ 544a
s/
Example Test Your
Understanding
In what interval does the 3 In what interval does the
8th decile lie? 34th percentile lie?

Height () in m Freq () Height () in m Freq ()

The 8th decile,


th
item , is along the th
item The 34th
percentile , is
data.
along the data.

The 99.2th item doesn’t lie It doesn’t lie in the first


within the first items. items, but does in the next
(as it takes us up? to the 49th
But is within the next . item)

The 8th decile lies within The 34th percentile lies


within
drfrost.org/ 544b
s/
Linear Interpolation
Height of tree Freq C.F
(m)
Cumulative Frequency

Using a cumulative frequency


graph, we know we can
estimate the median by drawing
a suitable line.
How could we read
off this value exactly
Height of trees using a suitable
(m) calculation?
We could find the fraction of the way along the line
segment using the frequencies, then go this same
fraction along the class interval.
Linear Interpolation
Height of tree Freq C.F
(m)
Cumulative Frequency

Using linear
interpolation, estimate
the median.
Step 1: Identify the interval in which
the median item, here the th value, lies.

Step 2: Write the relevant data needed


Height of trees to make the calculation. We
(m) recommend below.
Frequency up Item number we’re Frequency by
until this
interval
55 75
? ? interested in. 100? end of this
interval
Height at start
of interval.
m
? 𝑄
? 2 Height by m
?
end of
interval.
Linear Interpolation
Using linear Height of tree Freq C.F
interpolation, estimate (m)
the median.

Frequencies
go at the top. 𝟐𝟎
𝟒𝟓
𝟐𝟓
Frequency up Item number we’re Frequency by
𝟒𝟓
until this
interval
55 75 interested in. 100 end of this
interval
Height at start
of interval.
m 𝟐𝟓
𝑄2 Height by m
end of
𝟒𝟓 interval.
And heights at the
What fraction of the way of the
th bottom. You may
across the class interval are wish to put units to
way
avoid confusing with
we?
your frequencies.
Step 3: We therefore go

( )
the same fraction of the
20
way between m to m. 𝑄 2=0.6 + × 0.05 =𝟎 .𝟔𝟐𝟐 𝐦
45
Further Examples
Using linear interpolation, estimate the
median.
Weight of cat Freq C.F. Time (s) Freq C.F.
(kg)

? th item
16 ? th item
10
Median class 𝟑 ≤ 𝒘? <𝟒 Median class 𝟏𝟐 ≤ 𝒕<𝟏𝟒
?
interval: interval:
10
? 16
? 18
? 7
? 10
? 20
?

kg
? 𝑄 2 kg
? ?s 𝑄2 ?s

𝟔
Fraction along ? RIP Pippin❤️ Estimate of median:
interval: 𝟖 2012-2021
Estimate of ? secs (to
median: 3sf)
𝟑+ ( 𝟔
𝟖 )
×𝟏 ?=𝟑 .𝟕𝟓k
g
Test Your Understanding drfrost.org/ 544d
s/

4 The data shows the heights of


trees. Use linear interpolation
Lies in interval.
to estimate the median.
Height () in m Freq () C.F.

49 62 103
m 𝑄2 m
?

Top tip: Adding a m


Cumulative
Frequency column
can help with linear
interpolation
What’s Different About The Intervals Here?
Weight of cat to nearest Frequency
You can spot this by
kg either being aware of
the word
‘rounded’/’nearest’ in
the question, or where
the endpoints of the
intervals don’t match,
i.e. ‘have gaps’.
Because the weights are rounded to the nearest kg, a
weight of kg for example would appear in the kg
interval.
What interval does this actually represent?
10 − 12

9 .5 − 12.5
Lower class Upper
boundary class
Class width = boundary
Quickfire Boundaries/Class Width
1 2
Distance travelled (in m) … Time taken (to the …
nearest second)

Lower class boundary ?


Lower class boundary ?
Class width ?
Class width ?
3 4
Weight (to the nearest … Speed (in mph) …
kg)

Lower class boundary ?


Lower class boundary ?
Class width ?
Class width ?
Example Test Your
Understanding
Use linear interpolation to 5 Use linear interpolation to
estimate the median time. estimate the median volume.
Volume () to Freq ()
Time (to Frequency () C.F
C.F nearest ml
nearest hour) .
.

23.5th
25th item. item.
Lies in interval.
Lies in interval. 12 23.5 36
𝑄 2?
20 25 32 ml ml

h 𝑄2 h 𝑄 2=399.5+ ( 11.5
24 )
×200ml
𝑄 2=5.5+
5
12 (
× 5m ) drfrost.org/ 544e
s/
Interpolation with Quartiles
We can use the same process to determine quartiles (and
therefore the interquartile range). Simply use the correct item
number.
Use linear interpolation to a th item
estimate:
a
class interval:2.5 ≤ 𝑡 <5.5
b the lower quartile
c the upper quartile 12 12.5 20
the interquartile range h 𝑄1 h
Time (to Frequency ()
C.F
(8 )
nearest hour) 0.5
. 𝑄1 =2.5+ ×3 =𝟐 .𝟔𝟗h

b th
item

10.5 ≤ 𝑡<30.5
class interval:

32 37.5 50
h 𝑄3 h
c h
𝑄 3=10.5+ ( 5.5
18 )
×20 =𝟏𝟔 .𝟔𝟏
h
Test Your Understanding drfrost.org/ 544f
s/
544g

6 Use linear interpolation to a st


item
estimate:
a the lower quartile
class interval: 5 ≤ h< 8
b the upper quartile
17 31? 49
c the interquartile range
m 𝑄1 m
Height () in m Freq () C.F.
𝑄1 =5+ ( 14
32 )
×3 =𝟔 .𝟑𝟏
m

b rd
item

class interval: 8 ≤ h <10


49 93? 103
m 𝑄3 m

𝑄 3=8 + ( 44
54 )
× 2 =𝟗 .𝟔𝟑
m

c m ?
Inter-percentile and Inter-decile Ranges
Interquartile Range

𝑄1 𝑄2 𝑄3
of data 25 % 25 % 25 %
2nd to 8th Interdecile
Range
𝐷1 𝐷 2 𝐷 3 𝐷 4 𝐷5 𝐷 6 𝐷7 𝐷 8 𝐷 9
10 %10 %10 %10 %10 %10 %10 %10 %10 %10 %

15th to 85th Interpercentile


Range 𝑃 98 𝑃 99
𝑃1 𝑃2 𝑃3
1 1 1 1 1 1
%%%%%%
… … 1 1
%%

Just as we can calculate the interquartile range to mean the


range of the middle of data, we can also use deciles and
percentiles to find the range of a more general middle
Test Your Understanding drfrost.org/
s/
544h

7 Use linear interpolation to items


estimate the 3rd to 7th th
item
interdecile range of the st
item
following data:
Mass () to the Freq ()
nearest gram C.F 5 9 20
.
g 𝐷3 g

g
?

20 21 27
g 𝐷7 g

g to 3sf

You might also like