CHPT20 Histograms and Frequency Distribution Diagrams
CHPT20 Histograms and Frequency Distribution Diagrams
li..i:
20 Histosrams
Histogram
a nd frequency
d istribution
(ontinuous
Class interval
Frequency "
:
Grouped o
Frequency tables
Frequency density
rdodal class
d lagrams
Cumulative{reguency,,
Cumulative f requency':curve
Quartiles
t lnterquartile range
t Percentiles
You have already coilected, organised, summarised and displayed different sets of data using pie
charts, bar graphs and line graphs. In this section you are going to work with numerical data
(sets of data where the class intervals are numbers) to learn how to draw frequency distribution
diagrams called histograms and cumulative frequency curves.
Histograms are useful for visuaily showing patterns in large sets of numerical data. The shape of the
graph allows you see where most of the measurements are located and how spread out they are.
20.l Histograms
A histogram is a specialised graph that looks a lot like a bar chart but is normally used to
the distribution ofcontinuous or grouped data.
Frequency
Age in years
Notice that:
r The horizontal scale is continuous and each column is drawn above a particular
Continuous data was introduced in class interval.
chapter 4. { r The frequency ofthe data is shown by the area ofthe bars.
I There are no spaces between the bars on the graph because the horizontal scale is
continuous. (Ifthe frequency relating to a class interval is 0, you wont draw a bar in
that class, so there will be a gap in the bars in that case.)
The table and histogram below show the heights of trees in a sample from a forestry site.
&) 25 < h <3om Find the tallest bar and read the class interval from the horizontal scale.
k} 6l Find the frequency for each class with heights of 20 m or more and add them together.
{i0 The horizontal scale of a histogram is continuous, so the class intervals are also continuous. The inequality symbols
prevent the same height of tree falling into more than one group. For example, without the symbols a tree of height S m
could go into two groups and thus be counted twice.
{:f The frequency for the class interval 50 < h < 55 is zero, so no bar is drawn.
Joy-Anne did an experiment in her class to see what mass of raisins (in grams) the I
students could hold in one hand. Here are her results. Y
I
l
r8r8202222222223i i
2324242s25252s25F r.
2526262730303135!]
I
=
t--=: how to draw grouped (b) What is the modal class (the mode) of this data?
i
F
'Ilm{,tEuri.Fcy- tables in chapter 4. <(
(c) Draw a histogram to show her results. F
I
ffi
tr6-20t3tt
16-20 3
inthetable. i
'l 21-25
21-25 I 1414 I r
I
lzo-sol5ll 26-30 5
ffi
| -'31-35
::______________ 2 i
l
(b) 21-25. of
I
.ruEG;,. .,,,.,,
The modal class is lt is actually not possible to find the mode
grouped data because you do not have the
I
l.
- j', :-rapter l2 that mode is individual values within each group. lnstead, I
llrrr, - .r''=:-ent reSUlt. d you find the class interval that has the- greatest
fi
frequency. This is called the'modal class'
-r
_
(Extended students learned this in chapter 1 ,l
2).
l
i
0.5 < m <1.5 1.5 < m <2-5 2.5 < m <3.5 3.5 < m <4.5 4.5<m<55
1 T2 3l t6 0
0<t<2
2<t<4
4<t <6
6</<8
8<t<10
L0<t<12
t2< t <r4
14<t<16
(a) How many calls did she make altogether?
(b) What is the most common length of a call?
(c) Draw a histogram to show this distribution.
(d) Make a new frequency table of these results using the class intervals given opposite.
Shamiela cut 30 pieces of ribbon, which she estimated were each about 30 cm long. Her sister
measured them and got the following actual lengths in centimetres:
29.t 30.2 30.5 31.1 32.0 31.3 29.8 29.5 31.6 32.4
32.t 30.2 3I.7 3t.9 32.t 29.9 32.r 31.4 28.9 29.8
3t.2 31.2 30.5 29.7 30.3 30.4 30.1 31.1 28.8 29.5
(a) Draw a suitable frequency distribution table for this data. Use an equal class interval.
(b) Construct a histogram to show your distribution.
(c) How accurately did Shamiela estimate? Give a reason for your answer.
The French Traffic Police recorded the number of vehicles speeding on a stretch of highway
careful of discrete groups o{
on a Friday night. Draw a histogram to show this data.
data; the raw data is
so can take any value
the groups.
(rA =
frequencydensity
.ffiS
Frequency density takes into account the frequency relative to the size of the class interval,
making it more fair when comparing different sized intervals.
First work out the frequency density by adding columns to your frequency distribution
table like this:
Frequency
density I
25 30 35 40
Height in cm
300<n<350
350<n<400
400<n<500
(a) Use the table to calculate the frequency density for each class.
(b) Construct a histogram to display the results. Use a scale of l cm = 100 sweets on the
horizontal axis and a scale of 1 cm = 0.2 units on the vertical axis.
The table shows the mass of young children visiting a chnic (to the nearest kg).
Draw a histogram to illustrate the data.
6<m<9
9 <m<12
12<m <18
18<m<21
2L<m<30
The table shows the distribution of the masses of the actors in a theatre group.
Draw a histogram to show the data.
6O<m<63
63 <m <64
64< m<65
65< m <66
66< m <68
683m<72
t-iving rnsffls
Percentage body fat of soldiers 4 A group of on-duty soldiers underwent fitness tests in which their percentage body fat was
calculated. The fitness assessor drew up this histogram ofthe results.
Frequency
densr[.
Speed (km/h)
(a) Is it easy to see how many vehicles travelled above or below the speed limit?
Give a reason for your answer.
(b) The traffic officer claims the graph shows that most people stick to the speed limit.
Is he correct? Give a reason for your answer.
(c) His colleagues want to know exactly how many vehicles travel below or above the speed
(i) Reconstruct this frequency table. Round frequencies to the nearest whole number.
0<s<50
50<s<65
65<s<80
80<s<95
95<s<110
110<s< 125
125Ss<180
(ii) FIow many vehicles were below the minimum speed limit?
(d) What percentage of vehicles in this sample were exceeding the maximum speed limi8
2O.2 Cumulativefrequency
Sometimes you may be asked questions such ast
In statistics you can use a cumulative frequency table or a cumulative frequency curve to
Cumulative means'increasing as
answer questions about data up to a particular class boundary. You cart also use the cumulati
more is added'.
frequencies to estimate and interpret the median and the value of other positions of a data set-
Unit 5: Data handling
r
0<h<5
5<h<10
l0<h<15
15<h<25
25<h<50
<Wl,:,.,
ln chapter 12, median classes were {b}: ls <h<2s The heights are given lor 25O flowers, so the median height must
introduced for grouped data. You
a:.: .. : : be the mean of the height of the t 25th and 'l26th flower. lf you
will see that cumulative frequency look at the cumulative frequency you can see that this value falls
curves will enable you to estimate into the fourth height class (the l25th and I26th are both greater
the median when the number of than l 2o but less than 200)'
data is large. { i
:l
50-18=32'l Subtract the cumulative frequency at 3O minutes, 18, from the total
frequency.
''.
42 - 28 = 14 Subtract the cumulative frequency at 4O minutes, 28, from that at
.rr 60 minutes, 42.
2 The table shows the amount of money, $x, spent on books by a group of studentr,
0<x<10
l0 <x<20
20 <x<30
30<x<40
40<x<50
50<x<60
(a) Calculate an estimate of the mean amount of money per student spent on booh.
(b) Use the information in the table above to find the values ofp, 4 and r in the
cumulative frequency table.
This cumulative frequency table shows the distribution of the masses of the children
attending a clinic.
0 < m 3lO-O
0 <m<20.0
0 < m <30.0
A<m<40.0
0<rn(50.0
0 <m<60.0
Quartiles
In chapter L2youfound the range (the biggest value - the smallest value) to see how dispersed
various sets of data were. The range, however, is easily affected by outliers (extreme or unusual
values), so it is not always the best measure of how the data is spread out.
The data shown on a cumulative frequency curve can be divided into four equal groups
called quartiles to find a measure of spread called the interquartile range, which is more
representative than the range because it is not afected by extremes.
The cumulative frequency curve on the next page shows the marks obtained by 64 students in a
test. These are listed below:
r 48 students scored less than 15 marks. 15 marks is the upper quartile or third quartile Qr.
r 32 students scored less than 13 marks. 13 marks is the second quartile Qr, or median mark.
o 16 students scored less than 11 marks. 11 marks is the lower quartile or first quartile Q,.
(Dont forget that you need to move right to the curve and down to the horizontal axis to find
the values ofthe percentiles.)
The percentile range is the difference between given percentiles. In the example above,
this is Pr, - P,o.
In chapter 12, percentiles were first introduced but only the 25th and 75th percentiles were used
to introduce the interquartile range. A question was posed at the start of section 12.5 on page 240
All those candidates above the 80th percentile will be offered an interview What does this
mean?' The following worked example shows you how to answer this question.
The, cr:rmiilativefrequenry curve: ghows the test results of 200 candidates who have
applied foi a, post at Fashklddler's. Only those whci scere above the SOth percentile will
bercalled {or an interview; What is the lowest score,that can, be,obtai.ned to receive an
interview letter?
Candidate test Siores
2A0 l1ltr:i
J-1 i lj
180
160'
140
t20
Cum-ulative
100
frequency
80
q0
'40
2A
20 30
Test score
Exercise 20.4 The lengths of 32 metal rods were measured and recorded on this
curve. Use the graph to find an estimate for:
35
30
25
Cumulative
frequency
0 20 tt 40 4s
,."J,3,.-,"
This cumulative frequenry cuwe compares the results 120 students obtained on two maths
120
110
100
90
80
70
CumulativeUO
trequency
50
40
30
20
10
0
0 1020 40 50 60 70 80
Marks
5Thiscumulativefrequencycurveshowsthemassesof500l2-year-oldgirls(inkg).
Mass of 12-Year-old girls
400
300
Cumulative
freguencY
rOO
35 40
Mass (kg)
Thiscumulativefrequencytablegivesthespeedsof200carstravellingonthehighwayfrom
ctty'
Kuala Lumpur International Airport into the
60Ss<70
60Ss<80
60<s<90
60Ss<100
60<s< 110
603s<120
50Ss<130
60<s<140