Class 31 - Statistics 2
Class 31 - Statistics 2
Class: #31
Histogram
A histogram displays the shape and spread of continuous sample data. The columns of a
Worked Example 1
The table below shows the grouped data of the age of 29 persons.
1-10 3
11-20 5
21-30 12
31-40 7
41-50 2
Note: Here are two common ways that students mistakenly draw a Histogram.
Scale
𝑥-axis: 1 cm = 5 years
𝑦-axis: 1 cm = 1 person
Note: When drawing a histogram, here are some useful tips:
- First, determine the upper class boundary and lower class boundary for each class
- On the graph, insert a small dot at the UCB and LCB values for each interval based
on the frequency so you would be able to easily draw the height of the columns.
- Draw the vertical lines first and then the horizontal lines.
Frequency Polygon
A frequency polygon is a graph constructed by using lines to join the midpoints of each
Worked Example 1
The table below shows the grouped data of the age of 29 persons.
1-10 3 5.5
11-20 5 15.5
21-30 12 25.5
31-40 7 35.5
41-50 2 45.5
Scale
𝑥-axis: 1 cm = 10 years
𝑦-axis: 1 cm = 2 persons
Frequency
Age (years)
Line Graph
A line graph is a type of chart used to visualize the value of something over time.
Worked Example 1
The table below shows the number of cars manufactured from 1998-2001.
Scale
𝑥-axis: 1 cm = 1 year
𝑦-axis: 1 cm = 5 000 cars
Frequency (1000’s of cars)
Years
(b) During which two years had the largest increase in the number of manufactured
Solution:
(b) The largest increase in the number of manufactured cars occurred from 1998 to
1999.
= 10,000 cars
10 000+20 000+15 000+17 000
(c) Average amount of cars manufactured per year = 4
= 15,500 cars
Cumulative Frequency
➢ the sum of the class and all classes below it in a frequency distribution.
Worked Example
1-10 2
11-20 9
21-30 25
31-40 30
41-50 10
51-60 4
Question: What is the lower class boundary of the second class interval?
Answer: 10.5
Question: What is the upper class boundary of the last class interval?
Answer: 60.5
Question: What is the class width of the intervals?
Answer: The first class interval is 1-10. Using the first class interval,
= 10.5 − 0.5
= 10
Question: What is the lower class limit of the 4th class interval?
Answer: 31
(a) Fill out the cumulative frequency column of the data given above.
1-10 2 2
Note: The value
11-20 9 11
obtained in the last
21-30 25 36 row should be equal
to the total number
31-40 30 66
stated in the question.
41-50 10 76
51-60 4 80
(b) Draw a cumulative frequency curve using the information provided above.
Note: When drawing a cumulative frequency curve, here are some useful tips:
- First, determine the upper class boundary for each class interval of the grouped
data given as these values will be used to plot the cumulative frequency curve.
- In this case, the 𝑥-axis will represent age and 𝑦-axis is almost always used to
- The shape of the curve looks like a ‘stretched 𝑆’ and is called an ogive.
- Use a small ‘×’ to plot the points and a fine pencil to connect them.
1-10 2 2 10.5
11-20 9 11 20.5
21-30 25 36 30.5
31-40 30 66 40.5
41-50 10 76 50.5
51-60 4 80 60.5
Title: Cumulative Frequency Graph showing the age of 80 persons
Cumulative
Frequency
Scale
𝑥-axis: 1 cm = 5 years
𝑦-axis: 1 cm = 5 persons
Age (years)
(c) What is the median age?
Equation:
IQR = 𝑄3 − 𝑄1
𝑄3 = 3rd quartile
𝑄1 = 1st quartile
The IQR (interquartile range) describes the middle 50% of values when ordered
3(𝑛+1) 3(80+1)
The 3rd quartile occurs at the = = 60.75th value
4 4
OR
3𝑛 3(80)
The 3rd quartile occurs at the = = 60th value
4 4
OR
𝑛 80
The 1st quartile occurs at the 4 = = 20th value
4
∴ IQR = 𝑄3 − 𝑄1
= 38 − 25
= 13 years
IQR
SIQR =
2
13
= 2
= 6.5 years
Using the cumulative frequency curve, it can be deduced that 76 persons are under
4
= 80
1
= 20 or 0.05 or 5%
Cumulative
Frequency Scale
𝑥-axis: 1 cm = 5 years
𝑦-axis: 1 cm = 10 persons
Age (years)
Solution:
𝑛+1 60+1 61
The median value occurs at the = = = 30.5th value
2 2 2
OR
𝑛 60
The median value occurs at the 2 = = 30th value
2
Solution:
3𝑛 3(60)
The 3rd quartile occurs at the = = 45th value
4 4
Solution:
𝑛 60
The 1st quartile occurs at the 4 = = 15th value
4
dispersion of a set of values. A low standard deviation indicates that the values tend to
be close to the mean of the set, while a high standard deviation indicates that the values
➢ The standard deviation tells me how much the individual numbers are deviating
Example:
The tables below show the amount of weekly spending money between two students.
Notice that the average of both students, Jeremiah and Cromwell are equal to each
other. However, in Cromwell’s situation, the average does not represent how much
money he receives for the week. We can use the standard deviation to show how much
- “spread”
- “deviates”
- “mean”
Example:
Diagram 1:
Standard deviation = 5
Diagram 2:
Standard deviation = 1
For diagram 1, suppose that the standard deviation is 5 and for diagram 2, suppose that
the standard deviation is 1. The standard deviation of diagram 1 is larger than the
data, it means that the values of diagram 1 were more ‘spread out’ or distributed over
its range than the values in diagram 2. In diagram 1, they deviated further from the