Make A Grouped Frequency Distribution (Quantitative Variable)
Make A Grouped Frequency Distribution (Quantitative Variable)
The table and a histogram will be displayed in the Session Window. The midpoint of each class is
displayed, not the class limits or boundaries. “TEMPERATURES,” the column name, is truncated
to eight characters for display only.
2–3
Histograms, Frequency After the data have been organized into a frequency distribution, they can be presented
Polygons, and Ogives in graphic forms. The purpose of graphs in statistics is to convey the data to the viewer
Objective 2. Represent data
in pictorial form. It is easier for most people to comprehend the meaning of data pre-
in frequency distributions sented graphically than data presented numerically in tables or frequency distributions.
using histograms, frequency This is especially true if they have little or no statistical knowledge.
polygons, and ogives. Statistical graphs can be used to describe the data set or analyze it. Graphs are
also useful in getting the audience’s attention in a publication or a speaking presenta-
tion. They can be used to discuss an issue, reinforce a critical point, or summarize a
data set. They can also be used to discover a trend or pattern in a situation over a pe-
riod of time.
Section 2–3 Histograms, Frequency Polygons, and Ogives 45
Frequency 4
1
x
5.5 10.5 15.5 20.5 25.5 30.5 35.5 40.5
Class boundaries
(a) Histogram
4
Frequency
1
x
8 13 18 23 28 33 38
Class midpoints
(b) Frequency polygon
20
18
16
Cumulative frequency
14
12
10
8
6
4
2
x
5.5 10.5 15.5 20.5 25.5 30.5 35.5 40.5
Class boundaries
(c) Cumulative frequency graph
46 Chapter 2 Frequency Distributions and Graphs
The Histogram
The histogram is a graph that displays the data by using vertical bars of various heights to represent the
frequencies.
Example 2–4 Construct a histogram to represent the data shown below for the record high tempera-
tures for each of the 50 states (see Example 2–2).
Historical Note Class boundaries Frequency
Graphs originated when 99.5–104.5 2
ancient astronomers drew 104.5–109.5 8
the position of the stars in
the heavens. Roman
109.5–114.5 18
surveyors also used 114.5–119.5 13
coordinates to locate 119.5–124.5 7
landmarks on their maps.
124.5–129.5 1
The development of
statistical graphs can be 129.5–134.5 1
traced to William Playfair
(1748–1819), an engineer/
Solution
drafter who used graphs STEP 1 Draw and label the x and y axes. The x axis is always the horizontal axis,
to present economic data
pictorially.
and the y axis is always the vertical axis.
STEP 2 Represent the frequency on the y axis and the class boundaries on the x axis.
STEP 3 Using the frequencies as the heights, draw vertical bars for each class. See
Figure 2–2.
Figure 2–2 y
Histogram for 18
Example 2–4
15
Frequency
12
3
x
0
99.5° 104.5° 109.5° 114.5° 119.5° 124.5° 129.5° 134.5°
Temperature
As the histogram shows, the class with the greatest number of data values (18) is
109.5–114.5, followed by 13 for 114.5–119.5. The graph also has one peak with the data
clustering around it.
The Frequency Polygon Another way to represent the same data set is by using a frequency polygon.
The frequency polygon is a graph that displays the data by using lines that connect points plotted for
the frequencies at the midpoints of the classes. The frequencies are represented by the heights of the
points.
Section 2–3 Histograms, Frequency Polygons, and Ogives 47
The next example shows the procedure for constructing a frequency polygon.
Example 2–5 Using the frequency distribution given in Example 2–4, construct a frequency polygon.
Solution
STEP 1 Find the midpoints of each class. Recall that midpoints are found by adding
the upper and lower boundaries and dividing by 2.
99.5 104.5 104.5 109.5
102 107
2 2
STEP 2 Draw the x and y axes. Label the x axis with the midpoint of each class, and
then use a suitable scale on the y axis for the frequencies.
STEP 3 Using the midpoints for the x values and the frequencies as the y values, plot
the points.
STEP 4 Connect adjacent points with straight lines. Draw a line back to the
x axis at the beginning and end of the graph, at the same distance that
the previous and next midpoints would be located, as shown in
Figure 2–3.
48 Chapter 2 Frequency Distributions and Graphs
Figure 2–3 y
Frequency Polygon for 18
Example 2–5
15
Frequency
12
3
x
0
102° 107° 112° 117° 122° 127° 132°
Temperature
The frequency polygon and the histogram are two different ways to represent
the same data set. The choice of which one to use is left to the discretion of the
researcher.
The Ogive The third type of graph that can be used represents the cumulative frequencies for the
classes. This type of graph is called the cumulative frequency graph or ogive. The cu-
mulative frequency is the sum of the frequencies accumulated up to the upper bound-
ary of a class in the distribution.
The ogive is a graph that represents the cumulative frequencies for the classes in a frequency distribution.
Example 2–6 Construct an ogive for the frequency distribution described in Example 2–4.
Solution
STEP 1 Find the cumulative frequency for each class.
Class boundaries Cumulative frequency
99.5–104.5 2
104.5–109.5 10
109.5–114.5 28
114.5–119.5 41
119.5–124.5 48
124.5–129.5 49
129.5–134.5 50
STEP 2 Draw the x and y axes. Label the x axis with the class boundaries. Use
an appropriate scale for the y axis to represent the cumulative frequen-
cies. (Depending on the numbers in the cumulative frequency columns,
scales such as 0, 1, 2, 3, . . . , or 5, 10, 15, 20, . . . , or 1000, 2000,
Section 2–3 Histograms, Frequency Polygons, and Ogives 49
3000, . . . can be used. Do not label the y axis with the numbers in the
cumulative frequency column.) In this example, a scale of 0, 5, 10, 15, . . .
will be used.
STEP 3 Plot the cumulative frequency at each upper class boundary, as shown in
Figure 2–4. Upper boundaries are used since the cumulative frequencies
represent the number of data values accumulated up to the upper boundary
of each class.
Figure 2–4
y
Plotting the Cumulative
50
Frequency for
45
Example 2–6
40
35
Cumulative
Frequency
30
25
20
15
10
5 x
0
99.5° 104.5° 109.5° 114.5° 119.5° 124.5° 129.5° 134.5°
Temperature
STEP 4 Starting with the first upper class boundary, 104.5, connect adjacent points
with straight lines, as shown in Figure 2–5. Then extend the graph to the
first lower class boundary, 99.5, on the x axis.
Figure 2–5 y
Ogive for Example 2–6 50
45
40
35
Cumulative
Frequency
30
25
20
15
10
5 x
0
99.5° 104.5° 109.5° 114.5° 119.5° 124.5° 129.5° 134.5°
Temperature
Cumulative frequency graphs are used to visually represent how many values are
below a certain upper class boundary. For example, to find out how many record high
temperatures are less than 114.5, locate 114.5 on the x axis, draw a vertical line up un-
til it intersects the graph, and then draw a horizontal line at that point to the y axis. The
y axis value is 28, as shown in Figure 2–6.
50 Chapter 2 Frequency Distributions and Graphs
Figure 2–6 y
Finding a Specific 50
Cumulative Frequency 45
40
35
Cumulative
Frequency
30
28
25
20
15
10
5 x
0
99.5° 104.5° 109.5° 114.5° 119.5° 124.5° 129.5° 134.5°
Temperature
The steps for drawing the three types of graphs are shown in the Procedure Table.
Procedure
Procedure Table
Table
Relative Frequency The histogram, the frequency polygon, and the ogive shown previously were con-
Graphs structed by using frequencies in terms of the raw data. These distributions can be
converted into distributions using proportions instead of raw data as frequencies. These
types of graphs are called relative frequency graphs.
Graphs using relative frequencies instead of frequencies are used when the propor-
tion of data values that fall into a given class is more important than the actual number
of data values that fall into that class. For example, if one wanted to compare the age
distribution of adults in the city of Philadelphia, Pennsylvania, with the age distribution
of adults of Erie, Pennsylvania, one would use relative frequency distributions. The rea-
son is that since the population of Philadelphia is 1,478,002 and the population of Erie
is 105,270, the bars using the actual data values for Philadelphia would be much taller
than those for the same classes for Erie.
To convert a frequency into a proportion or relative frequency, divide the frequency
for each class by the total of the frequencies. The sum of the relative frequencies will al-
ways be 1. These graphs are similar to the ones that use raw data as frequencies, but the
values on the y axis are in terms of proportions. The next example shows the three types
of relative frequency graphs.
Section 2–3 Histograms, Frequency Polygons, and Ogives 51
Example 2–7 Construct a histogram, frequency polygon, and ogive using relative frequencies for the
distribution (shown here) of the miles 20 randomly selected runners ran during a given
week.
Class Cumulative
boundaries Frequency frequency
5.5–10.5 1 1
10.5–15.5 2 3
15.5–20.5 3 6
20.5–25.5 5 11
25.5–30.5 4 15
30.5–35.5 3 18
35.5–40.5 2 20
20
Solution
STEP 1 Convert each frequency to a proportion or relative frequency by dividing the
frequency for each class by the total number of observations.
For class 5.5–10.5, the relative frequency is 201 0.05.
For class 10.5–15.5, the relative frequency is 202 0.10.
For class 15.5–20.5, the relative frequency is 203 0.15.
And so on.
STEP 2 Using the same procedure, find the relative frequencies for the cumulative
frequency column. The relative frequencies are shown here.
Cumulative
Class Relative relative
boundaries Midpoints frequency frequency
5.5–10.5 8 0.05 0.05
10.5–15.5 13 0.10 0.15
15.5–20.5 18 0.15 0.30
20.5–25.5 23 0.25 0.55
25.5–30.5 28 0.20 0.75
30.5–35.5 33 0.15 0.90
35.5–40.5 38 0.10 1.00
1.00
STEP 3 Draw each graph as shown in Figure 2–7. For the histogram and ogive, use
the class boundaries along the x axis. For the frequency polygon, use the
midpoints on the x axis. The scale on the y axis uses proportions.
When analyzing histograms and frequency polygons, look at the shape of the curve.
For example, does it have one peak or two peaks, or is it relatively flat, or is it
U-shaped? Are the data values spread out on the graph, or are they clustered around the
52 Chapter 2 Frequency Distributions and Graphs
Figure 2–7 y
Graphs for Example 2–7 0.25
0.20
Relative frequency
0.15
0.10
0.05
x
0
5.5 10.5 15.5 20.5 25.5 30.5 35.5 40.5
Miles
(a) Histogram
y
0.25
0.20
Relative frequency
0.15
0.10
0.05
x
0
8 13 18 23 28 33 38
Miles
(b) Frequency polygon
y
1.00
Cumulative relative frequency
0.80
0.60
0.40
0.20
x
0
5.5 10.5 15.5 20.5 25.5 30.5 35.5 40.5
Miles
(c) Ogive
Section 2–3 Histograms, Frequency Polygons, and Ogives 53
center? Are there data values in the extreme ends? These may be outliers. (See Section
3–4 for an explanation of outliers.) Are there any gaps in the histogram, or does the fre-
quency polygon touch the x axis somewhere other than the ends? Finally, are the data
clustered at one end or the other, indicating a skewed distribution? (See Section 3–2 for
an explanation of skewness.)
For example, the histogram for the record high temperatures shown in Figure 2–2
(page 46 ) shows a single peaked distribution, with the class 109.5–114.5 containing the
largest number of temperatures. The distribution has no gaps, and there are fewer tem-
peratures in the highest class than in the lowest class.
Exercises
2–19. For 108 randomly selected college applicants, the 2–23. Thirty automobiles were tested for fuel efficiency, in
following frequency distribution for entrance exam scores miles per gallon (mpg). The following frequency
was obtained. Construct a histogram, frequency polygon, distribution was obtained. Construct a histogram, frequency
and ogive for the data. (The data for this exercise will be polygon, and ogive for the data. (The data for this exercise
used for Exercise 2–31.) will be used for Exercise 2–33.)
Class limits Frequency Class boundaries Frequency
90–98 6 7.5–12.5 3
99–107 22 12.5–17.5 5
108–116 43 17.5–22.5 15
117–125 28 22.5–27.5 5
126–134 9 27.5–32.5 2
Applicants who score above 107 need not enroll in a 2–24. Construct a histogram, frequency polygon, and
summer developmental program. In this group, how many ogive for the data in Exercise 2–14 and analyze the results.
students do not have to enroll in the developmental (The data in this exercise will be used for Exercise 2–34.)
program? 2–25. In a class of 35 students, the following grade
2–20. For 75 employees of a large department store, the distribution was found. Construct a histogram, frequency
following distribution for years of service was obtained. polygon, and ogive for the data. (A 4, B 3, C 2,
Construct a histogram, frequency polygon, and ogive for D 1, F 0.) (The data in this exercise will be used for
the data. (The data for this exercise will be used for Exercise 2–35.)
Exercise 2–32.) Grade Frequency
Class limits Frequency 0 3
1–5 21 1 6
6–10 25 2 9
11–15 15 3 12
16–20 0 4 5
21–25 8
A grade of C or better is required for the next level course.
26–30 6
Were the majority of the students able to meet this
requirement?
A majority of the employees have worked for how many
years or less? 2–26. In a study of reaction times of dogs to a specific
stimulus, an animal trainer obtained the following data,
2–21. Construct a histogram, frequency polygon, and
given in seconds. Construct a histogram, frequency
ogive for the data in Exercise 2–11 and analyze the results.
polygon, and ogive for the data and analyze the results.
2–22. Construct a histogram, frequency polygon, and (The histogram in this exercise will be used for Exercises
ogive for the data in Exercise 2–12 and analyze the results. 2–36, 3–16, and 3–68.)
54 Chapter 2 Frequency Distributions and Graphs
Class limits Frequency 2–35. For the data in Exercise 2–25, construct a
histogram, frequency polygon, and ogive, using
2.3–2.9 10
relative frequencies. What proportion of the students
3.0–3.6 12 cannot meet the requirement for enrollment in the
3.7–4.3 6 next course?
4.4–5.0 8
2–36. The animal trainer in Exercise 2–26 selected another
5.1–5.7 4
group of dogs that were much older than the first group and
5.8–6.4 2 measured their reaction times to the same stimulus.
Construct a histogram, frequency polygon, and ogive for
2–27. Construct a histogram, frequency polygon, and the data.
ogive for the data in Exercise 2–15 and analyze the results.
Class limits Frequency
2–28. To determine their lifetimes, 80 randomly selected
batteries were tested. The following frequency distribution 2.3–2.9 1
was obtained. The data values are in hours. Construct a 3.0–3.6 3
histogram, frequency polygon, and ogive for the data and 3.7–4.3 4
analyze the results. 4.4–5.0 16
5.1–5.7 14
Class boundaries Frequency
5.8–6.4 4
63.5–74.5 10
74.5–85.5 15
Analyze the results and compare the histogram for this
85.5–96.5 22 group with the one obtained in Exercise 2–26. Are there
96.5–107.5 17 any differences in the histograms? (The data in this
107.5–118.5 11 exercise will be used for Exercise 3–16 and 3–68.)
118.5–129.5 5
*2–37. Using the following histogram:
a. Construct a frequency distribution; include class limits,
2–29. Construct a histogram, frequency polygon, and class frequencies, midpoints, and cumulative frequencies.
ogive for the data in Exercise 2–16 and analyze the results. b. Construct a frequency polygon.
c. Construct an ogive.
2–30. For the data in Exercise 2–18, construct a histogram
for the home run distances for each player and compare y
them. Are they basically the same, or are there any 7
noticeable differences? Explain your answer. 6
Frequency
2–33. For the data in Exercise 2–23, construct a histogram, *2–38. Using the results from Exercise 2–37, answer the
frequency polygon, and ogive, using relative frequencies. following questions.
What proportion of the automobiles had a fuel efficiency of a. How many values are in the class 27.5–30.5?
17.5 miles per gallon or higher? b. How many values fall between 24.5 and 36.5?
c. How many values are below 33.5?
2–34. For the data in Exercise 2–14, construct a histogram,
d. How many values are above 30.5?
frequency polygon, and ogive, using relative frequencies.
Section 2–3 Histograms, Frequency Polygons, and Ogives 55
Example TI2–1
Plot a histogram for the following data from Examples 2–2 and 2–4.
112 100 127 120 134 118 105 110 109 112
110 118 117 116 118 122 114 114 105 109
107 112 114 115 118 117 118 122 106 110
116 108 110 121 113 120 119 111 104 111
120 113 120 117 105 110 118 112 114 114
Set the Window values as follows:
Xmin 100
Xmax 135
Ymin 5
Ymax 20
Input Input
Section 2–3 Histograms, Frequency Polygons, and Ogives 57
Press TRACE and use the arrow keys to determine the number of values in each group.
Output
Excel presents both a table and a chart on the new worksheet ply. It decides “bins” for the his-
togram itself (here it picked a bin size of seven units), but you can also define your own bin range
on the data worksheet. Here is the histogram with Excel-selected bins.
58 Chapter 2 Frequency Distributions and Graphs
Histogram for
Example XL2–1
2–4 In addition to the histogram, the frequency polygon, and the ogive, several other types
of graphs are often used in statistics. They are the Pareto chart, the time series graph,
Other Types of Graphs and the pie graph. Figure 2–8 shows an example of each type of graph.
Figure 2–8 y
How people get to work
Other Types of Graphs 30
Used in Statistics
25
20
Frequency
15
10
x
0
Auto Bus Trolley Train Walk
(a) Pareto chart
y
Temperature over a 9-hour period Marital status of employees
at Brown’s department store
60°
55° Married
Temperature
50%
50°
Widowed
5%
45° Single
Divorced 18%
27%
40°
x
0
12 1 2 3 4 5 6 7 8 9
Time
(b) Time series graph (c) Pie graph