0% found this document useful (0 votes)
96 views22 pages

MATH 121 Chapter 2 Frequency Distribution Graphs

Uploaded by

gjbabocado23
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
96 views22 pages

MATH 121 Chapter 2 Frequency Distribution Graphs

Uploaded by

gjbabocado23
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 22

MODULE 1.

2
Chapter
2 Frequency Distribution and Graphs
Learning Objectives
After completing this chapter, the students will able to:
o Define some basic terms in formulation of frequency distribution.
o Organize data into a frequency distribution.
o Compute for the midpoints of each class on a frequency distribution table.
o Compare and contrast raw data and frequency distribution.
o Construct a stem-and-leaf plot.
o Represents frequency distribution graphically using histogram, frequency polygons and
cumulative frequency polygon (ogive).
o Draw the data using pareto chart, bar chart, pie chart, time series graph, pictograph, and
scatter plot.
o Give the importance of graphs in statistics.
o Construct the different graphs and charts.

Chapter Outline
2.1 Introduction
2.2 Defining Some Terms
2.3 Constructing Frequency Distribution
Categorical Frequency Distribution
Determining Class Interval
Grouped Frequency Distribution
2.4 Stem-and-Leaf Plot
2.5 Graphing Frequency Distribution
Histogram
Frequency Polygon
Cumulative Frequency Polygon (or Ogive)
2.6 Other Types of Graphs
Pareto Chart
Bar Chart
Pie Chart
Time Series Graph
Pictograph
Scatter Plot
2.7 Guideline for Developing Good Graphs/Charts

Statistics: The only Science that enables different experts using the same figure to
draw different conclusions.
– Evan Esar

Page | 1
2.1 Introduction

When conducting a statistical research, investigation or study, the research must


gather data for the particular variable under investigation. To describe the situations, and
draw inferences about events, the researcher must organize the data gathered in some
meaningful way. The easiest way and widely used of organizing data is to construct a
frequency distribution. A frequency distribution is a grouping of the data into categories
showing the number of observations in each of the non-overlapping classes.
After organizing data, the next move of the researcher is to present the data so
they can be understood easily by those who will benefit from reading the study. The most
useful method of presenting data is by constructing graphs and charts. These are number
of ways to plot graphs and charts, and each one has a specific purpose.
The chapter discussed how to organize data by constructing frequency distribution
and how to present data by constructing graphs and charts.

2.1 Defining Some Terms

Before we get started in constructing frequency distribution, we must define some


terms that are essential to understand deeper the nature of data that are displayed in a
frequency distribution.

• Raw data is the data arranged neither in ascending nor descending order.
• Range is the difference of the highest value and the lowest value in a distribution.
• Frequency distribution is the organization of data in a tabular form, using
mutually exclusive classes showing the number of observation in each.
• Class Limits (or Apparent Limits) is the highest and lowest values describing a
class.
• Class Boundaries (or Exact Limits) is the upper and lower values of a class for
group frequency distribution whose values has additional decimal place more than
the class limits and end with the digit 5.
• Class Interval (or Class Width) is the distance between the class lower boundary
and the class upper boundary and it is denoted by the symbol i.
• Frequency (F) is the number of values in a specific class of frequency
distribution.
• Relative Frequency (RF) is the value obtained when the frequencies in each class
of the frequency distribution is divided by the total number of values.
• Percentage is obtained by multiplying the relative frequency by 100%.
• Cumulative Frequency (CF) is the sum of the frequencies accumulated up to the
upper boundary of a class in a frequency distribution.
• Class Midpoint (or Class Mark)is the point halfway between the class limits of
each class and is representative of the data within that class.

Page | 2
2.3 Constructing Frequency Distribution

A grouped frequency distribution is used when the range of the data set is large; the
data must be grouped into classes whether it is categorical data or interval data. For
interval data, the classes are more than one unit in width. The procedure in constructing
the frequency distribution is discussed in the succeeding sections.

A. Categorical Frequency Distribution


The categorical frequency distribution is used to organize nominal-level or ordinal-
level type of data. Some examples where we can apply this distribution are gender,
business type, political affiliation, and others.

Example: Twenty applicants were given a performance evaluation appraisal. The data
set is

High High High Low Average


Average Low Average Average Average
Low Average Average High High
Low Low Average High High

Construct a frequency distribution for the data

Solution:

Step 1: Construct a table as shown below.

Class Tally Frequency Percent


High
Average
Low

Step 2: Tally the raw data

Class Tally Frequency Percent


High IIII-II
Average IIII-III
Low IIII

Step 3:
Convert the tallied data into numerical
frequencies.

Page | 3
Class Tally Frequency Percent
High IIII-II 7
Average IIII-III 8
Low IIII 5
Step 4:Determine the percentage. The percentage is computed using the formula:
f
% = n x100 %, where f frequency of the class and n total number of values.

Class Tally Frequency Percent Found by:


High IIII-II 7 35 (7 ÷ 20) x 100
Average IIII-III 8 40 (8 ÷ 20) x 100
Low IIII 5 25 (5 ÷ 20) x 100
Total 20 100

For the sample, more applicants received an average of performance rating.

B. Determining Class Interval


Generally the number of classes for a frequency distribution table varies from 5 to 20,
depending primarily on the number of observations in the data set. It is preferably to have
more classes as the size of a data set increases. The decision about the number of classes
depends on the method used by the researcher.

1. Rule 1.To determine the number of classes is to use the smallest positive integer k
such that 2k ≥ n, where n is the total number of observations. Using Formula 2 -1we
can obtain the ideal class interval.
𝑅𝑎𝑛𝑔𝑒 HV − LV
Suggested Class Interval (i) = Number of classes = (Formula 2-1)
𝑘

where: HV = Highest value in a data set LV = Lowest value in a data set


k = number of classes i = suggested class interval

2. Rule 2. Another way to determine the class interval we can apply formula 2-2.
Range
Suggested Class Interval = 1+3.322 (logarithm of total frequencies) (Formula 2 -2)

3. Rule 3. Another guideline to determine the class interval is to have an ideal number
of classes, then apply Formula 2-3.

Highest Value – Lowest Value


Suggested Class Interval = (Formula 2 -3)
Number of Classe

Page | 4
C. Grouped Frequency Distribution

Example 1: Suppose a researcher wished to do study on the monthly salary (in ₱


thousands) of call center agents of selected call center companies. The research first
would have to collect the data by asking each call center agents about their monthly
salary. The data collected in original form is called raw data. In this case, the data are

P18.80 P22.00 P23.40 P24.30 P27.00 P27.90 P31.00 P26.00 P20.80 P17.00
20.00 22.60 23.40 24.50 27.00 29.30 32.10 26.10 21.00 17.30
20.25 22.75 23.70 24.70 27.40 30.10 33.70 26.30 21.60 17.80
18.40 21.90 23.00 23.85 26.80 27.80 30.80 25.00 20.40 15.50
18.70 21.90 23.20 24.10 26.90 27.90 30.90 25.20 20.50 15.70
17.95 21.75 22.90 23.70 26.50 27.50 30.60 24.75 20.25 14.10
18.35 21.80 22.90 23.70 26.50 27.60 30.75 25.00 20.30 14.30
20.20 22.80 23.50 24.60 27.30 29.50 32.90 26.20 21.30 17.40

Construct a frequency distribution using Rule 1 and determine the following:


a. Range e. Relative frequencies
b. Interval f. Percentages
c. Class limits g. Cumulative frequencies
d. Class boundaries h. Midpoints

Solution:
Step 1:Arrange the new data in ascending or descending order. In this particular example
we will arrange raw data in ascending order. This will make it easier for us to
tally the data.

P14.10 P17.95 P20.25 P21.75 P22.90 P23.70 P24.75 P26.50 P27.50 P30.60
14.30 18.35 20.30 21.80 22.90 23.70 25.00 26.50 27.60 30.75
15.50 18.40 20.40 21.90 23.00 23.85 25.00 26.80 27.80 30.80
15.70 18.70 20.50 21.90 23.20 24.10 25.20 26.90 27.90 30.90
17.00 18.80 20.80 22.00 23.40 24.30 26.00 27.00 27.90 31.00
17.30 20.00 21.00 22.60 23.40 24.50 26.10 27.00 29.30 32.10
17.40 20.20 21.30 22.75 23.50 24.60 26.20 27.30 29.50 32.90
17.80 20.25 21.60 22.80 23.70 124.70 26.30 27.40 30.10 33.70

Step 2:Determine the classes.

• Find the highest and lowest value.


Highest Value (HV) = 33.70 and Lowest Value (LV) = 14.10

• Find the range.


Range = Highest Value (HV) – Lowest Value (LV) = 33.70 – 14.10 = 19.60

Page | 5
• Determine the number of classes.
The objective is to use just enough classes. We can determine the number
of classes (k) using “2 to the k rule”. This will enable us to select the smallest
number (k) for the number of classes such that 2k (raised to the power of k) is
greater than the number of observations (n). Using our example, there are 80
call center agents (or n = 80). If we apply k = 6, which means we would use 6
classes, then 2k = 26 = 64, somewhat less than 80. Thus, 6 is not enough
classes. If we try k = 7, then 2k = 27 = 128, which is greater than 80.
Therefore, the recommended number of classes is 7.

• Determine the class interval (or width).

Generally the class interval (or width) should be equal for all classes. The
classes must cover all the values in the raw data (that is, from lowest to highest).
Class interval is generated using the formula:
Range HV – LV 19.60
Suggested Class Interval=Number of Classes = - = 2.80 ≈3
𝑘 7

Note: Round the value of the interval up to the nearest whole number if there is a remainder.

• Select a starting point for the lowest class limit.

The starting point can be the smallest data value or any convenient
number less than the smallest data value. In our case 14 is used.

• Set the individual class limit.

We need to add the interval (or width) to the lowest score taken as the
starting point to obtain the lower limit of the next class. Keep adding until we
reach the 7 classes, as reflected 14, 17, 20, 23, 26, 29, and 32.
To obtain the upper class limits, we need to subtract one unit to the lower
limit of the second class to obtain the upper class. That is, 17 – 1 = 16. Then
add the interval (or width) to each upper limit to obtain all the upper limits.

Class Limits
14 – 16
17 – 19
20 – 22
23 – 25
26 – 28
29 – 31
32 – 34

Page | 6
• Set the class boundaries in each class. To obtain the class boundaries, we
need to subtract 0.5 from each lower class limit and add 0.5 to each upper
class limit.

Class Limits Class Boundaries


14 – 16 13.5 – 16.5
17 – 19 16.5 – 19.5
20 – 22 19.5 – 22.5
23 – 25 22.5 – 25.5
26 – 28 25.5 – 28.5
29 – 31 28.5 – 31.5
32 – 34 31.5 – 34.5

Step 3:Tally the raw data.

Class Limits Class Boundaries Tally


14 – 16 13.5 – 16.5 IIII
17 – 19 16.5 – 19.5 IIII-IIII
20 – 22 19.5 – 22.5 IIII-IIII-IIII-I
IIII-IIII-IIII-IIII-
23 – 25 22.5 – 25.5
III
26 – 28 25.5 – 28.5 IIII-IIII-IIII-II
29 – 31 28.5 – 31.5 IIII-III
32 – 34 31.5 – 34.5 III

Step 4:Convert the tallied data into numerical frequencies.

Class Limits Class Boundaries Tally Frequency


14 – 16 13.5 – 16.5 IIII 4
17 – 19 16.5 – 19.5 IIII-IIII 9
20 – 22 19.5 – 22.5 IIII-IIII-IIII-I 16
23 – 25 22.5 – 25.5 IIII-IIII-IIII-IIII-III 23
26 – 28 25.5 – 28.5 IIII-IIII-IIII-II 17
29 – 31 28.5 – 31.5 IIII-III 8
32 – 34 31.5 – 34.5 III 3

Page | 7
Step 5:Determine the relative frequency. It can be found by dividing each frequency by
the total frequency

Class Limits Frequency Relative Frequency Found by


14 – 16 4 0.05 4 ÷ 80
17 – 19 9 0.11 9 ÷ 80
20 – 22 16 0.20 16 ÷ 80
23 – 25 23 0.29 23 ÷ 80
26 – 28 17 0.21 17 ÷ 80
29 – 31 8 0.10 8 ÷ 80
32 – 34 3 0.04 3 ÷ 80

Step 6:Determine the percentage. It can be found by multiplying 100% in each relative
frequency.

Class Limits Frequency Percentage Found by


14 – 16 4 5 (4 ÷ 80) x 100
17 – 19 9 11 (9 ÷ 80) x 100
20 – 22 16 20 (16 ÷ 80) x 100
23 – 25 23 29 (23 ÷ 80) x 100
26 – 28 17 21 (17 ÷ 80) x 100
29 – 31 8 10 (8 ÷ 80) x 100
32 – 34 3 4 (3 ÷ 80) x 100
Total 80 100

Step 7:Determine the cumulative frequencies. The cumulative frequency can be found by
adding the frequency in each class to the total frequencies of the classes
preceding that class.

Class Limits Frequency Cumulative Frequency Found by


14 – 16 4 4 4
17 – 19 9 13 4+9
20 – 22 16 29 4+9+16
23 – 25 23 52 4+9+16+23
26 – 28 17 69 4+9+16+23+17
29 – 31 8 77 4+9+16+23+17+8
32 – 34 3 80 4+9+16+23+17+8+3

Page | 8
Step 8:Determine the midpoints. The midpoint can be found by getting the average of the
upper limit and lower limit in each class.

Class Limits Frequency Midpoints Found by


14 – 16 4 15 (14+16) ÷ 2
17 – 19 9 18 (17+19) ÷ 2
20 – 22 16 21 (20+22) ÷ 2
23 – 25 23 24 (23+25) ÷ 2
26 – 28 17 27 (26+28) ÷ 2
29 – 31 8 30 (29+31) ÷ 2
32 – 34 3 33 (32+34) ÷ 2

Example 2: SJS Travel Agency, a nationwide local travel agency, offers special rates on
summer period. The owner wants additional information on the ages of those people
taking travel tours. A random sample of 50 customers taking travel tours last summer
revealed these ages.

18 29 42 57 61 67 37 49 53 47
24 34 45 58 63 70 39 51 54 48
28 36 46 60 66 77 40 52 56 49
19 31 44 58 62 68 38 50 54 48
27 36 46 59 64 74 39 51 55 48

Construct a frequency distribution using Rule 2 and determine the following:


a. Range e. Relative frequencies
b. Interval f. Percentages
c. Class limits g. Cumulative frequencies
d. Class boundaries h. Midpoints

Solution:

Step1:Arrange the raw data in ascending order.

18 29 37 42 47 49 53 57 61 67
19 31 38 44 48 50 54 58 62 68
24 34 39 45 48 51 54 58 63 70
27 36 39 46 48 51 55 59 64 74
28 36 40 46 49 52 56 60 66 77

Page | 9
Step2:Determine the classes.

• Find the highest and lowest value.


Highest Value (HV) = 77 and Lowest Value (LV) = 18

• Find the range.


Range = Highest Value (HV) – Lowest Value (LV) = 77 – 18 = 59

• Determine the class interval or width.

Class interval is generated using Formula 2-2:


Range
Suggested Class Interval =1+3.322 (logarithm of total frequencies)
77 – 18 59
= 1+3.322(log 50) = 1+3.322(1.698970004)
59
= 6.643978354 = 8.88 ≈ 9

• Select a starting point for the lowest class limit. The lowest value in the data set is
18, this will also serve as our starting point.

• Set the individual class limit. We will add 9 to each lower class limit until
reaching the number of classes (18, 27, 36, 45, 54, 63, and 72). To obtain the
upper class limits, we need to subtract one unit to the lower limit of the second
class to obtain the upper limit of the first class. That is, 27 – 1 = 26. Then add the
interval (or width) to each upper limit to obtain all the upper limits (26, 35, 44, 53,
62, 71, and 80).

Class Limits
18 – 26
27 – 35
36 – 44
45 – 53
54 – 62
63 – 71
72 – 80

• Set the class boundaries in each class. To obtain the class boundaries, we need to
subtract 0.5 from each lower class limit and add 0.5 to each upper class.

Page | 10
Class Limits Class Boundaries
18 – 26 17.5 – 26.5
27 – 35 26.5 – 35.5
36 – 44 35.5 – 44.5
45 – 53 44.5 – 53.5
54 – 62 53.5 – 62.5
63 – 71 62.5 – 71.5
72 – 80 71.5 – 80.5

Step3:Tally the raw data.

Class Limits Class Boundaries Tally


18 – 26 17.5 – 26.5 III
27 – 35 26.5 – 35.5 IIII
36 – 44 35.5 – 44.5 IIII-IIII
45 – 53 44.5 – 53.5 IIII-IIII-IIII
54 – 62 53.5 – 62.5 IIII-IIII-I
63 – 71 62.5 – 71.5 IIII-I
72 – 80 71.5 – 80.5 II

Step 4:Convert the tallied data into numerical frequencies.

Class Limits Class Boundaries Tally Frequency


18-26 17.5-26.5 III 3
27-35 26.5-35.5 IIII 5
36-44 35.5-44.5 IIII-IIII 9
45-53 44.5-53.5 IIII-IIII-IIII 14
54-62 53.5-62.5 IIII-IIII-I 11
63-71 62.5-71.5 IIII-I 6
72-80 71.5-80.5 II 2

Step 5:Determine the relative frequency.

Class Limits Frequency Relative Frequency Found by


18-26 3 0.06 3÷50
27-35 5 0.10 5÷50
36-44 9 0.18 9÷50
45-53 14 0.28 14÷50
54-62 11 0.22 11÷50
63-71 6 0.12 6÷50
72-80 2 0.04 2÷50

Page | 11
Step 6: Determine the percentage.

Class Limits Frequency Percentage Found by


18-26 3 6 (5÷50) x 100
27-35 5 10 (5÷50) x 100
36-44 9 18 (9÷50) x 100
45-53 14 28 (14÷50) x 100
54-62 11 22 (11÷50) x 100
63-71 6 12 (6÷50) x 100
72-80 2 4 (2÷50) x 100
Total 50 100

Step 7:Determine the cumulative frequencies

Class Limits Frequency Cumulative Frequency Found by


18-26 3 3 3
27-35 5 8 3+5
36-44 9 17 3+5+9
45-53 14 31 3+5+9+14
54-62 11 42 3+5+9+14+11
63-71 6 48 3+5+9+14+11+6
72-80 2 50 3+5+9+14+11+6+2

Step 8:Determine the midpoints.

Class Limits Frequency Midpoints Found by


18-26 3 22 (18+26) ÷ 2
27-35 5 31 (27+35) ÷ 2
36-44 9 40 (36+44) ÷ 2
45-53 14 49 (45 + 53) ÷ 2
54-62 11 58 (54 + 62) ÷ 2
63-71 6 67 (63 + 71) ÷ 2
72-80 2 76 (72 + 80) ÷ 2

2.4 Stem-and-Leaf Plot

A statistician named John Tukey introduced the stem-and-leaf plot. The objective
of this method is to some extent overcomes the loss of actual observations brought about
by the histogram. The advantage of the stem-and-leaf plot over the histogram is that we
can see the actual observations.
The stem is the leading digit or digits and the leaf is the trailing digit. The stem is
placed at the first column and the leaf at the second column.

Page | 12
Example 1: SJS Travel Agency, a nationwide local travel agency, offers special rates on
summer period. The owner wants additional information on the ages of people taking
travel tours. A random sample random sample of 50 customers taking travel tours last
summer revealed these ages.
18 29 37 42 47 49 53 57 61 67
19 31 38 44 48 50 54 58 62 68
24 34 39 45 48 51 54 58 63 70
27 36 39 46 48 51 55 59 64 74
28 36 40 46 49 52 56 60 66 77
Construct a stem-and-leaf plot.
Solution:
The stems (leading digits0 for the raw data are 1, 2, 3, 4, 5, 6, 7. The leaves for each
stem (trailing digit) are recorded at the same row and are rank-ordered to form a stem-
and-leaf plot.
Stem Leaf
1 8, 9
2 4, 7, 8, 9
Tens digit 3 1, 4, 6, 6, 7, 8, 9, 9,
Units digit
(leading digits) 4 0, 2, 4, 5, 6, 6, 7, 8, 8, 8, 9, 9
(trailingdigits)
5 0, 1, 1, 2, 3, 4, 4, 5, 6, 7, 8, 8, 9
6 0, 1, 2, 3, 4, 6, 7, 8
7 0, 4, 7

2.5 Graphing Frequency Distribution

When the data set contains large number of values, making conclusions from an
ordered array or stem-and-leaf plot is often difficult. We will need graphs or charts in
such situations. There are a number of graphs or charts to visually show numerical data.
These include histogram, frequency polygon, and cumulative frequency(ogive).
In this section, we discussed several graphical methods that are used for interval data.
The most important of these graphical methods is the histogram. Histogram is a powerful
graphical technique used to summarize interval data, but it also helps explain an
important aspect of probability.

A. Histogram
A histogram is a graph in which classes are marked on the horizontal axis (x-axis)
and the class frequencies represents on the vertical axis (y-axis). The height of the bars
represents the class frequencies, and the bars are drawn adjacent to each other.
Nevertheless, the histogram focuses on the frequency of each class and sacrifices
whatever information was contained in the actual observations.

Page | 13
B. Frequency Polygon

A frequency polygon is a graph that displays the data using points which are
connected by lines. The frequencies are represented by the heights of the points at the
midpoints of the classes. The vertical axis represents the frequency of the distribution
while the horizontal axis represents the midpoints of the frequency distribution.

C. Cumulative Frequency Polygon


The cumulative frequency polygon or ogive (read as oh’-jive) is a graph that displays
the cumulative frequencies for the classes in a frequency distribution. The vertical axis
represents the cumulative frequency of the distribution while the horizontal axis the upper
class boundaries (real upper limits) of the frequency distribution.

Example: As shown below is the is the frequency distribution in the Example 1 in


Section 2.3.

Class Limits Class Boundaries Midpoints Frequency cf


14-16 13.5-16.5 15 4 4
17-19 16.5-19.5 18 9 13
20-22 19.5-22.5 21 16 29
23-25 22.5-25.5 24 23 52
26-28 25.5-28.5 27 17 69
29-31 28.5-31.5 30 8 77
32-34 31.5-34.5 33 3 80
Construct a histogram, frequency polygon, and cumulative frequency polygon. What
conclusions can you reached based on the information presented in the histogram.
Solution:

a. Constructing a histogram
Step 1: Find the midpoints of each class.
Step 2: Draw and label the x-axis and y-axis.
Step3: Represent the frequency on the y-axis and the midpoints on the x-axis.
Step 4: Use the frequency to represent the height and draw the vertical bars.
The class frequencies are scaled along the vertical axis and the class midpoints
along the horizontal axis. From Figure 2.1 we note that there are 4 employees in the
₱15,000 class midpoints or ₱14,000-16,000. Therefore, the height of the column for class
₱14,000-₱16,000 is 4. Applying the same thing to other classes we shall obtain the graph
below.

As the histogram shows, the class with the greatest number of data values (23) is
₱ 23,000-₱25,000, followed by 17 for ₱26,000-₱28,000. The graph also has one peak
with the data clustering around it.

Page | 14
Histogram for Call Center Agents' Salary
25
Frequency 20
15
10
5
0
15 18 21 24 27 30 33
Salary (in Thousand)

Figure 2.1: Histogram for Call Center Agents’ Salary

b. Constructing a Frequency Polygon


Step 1: Find the midpoints of each class.
Step 2: Draw and label the x-axis and y-axis
Step 3: Represent the frequency on the y-axis and the midpoints on the x-axis.

Step 4: Connect the adjacent points with the line segments. Draw a line back to the x-
axis at the beginning and end of the graph.

Frequency Polygon for Call Agents' Salary


25
20
Frequency

15
10
5
0
15 18 21 24 27 30 33
Salary (in Thousand)

Figure 2.2 Frequency Polygon for Call Center Agents’ Salary

c. Constructing a Cumulative Frequency Polygon (ogive)


Step 1: Find the cumulative distribution of the data set.

Page | 15
Class Limits Class Boundaries Frequency cf
14-16 13.5-16.5 4 4
17-19 16.5-19.5 9 13
20-22 19.5-22.5 16 29
23-25 22.5-25.5 23 52
26-28 25.5-28.5 17 69
29-31 28.5-31.5 8 77
32-34 31.5-34.5 3 80

Step 2: Draw and label the x-axis and y-axis.

Step 3: Represent the frequency on the y-axis and the upper class boundaries on the x-
axis.
Step 4: Connect the adjacent points with line segments.

Ogive for Call Center Agents' Salary


100
Cumulative
Frequency

50

0
16.5 19.5 22.5 25.5 28.5 31.5 34.5
Real Limit (Salary in Thousand)

Figure 2.3 Ogive for Call Center Agents’ Salary

2.6 Other Types of Graphs/Charts

As we discussed in the previous section, the only allowable calculations on


nominal data is to count the frequency of each value of the variable. We can graphically
display the counts in three ways: pareto charts, bar charts, and pie charts. This section
also includes on how to graphically display time series graph, pictograph and scatter plot.

Page | 16
A. Pareto Chart
A pareto chart is a graph used to represent a frequency distribution for a
categorical data (or nominal-level) and frequencies are displayed by the heights of
vertical bars, which are arranged in order from highest to lowest.

B. Bar Chart (Bar Graph)


A bar chart is similar to bar histogram. The bases of rectangles are arbitrary
intervals whose centers are the codes. The height of each rectangle represents the
frequency of that category. It is also applicable for categorical data (nominal-level).

C. Pie Chart (Circle Graph)


A pie chart is a circle divided into portions that represent the relative frequencies
(or percentages) of the data belonging to different categories. The data in a pie chart
should be categorical or nominal-level.

D. Time Series Graph


A time series graph represents data that occur over specific period of time under
observation. In addition, it shows for a trend or pattern on the increase or decrease
over the period of time.
E. Pictograph (Pictogram)
A pictograph immediately suggest the nature of the data being shown. It is a
combination of the attention-getting quality and the accuracy of the bar chart.
Appropriate pictures arranged in a row (sometimes in a column) present the quantities
for comparison.

F. Scatter Plot
A scatter plotis used to examine possible relationships between two numerical
variables. The two variables are plot in x-axis and y-axis.
Now we will illustrate how to construct the pareto chart, bar chart, pie chart, time
series graph, pictograph, and scatter plot using the succeeding examples.

Example 1: Using the information in the table about the favourite snacks of 870 youths,
construct a pareto chart, bar chart, and pie chart.

Products Sales
Junk Foods 135
Candy 250
Ice Cream 185
Chocolate 210
Others 90

Page | 17
Solution:
a. Constructing a Pareto Chart

Step 1: Arrange the data from highest to lowest according to frequency.

Products Sales
Candy 250
Chocolate 210
Ice Cream 185
Junk Foods 135
Others 90

Step 2: Draw and label the x-axis (Products) and y-axis (Sales).

Step 3: Construct the chart by arranging the frequency from highest to lowest and from
left to right. Make a bar with the same width and draw the height corresponding
to the frequencies. Figure 2.4 shows the Pareto Graph on the favorite snacks of
youth.

Favorite Snacks
300
Sales (in Millions)

200

100

0
Candy Chocolate Ice Cream Junk Foods Others
Products

Figure 2.4 Pareto Chart for Example 1

b. Constructing a Bar Graph

Step 1: Draw and label the x-axis (Products) and y-axis (Sales).

Step 2:Make a bar with the same width and draw the height corresponding to the
frequencies. Figure 2.5 shows the Bar Chart on the favorite snacks of the youth.

Page | 18
Favorite Snacks
300

Sales (in Millions)


250
200
150
100
50
0
Junk Foods Candy Ice Cream Chocolate Others
Products

Figure 2.5:Bar Chart for Example 1

c. Constructing a Pie Chart

Step 1:Since there are 360 ˚ in a circle, the frequency of each class must be converted into
a proportional part of a circle. This conversion is done by applying the formula

f
Degrees=(n) (360˚)

where f = frequency of each class, and


n = sum of frequencies.

Hence the following conversions are obtained. The degrees should total to 360˚.

250 135
Candy (870) (360˚) =103˚ Junk Foods ( 70 ) (360˚) = 56˚

210 90
Chocolate (870) (360˚) = 87˚ Others (870) (360˚) = 37˚

185
Ice Cream (870) (360˚) = 77˚

Step 2: Each frequency must also converted to a percentage and has a total of 100%.
This percentage can be done using by applying the formula

𝑓
Percentage = (𝑛) (100%)

where f = frequency of each class, and n = sum of frequencies.

Page | 19
250 135
Candy (870) (100%) = 29% Junk Foods ( 70 ) (100%) = 16%

210 90
Chocolate (870) (100%) = 24% Others (870) (100%)= 10%

185
Ice Cream (870) (100%) = 21%

Step 3:Using a protractor graph each section and write its name and appropriate
percentage, as shown in Figure 2.6

Others Favorite Snacks


10% Candy
29%
Junk foods
16%

Ice Cream Chocolate


21% 24%

Figure 2.6: Pie Chart for Example 1

Example 2: Using the information in the table below about the dollar to peso exchange
rate from January to December of 2010, construct a time series graph.

Month January February March April May June


Peso/US Dollar 41 42 43 46 44 45
Exchange Rate
Month July August September October November December
Peso/US Dollar 43 42 45 44 45 43
Exchange Rate

Solution:

Step 1: Draw and label the x-axis and y-axis.


Step 2: Label the x-axis for months and y-axis for Peso per US Dollar.
Step 3: Plot each point according to the table.
Step 4: Draw a line segments connecting adjacent points.

Page | 20
Peso-US Dollar Exchange Rate

48

Peso per US Dollar


46
44
42
40
38
Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec
Months

Figure 2.7:Time Series Graph for Example 2

Example 3: The VSAS Realty Inc. is a real estate who develops household in Rizal
province. The information in the table show the number of house
construction from 2005 to 2009. Construct a pictograph.

Year 2006 2007 2008 2009 2010


No. of house 400 250 600 550 700

Solution:
Step 1: Draw and label the x-axis and y-axis.
Step 2: Label the x-axis for years and y-axis for Number of Houses.
Step 3: Draw a house that represent the number of houses.

800
700
600 Figure 2.8: Pictograph
500 for Example 3
no. of houses

400
300
200
100
2006 2007 2008 2009 2010
Year
Legend: = 100 houses
Page | 21
Example 4:The owner of the chain of halo-halo stores would like to study the effect of
atmospheric temperature on sales during the summer season. A random
sample of 12 days is selected with the given as follows:

Day 1 2 3 4 5 6 7 8 9 10 11 12
Temperature(˚F) 79 76 78 84 90 83 93 94 97 85 88 82
Total Sales 147 143 147 168 206 155 192 211 209 187 200 150

Put the data on a scatter diagram.

Solution:

Step 1: Draw and label the x-axis and y-axis.


Step 2: Label the x-axis for Temperature (˚F) and y-axis for Sales.
Step 3: Plot the points of each ordered pairs in a Cartesian Coordinate System.

225
200
175
150
Sales (Y)

125
100
75
50
25
0
0 15 30 45 60 75 90
Temperature (X)

2.7 Guidelines for Developing Good Graphs / Charts

Good graphical displays tell what the data are conveying. Sadly many graphs or
charts shown in newspapers and magazines are misleading, incorrect, or complicated that
must not be used. In order to correctly develop a good graphs/charts there are some
guidelines that needs to bear in minds such as

1. The graphs/chart should include a title.


2. The scales for al axes should be included.
3. The scale on the y-axis should start at zero.
4. The graph/chart should not disfigure the data.
5. The x-axis and y-axis should be properly labeled.
6. The graph/chart should not contain unnecessary decorations.
7. The simplest possible graph/chart should be used for any data set.

Page | 22

You might also like