Module 3 PDF
Module 3 PDF
Example 1.
In an exam of 100 items the ten students scored as follows: 23, 60, 79, 32,
47, 74, 52, 70, 80, and 36.
The array can be done as follows: 23, 32, 36,47, 52, 57, 60, 70, 74, 79,
and 80.
“The highest score is 80 and the lowest score is 23. Only seven
students scored above 50% of the 100 items of examination. Some three
students obtained scores below the 50% passing.”
Stem leaf plot- is a table which sorts the data according to certain pattern.
This is a process of separating a number into two parts. For a two digit number,
the stem comprises the tens digit and the leaf consist of the units digits. While for
a three digit number the stem consist of the hundred and the leaf comprise the
unit digits.
Example 2.
41, 37, 41, 46, 50, 47, 36, 44, 51, 37,
38, 44, 56, 30, 34, 28, 40, 38, 36, 49,
43, 46, 31, 31, 43, 29, 31, 33, 46, 37, and 35.
Solution: Draft (record the units digits by line so not to miss any single
observation)
Stem : Leaves
2 : 8 9
5 : 01 6
Stem Leaves
2 : 89
3 : 01113456677788
4 : 011334466679
5 : 016
“The 31 day record shows that the automobile muffler shop installed
as high as 56 automobile mufflers. The lowest was 28 mufflers. The plot
also manifest that mostly the shop was able to installed as much as 30 to
50 automobile mufflers.”
Example 3.
List down the data that corresponds to the following items of the stem-and
leaf plots:
Age Frequency
22 3
21 5
20 5
19 11
18 6
Total 30
60-62 5
63-65 18
66-68 42
69-71 27
72-74 8
Total 100
Let us define some terms necessary for constructing frequency distribution table.
Class Interval – refers to the grouping bounded by the lower limit and the upper
limit.
A class interval 11-15 would mean to include the values 11, 12, 13, 14,
and 15. Where 11 and 15 are called limits and 11 refers to the lower limit (LL)
while 15 represents upper limit (UL).
Class Mark or Class Midpoint – refers to the representative of every class
interval. This value can be obtained by adding the lower limit and upper limit then
divided by 2. It is denoted by x.
Example 5.
For the class interval 11 – 15 the class mark or midpoint is derived by
(11+15)/2 = 13
Example 6.
For the class interval 20 – 23 the class mark or midpoint is
(20+23)/2 = 21.5
Class Boundaries – refers to the true boundaries of a class interval.
Example 7.
For a class interval 16 – 20, the true boundaries or true limit 15.5 – 20.5
Class size – refers to the length or width of the class interval.
Example 8.
For a class interval 11 – 15 the class size is 5.
For a class interval 20 – 23 the class size is 4.
To compute the class size one can use any of the following formulas:
c = upper class boundary – lower class boundary (formula 1)
c = upper limit – lower limit + 1 (formula 2)
Frequency Distribution Table – is a table comprising the class intervals with their
associated frequencies.
Table 3.3
Test Score of 50 students in Statistics
53 30 58 67 46
41 25 44 42 40
31 42 15 73 47
39 51 33 79 56
64 43 65 80 54
47 35 25 54 52
20 44 58 52 48
62 46 44 43 55
66 37 65 48 75
67 36 63 51 40
Steps in construction the Group Frequency distribution for the Math 313
test scores:
Step 4. Tally the frequencies for each class, until the highest value is reached.
Class Interval
LL – UL Tally Freq
15 – 21 II 2
22 – 28 II 2
29 – 35 IIII 4
36 – 42 IIII-III 8
43 – 49 IIII-IIII-I 11
50 – 56 IIII-IIII 9
57 – 63 IIII 4
64 – 70 IIII-I 6
71 – 77 II 2
78 – 84 II 2
C=7 n=50
CI f RF%
15-21 2 2/50= 4%
22-28 2 4%
29-35 4 8%
36-42 8 16%
43-49 11 22%
50-56 9 18%
57-63 4 8%
64-70 6 12%
71-77 2 4%
78-84 2 4%
n= 50
15 – 21 2 2 50
22 – 28 2 4 48
29 – 35 4 8 46
36 – 42 8 16 42
43 – 49 11 27 34
50 – 56 9 36 23
57 – 63 4 40 14
64 – 70 6 46 10
71 – 77 2 48 4
78 – 84 2 50 2
C=7 n=50
Collected data remains useless and invalid unless presented effectively for
analysis and interpretations. Data are presented in three general methods as
enumerated below:
1. Textual Method
By textual or paragraph method, the reader gets information by merely
reading the gathered data. Readers are directed to pay particular attention on
specific data such as comparisons, contrasts, synthesis, generalizations, or
findings. This method, however, when employed alone elicit its boredom. It is a
weak means of showing the quantitative comparisons or relations among
quantitative or numerical data attractively and interestingly. This method of
presenting data is the simple when there are only a few numbers to be
presented. However, it is not desirable to include a big mass of quantitative data
in a “text” or paragraph as the presentation becomes almost incomprehensible.
Example 12.
The present enrollment of the Don Hilario High School is 19,900 students.
About 12,077 or 63.56% are female and 6923 or 36.44% are male. Of the 19,900
students, 950 or 5% are scholars sponsored by different benefactors.
Tabular Method
The tabular method is a more effective way of showing relationships or
comparisons of numerical data. It gives a more precise, systematic and orderly
presentation of data in rows and columns. This makes comparison of figures
easy and comprehensive.
Example 13.
https://www.google.com/url?sa=i&url=https%3A%2F%2Ffanyv88.com%3A443%2Fhttps%2Fwww.slideshare.net%2F
Sujarvs%2Fpresentation-of-data-ppt&psig=AOvVaw3SbOyzDq-
1_mAXjq5msOCG&ust=1599989876438000&source=images&cd=vfe&ved=0CAI
QjRxqFwoTCIjv_OGo4-sCFQAAAAAdAAAAABAN
Semi-tabular Method
The semi-tabular method employs both the textual and tabular methods. This
method is used only if there are few figures to be tabulated. The tables are
followed by narrative explanations to make the facts more understandable.
3. Graphical Presentation
The use of graphs is the most effective method of presenting statistical
result or findings. It gives the relationships of data in pictorial form. Presentations
of facts are made attractive and meaningful especially if colors are made and
pictures are used, making it easy for important information to be grasped by the
reader. It is only limitations are as follows:
Graphs are not as precise as the tables.
1. Graphs require more skillful and time to prepare.
2. Graphs can only be made after data have been presented in
tabular forms.
Kinds of graphs are as follows:
1. Line or curve graphs – frequency polygon
https://www.google.com/url?sa=i&url=https%3A%2F%2Ffanyv88.com%3A443%2Fhttp%2Fonlinestatbook.com%2F2
%2Fgraphing_distributions%2Ffreq_poly.html&psig
2. Bar Graph
The bar graph is used to represent discrete data, so instead of being
joined, like in the histogram the bars are separated. The length of each bar also
represents the frequency within the given class. The width of the bar is arbitrary.
The bars, however, must be of the same width. Thus, the bar graph is almost the
same as the histogram the only difference is that the bars of the histogram are
joined.
https://www.google.com/url?sa=i&url=https%3A%2F%2Ffanyv88.com%3A443%2Fhttps%2Fpsa.gov.ph%2Fcontent
%2Fphilippine-population-density-based-2015-census-population&psig
https://www.excel-easy.com/examples/column-chart.html
3.Pie graph
https://www.google.com/search?q=pie+graph+example&sxsrf
https://www.google.com/url?sa=i&url=https%3A%2F%2Ffanyv88.com%3A443%2Fhttps%2Fpsa.gov.ph%2Fcontent%2Fphilippine-
population-density-based-2015-census-population&psig
Step 1. Lay of the x-axis and the y-axis to represent the class intervals and the
class
Frequencies, respectively.
Step 2. Determine the most appropriate scale for the class frequencies. A well
balanced histogram must have a height that is 60%, or 75% of its width and
seldom below 50%.
Step 3. Draw each bar with a height equal to the class frequency of each class
boundary. Class marks may replace the class boundaries. So just enter the base
of each bar on its corresponding class mark.
Step 4. Label the x and y-axis correctly.
To illustrate a histogram, let us use the frequency distribution for the 50 scores.
Frequency Distribution of the 50 scores data in Table 3.3
CI f CM (x)
15-21 2 18
22-28 2 25
29-35 4 32
36-42 8 39
43-49 11 46
50-56 9 53
57-63 4 60
64-70 6 67
71-77 2 74
78-84 2 81
C=7 n=50
Step 1. Represent the x-axis by using the class marks of the intervals.
Step 2. Represent the y-axis using the frequencies of the class intervals.
Step 3. Plot each class frequency directly above the class marks of its class
interval.
Step 4. Connect the points by means of lines.
Step 5. Create an additional class at both end of the distribution to close the
polygon.
CI f CM (x)
15-21 2 18
22-28 2 25
29-35 4 32
36-42 8 39
43-49 11 46
50-56 9 53
57-63 4 60
64-70 6 67
71-77 2 74
78-84 2 81
C=7 n=50
Constructing the Cumulative Frequency Polygon
The cumulative frequency polygon is the graph of a cumulative frequency
distribution.
To construct the graph, the steps are as follows:
Step 1. Represent the units on the x-axis by the exact upper limit of each class
interval.
Step 2. Represent the y-axis by the cumulative frequency.
Step 3. Plot the point above the exact upper limit, which corresponds to its
Cumulative frequency.
Step 4. Connect the points by straight lines.
To illustrate a cumulative frequency polygon, let us use the 50 score data
and change it into s frequency cumulative distribution by adding a column for
cumulative frequency or cf. If the raw frequencies are converted to percentages
the cumulative frequencies will sum up to 100 percent. This is done by dividing
every cumulative frequency by the total number of cases and then multiplying the
result by 100. These polygon or ogive curve. This graph is used to find the
percentage of cases less than or greater than a given value.
CI f CB <cf >cf
15-21 2 14.5-21.5 2 50
22-28 2 21.5-28.5 4 48
29-35 4 28.5-35.5 8 46
36-42 8 35.5-42.5 16 42
43-49 11 42.5-49.5 27 34
50-56 9 49.5-56.5 36 23
57-63 4 56.5-63.5 40 14
64-70 6 63.5-70.5 46 10
71-77 2 70.5-77.5 48 4
78-84 2 77.5-84.5 50 2
CI f CB <cf >cf
15-21 2 14.5-21.5 2 50
22-28 2 21.5-28.5 4 48
29-35 4 28.5-35.5 8 46
36-42 8 35.5-42.5 16 42
43-49 11 42.5-49.5 27 34
50-56 9 49.5-56.5 36 23
57-63 4 56.5-63.5 40 14
64-70 6 63.5-70.5 46 10
71-77 2 70.5-77.5 48 4
78-84 2 77.5-84.5 50 2
Greater than Cumulative Frequency Polygon
(Greater than 0give)