Signed Learning Material No. 4A Data Management
Signed Learning Material No. 4A Data Management
Contact Details
Contact Number: 09167422338/09184922805
E-mail Address: [email protected]
Web Address: [Type your web address or social media link]
Consultation Schedule
Every Friday 2 -5 pm
In addition, students will undergo pre-test and post-test short-examination. The pre-test questionnaires will
be given at the start of each rating period (Prelims, Midterms, Finals) while the post-test questionnaires will
be given at the end of each rating period. The results of the assessment will serve as one of the key indicators
that determine the effectiveness of this learning material. Thus, exemplifying honesty and rectitude in this
particular undertaking are highly appreciated and commendable.
Always keep connected and updated with announcements and relevant information concerning this course.
Lastly, do not hesitate to ask for assistance and raise your concerns to your instructor / professor.
B. Mathematics as a Tool
Introduction
Mathematics is a powerful tool for global understanding and communication. Using it, students can make
sense of the world and solve complex and real problems. Rethinking math in a global context offers students
a twist on the typical content that makes the math itself more applicable and meaningful for students. For
students to function in a global context, math content needs to help them get to global competence, which is
understanding different perspectives and world conditions, recognizing that issues are interconnected across
the globe, as well as communicating and acting in appropriate ways. In math, this means reconsidering the
typical content in a typical ways and showing students how the world consists of situations, events and
phenomena that can be sorted out using the right math tools. In this learning material, you will find out how
mathematics is applied as a powerful tool in our nature.
Another way is to classify data into four levels of measurement such as nominal, ordinal, interval
and ratio. The nominal level of measurement is the lowest of the four ways to characterize data.
Nominal data deals with names, categories, or labels. Data at the nominal level is qualitative. Colors
of eyes, yes or no responses to a survey, and favorite breakfast cereal all deal with the nominal level
of measurement.
Ordinal level of measurement ranks qualitative data. Winners in a pageant and the academic
rank of teachers are examples of ordinal data. Interval level of measurement deals with data that
can be ordered, and in which differences between the data does make sense. Data at this level does
not have a starting point. The Fahrenheit and Celsius scales of temperatures are both examples of
data at the interval level of measurement. The fourth and highest level of measurement is the
ratio level. Data at the ratio level possess all of the features of the interval level, in addition to a zero
value. Examples are weight, the time to answer a quiz and the number of absences of students in a
class.
A relative frequency is the ratio (fraction or proportion) of the number of times a value of the data
occurs in the set of all outcomes to the total number of outcomes. To find the relative frequencies,
divide each frequency by the total number of students in the sample. Relative frequencies can be
written as fractions, percent, or decimals.
Cumulative relative frequency is the accumulation of the previous relative frequencies. To find the
cumulative relative frequencies, add all the previous relative frequencies to the relative frequency for
the current row.
Twenty-five employees were given a blood test to determine their blood type. Construct a
frequency distribution for the data.
Raw Data: A, B, B, AB, O O, O, B, AB, B B, B, O, A, O A, O, O, O, AB AB, A, O, B, A
When the range of the data is large, grouped frequency distributions are used. The smallest
and largest possible data values in a class are the lower- and upper-class limits. Class
boundaries separate the classes. To find a class boundary, average the upper-class limit of one
class and the lower-class limit of the next class. The class width can be calculated by subtracting
successive lower-class limits (or boundaries) or successive upper-class limits (or boundaries).
The class midpoint Xm can be calculated by averaging upper and lower class limits (or
boundaries).
Example 2
The following data represent the record high temperatures for each of the 50 states. Construct
a grouped frequency distribution for the data.
112 100 127 120 134 118 105 110 109 112 110 118 117 116 118 122 114
114 105 109 107 112 114 115 118 117 118 122 106 110 116 108 110 121
113 120 119 111 104 111 120 113 120 117 105 110 118 112 114 114
Step 2. Determine the no. of classes k, k = 1 + 3.32 log n, where n is the no. of population.
k = 1 + 3.32 log n; 1+ 3.32 log 50 = 6.64
𝑅 34
Step 3. Obtain the class size or class width, i, i = = = 5.12 or 5
𝑘 6.64
Step 4. Make the classes. For convenience sake, we will choose the lowest data value, 100
for the first lower class limit. The subsequent lower-class limits are found by adding the width
to the previous lower-class limits.
Step 5. Tally the data.
Table 2. Grouped Frequency Distribution of Recorded High Temperature of 50 States
Graphical Presentation
The graphical presentation on the other hand, is a visual presentation of the data. A graph is a tool
that helps you learn about the shape or distribution of a sample. The graph can be a more effective
way of presenting data than a mass of numbers because we can see where data clusters and where
there are only a few data values. Graphs are commonly used in oral presentation. There are several
forms of graphs to use like the pie chart, pictograph, bar graph, line graph, histogram, frequency
polygon and box-plot.
1. Histogram
A histogram is a graph that displays the data by using vertical bars of various heights to represent
the frequencies of the classes. The horizontal axis is labeled with what the data represents. The
vertical axis is labeled either frequency or relative frequency. Histograms use class boundaries
and frequencies of the classes.
Example
Construct a histogram to represent the data for the record high temperatures for each of the
50 states.
2. Frequency Polygon
Frequency polygon is a graph that displays the data by using lines that connect points plotted for
the frequencies at the class midpoints. The frequencies are represented by the heights of the
points and the class midpoints are represented on the horizontal axis. Frequency polygons use
class midpoints and frequencies of the classes.
Example
Using the example above, construct a frequency polygon.
3. Box Plot
A box plot is also known as box-and-whisker plots or box-whisker plots. It shows how far the
extreme values are from most of the data. A box plot is constructed from five values: the minimum
value, the 1st quartile, the median, the 3rd quartile, and the maximum value. We use these values
to compare how close other data values are to them. To construct a box plot, use a horizontal or
vertical number line and a rectangular box. The smallest and largest data values label the
endpoints of the axis. The 1st quartile marks one end of the box and the 3rd quartile marks the
other end of the box. The "whiskers" extend from the ends of the box to the smallest and largest
data values. The median or 2nd quartile can be between the 1st & 3rd quartiles, or it can be one,
or the other, or both. The box plot gives a good, quick picture of the data.
Example
Consider the dataset. 1; 1; 2; 2; 4; 6; 6.8; 7.2; 8; 8.3; 9; 10; 10; 11.5 The 1st quartile is 2, the
median is 7, and the 3rd quartile is 9. The smallest value is 1, and the largest value is 11.5. The
following image shows the constructed box plot.
4. Pareto Chart
A Pareto chart is a bar graph. The lengths of the bars represent frequency or cost (time or money),
and are arranged with longest bars on the left and the shortest to the right.
3. Pie Chart
A Pie Chart (or Pie Graph) is a special chart that uses "pie slices" to show relative sizes of data.
The chart is divided into sectors, where each sector shows the relative size of each value.
Mean (𝒙ഥ) is the set of scores or observations being added and divided by the number of
scores. It is also called as average or arithmetic mean.
∑𝑥
𝑥̅ = where: 𝑥̅ = sample mean
𝑛
∑ 𝑥 = sum of the scores or observation
n = number of scores or observation
∑𝑥 9+8+10+7+7+8+12+9+!0+11 91
Solution: 𝑥̅ = = = = 9.1
𝑛 10 10
The average age of 10 pupils in a certain school is 9.1.
Example 2
The data below are the number of employees in a certain store. Find the mean.
3, 6, 5, 4, 7, 8,10
∑𝑥 3+6+5+4+7+8+10 43
Solution: 𝑥̅ = = = = 6.14
𝑛 7 7
The average number of employees is 6.14.
ഥ𝒘
Weighted Mean 𝒙 of a given group of data is the average of the means of all the
groups.
∑ 𝒘𝒙
ഥ𝒘 =
𝒙 where w = weight of the score
∑ 𝒘
X = the score
∑ 𝒘 = sum of the weight
Example
There are 1,000 notebooks sold at ₱ 10 𝑒𝑎𝑐ℎ ; 500 notebooks at ₱ 20 𝑒𝑎𝑐ℎ ; 500 notebooks
at ₱ 25 𝑒𝑎𝑐ℎ and 100 notebooks at ₱25 𝑒𝑎𝑐ℎ. Solve for the weighted mean.
Solution:
∑ 𝒘𝒙 ₱ 𝟑𝟓 𝟓𝟎𝟎
ഥ𝒘 =
𝒙 = = ₱16.90
∑ 𝒘 𝟐 𝟏𝟎𝟎
Median (𝒙 ) is the value found at the middle when the data are arranged in an array
form. If there are two middle values, the average is taken.
Example 2
The following are the sizes of shoes of 6 male basketball players.
10, 12, 9, 11, 13, 11.5
11+11.5 22.5
Solution: 9, 10, 11, 11.5, 12,13 get the average; = =11.25
2 2
The median score is 11.25 which means that 50% of the sizes in the distribution are lower
than 11.25, those are 11, 10 and 9; and 50% are greater than 11.25, those are 13, 12 and
11.5 which mean three scores are below 11.25 and three scores are above 11.25.
Mode ( 𝒙ෝ) is the value that occurs most often in a data set. It can be unimodal, bimodal,
trimodal or multimodal. Unimodal is a distribution of scores consisting of one mode.
Bimodal consist of two modes, trimodal with three modes and multimodal with more than
two modes.
Example 2
Find the mode of the data set: 7, 9, 11, 15, 17, 18,18,19, 24, 24, 26
Answer: ______________________
2. Find the mean of the following data: 10, 12, 13, 15, 16, 19, 30
Solution:
Answer: ______________________
3. Find the Grade point average (GPA) of Camille for the first semester, school year 2020-2021.
4. The number of rooms in the seven hotels in Metro Manila is 713, 300, 618, 595, 311, 401, and 292.
Find the median.
Solution:
Answer: ______________________
5. The number of typhoons that have occurred in the Philippines over an 6-year period follows. Find
the median.
684, 764, 656, 702, 856, 1303
Solution:
Answer: ______________________
Answer: ______________________
7. Find the mode for the number of coal employees per county for 10 selected counties in
Southwestern Pennsylvania.
110, 731, 1031, 84, 20, 118, 1162, 1977, 103, 752
Solution:
Answer: ______________________
References:
https://asiasociety.org/education/understanding-world-through-math
Manlulu, E.A. and Hipolito, L.M.M. (2019) A Course Module for Mathematics in the Modern World.
Bluman, Allan, Elem. Statistics 7th edition
Barbara, Illowsky, et al, Introductory Statistics, OpenStax Rice University, c. 2018
Dr. Patricia B. Licuanan, TEACHING GUIDE FOR SENIOR HIGH SCHOOL Statistics and Probability, c. 2016
Dr. Kathy Chu, et al., Elementary Statistics, c. 2013
Daligdig, Romeo M., Mathematics in the Modern World, Lorimar Publishing Inc., c. 2019
Gabuyo, Yonardo A., Assessment of Learning 1, Rex Book Store, c. 2012
www.mathsisfun.com/definitions/pie-chart-pie-graph.html