Module 4 Data Management Introduction To Statistics
Module 4 Data Management Introduction To Statistics
Mathematics in
the Modern
World
Course
Modules
Week 7
MODULE 4
Data Management: Introduction to Statistics
4.1 Introduction
When we hear the word Statistics, the first thing that comes to mind is set
of numerical figures, such as your monthly allowance, the number of hours you
spend in school, the number of hours you spend on Facebook, your vital
statistics, etc.
However, the study of statistics is not limited to knowing and memorizing
numerical figures. This module will give us a better understanding of what
Statistics is about. Discussion on how some of its processes are done is also
included.
4.2 Learning Outcomes
After finishing this module, you are expected to:
Statistics is the branch of science that deals with the collection, presentation,
organization, analysis, and interpretation of data.
Why are all processes involved in Statistics important? Statistics has the
ability to provide us with tools we need to convert raw data into information that
we can use to make sensible decisions and intelligent choices.
Page 1 of 15
People from various fields of interest need to obtain information to answer
different types of problems. Nowadays, we do this by performing a statistical
inquiry. This will allow us to answer problems with clearer understanding of a
particular collection of information.
Usually, the population of interest may be too large that it becomes too
expensive and time-consuming to collect data from every element of the
population. Thus, we have no other option but to get the data we need from only
a subset of the population. We use the term sample to refer to this subset of the
population.
In any statistical inquiry, we study certain characteristics or attributes of
the elements in the population, which we call variables. Just like in algebra, we
denote variables with letters of the English alphabets. We refer to these
characteristics as variables because their realized values may vary for the
different elements in the sample or population.
Page 2 of 20
Example 2. Below are illustrations of variables together with their possible
values.
Example 4.
A summary measure that we are familiar with is the proportion. The
proportion is the quotient obtained when we divide the magnitude of a part by
the magnitude of the whole. Suppose that among the 35 students, 28 claimed
that they own a cellular phone. We can now compute for the proportion of
students in the population with cellular phones.
Page 3 of 19
The proportion of students in our population with cellular phones is an
example of a parameter because it is a summary measure describing a
characteristic of the population.
Suppose we take a sample of 10 students from this class. Among the 10
students in the sample, 7 own cellular phones. We cannot compute the
proportion of students in the population with cellular phones but we can
compute for (read as “ hat”), where is the proportion of students in the
sample with cellular phones, as follows:
Page 4 of 20
elements of a selected sample. In this case, the analysis using descriptive
statistics will only allow us to describe the sample. The methods used in
descriptive statistics will not allow us to generalize about the population using
sample data.
Example 5. Below is an illustration of application and restriction of descriptive
statistics.
Given the daily sales performance for a product for the previous year, we
can draw a line chart or a column chart to emphasize the upward/downward
movement of the series. Likewise, we can use descriptive statistics to calculate
a quantity index per quarter to compare the sales per quarter for the previous
year.
Inferential Statistics includes all the techniques used in analyzing the sample data
that will lead to generalizations about a population from which the sample came from.
It consists of performing hypothesis testing, determining relationships among variables,
and making predictions.
Page 5 of 19
4.3.2.1 Quantitative and Qualitative Variables or Data
In doing a report or research, initially, we have to define the variables
relevant to the data. There are two major classifications of variables: qualitative
and quantitative.
1. Discrete variables are values that are obtained by counting. The results
are whole numbers. For example, the number of students in the room.
Page 6 of 20
2. Ordinal or Rank Data. In this set, numbers can be ordered or ranked,
but a specific difference in the levels cannot be determined. For
example, the performance rating can be represented by numbers as
illustrated below:
3. Interval Data. In this set, numbers can be ordered and has exact
difference between any two units but has no meaningful zero or starting
point. For example, Temperature is an interval data since they can be
ordered, there is an exact difference between two degrees, but the zero
does not mean the starting point since there can be temperatures below
zero.
4. Ratio Data. This set is the highest level of measurement and allows for
all basic arithmetic operations, including division and multiplication.
Data at this level can be ordered, has exact difference between units,
and has a meaningful zero. Things that are counted are usually ratio
level, for example, business data, such as cost, revenue and profit.
In the case where data are not properly gathered, the consequences are as
follows:
Page 7 of 19
2. inability to repeat and validate the study
3. distorted findings resulting in wasted resources
4. misleading other researchers to pursue unproductive ways of
investigation
5. compromising decisions for public policy
6. causing harm to human participants and animal subjects
Primary data are data documented by a primary source. The data collectors themselves
documented this data.
Page 8 of 20
c. Bureau of Agricultural Statistics (BAS) is a primary source of data on
agriculture and livestock.
d. The University Registrar’s Office is a primary source of student records.
Example 9. The following are examples of secondary data.
a. The United Nations’ compiled data for its yearbook, which were originally
gathered by government statistical agencies of different countries.
b. A medical researcher’s documented data for his research paper, which
were originally collected by the Department of Health.
c. The documented data of a student for his thesis, which were originally
collected by the Department of Labor and Employment.
4.3.2.4b Surveys
DEFINITION 5.8. (survey, census, sample survey)
The survey is a method of collecting data on the variable of interest by asking people
questions. When data came from asking all the people in the population, then this is
called a census. On the other hand, when data came from asking a sample of people
from a well-defined population, then this is called a sample survey.
Page 9 of 19
In an experiment, there are different types of variables:
Page 10 of 20
it happens. For example, a researcher may use the observation method to study
the behavior patterns in panic situations like a big fire, the landslide in Itogon,
Benguet, or the destruction of structures when Typhoon Yolanda hit Tacloban
City.
It is also practical to use observation method when the subjects under study
cannot express their sentiments or are unable to speak. For example,
researchers often use the observation method to study the behavior of animals
in the wild, or the behavior of newborn babies in the nursery.
Page 11 of 19
Example 12. Here is an illustration of textual presentation.
Excerpts taken from the Isabela Covid-19 Case Updates.
“As of 4PM today, the Department of Health reports a total number of COVID19
cases at 290,190, after 3,475 newly-confirmed cases were added to the list of COVID-
19 patients.
DOH likewise announces 400 recoveries. This brings the total number of recoveries
to 230,233.
Twenty-eight duplicates were removed from the total case count. Of these, 19 were
recovered cases.
Moreover, 13 cases previously reported as recovered were reclassified as death
(12) and active (1) cases after final validation.”
From the illustration given, the paragraphs showed and highlighted only the
most important figures. Few numbers were included and minute details or a
large quantity of data were not presented. If we want to refer to other details of
the data, then it would be more appropriate to use tabular presentation.
Page 12 of 20
Example 13.
The population in the Philippines for the census years 1975 to 2000 is as
follows:
1975 42,070,660
1980 48,098,460
1990 60,703,206
1995 68,616,536
2000 76,498,735
SMART 10,080,112
GLOBE 8,800,000
PILTEL 2,867,085
EXTELCOM 29,896
Total 22,509,560
Page 13 of 19
The following presents the different parts of a formal statistical table:
number that identifies the position of the table in a
Table number
sequence
Source note gives the name of the agency that collected the data
Page 14 of 20
Example 14.
Below is an example of a formal statistical table.
Page 15 of 19
4.3.3.3 Graphical Presentation
Graphical presentation of data portrays numerical figures or relationships
among variables in pictorial form. Some statistical charts used in this type of
presentation is given in the following table:
Type of
Description Example
Chart
Line Chart • Useful for presenting historical
data
• Effective in showing movement
of a series over time
• Appropriate when comparing
two or more time series data
and trends over time
Page 16 of 20
Pictograph • Like a horizontal bar chart that
uses symbols or pictures
instead of bars
• The purpose is to get the
attention of the readers
Page 17 of 19
3. To help the researcher in making credible decisions based on
quantitative data or arguments.
4.8 References:
Beaver, B.M. and Beaver R.J. (1999). Introduction to Probability and Statistics.
10th ed. New York: Duxbury Press.
Bluman, A. (1998) Elementary Statistics: A Step by Step Approach. 3 rd ed.
McGraw-Hill Book Co.
Deuna, Melecio C. (1996), Elementary Statistics for Basic Education. Quezon
City: Phoenix Publishing House, Inc.
Febre, F.A. and Virginia F. Cawagas (Consultant)(1987) Introduction to
Statistics. Metro Manila, Pheonix Publishing House, Inc.
Reyes, C.Z. and Saren, L.L. (2003). Metro Manila. M.G. Reprographics.
Spiegel, M. and Stephens, L. (1999). Schaum’s Outline Theory and Problems
in Probability and Statistics. 3rd. Edition. Singapore: McGraw-Hill
Book Company.
Page 18 of 20
Thorndike,R.M. & Dinnel,D.L. (2002)Basic Statistics for the Behavioral
Sciences.Prentice Hall,Inc.
Triola, Mario (1995) Elementary Statistics. New York: Addison-Wesley
Publishing Company.
Most, .M.M., Craddick, S., Crawford, S., Redican, S., Rhodes, D., Rukenbrod,
F., Laws, R. (2003). Dietary quality assurance processes of the
DASHSodium controlled diet study. Journal of the American Dietetic
Association, 103(10): 1339-1346.
Web Sources:
http://lsc.cornell.edu/wp-content/uploads/2016/01/Why-study-
statistics.pdf
Page 19 of 19