Reviewer
Reviewer
ENGAGE
Do you know that statistics are happening in our everyday life? But sometimes
we are not aware that it is happening everywhere. For example playing basketball,
number of children in the community, people living in a certain barangay, total
population in the Philippines and covid 19 pandemic that we are experiencing now.
These are just a few events that statistics is always present. This is just only a few data
that we have and out of this data statistical methods will be applied. As you learn
statistics you will appreciate how to use different statistical tools to treat the data.
EXPLORE
Statistical knowledge helps you use the proper methods to collect the data,
employ the correct analyses, and effectively present the results. Statistics is a crucial
process behind how we make discoveries in science, make decisions based on data,
and make predictions.
Statistics is the
Collection
Calculation
Summarization
Presentation
Analyzation
Interpretation
Branches of Statistics
Descriptive Statistics aims at summarizing and presenting data in the form
which will make them easier to analyze and interpret.
Inferential Statistics it aims at drawing and making decision on the population
based on evidence obtained from a sample.
Classification of statistics
Measurement
- Refers to the process of assigning meaningful numbers (or Labels) to
Individual persons based on the degree to which they possess particular
characteristics.
1. Nominal scale consists of a finite set of possible values or categories that have
unordered scales.
Example; cancer, accidents, gender (male, female) blood type, nationality,
occupation, civil status and so on. In this scale of measurement, there is no
natural order of categories.
2. Ordinal scale consists of a finite set of possible values or categories which have
ordered scales.
Example: pain level (none, mild, moderate, severe) social status, socio –
economic status and so on. In the ordinal scale there is a natural ordering of the
categories.
Ordinal scale ranks the categories clearly, but absolute distance between
categories are unknown. The numbers have limited meaning. The real
differences between adjacent ranks may not be equal.
3. Interval scale is generally measured on a continuum and differences between
any two numbers on the scale that are of known size.
Example: tons of gravel, number of covid positive, income, age, and so on.
An important property of interval scale is that there is no true zero point. That is,
the value “ 0 “ is arbitrary and does not reflect absence of the attribute.
4. Ratio scale like the interval scale is also measured on a meaningful continuum.
The distinction is that ratio scale has a meaningful zero point.
Example; Weight in kilograms, height, age and so on.
The Ratio scale is used when not only the order and interval size are important,
but also the ratio between two measurements is meaningful. This scale of
measurement is the highest or most precise scale.
variables
Classification of Variables
SUMMATION NOTATION
https://www.cliffsnotes.com/study-guides/algebra/algebra-ii/sequences-and-series/summation-
notation
A simple method for indicating the sum of a finite (ending) number of terms in a
sequence is the summation notation. This involves the Greek letter sigma, Σ. When
using the sigma notation, the variable defined below the Σ is called the index of
summation. The lower number is the lower limit of the index (the term where the
summation starts), and the upper number is the upper limit of the summation (the term
where the summation ends). Consider
This is read as “the summation of (2 k + 3) as k goes from 2 to 7.” The replacements for
the index are always consecutive integers.
Example 1:
Write out the terms of the following sums; then compute the sum.
1.
2.
3.
1.
2.
Example 2:
1. 8 + 11 + 14 + 17 + 20
2.
This is an arithmetic series with five terms whose first term is 8 and whose common
difference is 3. Therefore, a 1 = 8 and d = 3. The nth term of the corresponding
sequence is
Since there are five terms, the given series can be written as
1.
This is a geometric series with six terms whose first term is and whose
Since there are six terms in the given series, the sum can be written
as
Classification of Data
1. Internal data refers to those data that relate to the activities within the
organization collecting the data.
For example: Department of health, it is the agency that is tasked to collect
the data. Philippines Statistics Authority, it is the agency responsible for
socio-demographic data, DOST and so on.
2. External data refers to the data that relates to the activities outside the
organization collecting data. All data obtained from computerized databases,
books, periodicals, government documents, and the like are considered as
external data. The data further classified into either statistical or non-
statistical.
3. Statistical data are those published data of the government, institutions,
companies, and associations which involve figures, tables and graphs.
4. Non-statistical area information which does not involve figures, tables and
graphs, like periodicals, books, and pamphlets.
SOURCES OF DATA
The sources of the data must be treated accurately. In order to ensure the
accuracy of data, one must know the sources of data. There are two sources of data.
These include primary source and secondary source.
1. Primary source refers to data that comes from the original sources and is
collected especially for the task at hand.
1. Mailed questionnaires are one of the most popular means of collecting data,
however it is difficult to design and the most criticized method. This method of
data collection may be employed if we have the names and addresses of the
intended respondents as the ones filling the forms.
2. Interview is a method of data collection that is primarily used to gain an
understanding of the underlying reasons and motivations for people’s
attitudes, preferences or behavior where there is a face-to-face conversation
between the interviewer and the interviewee.
3. Observation refers to the method of extracting data which involves recording
the behavioral patterns of people, objects and events in a systematic manner.
In this method, the information gathered may be documented using cameras,
video tape recorders, laboratory diagnostic apparatus, among others.
4. Telephone interview is an alternative form of personal interview. It is
considered as the most popular method in provinces or cities where almost all
residents have personal telephone.
EXPLAIN
Statistics helps you use the proper methods to collect the data, employ the correct analyses,
and effectively present the results. Statistics is a crucial process behind how we make
discoveries in science, make decisions based on data, and make predictions.
Statistics play a vital role in every field of human activity. Statistics helps in
determining the existing position of per capita income, unemployment, population and
every fields of endeavor.
Learning Objectives:
ENGAGE
The figure above will show you the different tabular and graphical presentation of
data. From the presentation we interpret and give a conclusion to the data. It also helps
us analyze the data.
In this module you will be able to give emphasis to significant figures and
appropriate when there are few figures to be presented. It is concise and easy to
understand, it facilitates analysis of categories of the given variables and presents data
in more detail. The raw data is collected at random and they have not been organized or
processed numerically for use. It is data in its original form. The frequency distribution is
a useful way to present data if the formation of a frequency distribution should neither
be too small nor too large. Also the ogive graph that represents the cumulative
frequencies of the classes. It is constructed by joining with lines a series of points which
are the class marks or mid points of the classes as against less than or greater than
cumulative frequencies.
EXPLORE
1. Data is incomprehensible when the large quantitative data are included in the
paragraph
2. Paragraph involving many figures can be tiresome to most readers when the same
words are repeated many times
TABULAR PRESENTATION
There are ten essential parts of a statistical table. These include the following: table
number, table title, column spanner, stub head, stub, column heads, body, divider,
footnotes, and source note.
Table Number refers to the relative position of the table within a series. It is placed on
the same line as the opening of the tile, separated from the title proper by a period.
Numbers should be omitted for a single table. Tables should be numbered in a
continuous Arabic numerals beginning with 1.
Table Title refers to a brief statement about the table presented. All beginning letters of
the words in the title must be capitalized and the rest are in lower case. It should be
concise and the key variables must be shown in the table. It should never be more than
two lines. Periods are left out at the end of the title. If the title is two lines long, it must
be single-spaced. It should always go above the table.
Stub head refers to the heading in the table that is placed above the leftmost column.
The column is the stub column. This column usually lists the independent variable. The
data that follow the stub column are known as the stub. All other column headings are
simply referred to as column heads.
Body is the main part of the table, which contains the quantitative information. It is the
actual data in a table occupying the columns, for example, percentages, frequencies,
statistical test results, means, “N” (number of samples), among others.
Dividers are lines that frame the top and bottom of the table and, or mark the different
parts of a table. They are often used for division or emphasis within the body of a table.
Footnote is any statement or note inserted at the foot or bottom of the table. You may
use table notes to explain anything in your table that is not self-explanatory. While basic
symbols and abbreviations like SD for standard deviation, N for sample size, and % for
percentage, are commonly used, you may have other technical terms or other issues
that you wish to explain.
Source Note refers to the specific source of the statistics. It is introduced by the word
“Source”. Thus, source notes may be included to acknowledge the origin of the data.
This is placed beneath the footnote.
GRAPHICAL PRESENTATION
Bar graph is a graph consisting of bars of the same sizes, which are drawn vertically or
horizontally for the purpose of comparing values to each other. Horizontal bar graph
usually used for qualitative variables.
Pie chart is a graph used to show how a whole is divided into its component parts. The
sum parts of the whole should be 100%. The pie chart is sometimes called the circle
graph. This kind of graph is needed to show percentages effectively. Angle of each
wedge “slice” is determined by multiplying the percentage contribution of the component
by 3.6. To highlight a specific component, a slice may be “exploded” or extended. This
kind of graph is most appropriate for nominal data. Different colors of the slice of the pie
chart can be applied to emphasize.
Component bar is a graph made of bar representing the whole which is further divided
into smaller rectangles representing the parts wherein the area of each smaller
rectangle is proportional to the relative contribution of the component to the whole. This
component bar is preferable over the pie chart in situations where the compositions of
two or more groups are to be compared. Like the pie chart, different colors can be
applied to the components to emphasize differences between parts of the whole.
Line graph is a graph used for displaying data that changes continuously over time.
The time is chronologically arranged on the horizontal axis and the relevant values are
indicated on the vertical axis. Variations in the data are indicated by a series of line
segments formed by joining consecutive points.
RAW DATA
Raw data is collected at random; they have not been organized or processed
numerically for use. It is data in its original form.
For example:
With the above raw data, it takes time to find the highest and lowest observations. To
make sense of the data, we have to arrange the observations from highest to lowest or
vice versa.
ARRAY
An array is an arrangement of observations in a given data according to their
magnitude from highest to lowest or lowest to highest.
95 92 89 85 80 79
94 92 88 83 80 79
93 91 86 83 80 77
92 90 86 82 80 76
92 90 86 81 79 75
Having organized the data in an array, it is easier to find the highest to the lowest
observation. Also the frequencies of each observation can be easily and quickly
determined. Aside from that, it is easier to find mode, median and all measures of
position. (refer to the next module)
FREQUENCY DISTRIBUTION
A useful way to present the data is the formation of a frequency distribution.
Frequency distribution refers to the number of observations that fall within a certain
range of data. To organize data into a frequency distribution, we need to pick some
convenient class intervals and tabulate the number of each individual observation that
falls into a particular interval. There is no clear-cut frequency distribution should neither
be too small nor too large; it should not be more than 20 and not less than 7 to avoid
laborious tabulation and erroneous grouping.
Example 1:
Solution:
Find the Highest Score first then find the Lowest Score
H – L = 95 – 75 = 20
H–L 95 – 75
Find the interval =--------------- = ----------------- = 2
10 10
HISTOGRAM
FREQUENCY POLYGON
Example;
The “less than” and the “greater than” cumulative frequency distribution of 30
freshmen students of statistics @ Universidad de Zamboanga randomly chosen or
selected from section A and B. The following data are given below.
EXPLAIN
From the presentation of data you were able to view how each data was
presented and frequency distribution you were able to understand the different methods
of presenting data. The data were presented in different ways with respect to the kind of
gathered data. It can be a textual, tabular, bar graph, pie chart, component bar and line
graph.
Text, tables, and graphs for data and information presentation are very
powerful communication tools. They can make an article easy to understand, attract and
sustain the interest of readers, and efficiently present large amounts of complex information. May
19, 2017