STS Reviewer
STS Reviewer
Computer Software
Recall
Imagine you've just spent weeks, months, or even years
gathering data for a research project, and now you want Statistics is the science of collecting, organizing,
to analyze it all to find out what it means. If the data summarizing, and analyzing information to draw
conclusions or answer questions.
seems too massive to handle, then you use computer
It provides procedure in data collection,
software to deal with the data and make sure the results
presentation, organization, and interpretation to
are useful and informative.
have a meaningful idea.
SPSS
Data
SPSS (Statistical Package for the Social Sciences)
The information referred to the definition is the data.
is perhaps the most widely used statistics software
According to the Merriam Webster dictionary, data
package within human behavior research. SPSS
are “factual information used as a basis for
offers the ability to easily compile descriptive
reasoning, discussion, or calculation”
statistics, parametric and non-parametric analyses,
as well as graphical depictions of results through
the graphical user interface (GUI). Types of Statistics
Descriptive Statistics
R It basically consists of organizing and summarizing data.
R is a free statistical software package that is Descriptive statistics describe data through numerical
widely used across both human behavior research summaries, tables, and graphs.
and in other fields. While R is a very powerful Examples:
software, it also has a steep learning curve, 1. The average score of a volleyball player for the past 10
requiring a certain degree of coding. games
2. Birth rate in rural areas in the Philippines
3. Enrollment record of all colleges in BSU – TNEU Lipa Quantitative variables or numerical variables are
Campus variables that take on numerical values representing
an amount or quantity. These numerical values
Inferential Statistics should answer the question how much or how
It is the logical process that involves generalizing many.
from a sample to the population from which the sample was Some examples of qualitative variables are height,
selected and assessing the reliability of such generalizations. weight, distance, salary, etc.
It is also called as statistical inference or inductive statistics. Variables can also be classified into two according
Examples: to purpose whether experimental or mathematical.
Basic Sampling Design be selected. For a survey using in-person interviews, the
The goal in sampling is to obtain individuals for a study in sampling frame might be a list of all street addresses.
such a way that accurate information about the population
can
be obtained.
Definitions
• Observation unit - An object on which a measurement is
taken. This is the basic unit of observation, sometimes
called an element. In studying human populations,
observation units are often individuals.
• Target population - The complete collection
of observations we want to study.
• Sampled population - The collection of all possible
observation units that might have been chosen in a
sample; the population from which the sample was taken.
• Sample - A subset of a population.
• Sampling unit - A unit that can be selected for a sample.
We may want to study individuals, but do not have a list
of all individuals in the target population. Instead,
households serve as the sampling units, and the
observation units are the individuals living in the
households.
• Sampling frame - A list, map, or other specification of
sampling units in the population from which a sample may
Population is a group to which the results of the
study are intended to apply. A sample is a group in
a research study on which information is obtained.
One of the most important steps in the research
process is to select the sample of individuals who
will participate as a part of the study.
Sampling refers to the process of selecting these
individuals.
e. Voluntary Sampling
d. Cluster Sampling
It is a technique when a sample is composed of
It is a process of selecting clusters from a respondents who are self-select (volunteered) into the
population which is very large or widely spread out over a study/survey. Most of the time, the respondents have a
wide geographical area strong interest in the topic of the study.
EXAMPLE: EXAMPLE:
The Fuds Administration (FA) wants to know if there are Consider a news show asks their viewers to participate in an
high levels of aflatoxin in Gagaraya’s Cracker Nut. The on-line poll. The samples are viewers who have chosen
FA head took a random sample of batches of the said themselves and not the survey administrator.
cracker nut and all bags in the chosen batches are included
in the sample. Measure of Central Tendency
- A measure of central tendency, commonly referred to as
Non-random Sampling or Non-probability Sampling
an average, is a single value that represents a data set.
o It is a sampling procedure where samples selected Its purpose is to locate the center of a data set.
in a deliberate manner with little or no attention to
randomization. Samples are obtained haphazardly,
There are three different measures of central tendency:
selected purposively or are taken as volunteers. The
mean, median, mode.
probabilities of selection are unknown. They
should not be used for statistical inference.
a. Convenience Sampling Mean
It is a process of selecting a group of individuals The mean, or arithmetic mean, is the most frequently
who are conveniently available for a study. used measure of central tendency. It is the only
EXAMPLE: common measure in which all values play an equal
role meaning to determine its values you would need
A researcher may only include close friends and clients to be
to consider all the values of any given data set.
included in the sample population
It is appropriate to determine the central tendency of
an interval or ratio data.
b. Purposive Sampling The symbol , called “x bar”, is used to represent the
It is a process of selecting based from judgement to mean of a sample and the symbol μ, called “mu”, is
select a sample which the researcher believed, based on used to denote the mean of a population.
prior information, will provide the data they need.
Properties of Mean Mode
- A set of data has only one mean. The mode is the value in a data set that appears most
- Mean can be applied for interval and ratio data. frequently. Like the median and unlike the mean, the
- All values in the data set are included in computing the extreme values in a data set do not affect the mode.
mean.
- The mean is very useful in comparing two or more data sets. A data set that has only one value that occur the
- Mean is most appropriate in symmetrical data. greatest frequency is said to be unimodal.
- Mean is affected by the extreme small or large values
(outliers) on a data set. If the data has two values with the same greatest
frequency, both values are considered the mode and the data
Mean can be computed as: set is bimodal.
𝑆𝑢𝑚 𝑜𝑓 𝑎𝑙𝑙 𝑣𝑎𝑙𝑢𝑒𝑠 If a data set have more than two modes, and the data set is
MEAN = 𝑁𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑣𝑎𝑙𝑢𝑒𝑠 said to be multimodal.
Sample Mean There are also some cases when data set values
have the same number frequency, when this occur, the data
set is said to be no mode.
Formula: (Position)
A. If n is odd, the median is the middle-ranked value.
𝑛𝑘
QUARTILE PERCENTILES
𝑛𝑘
+ 0.5 + 0.5
B. If n is even, the median is the average of the two
𝑛+1 4 100
middle ranked values. Q k= PK=
2
Median (Rank Value) =
𝑛𝑘
+ 0.5
DECLES
10
D K=
If the resulting positioning is an INTEGER, then the
particular numerical observation to that point is chosen for
the quartile.
Measure of Dispersion
Spread of data values from the average
Dispersion is the difference between the actual value
and the average value.
Range
- Difference of highest and lowest value.
- (low value – lesser the variability or malapit sa mean)
Standard Deviation
- Describes the difference between data values and
mean
-
- calculated as the square root of variance.
Variance
- Squared measure of standard deviation.
Whatever you do, work at it with all your heart, as working for the Lord, not for human masters.
- Colossians 3:23