2FAQ in STATISTICS. September 2 2018
2FAQ in STATISTICS. September 2 2018
STATISTICS
WILFREDO P. MARIÑO
Top 10 FAQ’s in STATISTICS
Population
Census
Parameters
Sample
Survey
Statistics
“Statistics is the mathematics of
collection, organization, and interpretation
of numerical data, especially the analysis
of population characteristics by inference
from sampling.“
American Heritage Dictionary
2. WHAT CONSTITUTES AN ADEQUATE
OR SUFFICIENT SIZE FOR A SAMPLE?
N= number of population
n= number of sample
e= error tolerance
SLOVIN’S FORMULA
N=1500
e= 5%= 0.05
Province N n
Cavite 250 53 250 x 0.21 = 53
𝑛 316
𝑘= = =0.21
𝑁 1500
SAMPLE SIZE FORMULA
n = sample size
z = value from the normal table at a given
confidence level (at 95% c.l., z = 1.96)
p = % that an event would occur
q = % that event would not occur
e = margin of error (MOE)
W C
A N
Z V L
A D GE
I
N S H RP
U T V
Z
B E H
K N Q
T W Z
A F KG
MQ E L
D
H
A F KG G
MQ E L F Q E
D
H D
Graphical Male
Female
Summary Measures
Central Interquartile
Tendency Range
Median Coefficient of
Variation
Mean Mode
5. HOW DO WE SELECT THE
APPROPRIATE STATISTICAL TOOLS?
1. Is there a significant difference in the LET
performance of students of URS in terms of sex?
......Independent t-test!
2. How do the stress level of URS Graduate School
students compare when grouped according to age?
……ANOVA!
3. In an intact group experiment, how do the pretest
scores and posttest scores differ?
.....Dependent t-test!
5. HOW DO WE SELECT THE
APPROPRIATE STATISTICAL TOOLS?
The following are the primary considerations;
•The statistical test is based on the problem stated
for which the research is intended;
•Sampling design and technique used in
determining the subject and respondents are most
of the times considered in determining the statistical
test to be used in the study;
• denoted by Ho
• the statement being tested
• it represents what the experimenter
doubts to be true
Alternative Hypothesis
• denoted by Ha or H1
• Is the statement that must be true if the
null hypothesis is false
• the operational statement of the theory
that the experimenter believes to be true
and wishes to prove
• Is sometimes referred to as the research
hypothesis
Test Statistics
• A random variable whose value is
calculated from the sample data
and is used in making the
decision “fail to reject Ho” or
“reject Ho”. These are the
computed z-value, t-value, F-
value, chi-square value, and
others.
Test Statistics
Critical Value
• It is the value or values that separate the
critical region from the values of the test
statistic that would not lead to rejection
of the null hypothesis. This is based on
the level of significance and degree of
freedom. We need a table of statistics to
determine the critical value that can be
found at the appendices of statistics
books.
Critical Value
Level of Significance
• It represents the chance of rejecting
a true null hypothesis.
• Most of the times we will set the
significance criterion at 0.05.
• This means that if there is no effect
in the population, the measured
effect in the sample must be so large
as to occur less than 5% of the time
by chance alone.
Probability Value
• It is the probability that the test
statistic could be the value or a more
extreme value (in the direction of the
alternative hypothesis) when the null
hypothesis is true.
• It is the probability of obtaining
sample mean difference as far apart
as we have, if the null hypothesis
were true.
Probability Value
Decision Rule
7. WHICH IS CORRECT, “ACCEPT HO” OR “FAIL TO
REJECT HO”
Alpha at 0.05
9. WHAT IS THE CONCLUSION IF THE
DECISION IS REJECT OR FAIL TO REJECT
HO?
Reject Ho
There is a sufficient evidence at the alpha level of
significance to show that…(the meaning of
alternative hypothesis).”
Fail to reject Ho
There is no sufficient evidence at the alpha level
of significance to show that…(the meaning of the
alternative hypothesis.”
As shown, there is a sufficient evidence at the 0.05 level of
significance to show that there is a significant difference between
the performance of learners before and after their exposure to the
intervention with respect to Lesson 1 and Lesson 3.
On the other hand, there is no sufficient evidence to show that
there is a significant difference between the performance of
learners before and after their exposure to the intervention with
respect to lesson 2,
As shown, there is no sufficient evidence at
the 0.05 level of significance to show that there
is a significant difference on the performance of
teachers in the Division of Antipolo City in terms
of sex.
As shown, there is a sufficient evidence
at the 0.05 level of significance to show that
there is a significant difference on the
performance of teachers in the Division of
Antipolo City in terms of district.
As shown, there is a sufficient evidence at the 0.05
level of significance to show that high school grade,
college grade and family income are predictors of the
LET Score.
On the other hand, there is no sufficient evidence to
show that age is a determinant of the Let Score.
Based on the result, the prediction equation will be,
LET= 8.373+0.286 HSG+0.566 CG -0.146 Age+ 0.374 FI
The coefficients indicates that on the average, each
point in high school grade was associated with a LET
score increase of 0.286. Every point in college grade
would have 0.566 increase in LET score. Each year in
age would have a 0.146 decrease in LET score, and
every peso in family income would mean 0.374 increase
in LET score.
10. WHAT ARE THE DIFFERENT
STATISTICAL SOFTWARES THAT
CAN BE UTILIZE IN DATA ANALYSIS?
• SAS • IBM SPSS
• OCTAVE • Minitab
• PSPP • SAS
• ADaMSoFT • JMP
• R • Genstat
• pbdR • STATISTICA
• Shogun • STATA
• CSPro • SigmaStat
• CumFreq • Unistat
†AD MAJOREM DEI
GLORIAM†
LET’S TRY