Ae9 Final Module
Ae9 Final Module
• The generally accepted levels are 0.10, 0.05, and 0.01. What happen to the defendant if the jury made type I
• Be as rigorous as possible. and type II error?
Two Types of Error
Answer:
A type I error is like putting an innocent person in
jail.
A type II error is like letting a guilty person go free.
3. Determine the Test Distribution to Use.
Reminders: Determine the appropriate statistical test to
It is important to note that we want to set be used.
( α ) before we start our study because the ✦ Dependent Sample t - Test
Type I error is the more ‘grevious’ error to
make. ✦ Independent Sample t - Test
Performing statistical analysis using statistical Reject the null hypothesis if the test statistic is not within
software such as Excel, SPSS, R, Minitab, SAS, the range specified by the confidence interval.
etc. ✦ Using Traditional Approach
Reject Ho if the computed value of the test statistic falls in
5. Make Statistical Decision the region of rejection.
✦ Using P-value Approach
✦ Using confidence interval Reject the null hypothesis if the computed p-value is less
than or equal to the set significance level , otherwise do not
✦ Using p-value approach reject the null hypothesis.
Example: If the level of significance (α = 0.05),
✦ Using traditional method P-value Decision
0.01 Reject H0
0.05 Reject H0
0.10 Failed to Reject H0
Traditional Approach One-tailed and Left tailed One-tailed and Right tailed
Ha : μ1 < μ2 Ha : μ1 > μ2
Rejection of region Rejection Region
or critical region is Rejection Region
the set of all values of
the test statistic
which will lead to the -2 0 2 -2 0 2
rejection of H0.
Acceptance Region is Two-tailed
the set of all values of Ha : μ1 ≠ μ2
the test statistic that Rejection Region
Rejection Region
leads the researcher to
retain H0.
-2 0 2
Use "=DEVSQ( )”
function in excel
∑ i ( n+1−i
STEP 3: Calculate b as follows: b = a x − xi)
i=1
n is the number of
observation
If n is even:
n
m=
2
If n is odd:
n−1
m=
2
Since n is even in this
SS means Sum of Square example, m=8. That’s
why we used a1 to a8
Shapiro - Wilk Table
We choose this
interval in the table of
Shapiro - Wilk,
because our n=16 and
our test statistic
(W=0.955) is within
this interval.
Result
Assumptions
Dependent Sample t - Test 1. Your dependent variable should be measured at
the interval or ratio level (i.e., they are
The dependent sample t-test (also called continuous).
the paired t-test or paired-samples t-test) 2. Your independent variable should consist of two
compares the means of two related groups categorical, "related groups" or "matched pairs”.
to determine whether there is a statistically 3. There should be no significant outliers in the
significant difference between these differences between the two related groups.
means. 4. The distribution of the differences in the
H0 : μ1 ≥ μ2 and Ha : μ1 < μ2 dependent variable between the two related
H0 : μ1 ≤ μ2 and Ha : μ1 > μ2 groups should be approximately normally
H0 : μ1 = μ2 and Ha : μ1 ≠ μ2 distributed.
Example: 1. State the Null and Alternative
A teacher is interested to know if the new learning program Hypothesis
will help to increase the number of correct remembered
Null hypothesis: Ho : μ1 ≥ μ2
words. 10 Subjects learn a list of 50 words. Learning
performance is measured using a recall test. The new learning program will not help to increase
After the first test all subjects the number of correct remembered words.
are instructed how to use the Alternative hypothesis: Ha : μ1 < μ2
learning program and then
The new learning program will help to increase the
learn a second list of 50 words.
Learning performance is again number of correct remembered words.
measured with the recall test. In 2. Set the Level of Significance or Alpha
the following table the number
of correct remembered words Level (α)
are listed for both tests. α = 0.05
Reject Ho
Exercises:
6. Draw Conclusion Apply the procedure in testing the hypothesis.
Professor Rhea measured the time (in second) required to
catch a falling meter sticks for 10 randomly selected
There is sufficient evidence to support that the new students' dominant hand and non-dominant hand. Professor
learning program help to increase the number of Rhea claims that the reaction time in an individual's
correct remembered words. dominant hand is less than the reaction time in
their non-dominant hand.
Proper Presentation of Results Test the claim at the level
of significance. The data
obtained are presented:
Result
Independent Sample t - Test
The independent sample t - test allows
researchers to evaluate or to compare the mean
difference between two populations using the data
from two separate samples. It is used to test
whether population means are significantly
different from each other, using the means from
randomly drawn samples.
H0 : μ1 ≥ μ2 and Ha : μ1 < μ2
H0 : μ1 ≤ μ2 and Ha : μ1 > μ2
H0 : μ1 = μ2 and Ha : μ1 ≠ μ2
Assumptions Example:
1. Your dependent variable should be measured on a Researchers wanted to know whether there was a difference in
continuous scale (i.e., it is measured at the interval or comprehension among students learning a computer program
ratio level). based on the style of the text. They randomly divided 18
2. Your independent variable should consist of two students into two groups of 9 each. The researchers verified
categorical, independent groups. that the 18 students were similar in terms of educational level,
3. You should have independence of observations, which age, and so on. Group 1 individuals learned the software using
means that there is no relationship between the visual manual (multimodal
observations in each group or between the groups instruction), while Group 2
themselves. individual learned the software
4. There should be no significant outliers. using textual manual (Unimodal
5. Your dependent variable should be approximately instruction). The following data
normally distributed for each group of the independent represent scores the students
variable. received on an exam given to them
6. There needs to be homogeneity of variances. they studied from the manuals.
1. State the Null and Alternative 3. Determine the Test
Hypothesis
Null hypothesis: Ho : μ1 = μ2
Distribution to Use.
There is no significant difference between the scores of the Dependent Variable:
students learning computer program using textual and
visual style. Scores
Alternative hypothesis: Ha : μ1 ≠ μ2 Independent Variable:
There is significant difference between the scores of the
students learning computer program using textual and Style of the Text (Visual and Textual)
visual style.
2. Set the Level of Significance or Alpha Since we are comparing the means of two
Level (α) independent groups, we will use the
α = 0.05 independent sample t-test.
Determine if the
variances are equal
or not equal.
Using p-value approach: If pvalue ≤ α , reject Ho, 4. Calculate Test Statistic or
otherwise failed to reject Ho
Ho: Equal Variances Assumed p - value.
Ha: Equal Variances Not Assumed Click “Data”, then click “Data Analysis”
Failed to
Reject Ho
Since we failed to reject Ho, we will proceed to t-test: Two
Sample Assuming Equal Variances. Polytechnic University of the Philippines
College of Science
Department of Mathematics and Statistics
Result
5. Make Statistical Decision 6. Draw Conclusion
Using p-value approach: If pvalue ≤ α , reject Ho, There is no enough evidence to support that
otherwise failed to reject Ho there is a difference in comprehension among
students learning a computer program based on
the style of the text.
Proper Presentation of Results
Failed to
Reject Ho
Exercises:
Apply the procedure in testing the hypothesis.
Twenty participants were given a list of 20 words to
process. The 20 participants were randomly assigned to
one of two treatment conditions. Half were instructed to
count the number of vowels in each word (shallow
processing). Half were instructed to judge whether the
object described by each word would be useful if one
were stranded on a desert island (deep processing).
After a brief distractor task, all subjects were given a
surprise free recall task. Did the instruction affect the
level of recall?The number of words correctly recalled
was recorded for each subject. Here are the data:
Result
Failed to
Reject Ho
E q u a l
Variances
Assumed
Failed to
Reject Ho
E q u a l
Variances
Assumed
Using p-value approach: If pvalue ≤ α , reject Ho,
otherwise failed to reject Ho
Ho: Equal Variances Assumed
Ha: Equal Variances Not Assumed
Failed to
Reject Ho
E q u a l
Variances
Assumed
Reject Ho
Exercises:
6. Draw Conclusion Apply the procedure in testing the hypothesis.
There is enough evidence to support that the A teacher is concerned about the level of
distribution of exam scores of students in knowledge possessed by PUP students regarding
mathematics is different for each city. Philippine history. Students completed a high
school senior level standardized history exam.
Proper Presentation of Results Academic major of the students was also recorded.
Data in terms of percent correct is recorded below
for 24 students. Is there a significant difference
between the levels of knowledge possessed by PUP
students regarding Philippine history when
grouped according to their academic major?
Result
X X X
r = -1 r = -.6 r =0
If r is positive, the correlation is direct. Y Y
If r is negative, the correlation is inverse.
r = .6 r=1
Reminders: Assumptions
• Correlation does not imply causation. 1. Your two variables should be measured at the
• Watch out for hidden (lurking) variables. interval or ratio level (i.e., they are
continuous).
Lurking Variable
2. There is a linear relationship between your
• A variable that is not included as an explanatory two variables.
or response variable in the analysis but can affect
the interpretation of relationships between 3. There should be no significant outliers.
variables.
4. Your variables should be approximately
• Can falsely identify a strong relationship between normally distributed.
variables or it can hide the true relationship.
Significance Testing of Pearson r Example:
Test Statistic: A dietetics student wanted to look at the
df relationship between calcium intake and
t=r knowledge about calcium in sports
1 − r2
where: science students. Table shows the data
df = degrees of freedom she collected. Is there a relationship
between calcium intake and knowledge
r = correlation coefficient of Pearson r
about calcium in sports science
Note:
students?
df = n − 2
df
t=r
1 − r2
Result
df = n − 2
5. Make Statistical Decision 6. Draw Conclusion
There is sufficient evidence to conclude that there
Using p-value approach: If pvalue ≤ α , is significant relationship between the calcium
reject Ho, otherwise failed to reject intake and knowledge about calcium in sports
Ho Strong and science students.
D i r e c t Proper Presentation of Results
Correlation
Reject Ho
Exercises:
Apply the procedure in testing the hypothesis.
Example: Reminders:
1. A doctor who knows that hypertension depends
on smoking habits can tell his smoking patients what
they should do. The word contingency refers to
dependence, but this is only a
2. If the traffic condition (light, moderate, heavy,
standstill) is found to be dependent on vehicle plate statistical dependence and cannot be
numbers (odd, even) a traffic officer may decide to used to establish a direct cause-and-
revise traffic law enforcement. effect link between the two variables in
3. If poverty status of households is found to be question.
correlated with family size, government ought to
adopt a viable poverty management program
1. State the Null and Alternative
Example: Hypothesis
Null hypothesis:
Educators are always looking for novel ways in
which to teach statistics to undergraduates as part Gender is independent with the preferred type of
of a non-statistics degree course (e.g., psychology). learning medium.
With current technology, it is possible to present Alternative hypothesis:
how-to guides for statistical programs online Gender is dependent with the preferred type of
instead of in a book. However, different people learning medium.
learn in different ways. An educator would like to
know whether gender (male/female) is associated
2. Set the Level of Significance or Alpha
with the preferred type of learning medium (online Level (α)
vs. books). Use “Data_Example and Exercises file”. α = 0.0.5
Grand Total
Column Total
Reject Ho
6. Draw Conclusion
There is sufficient evidence to conclude that there
gender is associated with the preferred type of
learning medium.
Proper Presentation of Results
Result