Module 6 - Non-Parametric Statistics
Module 6 - Non-Parametric Statistics
P U P 1
COURSE OUTCOME & TOPICS
Parametric Non-Parametric
STAT 20053 Evaluate hypotheses for a particular Evaluate hypotheses for the entire
parameter, usually the population mean. population distribution.
STATISTICAL
Quantitative data Quantitative, ranked, qualitative data
ANALYSIS with
SOFTWARE Require assumptions about the Require no assumptions (“distribution-
APPLICATION distributional characteristics of the free”) so used with non-normal
population distribution (e.g., normal distributions and when variances of the
distribution, homogeneity of variances) groups are not equal.
More powerful than non-parametric Generally use to compute.
tests when assumptions are met.
P U P
3
PARAMETRIC VS
NON-PARAMETRIC TESTS
STAT 20053
Parametric Non-Parametric
STATISTICAL Two Independent Samples Independent t-Test Mann-Whitney U
ANALYSIS with
Dependent/Paired Samples Paired t-Test Wilcoxon Signed-Rank
SOFTWARE
APPLICATION More than Two Samples One-way ANOVA Kruskal-Wallis
Nominal Samples N/A Chi-Square
P U P
4
MANN-WHITNEY U TEST
P U P
5
MANN-WHITNEY U TEST
Test Statistic
STAT 20053 𝑈=
𝑅 − 𝜇𝑅
𝜎𝑅
STATISTICAL where
𝑛1 𝑛1 + 𝑛2 + 1
ANALYSIS with 𝜇𝑅 =
2
SOFTWARE
𝑛1 𝑛2 𝑛1 + 𝑛2 + 1
APPLICATION 𝜎𝑅 =
12
P U P
6
MANN-WHITNEY U TEST
P U P
7
MANN-WHITNEY U TEST
P U P
8
MANN-WHITNEY U TEST
P U P
9
WILCOXON SIGNED-RANK TEST
When the samples are dependent, as they would be in a before-and-after test using
the same subjects, the Wilcoxon signed-rank test can be used in place of the t test for dependent
STAT 20053 samples. Again, this test does not require the condition of normality. It uses the following
hypotheses:
𝐻0 : The matched pairs have differences that come from a population with a median equal to zero.
STATISTICAL
ANALYSIS with 𝐻1 : The matched pairs have differences that come from a population with a nonzero median.
SOFTWARE
APPLICATION Test Statistic
𝑛(𝑛 + 1)
𝑤𝑠 −
𝑧= 4
𝑛(𝑛 + 1)(2𝑛 + 1)
24
where
• 𝑛 = number of pairs where difference is not 0
• 𝑤𝑠 = smaller sum in absolute value of signed ranks
P U P
10
WILCOXON SIGNED-RANK TEST
Requirements
1. Your dependent variable should be measured at a continuous level (i.e., they are interval or ratio
STAT 20053 variables) or ordinal level.
2. Your independent variable should consist of two categorical, "related groups" or "matched pairs".
STATISTICAL 3. There is no requirement that the two populations have a normal distribution or any other
ANALYSIS with particular distribution.
SOFTWARE
APPLICATION Steps in Minitab:
1. Test for normality of the distribution, if needed.
2. The difference scores that you need when running a Wilcoxon signed-rank test in Minitab are not
automatically calculated. Therefore, you need to run the Calc > Calculator... procedure.
3. Click Stat > Nonparametrics > 1-Sample Wilcoxon
4. Input the needed variables, etc.
5. Click OK
P U P
11
WILCOXON SIGNED-RANK TEST
P U P
12
WILCOXON SIGNED-RANK TEST
P U P
13
KRUSKAL-WALLIS TEST
The analysis of variance uses the F test to compare the means of three or more
populations. The assumptions for the ANOVA test are that the populations are normally
STAT 20053 distributed and that the population variances are equal. When these assumptions cannot be met,
the nonparametric Kruskal-Wallis test, sometimes called the H test, can be used to compare
three or more means.
STATISTICAL
𝐻0 : The samples come from populations with equal medians.
ANALYSIS with
SOFTWARE 𝐻1 : The samples come from populations with medians that are not all equal.
APPLICATION Test Statistic
12 𝑅12 𝑅22 𝑅𝑘2
𝐻= + + ⋯+ − 3(𝑁 + 1)
𝑁 𝑁+1 𝑛1 𝑛2 𝑛𝑘
where
• 𝑁 = total number of observations in all samples combined
• 𝑘 = number of samples
• 𝑅 = sum of ranks for Sample k calculated with the procedure that follows
• 𝑛 = number of observations in Sample k
P U P
14
KRUSKAL-WALLIS TEST
Requirements
STAT 20053 1. We have at least three independent samples, all of which are randomly selected.
2. This test works best if each sample has at least five observations.
STATISTICAL 3. There is no requirement that the populations have a normal distribution or any
other particular distribution.
ANALYSIS with
SOFTWARE
APPLICATION Steps in Minitab:
1. Test for normality of the distribution, if needed.
2. Click Stat > Nonparametrics > Kruskal-Wallis
3. Select Responses are in one column for all factor levels
4. Input the needed variables, etc.
5. Click OK
P U P
15
KRUSKAL-WALLIS TEST
P U P
16
CHI-SQUARE TEST
When data can be tabulated in table form in terms of frequencies, several types of
hypotheses can be tested by using the chi-square test.
STAT 20053
The Chi-Square tests are tests based on hypothesis-testing problems arise for
frequency, or count data. These are tests that are concerned with the testing of statistical
STATISTICAL hypotheses about multiple population proportions.
ANALYSIS with A multinomial experiment is an experiment that meets the following conditions:
SOFTWARE 1. The number of trials is fixed and trials are independent.
APPLICATION 2. Each trial must have all outcomes classified into exactly one of several different
categories.
3. For a 2 by 2 table, all expected frequencies > 5.
Hypotheses:
𝐻0 : The row and the column criteria are independent of each other.
𝐻1 : The row and the column criteria are not independent of each other.
P U P
17
CHI-SQUARE TEST
Test Statistic
STAT 20053 The formula for the test value for the independence test is as follows:
𝑘
𝑜𝑖 − 𝑒𝑖 2
STATISTICAL 𝜒2 =
𝑒𝑖
ANALYSIS with 𝑖=1
SOFTWARE The symbols 𝑜𝑖 and 𝑒𝑖 represent the observed and expected frequencies, for
APPLICATION the ith cell.
The chi-square independence test can be used to test the
independence of two variables for data expressed in contingency table.
Group Category A Category B
A 𝑜1 𝑜2
B 𝑜3 𝑜4
P U P
18
CHI-SQUARE TEST
P U P
19
CHI-SQUARE TEST
P U P
20
STAT 20053
STATISTICAL QUESTIONS?
ANALYSIS with
SOFTWARE
APPLICATION
You may reach at the ff channels during Worksheet
Consultation Hours:
• Google Classroom
• Facebook Group
• Discord
P U P
21