Dunn Test PDF

Download as pdf or txt
Download as pdf or txt
You are on page 1of 6
At a glance
Powered by AI
Dunn's test performs multiple comparisons after a Kruskal-Wallis test to test for stochastic dominance among groups. It makes pairwise comparisons based on Dunn's test statistic to test if one group's CDF crosses another. It can be considered a test for median differences under certain assumptions.

Dunn's test computes pairwise comparisons after a Kruskal-Wallis test of stochastic dominance among groups. It tests the null hypothesis that the probability of a value from one group being larger than a value from another group equals one half. This corresponds to the null hypothesis of the Wilcoxon-Mann-Whitney rank sum test.

The interpretation of stochastic dominance requires the assumption that the CDF of one group does not cross the CDF of another. It also assumes the data can be treated as continuous and the distributions are identical except for differences in location.

Package dunn.

test
February 9, 2017
Version 1.3.3
Date 2017-02-08
Title Dunn's Test of Multiple Comparisons Using Rank Sums
Author Alexis Dinno <[email protected]>
Maintainer Alexis Dinno <[email protected]>
Description Computes Dunn's test (1964) for stochastic dominance and reports the re-
sults among multiple pairwise comparisons after a Kruskal-Wallis test for stochastic domi-
nance among k groups (Kruskal and Wallis, 1952). The interpretation of stochastic domi-
nance requires an assump-
tion that the CDF of one group does not cross the CDF of the other. 'dunn.test' makes k(k-
1)/2 multiple pairwise comparisons based on Dunn's z-test-statistic approximations to the ac-
tual rank statistics. The null hypothesis for each pairwise comparison is that the probabil-
ity of observing a randomly selected value from the first group that is larger than a randomly se-
lected value from the second group equals one half; this null hypothesis corre-
sponds to that of the Wilcoxon-Mann-Whitney rank-sum test. Like the rank-
sum test, if the data can be assumed to be continuous, and the distributions are assumed identi-
cal except for a difference in location, Dunn's test may be understood as a test for median differ-
ence. 'dunn.test' accounts for tied ranks.
License GPL-2
Encoding UTF-8
NeedsCompilation no
Repository CRAN
Date/Publication 2017-02-09 07:01:57

R topics documented:
dunn.test . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
homecare . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5

Index 6

1
2 dunn.test

dunn.test Dunns Test

Description
Performs Dunns test of multiple comparisons using rank sums

Usage
dunn.test (x, g=NA, method=p.adjustment.methods, kw=TRUE, label=TRUE,
wrap=FALSE, table=TRUE, list=FALSE, rmc=FALSE, alpha=0.05)

p.adjustment.methods
# c("none", "bonferroni", "sidak", "holm", "hs", "hochberg", "bh", "by")

Arguments
x a numeric vector, or a list of numeric vectors. Missing values are ignored. If the
former, then groups must be specified using g.
g a factor variable, numeric vector, or character vector indicating group. Missing
values are ignored.
method adjusts the p-value for multiple comparisons using the Bonferroni, idk, Holm,
Holm-idk, Hochberg, Benjamini-Hochberg, or Benjamini-Yekutieli adjust-
ment (see Details). The default is no adjustment for multiple comparisons.
kw if TRUE then the results of the Kruskal-Wallis test are reported.
label if TRUE then the factor labels are used in the output table.
wrap does not break up tables to maintain nicely formatted output. If FALSE then
output of large tables is broken up across multiple pages.
table outputs results of Dunns test in a table format, as qualified by the label and
wrap options.
list outputs results of Dunns test in a list format.
rmc if TRUE then the reported test statistics and table are based on row minus col-
umn, rather than the default column minus row (i.e. the signs of the test statistic
are flipped).
alpha the nominal level of significance used in the step-up/step-down multiple com-
parisons procedures (Holm, Holm-idk, Hochberg, Benjamini-Hochberg, and
Benjamini-Yekutieli).

Details
dunn.test computes Dunns test (1964) for stochastic dominance and reports the results among
multiple pairwise comparisons after a Kruskal-Wallis test for stochastic dominance among k groups
(Kruskal and Wallis, 1952). The interpretation of stochastic dominance requires an assumption that
the CDF of one group does not cross the CDF of the other. dunn.test makes m = k(k-1)/2 multiple
dunn.test 3

pairwise comparisons based on Dunns z-test-statistic approximations to the actual rank statistics.
The null hypothesis for each pairwise comparison is that the probability of observing a randomly se-
lected value from the first group that is larger than a randomly selected value from the second group
equals one half; this null hypothesis corresponds to that of the Wilcoxon-Mann-Whitney rank-sum
test. Like the rank-sum test, if the data can be assumed to be continuous, and the distributions are
assumed identical except for a difference in location, Dunns test may be understood as a test for
median difference. dunn.test accounts for tied ranks.
dunn.test outputs both z-test-statistics for each pairwise comparison and the p-value = P(Z>|z|)
for each.
Several options are available to adjust p-values for multiple comparisons, including methods to
control the family-wise error rate (FWER) and methods to control the false discovery rate (FDR):
"none" no adjustment is made.
"bonferroni" the FWER is controlled using Dunns (1961) Bonferroni adjustment, and adjusted
p-values = max(1, pm).
"sidak" the FWER is controlled using idks (1967) adjustment, and adjusted p-values = max(1,
1 - (1 - p)^m).
"holm" the FWER controlled using Holms (1979) progressive step-up procedure to relax control
on subsequent tests. p values are ordered from smallest to largest, and adjusted p-values = max[1,
p(m+1-i)], where i indexes the ordering. All tests after and including the first test to not be rejected
at the alpha/2 level are not rejected.
"hs" the FWER is controlled using the Holm-idk adjustment (Holm, 1979): another progressive
step-up procedure but assuming dependence between tests. p values are ordered from smallest to
largest, and adjusted p-values = max[1, 1 - (1 - p)^(m+1-i)], where i indexes the ordering. All tests
after and including the first test to not be rejected at the alpha/2 level are not rejected.
"hochberg" the FWER is controlled using Hochbergs (1988) progressive step-down procedure to
increase control on successive tests. p values are ordered from largest smallest, and adjusted p-
values = max[1, p*i], where i indexes the ordering. All tests after and including the first to be
rejected at the alpha/2 level are rejected.
"bh" the FDR is controlled using the Benjamini-Hochberg adjustment (1995), a step-down proce-
dure appropriate to independent tests or tests that are positively dependent. p-values are ordered
from largest to smallest, and adjusted p-values = max[1, pm/(m+1-i)], where i indexes the ordering.
All tests after and including the first to be rejected at the alpha/2 level are rejected.
"by" the FDR is controlled using the Benjamini-Yekutieli adjustment (2011), a step-down procedure
appropriate to depenent tests. p-values are ordered from largest to smallest, and adjusted p-values
= max[1, pmC/(m+1-i)], where i indexes the ordering, and the constant C = 1 + 1/2 + . . . + 1/m.
All tests after and including the first to be rejected at the alpha/2 level are rejected.
Because the sequential step-up/step-down tests rejection decisions depend on both the p-values and
their ordering, those tests rejected using "holm", "hs", "hochberg", "bh", or "by" at the alpha/2
level have an asterisk placed immediately after the adjusted p-value in the output.

Value
dunn.test returns:

chi2 a scalar of the Kruskal-Wallis test statistic adjusted for ties.


4 dunn.test

Z a vector of all m of Dunn z test statistics.


P a vector of p-values corresponding to Z.
P.adjust a vector of p-values corresponding to Z, but adjusted for multiple comparisons
as per method (P = P.adjust if method="none").
comparisons a vector of strings labeling each pairwise comparison, as qualified by the rmc
option, using either the variable values, or the factor labels or (or factor values
if unlabeled). These labels match the corresponding position in the Z, P, and
P.adjust vectors.

Author(s)
Alexis Dinno (<[email protected]>)

References
Benjamini, Y. and Hochberg, Y. 1995. Controlling the False Discovery Rate: A Practical and Pow-
erful Approach to Multiple Testing. Journal of the Royal Statistical Society. Series B (Method-
ological). 57: 289300.
Benjamini, Y. and Yekutieli, D. 2001. The control of the false discovery rate in multiple testing
under dependency. Annals of Statistics. 29: 11651188.
Dunn, O. J. 1961. Multiple comparisons among means. Journal of the American Statistical
Association. 56: 5264.
Dunn, O. J. 1964. Multiple comparisons using rank sums. Technometrics. 6: 241252.
Hochberg, Y. 1988. A sharper Bonferroni procedure for multiple tests of significance. Biometrika.
75:800802.
Holm, S. 1979. A simple sequentially rejective multiple test procedure. Scandinavian Journal of
Statistics. 6: 6570.
Kruskal, W. H. and Wallis, A. 1952. Use of ranks in one-criterion variance analysis. Journal of
the American Statistical Association. 47: 583621.
idk, Z. 1967. Rectangular confidence regions for the means of multivariate normal distribu-
tions. Journal of the American Statistical Association. 62: 626633.

Examples
## Example cribbed and modified from the kruskal. test documentation
## Hollander & Wolfe (1973), 116.
## Mucociliary efficiency from the rate of removal of dust in normal
## subjects, subjects with obstructive airway disease, and subjects
## with asbestosis.
x <- c(2.9, 3.0, 2.5, 2.6, 3.2) # normal subjects
y <- c(3.8, 2.7, 4.0, 2.4) # with obstructive airway disease
z <- c(2.8, 3.4, 3.7, 2.2, 2.0) # with asbestosis
dunn.test(x=list(x,y,z))

x <- c(x, y, z)
g <- factor(rep(1:3, c(5, 4, 5)),
labels = c("Normal",
homecare 5

"COPD",
"Asbestosis"))
dunn.test(x, g)

## Example based on home care data from Dunn (1964)


data(homecare)
attach(homecare)
dunn.test(occupation, eligibility, method="hs", list=TRUE)

## Air quality data set illustrates differences in different


## multiple comparisons adjustments
attach(airquality)
dunn.test(Ozone, Month, kw=FALSE, method="bonferroni")
dunn.test(Ozone, Month, kw=FALSE, method="hs")
dunn.test(Ozone, Month, kw=FALSE, method="bh")
detach(airquality)

homecare Occupation and Home Care Eligibility

Description
Occupation and home care eligibility for 383 patients medically elgibile for home care.

Usage
homecare

Format
A data frame containing two variables, occupation and eligibility, over 383 observations.

Source
Dunn, O. J. (1964). Multiple comparisons using rank sums. Technometrics, 6(3):241252s. Page
244, Table 1

References
The data have been adapted from a study of a group of patients entering the Los Angeles County
General Hospital during the years 1959-61.
Index

Topic datasets
homecare, 5
Topic htest
dunn.test, 2
Topic nonparametric
dunn.test, 2
Topic stats
dunn.test, 2

Dunns test (dunn.test), 2


dunn.test, 2

homecare, 5

p.adjustment.methods (dunn.test), 2

You might also like