0% found this document useful (0 votes)

14 views6 pages

Power Analysis For Testing Two Independent Groups of Likert-Type Data

Uploaded by

vzorrilla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views6 pages

Power Analysis For Testing Two Independent Groups of Likert-Type Data

Uploaded by

vzorrilla

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

5th International Conference on Computer Sciences and Automation Engineering (ICCSAE 2015)

Power analysis for testing two independent groups of likert-type data

Zun-xiong Liu1,a& Hao Chen2,b*

1
School of Information Engineering, East China Jiaotong University, P.R.China
2
School of Information Engineering, East China Jiaotong University, P.R.China
a
email:[email protected], bemail:[email protected], *corresponding author

Keywords: two-sample problem, nonparametric test, power of test

Abstract. In the one-sample location problem, it is tested whether the center of the whole is equal
to a known value, otherwise whether there are significant differences between two samples is in
consideration on pratical situation. In practical problems, there are many simulations where two
general parameters are compared, instead it is tested whether the center of the whole is equal to a
known value in the one-sample location problem. The aim of this article is to determine the
goodness-of-fit of three different nonparametric tests, which being two sample rank test, Smirnov
test ( two-sample Kolmogorov-Smirnov test) and two-sample Cramér-von Mises test. In the
meantime the efficacies of their respective comparative analyses are also tested to choose their
own two-sample test methods. Simulation results indicate that neither of the tests is the best for
each sample distribution, but in most instances, the Cramer-von Mises test performs best.
Moreover the Kolmogorov-Smirnov test is better than the Mann-Whitney test in term of
distribution of samples, sample size and effect size .

Introduction
Categorical data is widely used in educational, psychological, and economic and social life. Such
as in social attitude surveys and heart tests Likert-type data are used. This article discusses a
specific link between the existences of two independent samples, whether to obey the same
specific data distribution or not [2]. We chose to test two independent samples by comparison.
Measurement on a continuous scale is sometimes not available, In particular for those variables
concerning feelings, attitudes, or opinions. Therefore, researchers create rating instruments
according to ordered categories. Thus, one can describe feelings, attitudes or opinions. Rensis
Likert’s dissertation created a new attitude-scaling technique from a survey of student attitudes.
We have chosen to compare by testing for two independent samples. The statistical methods in
this study: the Mann-Whitney test, Kolmogorov-Smirnov test [1] and Cramér-von Mises test [3],
will be examined for the robustness and statistical power in different circumstances by the
simulated Likert-type data sets. In order to find the appropriate test for each condition, the
empirical Type I error rate of all tests will be compared [4].
In the following section the alternative distributions are defined and the simulation and linear
interpolation technique used to approximate the powers are discussed. The number of categories
is restricted to 10 to enable reasonable examples of the different-shaped alternative distributions.
The results of the power studies are presented as follows. A summary of the most powerful test
statistic(s) for each of the specified alternative distributions is included in the Conclusions section.

Note: This article is supported by the National Nature Science Foundation of China (No.71361009); Social Science
Foundation of the State Education Ministry (No.13YJC63019 ).

© 2016. The authors - Published by Atlantis Press 34

Proposed method

A. Mann-Whitney test
The Mann-Whitney measures the probability that observations from two recordings are taken
from continuous distributions with equal median.
Using R function wilcox.test{stats}:Performs one and two-sample Wilcoxon tests on vectors of
data; the latter is also known as ‘Mann-Whitney’ test.
Data: The data contains two independent samples, the capacity of a group of n, X1, X2, ..., Xn;
another group capacity m, Y1, Y2, ..., Ym. N = n + m.
Test statistic:
n
N  1
i 1
R (X i )  n
2
T =
nm N
n m ( N  1) 2

N ( N  1)

i 1
R i
2

4 ( N  1)

Exactly equal to the value of the two samples into a knot, R () represents the rank assigned to
each sample.

B. Two-sample Kolmogorov-Smirnov test

The two-sample Kolmogorov-Smirnov test measures the probability that observations from two
recordings are taken from the same continuous distribution, measuring the distance between
empirical distributions.
Using R Package ‘dgof’: This package contains a proposed revision to the stats::ks.test()
function and the associated ks.test.Rd help page. With one minor exception, it does not change the
existing behavior of ks.test(), and it adds features necessary for doing one-sample tests with
hypothesized discrete distributions.
Data: The data contains two independent samples, the capacity of a group of n, X1, X2, ..., Xn;
another group capacity m, Y1, Y2, ..., Ym.
Test statistic: S1 (x) is the empirical distribution function of a sample X1, X2, ..., Xn, while S2 (x)
is the empirical distribution function of a sample Y1, Y2, ..., Yn.
Define Test statistic T1 as the maximum vertical distance between two empirical distribution
functions as follows:

T1  sup |S1 (x)-S 2 (x)|

C. Two-sample Cramér-von Mises test

The two-sample Cramér-von Mises test measures the probability that observations from two
recordings are taken from the same continuous distribution, measuring the goodness-of-fit
between empirical distributions.
Using R Package ‘cramer’: Perform Cramér-test for two-sample-problem. Both unvaried and
multivariate data is possible. For calculation of the critical value Monte-Carlo-bootstrap-
methods and eigenvalue-methods are available. For the bootstrap access ordinary and
permutation methods can be chosen as well as the number of bootstrap-replicates taken.

35
Data: The data contains two independent samples, the capacity of a group of n, X1, X2, ..., Xn;
another group capacity m, Y1, Y2, ..., Ym.
Test statistic:
mn
 [S
2
T  ( x )  S ( x )]
(m  n)
2 2 1 2
x Xi
x Y j

Where S1(x) is the empirical distribution function of a sample X1, X2,…,Xn, while S2(x) is the
empirical distribution function of a sample Y1, Y2, ..., Yn. The number m, n corresponds to the
size of each sample.

Simulation experiments of power

A. Distribution of samples
Use the R software simulation to simulate the identification of the Type1 error comparison, as
well as test the effectiveness of each distribution. This selection of five-point Likert scale
simulation data distribution, the probability distribution of each distribution is as follows [5][6]:
Table 1: Five marginal distributions for the 5-point response scale

5-point Uniform Moderate Highly Symmetric Bimodal

scale Skew Skew
1 0.2000 0.2400 0.6561 0.0625 0.3276
2 0.2000 0.4117 0.2906 0.2500 0.1471
3 0.2000 0.2636 0.0496 0.3750 0.0506
4 0.2000 0.0766 0.0046 0.2500 0.1471
5 0.2000 0.0081 0.0001 0.0625 0.3276

B. Sample size
Calculate using the Monte Carlo simulation method of goodness-of-fit test for each power. The
total of twelve sample sizes was examined for the difference of testing groups. The sample sizes
chosen for this study were as follows (10, 10), (10, 30), (10, 50), (30, 30), (30, 50), (30, 100), (50,
50), (50, 100), (50, 300), (100, 100), (100, 300), and (300, 300).

C. Power and levels of effect size

Statistical power is defined as the probability of rejecting the null hypothesis given the alternative
hypothesis is true. In order to evaluate the statistical power of the tests we need to specify the
effect size. The effect size refers to the magnitude of the effect of the alternative hypothesis. If the
effect size is large enough, the alternative hypothesis will be true and the null hypothesis of
equality is false. Therefore, there is a real difference between both testing groups. In this study the
effect sizes of 0.30, and 0.50 will be examined.

36
D. Significance level
In this study, we define the significance level(α) as 0.05. At nominal α=0.05, an observed Type
I error rate within 25% of this rate, i.e. ., from 0.0375 to 0.0625, is considered robust.

Results of experiments
Figures of the power of three tests are as follows:

A. Uniform distribution

Fig.1 Uniform distribution(effs=0.03) Fig.2 Uniform distribution(effs=0.05)

Define the effect size as 0.3. For the smaller sizes Figure 1 shows that the Cramér-von Mises test
performs worst when the both sample sizes are below 50. As the sample sizes grow bigger, there
is a great upward trend in terms of power for the Cramér-von Mises test, followed by the KS test,
at last the Mann-Whitney test.
When the effect size is 0.5, power curve of three tests corresponds to three parallel lines
approximately in Figure 2, the power of Cramér-von Mises test is highest, then KS test and finally
Mann-Whitney test.

B. Moderate Skew distribution

Fig.3 Moderate skew distribution(effs=0.3) Fig.4 Highly skew distribution(effs=0.5)

37
When the effect size equals 0.3, Figure 3 shows that the curves of the three tests seem to be a
disorder as the sample size choose the first four selections(the sample size is (10, 10), (10, 30),
(10,50)and (30, 30)).When the sample size grows bigger ,the Cramér-von Mises test shows well
than the other two tests, the Mann-Whitney test performs worst.
When the effect size is 0.5, obviously the power of Cramér-von Mises test is highest, then KS
test and finally Mann-Whitney test.

C. Highly Skew distribution

Fig.5 Highly skew distribution(effs=0.3) Fig.6 Highly skew distribution(effs=0.5)

By simulation studies, Figure 5 and Figure 6 show that the statistic power of the Cramér-von Mises
test is superior to the KS test and Mann-Whitney test both under the effect size=0.3 and 0.5. The
power of KS test ranks second among these three tests.

D. Symmetric distribution

Fig.7 Symmetric distribution(effs=0.3) Fig.8 Symmetric distribution(effs=0.5)

In Figure 7, choosing small sample size, such as (10,10) and (10,50),the statistic power of Cramér-
von Mises test is lower than the KS test, even lower than the Mann-Whitney test. But as the sample
size increases, the power of Cramér-von Mises test returns to normal, to become the highest.
In Figure 8, CVM test contains the efficacy of the highest, KS test, followed, finally, Mann-
Whitney test.

E. Bimodal distribution
We note that Figure 4.5.1 show that the Cramer-von Mises performs the best under effect size=0.3
by simulation studies. We can find that this pattern can also be observed in the estimation of the
effect size=0.5,which specified in Figure 4.5.2. When the effect size is 0.3, the statistic power of
Cramér-von Mises test is the lowest, also lower than the Mann-Whitney test. When the effect size

38
increases to 0.5, this situation slows, but does not change the fundamental problem. We can draw
that the Cramér-von Mises test is not fit for bimodal distribution.

Fig.9 Bimodal distribution(effs=0.3) Fig.10 Bimodal distribution(effs=0.5)

Conclusion and Recommendations

This study obviously indicates that the statistic power will be increased when effect size and sample
size are increased. It also shows that none of these three tests can realistically recommended to the
applied econometrician as having higher power for all situations. When the effect size is 0.3 or 0.5,
the two-sample Cramér-von Mises test seems to perform well than them. But when the alternative
distribution is bimodal distribution, the Cramér-von Mises test performs badly. Indeed the
difference between Cramér-von Mises test and KS test is subtle, they examine ergonomics similar
capacities in this regard considering goodness-of-fit, but the former seems to better good use of data.
Undoubtedly, the Mann-Whitney test is improper for the goodness-of-fit for likert-type data.
The results and the summary in the table below can at least give the applied econometrician some
guide to the choice of alternative goodness-of-fit test statistics with respect to power.
Table 2: General summary of the power of three tests
Alternative distribution Comparison of General Ranking of power
Uniform CVM>KS>MW
Moderate Skew CVM>KS>MW
Highly Skew CVM>KS>MW
Symmetric CVM>KS>MW
Bimodal KS>MW>CVM

Repeat simulation iterations should be increased, while the value of the effect size of 0.1 or 0.7 can
be selected for deeper study.

References
[1] Choulakian, V. Lockhart , R.A. and Stephens, M.A. (1994). Cramér-von Mises statistics for
discrete distributions. The Canadian Journal of Statistics, 22,125-137.
[2] Michical Steele and Janet Chaseling, (2006). Powers of Discrete Goodness-of-Fit Test Statistics
for a Uniform Null Against a Selection of Alternative Distributions. Communications in
Statistics—Simulation and Computation, 35,1067–1075.
[3] Pettitt, A.N. and Stephens, M.A. (1977). The Kolmogorov-Smirnov goodness-of-fit statistic with
discrete and grouped data. Technometrics, 19 205-210.
[4] W.J.Conover. Practical nonparametric statistics,Wiley, John & Sons, Incorporated,1998.
[5] Steele, M. The power of categorical goodness-of-fit test statistics.[D] Griffith University,
Brisbane, Australia.2002.
[6] Gibbons, J. D., & Chakraborti, S. Nonparametric statistical inference (3rd ed.). New York: M.
Dekker.1992.

ADA Binder
No ratings yet
ADA Binder
171 pages
Nonparametric Test
No ratings yet
Nonparametric Test
29 pages
Non Parametric Testing
No ratings yet
Non Parametric Testing
42 pages
Vol2 4 1 PDF
No ratings yet
Vol2 4 1 PDF
17 pages
Non Parametric Tests (Sarah)
No ratings yet
Non Parametric Tests (Sarah)
32 pages
Non Parametric Test: Business Research Methods
No ratings yet
Non Parametric Test: Business Research Methods
26 pages
STAT22209 - Nonparametric Statistics
No ratings yet
STAT22209 - Nonparametric Statistics
74 pages
Chapter 12
No ratings yet
Chapter 12
26 pages
SM 38
No ratings yet
SM 38
58 pages
Non Parametric Test
No ratings yet
Non Parametric Test
3 pages
Topic 1: Topic 2: Topic 3:: This Course Is Designed To Deepen Students'
No ratings yet
Topic 1: Topic 2: Topic 3:: This Course Is Designed To Deepen Students'
24 pages
Normality and Sample Size Appropriate Statistical Test For Two-Group Comparisons - Poncet, A. Et Al. - 2016
No ratings yet
Normality and Sample Size Appropriate Statistical Test For Two-Group Comparisons - Poncet, A. Et Al. - 2016
11 pages
amm 611 115
No ratings yet
amm 611 115
7 pages
Mathematics 09 00788 v3
No ratings yet
Mathematics 09 00788 v3
20 pages
Sign Mann Wilcoxon Kruskal - PPT - Compatibility Mode
No ratings yet
Sign Mann Wilcoxon Kruskal - PPT - Compatibility Mode
28 pages
BRM Unit-3
No ratings yet
BRM Unit-3
22 pages
Unit Iii Qa Mba
No ratings yet
Unit Iii Qa Mba
20 pages
R&M Assignment
No ratings yet
R&M Assignment
5 pages
R M Handout
No ratings yet
R M Handout
13 pages
Nonparametric Statistics and Model Selection: 5.1 Estimating Distributions and Distribution-Free Tests
No ratings yet
Nonparametric Statistics and Model Selection: 5.1 Estimating Distributions and Distribution-Free Tests
10 pages
Distribution Test and Rank Transformation
No ratings yet
Distribution Test and Rank Transformation
6 pages
Block 4
No ratings yet
Block 4
108 pages
Adstat Final Exam Reviewer2highlighted
No ratings yet
Adstat Final Exam Reviewer2highlighted
29 pages
Parametric & Non-Parametric Tests
No ratings yet
Parametric & Non-Parametric Tests
34 pages
1 s2.0 016794739592844N Main
No ratings yet
1 s2.0 016794739592844N Main
11 pages
Mann - Whitney Test - Nonparametric T Test
No ratings yet
Mann - Whitney Test - Nonparametric T Test
18 pages
Basic Concepts of Non-Parametric Methods (Statistics)
No ratings yet
Basic Concepts of Non-Parametric Methods (Statistics)
18 pages
Comparing Poisson Rates
No ratings yet
Comparing Poisson Rates
13 pages
Non-Parametric Tests
No ratings yet
Non-Parametric Tests
11 pages
Choosing A Statistical Test: © Louis Cohen, Lawrence Manion & Keith Morrison
No ratings yet
Choosing A Statistical Test: © Louis Cohen, Lawrence Manion & Keith Morrison
16 pages
Zimmerman 2012 A Note On Consistency of Non-Parametric Rank Tests and Related Rank Transformations
No ratings yet
Zimmerman 2012 A Note On Consistency of Non-Parametric Rank Tests and Related Rank Transformations
23 pages
4 Unit III Statistical Tests
No ratings yet
4 Unit III Statistical Tests
9 pages
Capella Hypothesis Testing
No ratings yet
Capella Hypothesis Testing
4 pages
Khatun (2021) - Applications of Normality Test in Statistical Analysis
No ratings yet
Khatun (2021) - Applications of Normality Test in Statistical Analysis
10 pages
Non Parametric Tests
No ratings yet
Non Parametric Tests
16 pages
Parametric & Non-Parametric Tests
100% (1)
Parametric & Non-Parametric Tests
34 pages
Adstat Final Exam Reviewer2
No ratings yet
Adstat Final Exam Reviewer2
29 pages
Stats Formulas &tables
No ratings yet
Stats Formulas &tables
21 pages
PY1PR1 Stats Lecture 6 Handout
No ratings yet
PY1PR1 Stats Lecture 6 Handout
35 pages
Designing The Research Methodology
No ratings yet
Designing The Research Methodology
42 pages
Session 8 DEN1015H 2012 Lecture Notes & Review Problems With Solutions
No ratings yet
Session 8 DEN1015H 2012 Lecture Notes & Review Problems With Solutions
15 pages
BUP-08-Nonparametric Tests
No ratings yet
BUP-08-Nonparametric Tests
9 pages
SPSS
No ratings yet
SPSS
8 pages
Ma3251-Statistics and Numerical Methods-256433279-Ma3251 SNM Que Bank
No ratings yet
Ma3251-Statistics and Numerical Methods-256433279-Ma3251 SNM Que Bank
13 pages
Full Work
No ratings yet
Full Work
19 pages
Non Parametric Tests
100% (1)
Non Parametric Tests
19 pages
A Kernel Two-Sample Test: Arthur Gretton
No ratings yet
A Kernel Two-Sample Test: Arthur Gretton
51 pages
Math204 NonParThree
No ratings yet
Math204 NonParThree
4 pages
R A Statistics Class 74 Testing of Hypothesis-IV
No ratings yet
R A Statistics Class 74 Testing of Hypothesis-IV
27 pages
Chapter 11
No ratings yet
Chapter 11
4 pages
Presentation Chapter 7 Non Par Tests of Independent Samples (Compatibility Mode)
No ratings yet
Presentation Chapter 7 Non Par Tests of Independent Samples (Compatibility Mode)
41 pages
On The Optimality of Gaussian Kernel Based Nonparametric Tests Against Smooth Alternatives
No ratings yet
On The Optimality of Gaussian Kernel Based Nonparametric Tests Against Smooth Alternatives
62 pages
Check List For Reviewing Mathematical Statistics
No ratings yet
Check List For Reviewing Mathematical Statistics
3 pages
SPSS 09
No ratings yet
SPSS 09
19 pages
Completion of Let's Try This and Gauge Your Learning Activities
No ratings yet
Completion of Let's Try This and Gauge Your Learning Activities
11 pages
Research Stat Tools
No ratings yet
Research Stat Tools
29 pages
Unit 4 QB Part B Answer (2023)
No ratings yet
Unit 4 QB Part B Answer (2023)
17 pages
Non Parametric Statistics
No ratings yet
Non Parametric Statistics
96 pages
Learn Statistics Fast: A Simplified Detailed Version for Students
From Everand
Learn Statistics Fast: A Simplified Detailed Version for Students
Hesbon R.M
No ratings yet
Digital Signal Processing (DSP) with Python Programming
From Everand
Digital Signal Processing (DSP) with Python Programming
Maurice Charbit
No ratings yet
(Topic 2) Cultural Relativism (MetaEthics)
No ratings yet
(Topic 2) Cultural Relativism (MetaEthics)
23 pages
Day-To-Night Heat Storage in Greenhouses: 2 Sub-Optimal Solution For Realistic Weather
No ratings yet
Day-To-Night Heat Storage in Greenhouses: 2 Sub-Optimal Solution For Realistic Weather
12 pages
Everest Group Price Benchmarking Primer
No ratings yet
Everest Group Price Benchmarking Primer
14 pages
03 Activity 2
No ratings yet
03 Activity 2
3 pages
Self-Assessment 2 Virtual Tools
No ratings yet
Self-Assessment 2 Virtual Tools
34 pages
Personality, Stress and Coping Style Form
100% (1)
Personality, Stress and Coping Style Form
10 pages
The Search For Consistent Intonation - An Exploration and Guide Fo
No ratings yet
The Search For Consistent Intonation - An Exploration and Guide Fo
136 pages
4Th Grade - Day 5: Ancient Chamorus
No ratings yet
4Th Grade - Day 5: Ancient Chamorus
9 pages
18 Disaster Preparedness and Basic Life Support
No ratings yet
18 Disaster Preparedness and Basic Life Support
23 pages
Gcse English Literature 9275 Question Paper Sample Set May June
No ratings yet
Gcse English Literature 9275 Question Paper Sample Set May June
18 pages
Easter 1916
No ratings yet
Easter 1916
1 page
Philippine Architecture
100% (2)
Philippine Architecture
124 pages
Revaluation Reserve For Issue of Bonus Shares PDF
No ratings yet
Revaluation Reserve For Issue of Bonus Shares PDF
3 pages
Grammar
No ratings yet
Grammar
8 pages
Next Generation Intrusion Detection Systems (IDS)
100% (1)
Next Generation Intrusion Detection Systems (IDS)
16 pages
Undertaking Cum Indemnity - Format Vendor Platform
100% (1)
Undertaking Cum Indemnity - Format Vendor Platform
4 pages
Isaiah 40-55 As
No ratings yet
Isaiah 40-55 As
16 pages
Quantitative Qualtitative Research
No ratings yet
Quantitative Qualtitative Research
21 pages
Arts and Culture Committee Report
No ratings yet
Arts and Culture Committee Report
3 pages
Author Andre Dupuis's New Book, "A Much Better Life," Is A Compelling Work That Captures The Essence of The Author's Real-Life Experiences
No ratings yet
Author Andre Dupuis's New Book, "A Much Better Life," Is A Compelling Work That Captures The Essence of The Author's Real-Life Experiences
3 pages
Master Data Requirement Gathering
No ratings yet
Master Data Requirement Gathering
2 pages
20 Common Idioms by JForrest English
No ratings yet
20 Common Idioms by JForrest English
5 pages
Visual Testing (Nde)
100% (1)
Visual Testing (Nde)
3 pages
Skimming Practice 1: Practise Your Skills
No ratings yet
Skimming Practice 1: Practise Your Skills
3 pages
Mosbys Respiratory Care Equipment 10th Edition Cairo
No ratings yet
Mosbys Respiratory Care Equipment 10th Edition Cairo
310 pages
Public Relations For Startups: Using PR To Grow Your Business
No ratings yet
Public Relations For Startups: Using PR To Grow Your Business
13 pages
Communication Theory Two Marks
100% (1)
Communication Theory Two Marks
40 pages
People vs. Tundag
No ratings yet
People vs. Tundag
19 pages
LLB 1st Semester Syllabus
No ratings yet
LLB 1st Semester Syllabus
5 pages
Site Planning - Design Process
No ratings yet
Site Planning - Design Process
14 pages

Power Analysis For Testing Two Independent Groups of Likert-Type Data

Uploaded by

Power Analysis For Testing Two Independent Groups of Likert-Type Data

Uploaded by

5th International Conference on Computer Sciences and Automation Engineering (ICCSAE 2015)

Power analysis for testing two independent groups of likert-type data

Zun-xiong Liu1,a& Hao Chen2,b*

Keywords: two-sample problem, nonparametric test, power of test

© 2016. The authors - Published by Atlantis Press 34

B. Two-sample Kolmogorov-Smirnov test

T1  sup |S1 (x)-S 2 (x)|

C. Two-sample Cramér-von Mises test

Simulation experiments of power

5-point Uniform Moderate Highly Symmetric Bimodal

C. Power and levels of effect size

Fig.1 Uniform distribution(effs=0.03) Fig.2 Uniform distribution(effs=0.05)

B. Moderate Skew distribution

Fig.3 Moderate skew distribution(effs=0.3) Fig.4 Highly skew distribution(effs=0.5)

C. Highly Skew distribution

Fig.5 Highly skew distribution(effs=0.3) Fig.6 Highly skew distribution(effs=0.5)

Fig.7 Symmetric distribution(effs=0.3) Fig.8 Symmetric distribution(effs=0.5)

Fig.9 Bimodal distribution(effs=0.3) Fig.10 Bimodal distribution(effs=0.5)

Conclusion and Recommendations

You might also like