0% found this document useful (0 votes)

16 views5 pages

BES - R Lab 7

kkkkip

Uploaded by

gdschanu.phuongvm1810

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views5 pages

BES - R Lab 7

kkkkip

Uploaded by

gdschanu.phuongvm1810

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

BES – LAB 7

Non-parametric Tests 2
1. Objectives
- Explain the R procedures to conduct and check assumptions for the Sign test and Wilcoxon
signed-rank sum test.
- Understand and interpret the R output.
2. Procedures
The Wilcoxon signed-rank sum test is a non-parametric approach used to compare paired data when
the data are not normally distributed. For Wilcoxon signed rank sum test, just follow the same code as
Mann-Whitney-Wilcoxon (MWW) test except for the argument paired=TRUE:
Ø wilcox.test(x,y,alternative=”two.sided”,paired=TRUE,exact=NULL,
correct=TRUE)# data are saved in two different numeric vectors
Ø wilcox.test(outcome ~ grouping variable, data = name of data frame,
alternative = ”two.sided”, paired = TRUE)# data are saved in a data
frame
For the Sign test, we must install the package PASWR, so that we can use function SIGN.test:
Ø install.packages(“PASWR”)
Ø library(PASWR)
Ø SIGN.test(x,y,alternative=“two.sided”)
Don’t forget to check the assumptions to decide which test to use. We prefer Sign test for ranked data,
and Wilcoxon signed-rank sum test for quantitative data (with non-normal differences).
3. Exercises
Exercise 1. A test was conducted for two overnight mail delivery services. Two samples of identical
deliveries were set up so that both delivery services were notified of the need for a delivery at the same
time. The hours required to make each delivery are stored in Overnight data file. Do the data suggest a
difference in the delivery times for the two services? Use a 0.05 level of significance for the test.
We’re going to work with the file Overnight.csv, so firstly import it into R:
Ø overnight<-read.table("Overnight.csv", header = T, sep=",",
stringsAsFactors=F)
Ø head(overnight) #to see some first subjects in the dataframe
Ø str(overnight) #to see the structure of the dataframe
The next step is to check the assumptions to see which test is to be applicable in this case. From the
above codes, we know that the data are quantitative, and the two samples are matched. Let’s check the
normality of differences with the help of the stem-and-leaf display and the Q-Q plot.
Ø diff<-overnight$Service1-overnight$Service2
Ø stem(diff)
Ø qqnorm(diff,main="QQ plot of differences")
Ø qqline(diff)
The outputs are given below.

1|Page
BES – LAB 7

The decimal point is 1 digit(s) to the right of the |

-0 | 444421
0 | 1111
0 | 8

Based on the R outputs, it’s not reasonable to assume that the differences in delivery times are normally
distributed. As a result, we must use a nonparametric test instead of a parametric one (t-test for matched
pairs). In more details, we’re comparing 2 populations; samples are matched; data are quantitative but
differences between paired samples cannot be assumed to be normal, so we must apply the Wilcoxon
signed-rank sum test.
Question 1. Set up the hypotheses for the test. What type of test is this?
Now we apply wilcox.test() to produce the R output for this problem. Notice that we are
asked to test for a significant difference between the 2 groups, choose alternative=”two.sided”;
and with paired samples, we set paired=TRUE.
Ø ex1<-wilcox.test(overnight$Service1,overnight$Service2,alternative
="two.sided",paired = TRUE,correct = TRUE)
Ø ex1
Wilcoxon signed rank test with continuity correction

data: overnight$Service1 and overnight$Service2

V = 22, p-value = 0.3489
alternative hypothesis: true location shift is not equal to 0

Question 2. What is your conclusion in this case?

Exercise 2. Vendors of prepared food are very sensitive to the public’s perception of the safety of the
food they sell. Food sold at outdoor fairs and festivals may be less safe than food sold in restaurants
because it is prepared in temporary locations and often by volunteer help. What do people who attend
fairs think about the safety of the food served? One study asked this question of people at a number of
2|Page
BES – LAB 7

fairs in the Midwest: How often do you think people become sick because the food they consume are
prepared at outdoor fairs and festivals? The variable “sfair” contains the responses described in the
example concerning safety of food served at outdoor fairs and festivals. The variable “srest” contains
responses to the same question asked about food served in restaurants. The possible responses were:
1 = very rarely; 2 = once in a while; 3 = often; 4 = more often than not; and 5 = always. In all, 303
people answered the question. We suspect that restaurant food will appear safer than food served
outdoors at a fair. Do the data give good evidence for this suspicion? Conduct the approporiate test with
significance level 𝛼 = 0.05.
The data are stored on file foodsafety.csv, so import it into R and check some first subjects as follows.
subject hfair sfair sfast srest gender
1 1 4 1 1 1 1
2 2 4 2 4 2 1
3 3 2 2 2 2 1
4 4 4 2 2 1 1
5 5 2 3 1 3 1
6 6 1 2 2 2 1
Check the assumptions and choose the appropriate technique.
Question 3. Are the two samples independent or matched? What type of data are represented? Should
we conduct the parametric test in this case?
On answering the above questions, you would realize that the best choice is the Sign test in the PASWR
package. Using SIGN.test() gives you the following results:
Ø ex2<-SIGN.test(foodsafety$sfair,foodsafety$srest,alternative="greater)
Ø ex2
Dependent-samples Sign-Test

data: foodsafety$sfair and foodsafety$srest

S = 137, p-value < 2.2e-16
alternative hypothesis: true median difference is greater than 0
95 percent confidence interval:
0 Inf
sample estimates:
median of x-y
0
Question 4. Give your interpretation of this output.
Exercise 3. File french.csv presents the scores in a test of understanding of spoken French for a group
of executives before and after an intensive French course.
(a) Show the assignment of ranks and the calculation of the signed rank statistic 𝑇 ! for the test to see
that the mean improvement in scores before and after the course is different than 0.
(b) Now use R to implement the Wilcoxon signed-rank procedure to reach a conclusion about the
impact of language course. State the hypotheses in words and report the statistic 𝑇 ! , its p-value, and
your conclusion. Remember to check all the assumptions.

3|Page
BES – LAB 7

Here are the outputs.

Executive Pretest Posttest
1 1 32 34
2 2 31 31
3 3 29 35
4 4 10 16
5 5 30 33
6 6 33 36

The decimal point is 1 digit(s) to the right of the |

-0 | 66666
-0 | 33333322211
0 | 000
0 | 6

Wilcoxon signed rank test with continuity correction

data: french$Pretest and french$Posttest

V = 14.5, p-value = 0.003257
alternative hypothesis: true location shift is not equal to 0

Exercise 4. A student organization surveyed both current students and recent graduates to obtain
information on the quality of teaching at a particular university. An analysis of the responses provided
the following teaching-ability rankings stored in Professors.csv. Do the rankings given by the current
students agree with the rankings given by the recent graduates? Use 𝛼 = 0.1 to draw conclusion.

4|Page
BES – LAB 7

Professor Current.Students Recent.Graduates

1 1 4 6
2 2 6 8
3 3 8 5
4 4 3 1
5 5 1 2
6 6 2 3

'data.frame': 10 obs. of 3 variables:

$ Professor : int 1 2 3 4 5 6 7 8 9 10
$ Current.Students: int 4 6 8 3 1 2 5 10 7 9
$ Recent.Graduates: int 6 8 5 1 2 3 7 9 4 10

Dependent-samples Sign-Test

data: professors$Current.Students and professors$Recent.Graduates

S = 4, p-value = 0.7539
alternative hypothesis: true median difference is not equal to 0
95 percent confidence interval:
-2.000000 2.675556
sample estimates:
median of x-y
-1

5|Page

Java Ebook by Durga Sir
67% (3)
Java Ebook by Durga Sir
477 pages
Praise and Worship Songbook
100% (2)
Praise and Worship Songbook
188 pages
The Gathas of Zoroaster
100% (1)
The Gathas of Zoroaster
55 pages
Programming With R Test 2
50% (2)
Programming With R Test 2
5 pages
A Watson Bain M A-French Poetry For Beginners PDF
No ratings yet
A Watson Bain M A-French Poetry For Beginners PDF
97 pages
Module07 Notes
No ratings yet
Module07 Notes
14 pages
Knowing The Different Types of Film Genre
No ratings yet
Knowing The Different Types of Film Genre
13 pages
Greenwood Intermediate Statistics With R
No ratings yet
Greenwood Intermediate Statistics With R
429 pages
Module 4 T Test For Independent
No ratings yet
Module 4 T Test For Independent
8 pages
English 5 - DLP - Week 1 - Day 1 - August 5, 2024
No ratings yet
English 5 - DLP - Week 1 - Day 1 - August 5, 2024
4 pages
Sta 226
No ratings yet
Sta 226
5 pages
R Intro 2011
No ratings yet
R Intro 2011
115 pages
CSE 325 Numerical Methods: Sadia Tasnim Barsha Lecturer, CSE, SU
No ratings yet
CSE 325 Numerical Methods: Sadia Tasnim Barsha Lecturer, CSE, SU
13 pages
Modelling in R
No ratings yet
Modelling in R
47 pages
Answer Assignment Questions
No ratings yet
Answer Assignment Questions
3 pages
Nonparametric Method
100% (1)
Nonparametric Method
19 pages
Statistical Computing by Using R
100% (1)
Statistical Computing by Using R
11 pages
Data Communication Slide
No ratings yet
Data Communication Slide
309 pages
Lost/Found in Translation: Qurratulain Hyder As Self-Translator
No ratings yet
Lost/Found in Translation: Qurratulain Hyder As Self-Translator
16 pages
Edar M-4
No ratings yet
Edar M-4
47 pages
Orientalism and Visual Culture: Imagining Mesopotamia in Nineteenth Century Europe
No ratings yet
Orientalism and Visual Culture: Imagining Mesopotamia in Nineteenth Century Europe
16 pages
Intro Stat
No ratings yet
Intro Stat
324 pages
HLST 2301 Notes Print Me
No ratings yet
HLST 2301 Notes Print Me
29 pages
R Programing Bhagu
No ratings yet
R Programing Bhagu
40 pages
Stat 362 UNIT 2
No ratings yet
Stat 362 UNIT 2
40 pages
Mi True Wireless Earbuds Basic User Manual
No ratings yet
Mi True Wireless Earbuds Basic User Manual
1 page
Episode 8
No ratings yet
Episode 8
15 pages
STAT22209 - Nonparametric Statistics
No ratings yet
STAT22209 - Nonparametric Statistics
74 pages
Indexing Structure: Chapter Four
No ratings yet
Indexing Structure: Chapter Four
26 pages
Chapter 3.2 WILCOXON RANK SUM TEST
No ratings yet
Chapter 3.2 WILCOXON RANK SUM TEST
16 pages
MODS 2023 L1W4 - CI and Stats Tests
No ratings yet
MODS 2023 L1W4 - CI and Stats Tests
30 pages
6phrase - Very Very Important - C MCQ - 4
No ratings yet
6phrase - Very Very Important - C MCQ - 4
21 pages
DTNS Lab
No ratings yet
DTNS Lab
26 pages
Unit Ii DS LM
No ratings yet
Unit Ii DS LM
20 pages
Introducing Inferential Statistics
No ratings yet
Introducing Inferential Statistics
55 pages
Statistical Hypothesis Testing
No ratings yet
Statistical Hypothesis Testing
20 pages
2017aug 02323 02402 Solution en
No ratings yet
2017aug 02323 02402 Solution en
43 pages
Assignment 7
No ratings yet
Assignment 7
23 pages
Assembly Language For x86 Processors: Chapter 1: Basic Concepts
No ratings yet
Assembly Language For x86 Processors: Chapter 1: Basic Concepts
41 pages
Unit4 R
No ratings yet
Unit4 R
21 pages
Wilcoxen Rank Test
No ratings yet
Wilcoxen Rank Test
12 pages
BES - R Lab
No ratings yet
BES - R Lab
5 pages
Unit4 R
No ratings yet
Unit4 R
21 pages
A Closer Look at Assumptions
No ratings yet
A Closer Look at Assumptions
8 pages
BM-1, Applied Statistics, Lesson 2: Comparing Two Groups (And One Group)
No ratings yet
BM-1, Applied Statistics, Lesson 2: Comparing Two Groups (And One Group)
39 pages
Tutorial 4
No ratings yet
Tutorial 4
24 pages
Which Test When: 1 Exploratory Tests
No ratings yet
Which Test When: 1 Exploratory Tests
5 pages
Maths Record Output .
No ratings yet
Maths Record Output .
24 pages
Non Parametric Tests Unit 5
No ratings yet
Non Parametric Tests Unit 5
21 pages
Unit Test-3: English Class
No ratings yet
Unit Test-3: English Class
1 page
Hypothesis
No ratings yet
Hypothesis
16 pages
Hypothesis Testing in R
No ratings yet
Hypothesis Testing in R
13 pages
Field Study 2 Reviewer
No ratings yet
Field Study 2 Reviewer
2 pages
SPH Just Code
No ratings yet
SPH Just Code
6 pages
BES - R Lab 7
No ratings yet
BES - R Lab 7
5 pages
Cheat Sheet Final
No ratings yet
Cheat Sheet Final
2 pages
Elements of Potery
No ratings yet
Elements of Potery
10 pages
R M Handout
No ratings yet
R M Handout
13 pages
50 Kash Sharma Maths
No ratings yet
50 Kash Sharma Maths
13 pages
Homework 9: Independent and Paired Samples T-Tests: Information 1
No ratings yet
Homework 9: Independent and Paired Samples T-Tests: Information 1
7 pages
Research G9 Q3M3 Independent-Samples-T-Test EDITED
No ratings yet
Research G9 Q3M3 Independent-Samples-T-Test EDITED
10 pages
Workshop Activity: X Seq y Length
No ratings yet
Workshop Activity: X Seq y Length
3 pages
STAT359 Study Guide
No ratings yet
STAT359 Study Guide
7 pages
2.3. The Wilcoxon Signed Test 2.3.1. The Wilcoxon Signed Test For Paired Samples (Small Sample Size)
No ratings yet
2.3. The Wilcoxon Signed Test 2.3.1. The Wilcoxon Signed Test For Paired Samples (Small Sample Size)
14 pages
Section Ten A Java Calculator Project
No ratings yet
Section Ten A Java Calculator Project
39 pages
Probability and Statistics Lab Submission 5
No ratings yet
Probability and Statistics Lab Submission 5
11 pages
00 Lab Notes
No ratings yet
00 Lab Notes
8 pages
Opium of The People - Wikipedia
No ratings yet
Opium of The People - Wikipedia
6 pages
PSquare Abacus Proposal School Pages 1-18
No ratings yet
PSquare Abacus Proposal School Pages 1-18
18 pages
RECORD
No ratings yet
RECORD
5 pages
STAT Worksheet 1
No ratings yet
STAT Worksheet 1
4 pages
Cross Cultural Communication of Turkey: Done by Mohit Agrawal 11BSUHH010030
No ratings yet
Cross Cultural Communication of Turkey: Done by Mohit Agrawal 11BSUHH010030
27 pages
T Test
No ratings yet
T Test
11 pages
2023 S2 STAT1006 CompLab2
No ratings yet
2023 S2 STAT1006 CompLab2
5 pages
Lab Kamal Sir
No ratings yet
Lab Kamal Sir
5 pages
Exercise Sheet - Control Structures and Functions: Hint: You Can Use The Command Diag
No ratings yet
Exercise Sheet - Control Structures and Functions: Hint: You Can Use The Command Diag
4 pages
R Console
No ratings yet
R Console
6 pages
Nonparametric Tests in R
No ratings yet
Nonparametric Tests in R
5 pages
R Session Bootstrapping Randomisation 2024
No ratings yet
R Session Bootstrapping Randomisation 2024
4 pages
Non Parametric Tests R Examples
No ratings yet
Non Parametric Tests R Examples
4 pages
97 Wilcoxon
No ratings yet
97 Wilcoxon
3 pages
Solutions Stat CH 5
No ratings yet
Solutions Stat CH 5
4 pages
Reading 20and 20writing 20mock 20test
No ratings yet
Reading 20and 20writing 20mock 20test
12 pages
Astrid Lindgren - ENG
No ratings yet
Astrid Lindgren - ENG
1 page
2.5 The Sylow Theorems
No ratings yet
2.5 The Sylow Theorems
2 pages
Https:/www-Jstor-Org Lib-E2 Lib Ttu Edu/stable/pdf/3050509 Pdf?refreqid
No ratings yet
Https:/www-Jstor-Org Lib-E2 Lib Ttu Edu/stable/pdf/3050509 Pdf?refreqid
5 pages
ENG2601 - Assignment 01 - 2024
No ratings yet
ENG2601 - Assignment 01 - 2024
2 pages
Cloud Infrastructure
No ratings yet
Cloud Infrastructure
2 pages

BES - R Lab 7

Uploaded by

BES - R Lab 7

Uploaded by

BES – LAB 7

The decimal point is 1 digit(s) to the right of the |

data: overnight$Service1 and overnight$Service2

Question 2. What is your conclusion in this case?

data: foodsafety$sfair and foodsafety$srest

Here are the outputs.

The decimal point is 1 digit(s) to the right of the |

Wilcoxon signed rank test with continuity correction

data: french$Pretest and french$Posttest

Professor Current.Students Recent.Graduates

'data.frame': 10 obs. of 3 variables:

data: professors$Current.Students and professors$Recent.Graduates

You might also like