0% found this document useful (0 votes)

13 views22 pages

Week3 Slides

The document discusses multiple testing in the context of ANOVA, highlighting the importance of corrections to maintain a family-wise error rate as the number of tests increases. It introduces methods such as the Bonferroni correction and Tukey's Honestly Significant Differences for adjusting significance levels. Additionally, it covers basic factorial design and the significance of interaction terms in experimental results.

Uploaded by

kimberly.jcui

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views22 pages

Week3 Slides

Uploaded by

kimberly.jcui

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

STATS 101B Discussion 4/14

Navin Souda

2024-04-15
Multiple Testing I
▶ So we run ANOVA and find statistically significant differences
in the effects/means
▶ But this tells us nothing about which effects are significant
▶ Isn’t that what the t-tests are for?
▶ Sort of, but we have to be careful about our significance level
▶ For testing a single effect (i.e. a binary factor), we usually use
a significance level of 0.05 - practically, this means that we
have an at most 5% chance (on average) of rejecting the null
hypothesis when the null hypothesis is actually true
▶ But as we include more tests, each of them separately has a
5% chance of failing - the probability of them all being correct
simultaneously could actually be much less than 95%
▶ Some boring but important math: let Ai be the event test i is
correct for i = 1, ..., p
▶ Then the probability that at least one test fails,
P((∩i Ai )c ) = 1 − P(∩i Ai ), and if all the
Q tests are
independent, then 1 − P(∩i Ai ) = 1 − i P(Ai ) = 1 − (1 − α)p
Multiple Testing II

▶ i.e. as p increases, the probability of at least one test being

incorrect will get closer and closer to one (unless the tests are
completely dependent)
▶ This leads us to the concept of multiple testing correction,
which aims to adjust α so that either P((∩i Ai )c ) (known as
the family-wise error rate (FWER)), or a slightly different
quantity, the false positive rate, is at α for all the tests
simultaneously
Multiple Testing Correction I
▶ Bonferroni Method
▶ The simplest and most conservative method, makes no
assumptions about the tests
▶ With p tests, just use αp for each test
▶ This actually will often end up giving us a FWER less than
0.05, but as mentioned is very simple
▶ Will skip the math but feel free to ask in OH/through email if
interested
▶ pairwise.t.test() with correction = "bonferroni" in
R
▶ Tukey’s Honestly Significant Differences (HSD)
▶ Calculate a standardized statistic for the difference between
each pair of means, which will tell us whether that particular
difference is significant
▶ T = p 1 x̄i. −x̄j. 1 1
2 MSwithin ( ni +n )
j
▶ x̄i. and ni are the mean and size of group i, MSwithin is the
within group sum of squares generated found by one-way
ANOVA
Multiple Testing Correction II

▶ This follows a distribution called a studentized range

distribution, will need R or a table to find the p-value
▶ TukeyHSD() in R
Basic Factorial Design
▶ In a factorial design, we have two or more treatments and an
observation for each combination of the treatment groups (or
rather, we randomly assign each experimental unit to a
treatment combination at random)
▶ The general concepts are the same as with a single factor, but
we have to consider the interaction between the factors as well
▶ We can look for interactions visually using an interaction plot
(example later), and also test its significance using ANOVA
▶ Going to ignore the details of calculating the ANOVA table,
can see lecture slides for derivation - essentially boils down to
finding between group SS for each treatment group individually
as well as as the combination of treatments for the interaction
▶ Model assessment
▶ This is basically the same as it was in 101A - we’re looking for
independent, normally distributed residuals with constant
variance - the usual tools such as residual plots, leverage, and
Cook’s distance will be useful
Example

We’re going to perform an experiment with 2 treatments - the first

has 4 levels, and the second has 3. We want to determine whether
these treatments have an effect on our response, as well as whether
they interact with each other. Finally, we want to determine which
groups of each treatment cause the greatest difference in response.

## treatment1 treatment2 response

## 1 1 a 401.6951
## 2 1 b 504.5292
## 3 1 c 408.6511
## 4 2 a 361.1613
## 5 2 b 351.1951
## 6 2 c 347.7072
Example (cont.)
500
450
400
response

350
300
250

1 2 3 4

treatment1
Example (cont.)
500
450
400
response

350
300
250

a b c

treatment2
Example (cont.)
We might want to start by determining whether it is useful to consider
the interaction term. Based on the following plots, does it seem like the
interaction effect would be significant? Why or why not?
500

400

treatment1
response

1
2
3
4

300
Example (cont.)
500

400

treatment2
response

a
b
c

300

1 2 3 4
treatment1
Example (cont.)

We probably want to start by determining whether it is useful to

consider the interaction term. Based on the following plots, does it
seem like the interaction effect would be significant? Why or why
not?
A: We probably should include the interaction term, as it is clear
that for different groups of treatment 1, the groups of treatment 2
have varying responses (as seen in the first plot).
Extra Q: What would we expect the plots to look like if there was
no interaction?
A: The lines corresponding to different levels would be “parallel”,
i.e. the effect of treatment 2 is the same regardless of the level of
treatment 1 and vice versa
Example (cont.) I

## Df Sum Sq Mean Sq F value Pr(>F)

## treatment1 3 166409 55470 177.112 < 2e-16 ***
## treatment2 2 4907 2453 7.833 0.00241 **
## treatment1:treatment2 6 16040 2673 8.536 5.05e-05 ***
## Residuals 24 7517 313
## ---
## Signif. codes: 0 ’***’ 0.001 ’**’ 0.01 ’*’ 0.05 ’.’ 0.1 ’ ’
Example (cont.) II
## Tables of effects
##
## treatment1
## treatment1
## 1 2 3 4
## 99.54 19.11 -35.06 -83.59
##
## treatment2
## treatment2
## a b c
## -9.948 16.385 -6.437
##
## treatment1:treatment2
## treatment2
## treatment1 a b c
## 1 -21.35 47.61 -26.26
## 2 8.77 -24.14 15.37
## 3 -6.74 -10.56 17.31
## 4 19.33 -12.91 -6.42
Example (cont.)

Now that we’ve decided to include the interaction term, let’s fit
our ANOVA model. Based on the resulting output, what can we
say about the significance of our treatments with regard to the
response? Which levels of each treatment are significant?
A: Both treatments are significant, as well as their interaction. We
don’t have enough information to say which levels of treatment are
significant.
Example (cont.) I
According to the Tukey HSD output, which levels of treatment 1
and treatment 2 have significant differences? Does the Bonferroni
adjustment agree with the Tukey HSD? Why or why not?

## Tukey multiple comparisons of means

## 95% family-wise confidence level
##
## Fit: aov(formula = response ~ treatment1 + treatment2 +
##
## $treatment1
## diff lwr upr p adj
## 2-1 -80.42544 -103.43923 -57.41165 0.00e+00
## 3-1 -134.59933 -157.61312 -111.58554 0.00e+00
## 4-1 -183.12940 -206.14319 -160.11561 0.00e+00
## 3-2 -54.17389 -77.18768 -31.16010 5.80e-06
## 4-2 -102.70396 -125.71775 -79.69017 0.00e+00
## 4-3 -48.53007 -71.54386 -25.51628 3.01e-05
Example (cont.) II
## Tukey multiple comparisons of means
## 95% family-wise confidence level
##
## Fit: aov(formula = response ~ treatment1 + treatment2 +
##
## $treatment2
## diff lwr upr p adj
## b-a 26.333460 8.290942 44.375977 0.0035479
## c-a 3.511213 -14.531304 21.553731 0.8785812
## c-b -22.822246 -40.864764 -4.779729 0.0113900

##
## Pairwise comparisons using t tests with pooled SD
##
## data: dat$response and dat$treatment1
##
## 1 2 3
Example (cont.) III
## 2 1.5e-05 - -
## 3 3.9e-10 0.0032 -
## 4 1.5e-13 1.6e-07 0.0095
##
## P value adjustment method: bonferroni

##
## Pairwise comparisons using t tests with pooled SD
##
## data: dat$response and dat$treatment2
##
## a b
## b 1 -
## c 1 1
##
## P value adjustment method: bonferroni
Example (cont.)

According to the Tukey HSD output, which levels of treatment 1

and treatment 2 have significant differences? Does the Bonferroni
adjustment agree with the Tukey HSD? Why or why not?
A: For treatment 1, all levels are significantly different. For
treatment 2, levels a and b and b and c are significantly different.
For treatment 1, the Bonferroni procedure and the Tukey HSD
seem to agree, but not for treatment 2. This might be because the
pairwise tests aren’t properly accounting for the interaction.
Extra Q: Write a line of R code that we could use to identify the
significantly different interaction terms.
Example (cont.)

According to the Tukey HSD output, which levels of treatment 1

and treatment 2 have significant differences? Does the Bonferroni
adjustment agree with the Tukey HSD? Why or why not?
A: For treatment 1, everything except levels 2 and 3 are
significantly different. For treatment 2, levels a and b and a and c
are significantly different. For treatment 1, the Bonferroni
procedure and the Tukey HSD seem to agree, but not for
treatment 2. This might be because the pairwise tests aren’t
properly accounting for the interaction.
Extra Q: Write a line of R code that we could use to identify the
significantly different interaction terms.
TukeyHSD(example_model, "treatment1:treatment2")['p
adj'] < .05)
Example (cont.)
Comment on the residuals of the model.
Residuals vs Fitted Q−Q Residuals
40

23 23

2
Standardized residuals
20

1
Residuals

0
−1
−20

−2
35
−40

15
15

250 300 350 400 450 500 −2 −1 0 1 2

Fitted values Theoretical Quantiles

Constant Leverage:
Scale−Location Residuals vs Factor Levels
15
1.5

23 23

2
35
Standardized residuals

Standardized residuals

1
1.0

0
−1
0.5

−2

35
15
0.0

−3

treatment1 :
250 300 350 400 450 500 1 2 3 4

Fitted values Factor Level Combinations

ANOVA and Post Hoc
No ratings yet
ANOVA and Post Hoc
2 pages
Adoree A. Ramos, DBA
No ratings yet
Adoree A. Ramos, DBA
28 pages
Cs-Repeated and Mixed
No ratings yet
Cs-Repeated and Mixed
4 pages
Uu Uuuuuuu
No ratings yet
Uu Uuuuuuu
16 pages
Expo Facto Research Design
No ratings yet
Expo Facto Research Design
5 pages
1004B Tutorial 6 Slides (With Answers) - 2
No ratings yet
1004B Tutorial 6 Slides (With Answers) - 2
43 pages
Comparing Cell Means
No ratings yet
Comparing Cell Means
26 pages
Template Submit Jurnal UI
No ratings yet
Template Submit Jurnal UI
9 pages
On Sample Size Estimation For Lomax Disrtibution
No ratings yet
On Sample Size Estimation For Lomax Disrtibution
6 pages
Null Hypothesis Example
100% (3)
Null Hypothesis Example
6 pages
Analysis of Effect in Consumption Pattern Due To Different Education-Level of Beneficiary Farmers Enrolled Under PM-KISAN Scheme in Jammu Region, J&K (U.T.)
No ratings yet
Analysis of Effect in Consumption Pattern Due To Different Education-Level of Beneficiary Farmers Enrolled Under PM-KISAN Scheme in Jammu Region, J&K (U.T.)
6 pages
Flocabulary Thesis Video
100% (3)
Flocabulary Thesis Video
8 pages
20ma402 Ps Unit III DCM
No ratings yet
20ma402 Ps Unit III DCM
77 pages
Psychology Research Method
No ratings yet
Psychology Research Method
77 pages
Session 2 - Stats 2
No ratings yet
Session 2 - Stats 2
87 pages
ONE WAY ANOVA and ANCOVA
No ratings yet
ONE WAY ANOVA and ANCOVA
26 pages
U1 - Analysis of Variance: U1-T1-S1 - Meaning and Assumptions. Fixed, Random and Mixed Effect Models
No ratings yet
U1 - Analysis of Variance: U1-T1-S1 - Meaning and Assumptions. Fixed, Random and Mixed Effect Models
11 pages
BST 32202 Linear Regression 5 Multiple Comparisons
No ratings yet
BST 32202 Linear Regression 5 Multiple Comparisons
29 pages
Ch13-2skel Anova Part 2
No ratings yet
Ch13-2skel Anova Part 2
36 pages
Book
No ratings yet
Book
166 pages
Lect W4m08ab f2023
No ratings yet
Lect W4m08ab f2023
8 pages
Slidesc53 5
No ratings yet
Slidesc53 5
68 pages
PSYC206 Mid-Semester Exam
No ratings yet
PSYC206 Mid-Semester Exam
7 pages
Methods Notes
No ratings yet
Methods Notes
9 pages
Non Parametric Testtt
No ratings yet
Non Parametric Testtt
17 pages
Systematic Reviews
No ratings yet
Systematic Reviews
4 pages
Assignment Statistic
No ratings yet
Assignment Statistic
16 pages
Chapter 6 7 Anomaly Fraud Detection Advanced Datamining Application
No ratings yet
Chapter 6 7 Anomaly Fraud Detection Advanced Datamining Application
10 pages
Short-Term Load Forecasting Using Time Pooling Dee
No ratings yet
Short-Term Load Forecasting Using Time Pooling Dee
5 pages
Computer Lab 3 MM
No ratings yet
Computer Lab 3 MM
38 pages
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
From Everand
Sample Size for Analytical Surveys, Using a Pretest-Posttest-Comparison-Group Design
Joseph George Caldwell
No ratings yet
Basic Concepts On Research For MOR
No ratings yet
Basic Concepts On Research For MOR
31 pages
ID Pengaruh Perubahan Strategi Organisasional Dalam Menghadapi Turbulance Environme
No ratings yet
ID Pengaruh Perubahan Strategi Organisasional Dalam Menghadapi Turbulance Environme
30 pages
The Thermo Scientific iCAP 7000 Plus Series ICP-OES Unique Charge Injection Device (CID) Detector
No ratings yet
The Thermo Scientific iCAP 7000 Plus Series ICP-OES Unique Charge Injection Device (CID) Detector
4 pages
Assign3 Sol PDF
No ratings yet
Assign3 Sol PDF
7 pages
Creative Thinking Tools Techniques Methods Subroutines - RNMAboganda
No ratings yet
Creative Thinking Tools Techniques Methods Subroutines - RNMAboganda
4 pages
Knowledge Transmission
No ratings yet
Knowledge Transmission
17 pages
CHAP 15. Audit Sampling For Tests of Controls and Substantive Tests of Tran
No ratings yet
CHAP 15. Audit Sampling For Tests of Controls and Substantive Tests of Tran
32 pages
Summary Data
No ratings yet
Summary Data
9 pages
Manova Iii
No ratings yet
Manova Iii
36 pages
Chen, Xu, Tu, Wang, & Niu (2018) Relationship Between Omnibus and Post-Hoc Tests
No ratings yet
Chen, Xu, Tu, Wang, & Niu (2018) Relationship Between Omnibus and Post-Hoc Tests
6 pages
Multiple Comparisons Testing
No ratings yet
Multiple Comparisons Testing
7 pages
LMM (02) 17 Model Law On Computer and Computer Related Crime: Background
No ratings yet
LMM (02) 17 Model Law On Computer and Computer Related Crime: Background
24 pages
LBY2STA Reviewer PDF
No ratings yet
LBY2STA Reviewer PDF
5 pages
Midterm 2023 Sol
No ratings yet
Midterm 2023 Sol
10 pages
What Is Business Analytics
No ratings yet
What Is Business Analytics
10 pages
QRM - Week 3 Lecture - Canvas
No ratings yet
QRM - Week 3 Lecture - Canvas
25 pages
Hw6 Solution
100% (1)
Hw6 Solution
10 pages
STAT453 Study Guide
No ratings yet
STAT453 Study Guide
11 pages
Assingment6 512
No ratings yet
Assingment6 512
6 pages
SV 110
No ratings yet
SV 110
552 pages
BBADM 221 Unit 10 - With Notes
No ratings yet
BBADM 221 Unit 10 - With Notes
51 pages
T-Test and F-Test Hypotheses
No ratings yet
T-Test and F-Test Hypotheses
25 pages
Midterm2021R1 Sol PDF
No ratings yet
Midterm2021R1 Sol PDF
13 pages
BES - R Lab 5
No ratings yet
BES - R Lab 5
4 pages
How To Do A Science Investigatory Project
No ratings yet
How To Do A Science Investigatory Project
16 pages
Correlation and Linear Regression
No ratings yet
Correlation and Linear Regression
25 pages
Grade: A. Great Job! Feel Free To Let Me Know If You Have Any Questions About My Comments
No ratings yet
Grade: A. Great Job! Feel Free To Let Me Know If You Have Any Questions About My Comments
11 pages
Check Your Calendar : If You Are Qualified, Fill Out Make-Up Exam Form by 5pm This Friday, Attach Necessary Documents
No ratings yet
Check Your Calendar : If You Are Qualified, Fill Out Make-Up Exam Form by 5pm This Friday, Attach Necessary Documents
26 pages
Isye 6421: Biostatistics: Analysis of Variance (Anova) Pairwise Comparison For Confidence Intervals
No ratings yet
Isye 6421: Biostatistics: Analysis of Variance (Anova) Pairwise Comparison For Confidence Intervals
10 pages
A Analysis of Variance Is A Technique That Partitions The Total
No ratings yet
A Analysis of Variance Is A Technique That Partitions The Total
76 pages
Anova Two Ways: Citra Dewi Rakhmania 02211850012005
No ratings yet
Anova Two Ways: Citra Dewi Rakhmania 02211850012005
39 pages
INFE StatsModule Part-3 T-Test ANOVA
No ratings yet
INFE StatsModule Part-3 T-Test ANOVA
15 pages
Unit 10 - More Multiple Regression - 1 Per Page
No ratings yet
Unit 10 - More Multiple Regression - 1 Per Page
30 pages
Post Hoc Tests Familywise Error: Newsom Psy 521/621 Univariate Quantitative Methods, Fall 2020 1
No ratings yet
Post Hoc Tests Familywise Error: Newsom Psy 521/621 Univariate Quantitative Methods, Fall 2020 1
4 pages
Post-Hoc ANOVA Test
No ratings yet
Post-Hoc ANOVA Test
16 pages
Two Way Anova
No ratings yet
Two Way Anova
12 pages
Repory
No ratings yet
Repory
17 pages
One-Way ANOVA: Multiple Comparisons
No ratings yet
One-Way ANOVA: Multiple Comparisons
39 pages
Archaeology in The Classroom PDF
No ratings yet
Archaeology in The Classroom PDF
143 pages
Analysis Analysis of Variance One Way Anova
No ratings yet
Analysis Analysis of Variance One Way Anova
3 pages
Post-Hoc Tests Final With Computation
No ratings yet
Post-Hoc Tests Final With Computation
88 pages
Types of Research
100% (1)
Types of Research
16 pages
Post Hoc
No ratings yet
Post Hoc
5 pages
Lesson 16 Post-Hoc Tests: N MS Q HSD
No ratings yet
Lesson 16 Post-Hoc Tests: N MS Q HSD
3 pages
R 4 Different Drugs
No ratings yet
R 4 Different Drugs
13 pages
STAT 514 HW10 1. (1) : r σ abσ α σ rb α a−1 bσ rα σ bσ β σ ra β b−1 aσ rβ σ aσ
No ratings yet
STAT 514 HW10 1. (1) : r σ abσ α σ rb α a−1 bσ rα σ bσ β σ ra β b−1 aσ rβ σ aσ
11 pages
Probability & Random Variables (Notes)
No ratings yet
Probability & Random Variables (Notes)
26 pages
Lecture 2: Completely Randomised Designs: Example 1
No ratings yet
Lecture 2: Completely Randomised Designs: Example 1
25 pages
Analysis of Variance-20220125072228
No ratings yet
Analysis of Variance-20220125072228
120 pages
Unit 9 (STAT 17 Assignment)
No ratings yet
Unit 9 (STAT 17 Assignment)
5 pages
Bon Ferroni
No ratings yet
Bon Ferroni
3 pages
Asst. Prof. Florence C. Navidad, RMT, RN, M.Ed
100% (1)
Asst. Prof. Florence C. Navidad, RMT, RN, M.Ed
37 pages
Designing Comparative Experiments: Points of View
No ratings yet
Designing Comparative Experiments: Points of View
2 pages
A Nova Sumner 2016
No ratings yet
A Nova Sumner 2016
23 pages
Inferential Statistics
No ratings yet
Inferential Statistics
101 pages
Psyc 3301: Experimental Psychology Study Guide For Exam 2: Chapters 13-14 (Anovas)
No ratings yet
Psyc 3301: Experimental Psychology Study Guide For Exam 2: Chapters 13-14 (Anovas)
2 pages
Midterm II Review: Remember: Include Lots of Examples, Be Concise But Give Lots of Information
No ratings yet
Midterm II Review: Remember: Include Lots of Examples, Be Concise But Give Lots of Information
6 pages
2012 Mean Comparison
No ratings yet
2012 Mean Comparison
35 pages
Regression Analysis: Statistics For Psychology
No ratings yet
Regression Analysis: Statistics For Psychology
40 pages

Week3 Slides

Uploaded by

Week3 Slides

Uploaded by

STATS 101B Discussion 4/14

▶ i.e. as p increases, the probability of at least one test being

▶ This follows a distribution called a studentized range

We’re going to perform an experiment with 2 treatments - the first

## treatment1 treatment2 response

We probably want to start by determining whether it is useful to

We probably want to start by determining whether it is useful to

## Df Sum Sq Mean Sq F value Pr(>F)

## Tukey multiple comparisons of means

According to the Tukey HSD output, which levels of treatment 1

According to the Tukey HSD output, which levels of treatment 1

250 300 350 400 450 500 −2 −1 0 1 2

Fitted values Theoretical Quantiles

Fitted values Factor Level Combinations

You might also like