0% found this document useful (0 votes)

15 views14 pages

1 3 Multiple Hypothesis Testing

Uploaded by

golgothgolgoth039

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views14 pages

1 3 Multiple Hypothesis Testing

Uploaded by

golgothgolgoth039

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

MITx:

Statistics, Computation & Applications

Statistics Refresher
Lecture 3: Multiple Hypothesis Testing

Caroline Uhler (MIT) MITx: Statistics, Computation & Applications Lecture 3 1/9
Some quotes and research findings

Giovannucci et al., Journal of the National Cancer Institute 87 (1995):

Intake of tomato sauce (p-value of 0.001), tomatoes (p-value of 0.03),
and pizza (p-value of 0.05) reduce the risk of prostate cancer;
But for example tomato juice (p-value of 0.67), or cooked spinach
(p-value of 0.51), and many other vegetables are not significant.

Caroline Uhler (MIT) MITx: Statistics, Computation & Applications Lecture 3 2/9
Some quotes and research findings

Giovannucci et al., Journal of the National Cancer Institute 87 (1995):

”Orange cars are less likely to have serious damages that are discovered
only after the purchase.”

Caroline Uhler (MIT) MITx: Statistics, Computation & Applications Lecture 3 2/9
Jelly Beans and Acne

Caroline Uhler (MIT) MITx: Statistics, Computation & Applications Lecture 3 3/9
Problematic of selective inference

http://imgs.xkcd.com/comics/significant.png
Caroline Uhler (MIT) MITx: Statistics, Computation & Applications Lecture 3 4/9
Wonder-syrup

randomized group of 1000 people

measure 100 variables before and after taking the syrup: weight,
blood pressure, etc.

perform a paired t-test with a significance level of 5%

Caroline Uhler (MIT) MITx: Statistics, Computation & Applications Lecture 3 5/9
Wonder-syrup

randomized group of 1000 people

measure 100 variables before and after taking the syrup: weight,
blood pressure, etc.

perform a paired t-test with a significance level of 5%

V := # false significant tests: V ∼ Binomial(100, 0.05)

⇒ in average 5 out of 100 variables show a significant effect!

Caroline Uhler (MIT) MITx: Statistics, Computation & Applications Lecture 3 5/9
Wonder-syrup

randomized group of 1000 people

measure 100 variables before and after taking the syrup: weight,
blood pressure, etc.

perform a paired t-test with a significance level of 5%

V := # false significant tests: V ∼ Binomial(100, 0.05)

⇒ in average 5 out of 100 variables show a significant effect!

Caroline Uhler (MIT) MITx: Statistics, Computation & Applications Lecture 3 5/9
Different protection levels

Compute p-values using methods that control:

family-wise error rate (FWER) ≤ α, where

FWER = P(at least one false significant result)

false discovery rate (FDR) ≤ α, where

FDR = expected fraction of false significant results

among all significant results

Caroline Uhler (MIT) MITx: Statistics, Computation & Applications Lecture 3 6/9
Corrections for multiple testing
Bonferroni correction:
Reject H0 when: m · p-value ≤ α
where m is the total number of hypothesis tests performed
Bonferroni correction implies FWER ≤ α

Caroline Uhler (MIT) MITx: Statistics, Computation & Applications Lecture 3 7/9
Corrections for multiple testing
Bonferroni correction:
Reject H0 when: m · p-value ≤ α
where m is the total number of hypothesis tests performed
Bonferroni correction implies FWER ≤ α

Holm-Bonferroni correction:
Sort p-values in increasing order: p(1) ≤ · · · ≤ p(m)
Reject H0 when: (m − i + 1)p(i) ≤ α (more power than Bonferroni)
Holm-Bonferroni correction implies FWER ≤ α

Benjamini-Hochberg correction:
Sort p-values in increasing order: p(1) ≤ · · · ≤ p(m)
Reject H0 when: mp(i) /i ≤ α
Benjamini-Hochberg correction implies FDR ≤ α
Caroline Uhler (MIT) MITx: Statistics, Computation & Applications Lecture 3 7/9
Commonly accepted practice

No correction for multiple testing when generating hypotheses (but

report number of tests performed)

FDR ≤ 10% in exploratory analysis or screening

balance between high power and low # of false significant results

FWER ≤ 5% in confirmatory analysis

food and drug administration (FDA)

Caroline Uhler (MIT) MITx: Statistics, Computation & Applications Lecture 3 8/9
References

Lecture by Yoav Benjamini, THE expert for multiple testing issues:

http://simons.berkeley.edu/talks/yoav-benjamini-2013-12-11a

Caroline Uhler (MIT) MITx: Statistics, Computation & Applications Lecture 3 9/9

Foundations of Clinical Research: Applications To Practice. ISBN 0803646577, 978-0803646575
100% (31)
Foundations of Clinical Research: Applications To Practice. ISBN 0803646577, 978-0803646575
23 pages
04a Independent Sample T-Test
No ratings yet
04a Independent Sample T-Test
37 pages
Intuitive Biostatistics & Normality Test & Sample PDF
100% (11)
Intuitive Biostatistics & Normality Test & Sample PDF
605 pages
Multiple Testing Multiple Testing: Statistical Inference
No ratings yet
Multiple Testing Multiple Testing: Statistical Inference
19 pages
Index PDF
No ratings yet
Index PDF
19 pages
Multiple Testing
No ratings yet
Multiple Testing
8 pages
Multiple Comparisons Testing
No ratings yet
Multiple Comparisons Testing
7 pages
m09 Inference
No ratings yet
m09 Inference
20 pages
Lecture 01
No ratings yet
Lecture 01
8 pages
Lecture 04
No ratings yet
Lecture 04
9 pages
HS4510, Wa.u5
No ratings yet
HS4510, Wa.u5
6 pages
Bonferroni Test Phoebe de Leon
No ratings yet
Bonferroni Test Phoebe de Leon
17 pages
Introduction To Hypothesis Testing24
No ratings yet
Introduction To Hypothesis Testing24
54 pages
Concept of Hypothesis Testing - Topic 5
No ratings yet
Concept of Hypothesis Testing - Topic 5
38 pages
T Test Numerical4
No ratings yet
T Test Numerical4
3 pages
Fisher Sign Test and Wilcoxon in R
No ratings yet
Fisher Sign Test and Wilcoxon in R
2 pages
Lecture11 Hypothesis Testing
No ratings yet
Lecture11 Hypothesis Testing
46 pages
9 3+sig+test+notes
No ratings yet
9 3+sig+test+notes
33 pages
BPS651 Exercise V
50% (2)
BPS651 Exercise V
5 pages
Lecture BDS 9-23-24 Print
No ratings yet
Lecture BDS 9-23-24 Print
13 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
31 pages
Assignment06 1
No ratings yet
Assignment06 1
4 pages
Linear Regression For Air Pollution Data: U T S A
No ratings yet
Linear Regression For Air Pollution Data: U T S A
14 pages
Lecture 26 Compact
No ratings yet
Lecture 26 Compact
5 pages
Hypothesis Testing (Lecture) PDF
50% (2)
Hypothesis Testing (Lecture) PDF
50 pages
Lecture 9 - Null Hypothesis Significance Testing (Part1)
No ratings yet
Lecture 9 - Null Hypothesis Significance Testing (Part1)
20 pages
MODS 2023 L1W4 - CI and Stats Tests
No ratings yet
MODS 2023 L1W4 - CI and Stats Tests
30 pages
Bon Ferroni
No ratings yet
Bon Ferroni
3 pages
Chapter 3
No ratings yet
Chapter 3
43 pages
W7 Lecture7
No ratings yet
W7 Lecture7
19 pages
Overview of Hypothesis Testing: Laura Lee Johnson, PH.D
No ratings yet
Overview of Hypothesis Testing: Laura Lee Johnson, PH.D
71 pages
Hypothesis Testing2
No ratings yet
Hypothesis Testing2
40 pages
Nutritional Epidemiology Lecture 3 2013-14 - Moodle
100% (1)
Nutritional Epidemiology Lecture 3 2013-14 - Moodle
26 pages
Stat 201 MT 2 Cheatsheet
No ratings yet
Stat 201 MT 2 Cheatsheet
2 pages
PHPS30020 Week1 (5) - 29nov2023 (Test Decisions & Assumptions, Hypothesis, Compare 2 Groups)
No ratings yet
PHPS30020 Week1 (5) - 29nov2023 (Test Decisions & Assumptions, Hypothesis, Compare 2 Groups)
16 pages
Assignment: 1. Perform The Following Steps
No ratings yet
Assignment: 1. Perform The Following Steps
3 pages
Dav LVC-1
No ratings yet
Dav LVC-1
36 pages
Statistical Fallacies and Errors in Medical Research
No ratings yet
Statistical Fallacies and Errors in Medical Research
41 pages
Statistical Significance and The PHC Curve Instant DOCX Download
100% (12)
Statistical Significance and The PHC Curve Instant DOCX Download
14 pages
ProbList5 24 SLN
No ratings yet
ProbList5 24 SLN
9 pages
Introduction To Key Statistical Concepts - 2024
No ratings yet
Introduction To Key Statistical Concepts - 2024
27 pages
Slides CH 14
No ratings yet
Slides CH 14
50 pages
HS4510, DF, U7
No ratings yet
HS4510, DF, U7
3 pages
Non Parametric Tests
No ratings yet
Non Parametric Tests
37 pages
Hypothesis Testing in R
No ratings yet
Hypothesis Testing in R
13 pages
Understanding P - Values and CI 20nov08
No ratings yet
Understanding P - Values and CI 20nov08
37 pages
Case Control
No ratings yet
Case Control
5 pages
Testing of Hypothesis
No ratings yet
Testing of Hypothesis
26 pages
ANP 802 Lecture 2verynew
No ratings yet
ANP 802 Lecture 2verynew
50 pages
Logistic Regression Notes
No ratings yet
Logistic Regression Notes
79 pages
Hypothesis Testing-2 PDF
No ratings yet
Hypothesis Testing-2 PDF
16 pages
Hypothesis Test Errors
No ratings yet
Hypothesis Test Errors
16 pages
Single Group When Observations Are Not Normally Distributed
No ratings yet
Single Group When Observations Are Not Normally Distributed
35 pages
Handbook of Multiple Comparisons 1st Edition All Chapters Included
100% (14)
Handbook of Multiple Comparisons 1st Edition All Chapters Included
17 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
86 pages
Biometrics 2011 II 7
No ratings yet
Biometrics 2011 II 7
16 pages
5 1 One-Sample Means With The T-Distribution
No ratings yet
5 1 One-Sample Means With The T-Distribution
18 pages
Merged Statistics II Cheat Sheet
No ratings yet
Merged Statistics II Cheat Sheet
9 pages
Chi Squared for Beginners
From Everand
Chi Squared for Beginners
Stephanie Glen
No ratings yet
Concise Biostatistical Principles & Concepts: Guidelines for Clinical and Biomedical Researchers
From Everand
Concise Biostatistical Principles & Concepts: Guidelines for Clinical and Biomedical Researchers
Franklin Opara
No ratings yet
71st AACC Annual Scientific Meeting
From Everand
71st AACC Annual Scientific Meeting
CTI Meeting Technology
No ratings yet
Concise Epidemiologic Principles and Concepts: Guidelines for Clinicians and Biomedical Researchers
From Everand
Concise Epidemiologic Principles and Concepts: Guidelines for Clinicians and Biomedical Researchers
Laurens Holmes Jr.
No ratings yet
Building A Tanh Activation Function
No ratings yet
Building A Tanh Activation Function
9 pages
RAGE Against The Machine - Retrieval-Augmented LLM Explanations
No ratings yet
RAGE Against The Machine - Retrieval-Augmented LLM Explanations
4 pages
Programming With Python and GUI Development... 2024
No ratings yet
Programming With Python and GUI Development... 2024
145 pages
Stock Market Dashboard in Python
No ratings yet
Stock Market Dashboard in Python
4 pages
Notebook - Main Code
No ratings yet
Notebook - Main Code
4 pages
Notebook - Agave Plant Maturation Model Inference and Testing
No ratings yet
Notebook - Agave Plant Maturation Model Inference and Testing
7 pages
Notebook - Deep Neural Networks
No ratings yet
Notebook - Deep Neural Networks
28 pages
Notebook - Text Classification
No ratings yet
Notebook - Text Classification
7 pages
Time Series Analysis 1718649022
No ratings yet
Time Series Analysis 1718649022
5 pages
Notebook - Music Recommendation System Reference
No ratings yet
Notebook - Music Recommendation System Reference
22 pages
Boston Dataset
No ratings yet
Boston Dataset
6 pages
New System To Harness 40% of The Sun's Heat To Produce Clean Hydrogen Fuel
No ratings yet
New System To Harness 40% of The Sun's Heat To Produce Clean Hydrogen Fuel
6 pages
Notebook - Geospatial
No ratings yet
Notebook - Geospatial
11 pages
Data Pipeline in ML
No ratings yet
Data Pipeline in ML
3 pages
5 3-2 Spatial Environmental Data Model Selection Long-Range Dependencies
No ratings yet
5 3-2 Spatial Environmental Data Model Selection Long-Range Dependencies
3 pages
5 2-6 Spatial Environmental Data Gaussian Processes
No ratings yet
5 2-6 Spatial Environmental Data Gaussian Processes
4 pages
The CNN Architecture
No ratings yet
The CNN Architecture
15 pages
5 2-4 Spatial Environmental Data Gaussian Processes
No ratings yet
5 2-4 Spatial Environmental Data Gaussian Processes
3 pages
Glossary of Notations - Recommender Systems Part 3
No ratings yet
Glossary of Notations - Recommender Systems Part 3
4 pages
MLS 1 - Presentation
No ratings yet
MLS 1 - Presentation
11 pages
MLS 1 - Regression
No ratings yet
MLS 1 - Regression
20 pages
ML LVC 2 Post-Session Summary
No ratings yet
ML LVC 2 Post-Session Summary
12 pages
ML LVC 3 Post-Session Summary
No ratings yet
ML LVC 3 Post-Session Summary
16 pages
ML LVC 3 Glossary
No ratings yet
ML LVC 3 Glossary
1 page
Mean Separation Statistics
100% (1)
Mean Separation Statistics
15 pages
Functional Genomics
100% (1)
Functional Genomics
210 pages
Tukey Kramer
No ratings yet
Tukey Kramer
11 pages
Foundations of Clinical Research Applications To Practice 3rd Edition by Leslie G Portney, Mary P Watkins ISBN 0131716409 9780131716407 Download
100% (5)
Foundations of Clinical Research Applications To Practice 3rd Edition by Leslie G Portney, Mary P Watkins ISBN 0131716409 9780131716407 Download
47 pages
Allen Ap 1999 c56 2029
No ratings yet
Allen Ap 1999 c56 2029
12 pages
Statistical Methods in Psychiatry and Related Fields
No ratings yet
Statistical Methods in Psychiatry and Related Fields
371 pages
Roaini Alkazam Spss1-2413351138
No ratings yet
Roaini Alkazam Spss1-2413351138
29 pages
Agricolae PDF
No ratings yet
Agricolae PDF
118 pages
Simultaneous Statistical Inference With Applications in The Life Sciences Full Ebook Access
No ratings yet
Simultaneous Statistical Inference With Applications in The Life Sciences Full Ebook Access
14 pages
The Importance of Instructional Materials in Teaching English As A Second Language
No ratings yet
The Importance of Instructional Materials in Teaching English As A Second Language
9 pages
Journal of The American Statistical Association
No ratings yet
Journal of The American Statistical Association
14 pages
(Ebook PDF) A Guide To Crisis Intervention 5th Edition PDF Download
100% (1)
(Ebook PDF) A Guide To Crisis Intervention 5th Edition PDF Download
49 pages
On The Performance of Lottery Winning Strategies: A Case Study of Oyo State Lottery, Nigeria
No ratings yet
On The Performance of Lottery Winning Strategies: A Case Study of Oyo State Lottery, Nigeria
14 pages
Isye 6421: Biostatistics: Analysis of Variance (Anova) Pairwise Comparison For Confidence Intervals
No ratings yet
Isye 6421: Biostatistics: Analysis of Variance (Anova) Pairwise Comparison For Confidence Intervals
10 pages
A Pragmatic View On Children
No ratings yet
A Pragmatic View On Children
12 pages
Tests of Normality
No ratings yet
Tests of Normality
11 pages
Univariate Analysis of Variance: Between-Subjects Factors
No ratings yet
Univariate Analysis of Variance: Between-Subjects Factors
3 pages
PSM Syllabus
No ratings yet
PSM Syllabus
13 pages
Financial Inclusion and The Role of Banking System Sudarshan Maity - The Ebook in PDF/DOCX Format Is Available For Instant Download
No ratings yet
Financial Inclusion and The Role of Banking System Sudarshan Maity - The Ebook in PDF/DOCX Format Is Available For Instant Download
56 pages
Lucky Factors - Harvey&Liu
No ratings yet
Lucky Factors - Harvey&Liu
59 pages
Statistical Significance of Feature Importance Rankings
No ratings yet
Statistical Significance of Feature Importance Rankings
20 pages
Dunn Test PDF
No ratings yet
Dunn Test PDF
6 pages
Comparing Alternative System Configurations
No ratings yet
Comparing Alternative System Configurations
17 pages
STAMP Users Guide v2.0.0
No ratings yet
STAMP Users Guide v2.0.0
26 pages
Lecture Slides Sec 5 Power
No ratings yet
Lecture Slides Sec 5 Power
41 pages
Lecture 12
No ratings yet
Lecture 12
67 pages
Oneway: Notes
No ratings yet
Oneway: Notes
14 pages
Multiple Testing in QTL Mapping: Lucia Gutierrez Lecture Notes Tucson Winter Institute
No ratings yet
Multiple Testing in QTL Mapping: Lucia Gutierrez Lecture Notes Tucson Winter Institute
18 pages

1 3 Multiple Hypothesis Testing

Uploaded by

1 3 Multiple Hypothesis Testing

Uploaded by

MITx:

Statistics, Computation & Applications

Giovannucci et al., Journal of the National Cancer Institute 87 (1995):

Giovannucci et al., Journal of the National Cancer Institute 87 (1995):

randomized group of 1000 people

perform a paired t-test with a significance level of 5%

randomized group of 1000 people

perform a paired t-test with a significance level of 5%

V := # false significant tests: V ∼ Binomial(100, 0.05)

⇒ in average 5 out of 100 variables show a significant effect!

randomized group of 1000 people

perform a paired t-test with a significance level of 5%

V := # false significant tests: V ∼ Binomial(100, 0.05)

⇒ in average 5 out of 100 variables show a significant effect!

Compute p-values using methods that control:

family-wise error rate (FWER) ≤ α, where

FWER = P(at least one false significant result)

false discovery rate (FDR) ≤ α, where

FDR = expected fraction of false significant results

No correction for multiple testing when generating hypotheses (but

FDR ≤ 10% in exploratory analysis or screening

FWER ≤ 5% in confirmatory analysis

Lecture by Yoav Benjamini, THE expert for multiple testing issues:

You might also like