0% found this document useful (0 votes)

28 views25 pages

QRM - Week 3 Lecture - Canvas

The document discusses comparing more than two groups or conditions using analysis of variance (ANOVA). It covers assumptions, what to do if assumptions are violated, interpretation and follow-up analyses, and reporting results. It also briefly discusses within-subjects ANOVA and different post hoc tests that can be used as follow ups.

Uploaded by

rxh6kp5pp9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views25 pages

QRM - Week 3 Lecture - Canvas

Uploaded by

rxh6kp5pp9

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

COMPARING MORE THAN 2

GROUPS/CONDITIONS
Sharon Morein
[email protected]
Comparing 3 groups or more
This lecture:
Between-Subjects ANOVA
• Assumptions
• What to do if assumptions are violated?
• Interpretation and follow-up analyses
• Reporting results

Within-subjects ANOVA (just a little bit today, more next

week)
• What to do if assumptions are violated?
Independent ANOVA
• Can compare 2 means* and more (3, 4 etc.)
• Multiple t-tests inflate type I error [with α=.05: FWE= 1-
(0.95)k] → too liberal
• Workhorse – certainly within experimental design

• ANOVA Omnibus test (F)

• H0: μ1=μ2 = μ3=…μk
• H1: ‘at least one mean is different from all the others’
• Usually by itself isn’t particularly useful – needs following
up tests
Example: 4 different
treatments for MDD
(so k=4)
Group 3

Group 2
Group 1
Example – influence of drug on libido
• Each observation is
comprised of 3
components:
• Grand mean
• (increment or
decrement of the
specific group mean)
• Noise/error
(Individual
differences,
measurement error
etc.)
Partitioning of variance
systematic/explained variance(+error variance)
error (unexplained variance)

SST  SS M  SS R
dfT  dfM  dfR

SS R   ( xi  xi ) 2 df R   N  k 
SS M   ni (x i  x grand )2 df M  k  1
SST   (x i  x grand )2 dfT   N  1
About a name: it’s all about the variances
Logic
• We calculate how much variability there is between all scores
Total Sum of squares (SST).
• We then calculate how much of this variability can be explained
by the ‘model’ we fit to the data
Model Sum of Squares (SSM) [variability due to the experimental
manipulation]
• And how much of this variability cannot be explained by our
‘model’,
Residual Sum of Squares (SSR) [variability due to error, e.g.,
individual differences in performance]
• We compare the amount of variability explained by the ‘model’
(experiment), to the error in the model (individual differences)
• This ratio is called the F-ratio.
• If the model explains a lot more variability than it can’t explain,
then the experimental manipulation has had a significant effect
on the outcome (DV).
DF
• Degrees of Freedom (df) are the number of values that are free
to vary.
• In general, the df are one less than the number of values used
to calculate the SS*
• Example: how many values are free to vary in a group of 10
observations if I HAVE to have a mean of 70? [ans=9]
• A mathematical restriction that needs to be put in place when
estimating one statistic from an estimate of another [you lose
one df for each parameter estimated prior to estimating the
(residual) standard deviation]
SS M SS R
MS M  MS R 
df M df R

• Mean square – we don’t want to be influenced by the number

of observations, so divide by df (extrapolating to the population)
F ratio – general behaviour

• 2 df MS M
F 
MSR
• The larger the F…

• F<1 …

• F(1,dfx)=t(dfx)2
Assumptions
• Errors ϵij ~NID (0, σ2ϵ)
• Normal distribution within groups
• Independent Distribution of observations
• Homogeneity of variance (HOV)
• Variances for each experimental group do not differ (‘can be tested
with Levene’s test’, but bear in mind the n…)

[FYI: sometimes you will see assumptions on observations

(xij) and sometimes on errors (ϵij)– since one is a linear
transformation of the other the assumptions are the same]
Violation of Assumptions
When group sizes are equal – ANOVA ‘relatively’ robust to
violations of normality and HOV [see also p. 536]
• Relatively robust (in terms of type I and II error control)
• Not to the assumption of independence
• Observations between groups HAVE to be independent, no
correlation between them allowed : else type I error could go up
really fast (α>0.5)!
When group sizes unequal –
• Normality violation: F is affected in non-predictable ways
• HOV violation:
• Too conservative if larger group has larger variance
• Too liberal if larger group has smaller variance (reduced α control)
Alternatives and corrections
• Transform

• Corrected F tests: [get around some of the unequal n’s

problem]
• HOV violation: Welch F (slightly better for power) > followed by
Bootstrapped 95%CI
• Brown-Forsythe (not popular)

• Non-parametric option
• Kruskall-Wallis: just like a M-W but with more than 2 groups
• Combine everyone and order(rank) them: if the sum of ranks is
similar, the groups likely don’t differ
Results and follow-up
Omnibus: let’s you in, but… not enough
• Multiple t-tests
A bad idea
We need control of type I error: else, inflation!
• Post Hoc Tests
• Not Planned (no hypothesis)
• Compare (all*) (pairs* of) means
• Contrasts/ Planned Comparisons
• Hypothesis driven
• Planned a priori
• Orthogonal
• Trend Analysis – for ordinal IVs only
Contrasts & Orthogonal Comparisons
• Breaking down the variability between groups selectively
• ONLY K-1 comparisons (K= total number of groups in ANOVA)
• Why “contrast”? We end up with two ‘means’ – weights sum to 0 so
always F(1,X)
• Simple (pairwise) versus complex contrasts (e.g., B and A below)

Orthogonal (independent) are ‘cleanest’ but depends on purpose

How to figure if 2 contrasts are orthogonal? multiplying the weights will
result in 0 [linear transformation of weights does not change contrast]
• If there are more than 3 groups (K>3), there will be different sets of
orthogonal contrasts

• Options to select from

Placebo Lo dose Hi dose Σ
• Can create your own
• Non-orthogonal Contrast 2 -1 -1 0
A
Be sure to justify
Contrast 0 1 -1 0
B
A*B 0 -1 1 0
A special kind of contrast - trend analysis
• Only makes sense when k groups (IV) vary on ordinal (or
above) scale
• e.g., study duration, stimulus duration or retention time
• e.g., class size
• e.g., drug dosage

• Number of levels
• Linear trend only with 2 and more
• Quadratic trend only with 3 and more
• Random vs. fixed effects (round 1)
• Different calculations
• Strong implications in some domains
(e.g., imaging)
Post-hoc Tests
• The Big Issue: too little or too much type I error control
• Too liberal vs. not enough power/ too conservative
• Per contrast vs. set of contrasts (family of contrasts)
• Many post hoc tests (e.g., Jamovi has 5, SPSS has 18 options!) – still lots of
disparity of opinions
• Some are very Liberal (LSD, N-K)
• Some intermediate (Bonferroni, Tukey, Holm)
• Some Conservative (Scheffe- all possible contrasts, including complex ones)
• Current (QRM) recommendations:
Assumptions (reasonably) met with equal n’s:
• Tukey HSD (Honest Significant Difference)
Safe Option with small k:
• Bonferroni, Holm
Unequal Variances:
• Games-Howell
Unequal Sample Sizes:
• Gabriel’s (small n), Hochberg’s GT2 (large n).
Post-hoc Tests Bonferroni  

Number of Tests
• Bonferroni
• Select some pairwise comparisons (let’s say n)
• New αn= 0.05 / total number of tests made 0.05/3 = .0167
• Essentially T-tests, each with new level of αn
• Tukey
• All pairwise comparisons
• Tends to be conservative with unequal n’s
• Scheffe
• All possible simple and complex comparisons
• Most conservative and reduced power
Effect sizes in ANOVAs
• Eta-squared (biased: uses sum of squares and is a
function of what else we have in the model)
• proportion of variance attributable to the effect 𝑆𝑆𝑀 𝑆𝑆𝑒𝑓𝑓𝑒𝑐𝑡
𝑟2 η2
• Sample specific, overestimates the effect size 𝑆𝑆𝑇 𝑆𝑆𝑡𝑜𝑡𝑎𝑙

• Partial 𝑆𝑆𝑒𝑓𝑓𝑒𝑐𝑡
η𝑝 2

• How much of the variance in scores is accounted 𝑆𝑆𝑒𝑓𝑓𝑒𝑐𝑡 𝑆𝑆𝑡𝑜𝑡𝑎𝑙

for by the effect
• Proportion of variance attributable to the effect(+error)

• Omega squared 𝑆𝑆𝑀𝑠 𝑑𝑓𝑀 𝑀𝑆𝑅

• Variance accounted for by the effect in the population
ω 2
𝑆𝑆𝑇 𝑀𝑆𝑅

• Interclass correlation [not commonly used in

traditional ANOVA designs]
Non-parametric option to ANOVA
Kruskal–Wallis test
Common reporting:
“Aesthetic quality of wine judgements were significantly
affected by quality expectations, H(2) = 9.66, p = .022.”

3 groups: A A B B C C
rank rank rank
6.4 11 2.5 2 1.3 1
6.8 12 3.7 3 4.1 4
7.2 13 4.9 5.5 4.9 5.5
• Follow-up
• Post hocs 8.4 18 5.4 8 5.5 9
• Direct comparisons 9.1 19 8.1 14
between groups 9.7 21
(prevent α inflation!) Sum ranks 131 58 42
Avg ranks 13.1 6.44 5.25
Repeated Measures (within subjects)
• Same participants contribute to different means
• Extension of dependent t-test, where there are more than 2
conditions/means to compare
• Minimizes/shrinks the error variance
• Advantages and disadvantages
• Useful when controlling for individual differences, more sensitive
• More economic/efficient (time and ££££)
• Carryover effects (practice, fatigue etc.) -> experimental design
essential

• Hypotheses – same as between subjects ANOVA

Partitioning of variance
Variation between individuals

SST  SS M  SS R SSsubjects

dfT  dfM  dfW SSerror

MS M
F
SS error ( residual )  SSTotal  SSM  SSSubjects
MS Re s

Shrinkage
achieved!
Partitioning of variance
SS R   ( xi  xi ) 2
Variation among individuals: we’ve got multiple
measures for each person df R   N  k 

SS subjects   ( xsubject  x grand ) 2 SSW   ( xw / subject  x grand ) 2 dfW  n  1

SS error / residual  SSTotal  SSM  SSSubjects dferror  k  1)(n  k 

SS M   ni (x i  x grand )2 df M  k  1 MS M
F
SST   (x i  x grand )2 dfT   N  1 MS Re s
Assumptions
• Normality
• Independence of observations (unless the repeated one)
• HOV doesn’t make sense here
• Sphericity (less restrictive form of compound symmetry)
• Need at least 3 means for sphericity
• The differences between each two means (treatment levels) needs to
fulfil HOV

• To assess sphericity – Mauchly’s test (H0: variances of the

differences between conditions are equal)
• If Mauchly’s n.s., (and depending on n) – all good
• But if Mauchly’s p<.05…
If sphericity is violated?
Or we don’t trust the Mauchly’s test:
Need to “pay a price”
• Greenhouse-Geisser estimate
• Huynh-Feldt estimate

• Non-parametric alternative (Friedman’s test)

Non-parametric alternative to repeated
measures ANOVA
e.g., if your data is
ordinal:
Friedman’s ANOVA
• The preference of
participants did not
significantly differ
between the three Original Measure Ranked Measure
chocolate brands, χ2(2) sub A B C A B C
= 0.20, p = .91
1 5 2 2 3 1.5 1.5
2 4.5 4 5 2 1 3
3 2 1 1 3 1.5 1.5
…
10 2 5 1 2 3 1

Sample Checklist For Admin Audit
100% (25)
Sample Checklist For Admin Audit
4 pages
Manual For TP-329 CHG 9 (PS-835 CMM)
No ratings yet
Manual For TP-329 CHG 9 (PS-835 CMM)
108 pages
Hypothesis Testing - Analysis of Variance
No ratings yet
Hypothesis Testing - Analysis of Variance
19 pages
703 Application of Statistics in Marine Science
100% (1)
703 Application of Statistics in Marine Science
21 pages
Dorks For Dorks
No ratings yet
Dorks For Dorks
2 pages
Odata Interview Question
20% (5)
Odata Interview Question
4 pages
One Way Anova
No ratings yet
One Way Anova
49 pages
MBS659 Anova and T-Tests
No ratings yet
MBS659 Anova and T-Tests
8 pages
IAR Lecture 3
No ratings yet
IAR Lecture 3
6 pages
Introduction To Analysis of Variance
No ratings yet
Introduction To Analysis of Variance
17 pages
Comparing Means and Proportions Measures of Association
No ratings yet
Comparing Means and Proportions Measures of Association
59 pages
DLC Boot 2018 Final Version
No ratings yet
DLC Boot 2018 Final Version
6 pages
Software Quality Metrics Overview
No ratings yet
Software Quality Metrics Overview
63 pages
RMB W7
No ratings yet
RMB W7
68 pages
Null Hypothesis: The Several Populations Being Compared All Have The Same Mean Research Hypothesis: They Have Different Means
No ratings yet
Null Hypothesis: The Several Populations Being Compared All Have The Same Mean Research Hypothesis: They Have Different Means
5 pages
Experimental Psychology Notes
No ratings yet
Experimental Psychology Notes
3 pages
Chapter10 - ANOVA - Student
No ratings yet
Chapter10 - ANOVA - Student
38 pages
ANOVA Lectures Slides 2021
No ratings yet
ANOVA Lectures Slides 2021
33 pages
BST 32202 Linear Regression 5 Multiple Comparisons
No ratings yet
BST 32202 Linear Regression 5 Multiple Comparisons
29 pages
Lecture 12
No ratings yet
Lecture 12
67 pages
06 HypothesisTesting
No ratings yet
06 HypothesisTesting
65 pages
Stats Final Notes
No ratings yet
Stats Final Notes
18 pages
RMPE Handout
No ratings yet
RMPE Handout
9 pages
50 Jenkins Interview Questions and Answers 2023
No ratings yet
50 Jenkins Interview Questions and Answers 2023
10 pages
ANOVA
No ratings yet
ANOVA
43 pages
MIT9 63F09 Lec04
No ratings yet
MIT9 63F09 Lec04
7 pages
NOTES Module 2 - ANOVA (Analysis of Variance)
No ratings yet
NOTES Module 2 - ANOVA (Analysis of Variance)
37 pages
Summary Data
No ratings yet
Summary Data
9 pages
Psych 110 Chapter 6 Notes
No ratings yet
Psych 110 Chapter 6 Notes
6 pages
Psychology Research Method
No ratings yet
Psychology Research Method
77 pages
Midterm II Review: Remember: Include Lots of Examples, Be Concise But Give Lots of Information
No ratings yet
Midterm II Review: Remember: Include Lots of Examples, Be Concise But Give Lots of Information
6 pages
T-Test and F-Test Hypotheses
No ratings yet
T-Test and F-Test Hypotheses
25 pages
2.C Statistics
No ratings yet
2.C Statistics
5 pages
Last Meeting Incomplete
No ratings yet
Last Meeting Incomplete
6 pages
A Nova Sumner 2016
No ratings yet
A Nova Sumner 2016
23 pages
Statistical Methods For Comparing Multiple Groups: Independence Goodness of Fit Homogeneity
No ratings yet
Statistical Methods For Comparing Multiple Groups: Independence Goodness of Fit Homogeneity
9 pages
Comparing Several Means: Anova
No ratings yet
Comparing Several Means: Anova
52 pages
Consent Letter For Society
No ratings yet
Consent Letter For Society
3 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
18 pages
Adobe Scan 03-Jan-2024
No ratings yet
Adobe Scan 03-Jan-2024
33 pages
20-Introduction To Analysis of Variance
No ratings yet
20-Introduction To Analysis of Variance
31 pages
ANova & Experiemntal Design
No ratings yet
ANova & Experiemntal Design
40 pages
T-And F-Tests: Testing Hypotheses
No ratings yet
T-And F-Tests: Testing Hypotheses
26 pages
Https
No ratings yet
Https
2 pages
Analysis of Variance
No ratings yet
Analysis of Variance
40 pages
Anova - Full
No ratings yet
Anova - Full
25 pages
Analytical Review2
No ratings yet
Analytical Review2
40 pages
Final Bio-Metric Project 2 Atul K
100% (1)
Final Bio-Metric Project 2 Atul K
44 pages
P6 File Corruption
No ratings yet
P6 File Corruption
20 pages
Standford - HRP 259 Introduction To Probability and Statistics - Lecture 12
No ratings yet
Standford - HRP 259 Introduction To Probability and Statistics - Lecture 12
67 pages
Analysis of Continuous and Categorical Variables: January 28, 2020
No ratings yet
Analysis of Continuous and Categorical Variables: January 28, 2020
31 pages
Principles of The T-Test and ANOVA
No ratings yet
Principles of The T-Test and ANOVA
64 pages
Medical Statistics New
No ratings yet
Medical Statistics New
46 pages
Statistics Presentation 7
No ratings yet
Statistics Presentation 7
55 pages
Where Are We and Where Are We Going?: Purpose IV DV Inferential Test
No ratings yet
Where Are We and Where Are We Going?: Purpose IV DV Inferential Test
36 pages
Seminar 3
No ratings yet
Seminar 3
69 pages
Day 7 Biostatistics
No ratings yet
Day 7 Biostatistics
44 pages
Applied Statistics Chapter 3 Comparisons
No ratings yet
Applied Statistics Chapter 3 Comparisons
20 pages
Drag Force: The Basics of Transport Phenomena
No ratings yet
Drag Force: The Basics of Transport Phenomena
12 pages
Analysis of Variance-20220125072228
No ratings yet
Analysis of Variance-20220125072228
120 pages
14 Anova1
No ratings yet
14 Anova1
31 pages
How Do We Decide If The Medication Was Successful in Lowering The Patient's Concentration of Blood Glucose?
No ratings yet
How Do We Decide If The Medication Was Successful in Lowering The Patient's Concentration of Blood Glucose?
7 pages
Chapter 6 ANOVA (Analysis of Variance)
No ratings yet
Chapter 6 ANOVA (Analysis of Variance)
26 pages
ANOVA
No ratings yet
ANOVA
39 pages
Introduction To Cellular Mobile Radio Systems
No ratings yet
Introduction To Cellular Mobile Radio Systems
83 pages
Lecture 4 - How To Choose A Statistical Test
No ratings yet
Lecture 4 - How To Choose A Statistical Test
18 pages
III Term Paper EM
No ratings yet
III Term Paper EM
5 pages
Regression Analysis: Statistics For Psychology
No ratings yet
Regression Analysis: Statistics For Psychology
40 pages
Ugong Senior High School
No ratings yet
Ugong Senior High School
6 pages
Quantitative Research Artifact
No ratings yet
Quantitative Research Artifact
13 pages
Dxa9ka 1
No ratings yet
Dxa9ka 1
1 page
Implimentasi Geometri Terhadap Navigasi Pengguna Bagi Persekitaran Dalaman: Satu Kajian
No ratings yet
Implimentasi Geometri Terhadap Navigasi Pengguna Bagi Persekitaran Dalaman: Satu Kajian
11 pages
United States Patent: (10) Patent No.: US 7,702,608 B1
No ratings yet
United States Patent: (10) Patent No.: US 7,702,608 B1
17 pages
Unit 4 Physical Pharmaceutics 1
No ratings yet
Unit 4 Physical Pharmaceutics 1
37 pages
ICT133 Structured Programming: Tutor-Marked Assignment Presentation
No ratings yet
ICT133 Structured Programming: Tutor-Marked Assignment Presentation
10 pages
Rosemount Level Switch
No ratings yet
Rosemount Level Switch
24 pages
Adda247 - No. 1 APP For Banking & SSC Preparation
No ratings yet
Adda247 - No. 1 APP For Banking & SSC Preparation
6 pages
Paper of Alexander Huth, Austin, Texas University On fMRI
No ratings yet
Paper of Alexander Huth, Austin, Texas University On fMRI
20 pages
ACC 205 Practice Questions-1
No ratings yet
ACC 205 Practice Questions-1
14 pages
MCA 2024 Syllabus Sorted
No ratings yet
MCA 2024 Syllabus Sorted
25 pages
Synopsis of Final Project Credit Appraisal Procedure of Canara Bank
No ratings yet
Synopsis of Final Project Credit Appraisal Procedure of Canara Bank
4 pages
Instructions
No ratings yet
Instructions
20 pages
IM Appendix F Client Server Systems Ed12
No ratings yet
IM Appendix F Client Server Systems Ed12
7 pages
Quiz Jom Rancang
No ratings yet
Quiz Jom Rancang
4 pages
Sridevi SR - Accounts Executive With 4 Years of Exp
No ratings yet
Sridevi SR - Accounts Executive With 4 Years of Exp
3 pages
Magistr 3 Jurnalı
No ratings yet
Magistr 3 Jurnalı
1 page
How to Find Inter-Groups Differences Using Spss/Excel/Web Tools in Common Experimental Designs: Book 1
From Everand
How to Find Inter-Groups Differences Using Spss/Excel/Web Tools in Common Experimental Designs: Book 1
P.Y. Cheng
No ratings yet
Quantitative Method-Breviary - SPSS: A problem-oriented reference for market researchers
From Everand
Quantitative Method-Breviary - SPSS: A problem-oriented reference for market researchers
Jens K. Perret
No ratings yet
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
Hypothesis Testing: Six Sigma Thinking, #6
From Everand
Hypothesis Testing: Six Sigma Thinking, #6
Sumeet Savant
No ratings yet

QRM - Week 3 Lecture - Canvas

Uploaded by

QRM - Week 3 Lecture - Canvas

Uploaded by

COMPARING MORE THAN 2

Within-subjects ANOVA (just a little bit today, more next

• ANOVA Omnibus test (F)

• Mean square – we don’t want to be influenced by the number

[FYI: sometimes you will see assumptions on observations

• Corrected F tests: [get around some of the unequal n’s

Orthogonal (independent) are ‘cleanest’ but depends on purpose

• Options to select from

• How much of the variance in scores is accounted 𝑆𝑆𝑒𝑓𝑓𝑒𝑐𝑡 𝑆𝑆𝑡𝑜𝑡𝑎𝑙

• Omega squared 𝑆𝑆𝑀𝑠 𝑑𝑓𝑀 𝑀𝑆𝑅

• Interclass correlation [not commonly used in

• Hypotheses – same as between subjects ANOVA

dfT  dfM  dfW SSerror

SS subjects   ( xsubject  x grand ) 2 SSW   ( xw / subject  x grand ) 2 dfW  n  1

SS error / residual  SSTotal  SSM  SSSubjects dferror  k  1)(n  k 

• To assess sphericity – Mauchly’s test (H0: variances of the

• Non-parametric alternative (Friedman’s test)

You might also like