0% found this document useful (0 votes)

175 views11 pages

Chapter 8 Variation Partitioning - Workshop 10 - Advanced Multivariate Analyses in R

Multivariate analysis_variation partitioning

Uploaded by

RAMZI Azeddine

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

175 views11 pages

Chapter 8 Variation Partitioning - Workshop 10 - Advanced Multivariate Analyses in R

Multivariate analysis_variation partitioning

Uploaded by

RAMZI Azeddine

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Chapter 8 Variation partitioning

Variation partitioning is a type of analysis that combines RDA and partial RDA to divide the
variation of a response variable among two, three or four explanatory data sets. For example,
you might want to partition the variation in a community matrix among a set of abiotic
environmental variables, and a set of biotic variables. You could also partition this community
variation among small-scale or large-scale variables, to test the effect of spatial scale on your
community.

Figure 8.1: The basic structure of variation partitioning.

The results of variation partitioning analyses are traditionally represented by a Venn diagram,
in which the percentage of explained variance by each explanatory data set is reported. In a
case where we are partitioning the variation among two explanatory matrices, the result could
be represented as follows:
Figure 8.2: Representing variation partitioning results.
Here,

Fraction [a + b + c] is the explained variance by X1 and* X2 together, calculated using

a RDA of Y by X1 + X2 .
Fraction [d] is the unexplained variance by X1 and X2 together, obtained from the same
RDA as above.
Fraction [a] is the explained variance by X1 only, calculated using a partial RDA of Y by
X1|X2 (controlling for X2 ).
Fraction [c] is the explained variance by X2 only, calculated using a partial RDA of Y by
X2|X1 (controlling for X1 ).
Fraction [b] is calculated by subtraction, i.e. b = [a + b] + [b + c] − [a + b + c]. Because
[b] is not the result of an RDA, it cannot be tested for significance. It can also be negative,
which indicates that the response matrix is better explained by the combination of X1 and
X2 than by either matrix on its own.

8.1 Variation partitioning in R

To demonstrate how variation partitioning works in R , we will partition the variation of fish
species composition between chemical and topographic variables. The varpart() function
from vegan makes this easy for us.

# Partition the variation in fish community composition

spe.part.all <- varpart(spe.hel, env.chem, env.topo)

spe.part.all$part # access results!

## No. of explanatory tables: 2
## Total variation (SS): 14.07
## Variance: 0.50251
## No. of observations: 29
##

## Partition table:
## Df R.squared Adj.R.squared Testable
## [a+c] = X1 7 0.60579 0.47439 TRUE
## [b+c] = X2 3 0.41526 0.34509 TRUE
## [a+b+c] = X1+X2 10 0.73414 0.58644 TRUE
## Individual fractions

## [a] = X1|X2 7 0.24135 TRUE

## [b] = X2|X1 3 0.11205 TRUE
## [c] 0 0.23304 FALSE
## [d] = Residuals 0.41356 FALSE
## ---

## Use function 'rda' to test significance of fractions of interest

You can then visualise the results with the plot() function.

# plot the variation partitioning Venn diagram

plot(spe.part.all,

Xnames = c("Chem", "Topo"), # name the partitions

bg = c("seagreen3", "mediumpurple"), alpha = 80, # colour the circles
digits = 2, # only show 2 digits
cex = 1.5)
The chemical variables explain 24.1% of the variation in fish species composition, the
topography variables explain 11.2% of the variation in fish species composition, and these two
variable groups jointly explain 23.3% of the variation in fish species composition.

Be careful when reporting results of variation partitioning! The shared fraction

[b] does not represent an interaction effect of the two explanatory matrices.
Think of it as an overlap between X1 and X2 . It represents the shared
fraction of variation explained when the two are included in the model,
meaning it is the portion of variation that cannot be attributed to X1 or X2

separately. In other words, the variation partitioning cannot disentangle the

effects of chemistry and topography on 23.3% of the variation in the fish
community composition.
8.2 Significance testing

The output from the varpart() function reports the adjusted R

2
for each fraction, but you will
notice that the table does not include any test of statistical significance. However, the
Testable column identifies the fractions that can be tested for significance using the function
anova.cca() , just like we did with the RDA!

X1 [a+b]: Chemistry without controlling for topography

# [a+b] Chemistry without controlling for topography

anova.cca(rda(spe.hel, env.chem))

## Permutation test for rda under reduced model

## Permutation: free
## Number of permutations: 999
##
## Model: rda(X = spe.hel, Y = env.chem)
## Df Variance F Pr(>F)

## Model 7 0.30442 4.6102 0.001 ***

## Residual 21 0.19809
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

X2 [b+c] Topography without controlling for chemistry

# [b+c] Topography without controlling for chemistry

anova.cca(rda(spe.hel, env.topo))
## Permutation test for rda under reduced model
## Permutation: free
## Number of permutations: 999
##
## Model: rda(X = spe.hel, Y = env.topo)

## Df Variance F Pr(>F)
## Model 3 0.20867 5.918 0.001 ***
## Residual 25 0.29384
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

X1 | X2 [a] Chemistry alone

# [a] Chemistry alone

anova.cca(rda(spe.hel, env.chem, env.topo))

## Permutation test for rda under reduced model

## Permutation: free
## Number of permutations: 999

##
## Model: rda(X = spe.hel, Y = env.chem, Z = env.topo)
## Df Variance F Pr(>F)
## Model 7 0.16024 3.0842 0.001 ***
## Residual 18 0.13360
## ---

## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Recognize this? It’s a partial RDA!

X2 | X1 [c] Topography alone

# [c] Topography alone

anova.cca(rda(spe.hel, env.topo, env.chem))
## Permutation test for rda under reduced model
## Permutation: free
## Number of permutations: 999
##
## Model: rda(X = spe.hel, Y = env.topo, Z = env.chem)

## Df Variance F Pr(>F)
## Model 3 0.064495 2.8965 0.001 ***
## Residual 18 0.133599
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

All of the testable fractions in the variation partitioning are statistically significant!

8.3 Challenge 3

Partition the variation in the mite species data according to substrate variables ( SubsDens ,
WatrCont ) and significant spatial variables.

What proportion of the variation is explained by substrate variables? By space?

Which individual fractions are significant?
Plot your results!

Load the spatial variables:

data("mite.pcnm")

Recall some useful functions:

ordiR2step()
varpart()

anova.cca(rda())
plot()
8.3.1 Challenge 3: Solution

Step 1: Forward selection of significant spatial variables.

There are a lot of spatial variables in this dataset (22!). We should select the most important
ones, to avoid overloading the model.

# Step 1: Forward selection!

# Write full RDA model with all variables

full.spat <- rda(mite.spe.hel ~ ., data = mite.pcnm)

# Forward selection of spatial variables

spat.sel <- ordiR2step(rda(mite.spe.hel ~ 1, data = mite.pcnm),
scope = formula(full.spat), R2scope = RsquareAdj(full.spat)$adj.r.squared,

direction = "forward", trace = FALSE)

spat.sel$call

## rda(formula = mite.spe.hel ~ V2 + V3 + V8 + V1 + V6 + V4 + V9 +
## V16 + V7 + V20, data = mite.pcnm)

Step 2: Group variables of interest.

# Step 2: Group variables of interest.

# Subset environmental data to retain only substrate

# variables
mite.subs <- subset(mite.env, select = c(SubsDens, WatrCont))

# Subset to keep only selected spatial variables

mite.spat <- subset(mite.pcnm, select = names(spat.sel$terminfo$ordered))
# a faster way to access the selected variables!

Step 3: Partition the variation in species abundances.

# Step 3: Partition the variation in species abundances.
mite.part <- varpart(mite.spe.hel, mite.subs, mite.spat)
mite.part$part$indfract # access results!

## Df R.squared Adj.R.squared Testable

## [a] = X1|X2 2 NA 0.05901929 TRUE
## [b] = X2|X1 10 NA 0.19415929 TRUE

## [c] 0 NA 0.24765221 FALSE

## [d] = Residuals NA NA 0.49916921 FALSE

What proportion of the variation is explained by substrate variables? 5.9%

What proportion of the variation is explained by spatial variables? 19.4%

Step 4: Which individual fractions are significant?

[a]: Substrate only

# Step 4: Significance testing [a]: Substrate only

anova.cca(rda(mite.spe.hel, mite.subs, mite.spat))

...
## Model: rda(X = mite.spe.hel, Y = mite.subs, Z = mite.spat)

## Df Variance F Pr(>F)
## Model 2 0.025602 4.4879 0.001 ***
## Residual 57 0.162583
...

[c]: Space only

# [c]: Space only

anova.cca(rda(mite.spe.hel, mite.spat, mite.subs))
...
## Model: rda(X = mite.spe.hel, Y = mite.spat, Z = mite.subs)
## Df Variance F Pr(>F)
## Model 10 0.10286 3.6061 0.001 ***
## Residual 57 0.16258

...

Step 5: Plot the variation partitioning results.

# Step 5: Plot
plot(mite.part,
digits = 2,

Xnames = c("Subs", "Space"), # label the fractions

cex = 1.5,
bg = c("seagreen3", "mediumpurple"), # add colour!
alpha = 80) # adjust transparency
So, what can we say about the effects of substrate and space on mite species
abundances?

Hint: Why is the model showing such an important effect of space?

Space explains a lot of the variation in species abundances here: 19.4% (p = 0.001) of the
variation is explained by space alone, and 24.8% is jointly explained by space and substrate.
Substrate only explains ~6% (p = 0.001) of the variation in community composition across
sites on its own! Also note that half of the variation is not explained by the variables we
included in the model (look at the residuals!), so the model could be improved.

This large effect of space could be a sign that some spatial ecological process is
important here (like dispersal, for example). However, it could also be telling us that we
are missing an important environmental variable in our model, which itself varies in space!

All the content of the workshop series is under a Creative Commons Attribution-NonCommercial-
ShareAlike 4.0 International License.

Solution Multivariate
No ratings yet
Solution Multivariate
12 pages
SAT Sunday
No ratings yet
SAT Sunday
6 pages
Chapter 7 Partial Redundancy Analysis - Workshop 10 - Advanced Multivariate Analyses in R
No ratings yet
Chapter 7 Partial Redundancy Analysis - Workshop 10 - Advanced Multivariate Analyses in R
8 pages
RENR 690 - Geostatistics Lab
No ratings yet
RENR 690 - Geostatistics Lab
6 pages
Box, Hunter and Hunter: Statistics Package
No ratings yet
Box, Hunter and Hunter: Statistics Package
21 pages
7exploatory Data Analysis
No ratings yet
7exploatory Data Analysis
33 pages
R Code For Linear Regression Analysis 1 Way ANOVA
No ratings yet
R Code For Linear Regression Analysis 1 Way ANOVA
8 pages
Questions With No Solutions
No ratings yet
Questions With No Solutions
20 pages
Research Methodology and Biostatistics Unit II Part 3, F-Test ANOVA - One Way and Two Way Classification
No ratings yet
Research Methodology and Biostatistics Unit II Part 3, F-Test ANOVA - One Way and Two Way Classification
18 pages
HW 4 Andalib
No ratings yet
HW 4 Andalib
23 pages
Example Report
No ratings yet
Example Report
22 pages
Exploratory Data Analysis: 2.1 Objectives
No ratings yet
Exploratory Data Analysis: 2.1 Objectives
23 pages
R
No ratings yet
R
6 pages
Rcourse Partviz
No ratings yet
Rcourse Partviz
9 pages
Solutions - Lab 4 - Assumptions & Multiple Comparisons: Learning Outcomes
No ratings yet
Solutions - Lab 4 - Assumptions & Multiple Comparisons: Learning Outcomes
23 pages
4-Lecture 04
No ratings yet
4-Lecture 04
34 pages
Geor1 INTRODUCCION
No ratings yet
Geor1 INTRODUCCION
36 pages
07 - Diversity - Stats in R
No ratings yet
07 - Diversity - Stats in R
25 pages
Module - 5
No ratings yet
Module - 5
9 pages
P7.01 Anova Gia Ara 2016 2017
No ratings yet
P7.01 Anova Gia Ara 2016 2017
15 pages
Lab Geo Statistics
100% (1)
Lab Geo Statistics
8 pages
Codes Workshop
No ratings yet
Codes Workshop
13 pages
Anova
No ratings yet
Anova
7 pages
ANOVA in R
No ratings yet
ANOVA in R
7 pages
Anova
No ratings yet
Anova
8 pages
Anova: Anuj Rana (Telecom & IT) Aditi Arora (Infrastructure
No ratings yet
Anova: Anuj Rana (Telecom & IT) Aditi Arora (Infrastructure
11 pages
Legendre Scale Talk PDF
No ratings yet
Legendre Scale Talk PDF
71 pages
Solution 1
No ratings yet
Solution 1
14 pages
Industrial Statistics - A Computer Based Approach With Python
No ratings yet
Industrial Statistics - A Computer Based Approach With Python
140 pages
Regvari
No ratings yet
Regvari
35 pages
Spatial Statistics in R
No ratings yet
Spatial Statistics in R
29 pages
Lesson 6 - Graphs of Function and Empirical Graphs
No ratings yet
Lesson 6 - Graphs of Function and Empirical Graphs
20 pages
Report Stats PDF
No ratings yet
Report Stats PDF
23 pages
Solutions Modernstatistics
No ratings yet
Solutions Modernstatistics
144 pages
Document (26) - Copy 2
No ratings yet
Document (26) - Copy 2
17 pages
Single Variable Visualization 2: Keunwoo Kim
No ratings yet
Single Variable Visualization 2: Keunwoo Kim
20 pages
Two Variable Statistics
No ratings yet
Two Variable Statistics
12 pages
Statistical Analysis of Micro Biomed at A With R
No ratings yet
Statistical Analysis of Micro Biomed at A With R
43 pages
Package MFD': February 26, 2024
No ratings yet
Package MFD': February 26, 2024
93 pages
MA304 - Lecture 4
No ratings yet
MA304 - Lecture 4
60 pages
Analysis of Variance Table
No ratings yet
Analysis of Variance Table
19 pages
Essay Applied Math
No ratings yet
Essay Applied Math
58 pages
Week 8
No ratings yet
Week 8
13 pages
Comprehensive Guide To Data Visualization - R
No ratings yet
Comprehensive Guide To Data Visualization - R
1 page
Anova
No ratings yet
Anova
13 pages
Journal of Chemometrics - 2018 - Brereton - Introduction To Analysis of Variance
No ratings yet
Journal of Chemometrics - 2018 - Brereton - Introduction To Analysis of Variance
4 pages
Unit 3 DS
No ratings yet
Unit 3 DS
30 pages
BALAJI - Module - 6 7 AOV, CHI
No ratings yet
BALAJI - Module - 6 7 AOV, CHI
8 pages
04.11. Assignment 1 Write-Up
No ratings yet
04.11. Assignment 1 Write-Up
15 pages
Chapter 12 - One-Way Analysis of Variance - One-Way ANOVA
No ratings yet
Chapter 12 - One-Way Analysis of Variance - One-Way ANOVA
34 pages
MBC W1-2 Notes
No ratings yet
MBC W1-2 Notes
21 pages
T L.L. T P L U N C: Forrest W. Young & Carla M. Bann
No ratings yet
T L.L. T P L U N C: Forrest W. Young & Carla M. Bann
12 pages
Spatial Statistics in R
No ratings yet
Spatial Statistics in R
29 pages
Topic - Chapter 11 - ANOVA
No ratings yet
Topic - Chapter 11 - ANOVA
1 page
Data Mining - R Assignment: Konstantinos Stavrou (70134) 11/11/2012
No ratings yet
Data Mining - R Assignment: Konstantinos Stavrou (70134) 11/11/2012
13 pages
Ds 1
No ratings yet
Ds 1
22 pages
MSE Game For EPO Bigeye Tuna
No ratings yet
MSE Game For EPO Bigeye Tuna
4 pages
Creating Filters - Quarto
No ratings yet
Creating Filters - Quarto
3 pages
Quarto Word Templates Apply Different Style To Table Section - General - Posit Community
No ratings yet
Quarto Word Templates Apply Different Style To Table Section - General - Posit Community
2 pages
Basiloneetal.201713386 37426 1 PB
No ratings yet
Basiloneetal.201713386 37426 1 PB
14 pages
Data Science With Python Workflow
No ratings yet
Data Science With Python Workflow
1 page
CAPAM Good Practices Pre-Announcement
No ratings yet
CAPAM Good Practices Pre-Announcement
1 page
Data Manipulation With Dplyr
100% (1)
Data Manipulation With Dplyr
39 pages
Create Elegant Data Visualisations Using The Grammar of Graphics - Ggplot2
No ratings yet
Create Elegant Data Visualisations Using The Grammar of Graphics - Ggplot2
5 pages
Portfolio Output No. 17: Reflections On Personal Relationships
100% (1)
Portfolio Output No. 17: Reflections On Personal Relationships
2 pages
Management Representative
No ratings yet
Management Representative
3 pages
Curriculum 2018-00016 1
No ratings yet
Curriculum 2018-00016 1
2 pages
Catalogue 2018 PDF
No ratings yet
Catalogue 2018 PDF
124 pages
Implemention of Mailbox Function Using LPC2148
No ratings yet
Implemention of Mailbox Function Using LPC2148
5 pages
Checkpoint Monitor
No ratings yet
Checkpoint Monitor
30 pages
2024 2025 Final Examinations Timetable Numbers 11.11.2024
No ratings yet
2024 2025 Final Examinations Timetable Numbers 11.11.2024
14 pages
Revision of Ct and Sample Mid Course Test 30-9-2018 Đã Chuyển Đổi
No ratings yet
Revision of Ct and Sample Mid Course Test 30-9-2018 Đã Chuyển Đổi
14 pages
Administration Robbins, Coulter (Tenth Ed.) - Chapter 11
No ratings yet
Administration Robbins, Coulter (Tenth Ed.) - Chapter 11
32 pages
Calcium
No ratings yet
Calcium
3 pages
Book of Abstracts
No ratings yet
Book of Abstracts
493 pages
Combined Convection and Radiation
No ratings yet
Combined Convection and Radiation
9 pages
Shear Force & Bending Moment (Level 0)
No ratings yet
Shear Force & Bending Moment (Level 0)
7 pages
Idea Statica PL Steel 2018 A4 en
No ratings yet
Idea Statica PL Steel 2018 A4 en
16 pages
Online Genealogy Research Resources
80% (5)
Online Genealogy Research Resources
77 pages
Textile Mill Docx Assignment 2
No ratings yet
Textile Mill Docx Assignment 2
8 pages
Schedule of Rates
100% (3)
Schedule of Rates
90 pages
La Tech: Physics 201: Practice Exam 1
No ratings yet
La Tech: Physics 201: Practice Exam 1
8 pages
Domestication Vs Foriengization
No ratings yet
Domestication Vs Foriengization
6 pages
AER 101 Introduction To Aeronautics
No ratings yet
AER 101 Introduction To Aeronautics
74 pages
Geo 3 Rock Slope Stability
No ratings yet
Geo 3 Rock Slope Stability
30 pages
Django PDF
No ratings yet
Django PDF
7 pages
Nanoparticles
100% (1)
Nanoparticles
31 pages
70 Awesome Coaching Questions Using The GROW Model
100% (1)
70 Awesome Coaching Questions Using The GROW Model
3 pages
Design Thinking PPT Mba 1st Year 1698663212
100% (1)
Design Thinking PPT Mba 1st Year 1698663212
61 pages
History Curriculum - 11 & 12
No ratings yet
History Curriculum - 11 & 12
265 pages
Installation Manual
No ratings yet
Installation Manual
51 pages
Higher Lessons in English A Work On English Grammar and Composition by Kellogg, Brainerd
100% (2)
Higher Lessons in English A Work On English Grammar and Composition by Kellogg, Brainerd
293 pages

Chapter 8 Variation Partitioning - Workshop 10 - Advanced Multivariate Analyses in R

Uploaded by

Chapter 8 Variation Partitioning - Workshop 10 - Advanced Multivariate Analyses in R

Uploaded by

Chapter 8 Variation partitioning

Figure 8.1: The basic structure of variation partitioning.

Fraction [a + b + c] is the explained variance by X1 and* X2 together, calculated using

8.1 Variation partitioning in R

# Partition the variation in fish community composition

spe.part.all$part # access results!

## [a] = X1|X2 7 0.24135 TRUE

## Use function 'rda' to test significance of fractions of interest

# plot the variation partitioning Venn diagram

Xnames = c("Chem", "Topo"), # name the partitions

Be careful when reporting results of variation partitioning! The shared fraction

separately. In other words, the variation partitioning cannot disentangle the

The output from the varpart() function reports the adjusted R

X1 [a+b]: Chemistry without controlling for topography

# [a+b] Chemistry without controlling for topography

## Permutation test for rda under reduced model

## Model 7 0.30442 4.6102 0.001 ***

X2 [b+c] Topography without controlling for chemistry

# [b+c] Topography without controlling for chemistry

X1 | X2 [a] Chemistry alone

# [a] Chemistry alone

## Permutation test for rda under reduced model

Recognize this? It’s a partial RDA!

X2 | X1 [c] Topography alone

# [c] Topography alone

What proportion of the variation is explained by substrate variables? By space?

Load the spatial variables:

Recall some useful functions:

Step 1: Forward selection of significant spatial variables.

# Step 1: Forward selection!

# Write full RDA model with all variables

# Forward selection of spatial variables

direction = "forward", trace = FALSE)

Step 2: Group variables of interest.

# Step 2: Group variables of interest.

# Subset environmental data to retain only substrate

# Subset to keep only selected spatial variables

Step 3: Partition the variation in species abundances.

## Df R.squared Adj.R.squared Testable

## [c] 0 NA 0.24765221 FALSE

What proportion of the variation is explained by substrate variables? 5.9%

Step 4: Which individual fractions are significant?

[a]: Substrate only

# Step 4: Significance testing [a]: Substrate only

[c]: Space only

# [c]: Space only

Step 5: Plot the variation partitioning results.

Xnames = c("Subs", "Space"), # label the fractions

Hint: Why is the model showing such an important effect of space?

You might also like