Keywords
archiDART, plant root systems, topology, persistent homology, Fitter indices, Data Analysis of Root Tracings (DART), Root System Markup Language (RSML)
This article is included in the RPackage gateway.
This article is included in the Agriculture, Food and Nutrition gateway.
This article is included in the Plant Science gateway.
This article is included in the Interactive Figures collection.
archiDART, plant root systems, topology, persistent homology, Fitter indices, Data Analysis of Root Tracings (DART), Root System Markup Language (RSML)
The quantification of plant root systems is central in many research areas, ranging from developmental studies1 to crop phenotyping2. Root systems are often characterised using geometric descriptors, such as the total root length3, total root surface4, or number of root tips5. However, such descriptors often fail to describe the full complexity of root systems. Additional descriptors able to describe the topology of root systems are often missing, despite the fact that they can help understanding the feedback between plant morphology and root system functions6,7.
The topology of a root system is an important component of its architecture and refers to how individual roots are connected to each other through branching8. Studying the topology of branching structures as complex as root systems is challenging and requires quantitative methods allowing the description and comparison of plant morphologies7. In the 1980’s, A.H. Fitter introduced a method to describe branching structures and classify root systems into topologically distinct networks9. His method relies on the calculation of three indices to describe the topology of a root system, namely the magnitude, altitude, and external path length. A detailed description of this method can be found in 9,10. More recently, the use of persistent homology to quantify plant morphologies was introduced in the plant sciences community. This method was successfully used to quantify leaf shapes, leaf serrations, and root system architectures11. Persistent homology is a mathematical framework allowing the quantification of plant morphologies at different scales (from organs to organisms). Because plant roots can be represented as a succession of nodes connected by straight lines in a tree graph, they are referred to as zero-order homology groups (H0, path-connected component) in mathematics. The goal of a persistent homology analysis applied to a root system is to study how H0 features persist across the scales of a continuous mathematical function. A common mathematical function used to capture the topology of branching structures, such as plant shoots and root systems, is the geodesic distance (i.e., the distance measured along the roots between the root system base and any point of the root system). A nice explanation of how persistent homology can be applied to capture plant topologies is provided in 12. The main output of a persistent homology analysis is a persistence barcode recording the birth (apparition of a new connected component) and death (fusion of two connected components) of each H0 branch when a distance function traverses the branching structure (Figure 1). The degree of similarity between different root system topologies can be assessed by computing a pairwise distance matrix using a bottleneck distance method to compare persistence barcodes. Multivariate statistical tools, such as multidimensional scaling, can then be used to visualize topological differences between root systems12.
In the past decade, many tools were developed to analyse root systems from digital images (for an extensive list, see the plant-image-analysis.org website13) or model root system architectures14,15. Several of these tools are able to extract the full root system architecture from the images, including the topology16–19. A common format for the storage of root architecture data, the Root System Markup Language (RSML)20, was also created to facilitate the exchange of information between researchers. Building on this new format, several tools were created to analyse root architecture data20,21. Among these tools, the R package archiDART offers a wide range of functionalities to analyse root system architectures in a free, open-source, and popular data analysis environment21.
In this paper, we present a new version (v.3.0) of the R package archiDART. In comparison with the version described earlier21, this version now includes several topological analysis methods, including, but not limited to, persistent homology. Our main objective is to demonstrate how the functions of the archiDART package can be used to analyse and compare the topology of plant root systems using persistent homology. In addition, we also aim to show that the topological analysis of plant root systems is highly complementary to the more classical approach that uses a set of geometric descriptors to compare root systems.
archiDART is an R package developed for the automated analysis of plant root system architectures using Data Analysis of Root Tracings (DART)17 and Root System Markup Language files (RSML)20. The version 3.0 of archiDART can be downloaded from the CRAN repository. An overview of the functions available in the package is presented in Table 1. Among the 10 functions developed for the package, 5 were already presented elsewhere21 and will not be further discussed in this paper.
In comparison with the version presented earlier, the version 3.0 of the package supports the analysis of 3D root systems. In addition, time series data in RSML files can be analysed if the root system age is stored as a continuous function along the root segments. Finally, we developed a set of 5 new functions and updated the architect function to allow the topological analysis of plant root systems. The architect function is now able to calculate the topological indices introduced by Fitter10, and the 5 new functions presented in this paper are devoted to the topological analysis of root systems using persistent homology12.
All functions of archiDART were coded using the R programming language. The package is compatible with Windows, Mac OS X, and major Linux operating systems. A detailed documentation file listing the package dependencies and describing all the functions listed in Table 1 can be downloaded from the CRAN package area. The bottleneckdist function of archiDART relies on the bottleneck function of the TDA package22 to compute the bottleneck distance between two persistence diagrams.
The root system library used in this paper has already been presented elsewhere23. Briefly, this library contains a total of 10,464 simulated root systems created using the root architecture model ArchiSimple14. The result of each simulation was stored as an RSML file. The library consists of two categories of root systems: tap-rooted (5212) and fibrous (5252). For the use cases presented in this paper, 50 tap-rooted and 50 fibrous root systems were selected from the RSML library. All root systems used in this paper had a total root length comprised between 17 and 23 m (20 m ± 15%). Summary statistics describing the root system library used in this study are presented in Table S1.
In order to demonstrate and illustrate the capabilities of archiDART, we developed a web application (archiShiny) using the Shiny library24. This application is freely available here: https://plantmodelling.shinyapps.io/archidart. We developed archiShiny with the following aims in mind: (1) demonstrating how multivariate statistical tools (such as principal component analysis) can be used on the aggregated metrics computed by the architect function to differentiate root systems; (2) showing how root systems can be plotted using the advanced graphical functions of the ggplot2 library25; and (3) comparing the topology of root systems using persistent homology. The web application uses a library of 70 RSML files created using the root architecture model ArchiSimple14. Based on the initial values of the parameters of the model, the root systems were classified into seven genotypes (mock, dense, sparse, steep, shallow, slow, and fast). Each genotype was represented by 10 simulations. The different genotypes were based on a standard parameter set (mock) and had one parameter changed: growth rate (slow vs. fast), inter-lateral distance (dense vs sparse) or gravitropism (steep vs shallow).
After package installation, the topological analysis of plant root systems (RSML files) using persistent homology comprises four main steps: (1) creating an rsmlToTable object, (2) computing persistence barcodes, (3) computing a pairwise bottleneck distance matrix, and (4) visualizing topological differences between root systems using non-metric multidimensional scaling (NMDS). The main steps of the analysis performed in this section of the paper are summarized in Figure 2. Although we only present the analysis pipeline developed for RSML files, root systems vectorized with DART can be analysed using exactly the same approach (see Figure 2).
The first step of the analysis is to import the RSML files into R with the rsmlToTable function. If root systems were vectorized with DART, the dartToTable function should be used instead. The rsmlToTable function creates a data frame (table) containing at least 23 columns (spatial coordinates, length, diameter, surface, volume, growth rate, orientation, geodesic distance, etc.) and as many lines as root segments. Here, a root segment is defined as the straight line between two nodes in the data file. The table is an rsmlToTable object that can directly be used as an input to compute the persistence barcodes using the perhomology function. It is worth noting that rsmlToTable objects can also be used as an input for the architect function of this new version of the package to compute a set of aggregated metrics describing the global architecture of plant root systems.
The perhomology function computes the persistence barcode of each root system stored in an rsmlToTable or a dartToTable object. Each persistence barcode is computed using a geodesic distance function (Figure 3). For each root system, the results are stored as a barcode object in a list that contains as many elements as root systems. A barcode object is a matrix with 3 columns (dimension, birth, and death) and has as many lines as zero-order homology bars in the persistence barcode. An S3 method (plot.barcode) was developed for plotting persistence barcodes. A code example to compute and plot persistence barcodes from RSML files is provided below.
path <- "PATH_TO_FOLDER_WITH_RSML_FILES"
table <- rsmlToTable(path, fitter=TRUE)
ph <- perhomology(table)
plot(ph$RSML_NAME)
To compare persistence barcodes against each other, a pairwise distance matrix is needed and the bottleneck distance is one possible option. The bottleneck distance is considered as a robust dissimilarity metric between two persistence barcodes, and its interpretation is quite straightforward: the greater the distance between two persistence barcodes, the greater will be the dissimilarity between them12. Such pairwise bottleneck distance matrix can be calculated with the bottleneckdist function of the package. This function only requires a perhomology object as an input. It has to be noted that the computation time required to compute a bottleneck distance matrix is highly dependent on the number and complexity of root systems being compared. A code example to compute a bottleneck distance matrix from persistence barcodes is provided below.
path <- "PATH_TO_FOLDER_WITH_RSML_FILES"
table <- rsmlToTable(path, fitter=TRUE)
ph <- perhomology(table)
dist <- bottleneckdist(ph)
A large variety of morphological, architectural, and topological traits can be measured on plant root systems (e.g., total root length, diameter, number of lateral roots per branching order, lateral root density, Fitter indices, etc.). When working with root architecture models and image analysis tools supporting the RSML format, such traits can be easily extracted from RSML files using the architect function of the archiDART package21 or the ImageJ plugin RSML Reader20. Using multivariate statistical tools, such as principal component analysis (PCA), one can then determine the key traits differentiating the root systems being compared23. In the next section, we would like to show that the information gained with this approach can be nicely complemented by a topological analysis of root systems using persistent homology.
After selecting 100 root systems from a large RSML library, we first used the architect function to compute a set of 20 traits for each root system (Table S1). Then, we performed a PCA to visualize differences between root systems and find the most interesting morphological, architectural, and topological variables to differentiate them. On the score plot constructed with the two first principal components, a good separation between fibrous and taproot root systems can be observed on the first principal component (Figure 4A). On average, fibrous root systems were characterized by a greater number/length of first-order roots, while taproot systems had a greater lateral root length and a greater secondary root density (Figure 4B, Table S2). Interestingly, two topological indices (altitude and external path length) were on average greater for taproot systems. On the second principal component, however, a separation between dicotyledonous root systems can be observed (Figure 4A). Negative PC2 scores were mainly associated with taproot systems having a greater number/length of tertiary roots and a greater magnitude, while root systems with positive PC2 scores had on average greater root diameters, surface, and volume (Figure 4B, Table S2). Although this approach is very useful to assess root system diversity and derive a functional classification of root systems27, it poorly takes into account topological differences that might exist between root systems sharing similar trait values.
To illustrate this, we plotted four representative root systems from the RSML library used in this study (Figure 5). Although the global architecture and topology of these root systems clearly differ, fibrous 1 and fibrous 2, as well as taproot 1 and taproot 2, were poorly separated by the PCA (Figure 4A). Therefore, we performed a topological analysis of the root systems in our library using the persistent homology analysis pipeline described previously. Non-metric multidimensional scaling (NMDS) was used to visualize dissimilarities between persistence barcodes (Figure 4C and D). Results showed that (1) fibrous and taproot root systems can be clearly separated using persistent homology, and (2) strong topological differences exist between fibrous 1 and fibrous 2, as well as between taproot 1 and taproot 2, despite the fact that these root systems were not separated by the PCA. Altogether, these results showed that persistent homology is highly complementary to the more traditional approach consisting at using a set of aggregated metrics to compare root systems.
In this paper, we presented a new analysis pipeline implemented in the R package archiDART to perform topological analysis of plant root systems using root architectural data (DART or RSML files). Using root architecture models, we showed that persistent homology is an efficient tool to capture and compare the topology of a large diversity of root systems. In addition, our results showed that the use of both geometric and topological descriptors are necessary to capture the natural complexity of plant root systems. Because topology is independent of transformation and deformation, the analysis pipeline described in this paper is highly flexible and can be used on data describing the architecture of 3D (e.g., root architecture models) and 2D (e.g., excavated root systems) root systems. Altogether, we believe that this great flexibility in root architecture data, the ease of use of the functions developed for the analysis pipeline presented in this paper, as well as the open-source nature of archiDART, make topological analysis of root systems widely accessible to the scientific community.
The data and R codes used for the use cases presented in this manuscript are available: https://doi.org/10.5281/zenodo.111783630
The latest stable version of archiDART is available on the CRAN repository: http://cran.r-project.org/package=archiDART
Source code available from: https://github.com/archidart/archidart
Archived source code as at time of publication: https://dx.doi.org/10.5281/zenodo.111786431
Software license: GNU GPL v2.0
More information about archiDART can be found on this website: https://archidart.github.io/
The web application illustrating the capabilities of archiDART is accessible at: https://plantmodelling.shinyapps.io/archidart
The data and codes used to make the web application are available: https://github.com/archidart/archishiny
Archived source code as at time of publication: https://doi.org/10.5281/zenodo.113343532
License for web application: GNU GPL v3.0
This research was funded by the Chair of Ecosystem Functioning and Services, Leuphana University, Lüneburg, Germany.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
We would like to thank Magdalena Julkowska (KAUST), Pierre Delaplace (Liège University, Gembloux Agro-Bio Tech, Belgium), and Loïc Pagès (INRA, France) for their insightful comments on the web application.
Table S1. Descriptive statistics of the root system library used in this study. The RSML library consisted of 50 fibrous and 50 taproot root systems created using the ArchiSimple model14.
Click here to access the data.
Table S2. Principal component analysis: Correlation between each root system variable and the two first principal components. Correlation coefficients written in bold contributed significantly to a principal component (PC). We considered that a variable contributed significantly to a PC if its contribution (in %) was greater than the contribution that would have been expected if all variables contributed equally to a PC (threshold value equal to 5%). The topological indices used in the PCA were calculated following9,10.
Views | Downloads | |
---|---|---|
F1000Research | - | - |
PubMed Central
Data from PMC are received and updated monthly.
|
- | - |
Is the rationale for developing the new software tool clearly explained?
Yes
Is the description of the software tool technically sound?
Yes
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?
Yes
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?
Yes
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?
Partly
Competing Interests: No competing interests were disclosed.
Is the rationale for developing the new software tool clearly explained?
Yes
Is the description of the software tool technically sound?
Yes
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?
Partly
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?
Partly
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?
Yes
References
1. Biasotti S, Attali D, Boissonnat J, Edelsbrunner H, et al.: Skeletal Structures. 2008. 145-183 Publisher Full TextCompeting Interests: No competing interests were disclosed.
Is the rationale for developing the new software tool clearly explained?
Yes
Is the description of the software tool technically sound?
Yes
Are sufficient details of the code, methods and analysis (if applicable) provided to allow replication of the software development and its use by others?
Yes
Is sufficient information provided to allow interpretation of the expected output datasets and any results generated using the tool?
Partly
Are the conclusions about the tool and its performance adequately supported by the findings presented in the article?
Partly
Competing Interests: No competing interests were disclosed.
Alongside their report, reviewers assign a status to the article:
Invited Reviewers | |||
---|---|---|---|
1 | 2 | 3 | |
Version 1 08 Jan 18 |
read | read | read |
Provide sufficient details of any financial or non-financial competing interests to enable users to assess whether your comments might lead a reasonable person to question your impartiality. Consider the following examples, but note that this is not an exhaustive list:
Sign up for content alerts and receive a weekly or monthly email with all newly published articles
Already registered? Sign in
The email address should be the one you originally registered with F1000.
You registered with F1000 via Google, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Google account password, please click here.
You registered with F1000 via Facebook, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Facebook account password, please click here.
If your email address is registered with us, we will email you instructions to reset your password.
If you think you should have received this email but it has not arrived, please check your spam filters and/or contact for further assistance.
Comments on this article Comments (0)