Gardner 等 - 2005 - A benchmark of multiple sequence alignment programs upon structural RNAs
Gardner 等 - 2005 - A benchmark of multiple sequence alignment programs upon structural RNAs
*To whom correspondence should be addressed. Tel: +45 3532 1338; Fax: +45 3532 1300; Email: [email protected]
ª The Author 2005. Published by Oxford University Press. All rights reserved.
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access
version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press
are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but
only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact [email protected]
2434 Nucleic Acids Research, 2005, Vol. 33, No. 8
Algorithm options are summarized in the Supplementary homology groups. An additional tRNA dataset was generated
Material Tables 2 and 3. with just two sequences to each alignment (hereafter referred
to as dataset 2). This was used to contrast pairwise structural
alignment methods and sequence alignment methods.
Test datasets
We generated four diverse structural RNA datasets of Group II Caveat: tools improve
introns, 5S rRNA, tRNA and U5 spliceosomal RNA. The
sequences and the reference alignments for calculating the We note here that our data reflect the state of the art in early
SPS were obtained from the Rfam v5.0 database. SRP from 2005. Most of the tools tested are relatively recent, and many
the SRPDB database was included in the original dataset but are still under development. Hence, not all the observations
was later discarded due to poor comparability between below will remain reproducible. In fact, we hope this study
predicted and structural alignments contained in this dataset helps to obtain better results in the future.
(the results suggest that a fraction of the SRP reference
Table 1. The mean SCI score and mean SPSs computed for dataset 1 (see text for details)
Algorithm High homology (75% < seq. id) Medium homology (75% < seq. id < 55%) Low homology (seq. id < 55%)
SCI SPS Rank SCI SPS Rank SCI SPS Rank
Dataset 1 has been divided into three homology groups: the high-homology group (75% < seq. id), the medium-homology group (75% < seq. id < 55%) and the
low-homology group (seq. id < 55%). Rankings are computed from the product of SCI and SPS. The top 10 ranks are highlighted in boldface. Abbreviations of the
parameter switches used to produce these results are shown in parentheses. Further details of these can be found in Supplementary Material.
2436 Nucleic Acids Research, 2005, Vol. 33, No. 8
with a variety of algorithm parameters. The results using Dataset 2: comparison of structural and sequence
default and any parameter combinations that increased algo- methods
rithm performance are summarized in Figure 2. In order to
Now, we contrast the relative performance of the comparat-
measure relative algorithm performances, a ranking for each
ively good sequence-based methods identified in the previous
algorithm (and parameter setting) was calculated within each
section with structural alignment methods using a smaller
of the three homology ranges. The rank is based upon the
tRNA dataset. The structure-based methods are generally
product of mean SCI score and mean SPSs.
computationally more intensive than the sequence-based
For the high-similarity datasets (sequence identity >75%),
methods—hence the small size (in terms of the number of
there is little difference in accuracy across most of the
sequences and the sequence length) of this dataset.
algorithms considered here (see Table 1). Align-m and Handel
We use dataset 2 to compare the relative performances of
rank well on this dataset, yet the relative performance of
structure-based methods (e.g. Dynalign, Foldalign, PMcomp
both these methods dropped rapidly with decreasing sequence
and Stemloc) to the ‘better’ sequence-based methods identified
homology. Interestingly, for Align-m, this is the opposite to
in the previous section (e.g. ClustalW, MUSCLE, PCMA,
Figure 2. Both measures of structural RNA alignment correctness, SCI (A) and SPS (B), are plotted as functions of the mean pairwise sequence identity
(calculated using the reference alignments). The curves are fit to dataset 1 (see text for details) using lowess (local weighted regression) smoothing. At most,
two curves are plotted for each alignment package—one corresponding to the default parameters, the other corresponds to the best parameter combination we
could identify.
Nucleic Acids Research, 2005, Vol. 33, No. 8 2437
disappointing. We hypothesize that perhaps these are due to datasets (32). Another surprise was that the supposedly
an overemphasis upon the sequence-based component of the outdated, yet still widely used method ClustalW, performed
algorithm. consistently well across all homology datasets. This is possibly
All of the above datasets are freely available from a consequence of the fact that more recent algorithms are
http://www.binf.ku.dk/users/pgardner/bralibase/. As novel heavily optimized for protein alignment. The relatively
and updated algorithms become available, updated results new methods ProAlign, POA (gp) and MUSCLE also per-
will also be made available from the web page. formed consistently well. ProAlign, in particular, produced
(comparatively) reliable alignments and ranked in the top 5
across all homology ranges. This is possibly due to the fact
that ProAlign is one of the few algorithms to use a scoring
DISCUSSION
scheme derived from reliable nucleic acid sequence align-
An aim of this work is to determine the boundaries between ments. The performance of POA (gp) is also remarkable,
when pure sequence alignment methods perform well and not only because it employs a very fast method [said to
when augmentation of the alignment with structure is neces- accurately align 5000 EST sequences in 4 h on a Pentium II
sary. We wish to highlight that the benchmarks based purely (44)] but also because it performed consistently well over all
upon structural protein alignments do not adequately test all test sets.
the uses of sequence alignment. In addition, we are pleased to Another conclusion of this work is that the ‘twilight zone’ of
note that our two independent measures of alignment fitness ncRNA alignment—the homology range where little to no
(SCI and SPS) produce similar results. structural information of predicted alignments (using the cur-
In some cases, we found that altering algorithm para- rent state of the art algorithms) for structurally homologous
meters produced a dramatic improvement over the defaults sequences is retained—is in the 50–60% sequence-identity
(e.g. T-Coffee performance improves using Clustal to generate range. This is dramatically higher than that of the protein
a library of pairwise alignments and POA performance sequences which is 10–20% (29). Much of this difference
improves dramatically using a combination of the global is, of course, due to the different alphabet sizes and the gen-
and the progressive modes). erally limited models and the score matrices for nucleotide
We find that the conclusions of previous studies based upon alignment.
structural protein alignments do not necessarily hold for the It is interesting to note that three of the structural methods
alignment of structural ncRNA. For example, DIALIGN, iden- (Dynalign, Foldalign and PMcomp), for a short homology
tified as a method which performed well for low-homology range (40–60% sequence identity), have higher SCI scores
protein alignment did not generally improve (relative to the than the reference alignment and that in the same regions
alternative methods) on low-homology datasets (32). Another there is a dip in the performance when Dynalign, Foldalign
surprising discovery was that T-Coffee, touted as an excellent and PMcomp performance is measured using SPS. This
method for high-homology datasets, did not perform well suggests that the reference alignments themselves may
(again, relative to the alternative methods) on the ncRNA be improved upon in this homology range.
2438 Nucleic Acids Research, 2005, Vol. 33, No. 8
Based upon these results the Foldalign score routines seem (analogous to the index creation of the BLAST procedure)
to have optimized the delicate balance between the sequence and the profile alignment method is likely to produce a
and the structure-based scores. This implementation of superior homology detection tool. This application and the
Sankoff’s algorithm employs a light-weight energy model extension of this method to multiple alignments is an area
(13,41,45,46) in concert with the substitution matrices similar of active research.
to those of RIBOSUM (47) and BLOSUM (48), which seem to Other potentially fruitful research areas to explore are as
produce excellent predictions. However, the computational follows: (i) The implementation of light-weight Sankoff-like
complexity of this algorithm is still an issue, global alignment algorithms, which produce reasonable alignments in a short
is restricted to sequences of 200 nt or less, in practice. time-frame and use score routines combining energy and
Further optimization may increase this bound, however. sequence scores similar to those of Foldalign. (ii) In analogy
The profile-based approach of Hofacker et al. [pmcomp to the improvement of structure prediction accuracy by
- -fast (15,49)], holds promise for producing fast and reason- including stacking parameters (nearest-neighbour model),
ably accurate alignments in satisfactory time across all perhaps the alignment of RNA sequences over a dinucleotide
the Open Access publication charges for this article was 25. Mathews,D. (2004) Using an RNA secondary structure partition function
provided by a Carlsberg Foundation Grant (21-00-0680). to determine confidence in base pairs predicted by free energy
minimization. RNA, 10, 1178–1190.
Conflict of interest statement. None declared. 26. Gardner,P. and Giegerich,R. (2004) A comprehensive comparison of
comparative RNA structure prediction approaches. BMC Bioinformatics,
5, 140.
27. Zuker,M. and Sankoff,D. (1984) RNA secondary structures and their
REFERENCES prediction. Bull. Math. Biol., 46, 591–621.
28. Hofacker,I.L., Fontana,W., Bonhoeffer,S. and Stadler,P.F. (1994) Fast
1. Chiu,D.K. and Kolodziejczak,T. (1991) Inferring consensus structure folding and comparison of RNA secondary structures. Monatshefte f€ ur
from nucleic acid sequences. Comput. Appl. Biosci., 7, 347–352. Chemie, 125, 167–188.
2. Gutell,R.R., Power,A., Hertz,G.Z., Putz,E.J. and Stormo,G.D. (1992) 29. Thompson,J., Plewniak,F. and Poch,O. (1999) A comprehensive
Identifying constraints on the higher-order structure of RNA: continued comparison of multiple sequence alignment programs. Nucleic
development and application of comparative sequence analysis Acids Res., 27.
methods. Nucleic Acids Res., 20, 5785–5795. 30. Thompson,J., Plewniak,F. and Poch,O. (1999) BAliBASE: a benchmark
3. Gorodkin,J., Heyer,L., Brunak,S. and Stormo,G. (1997) Displaying the alignment database for the evaluation of multiple alignment programs.