Abstract
Inferring an accurate evolutionary tree of life requires high-quality alignments of molecular sequence data sets from large numbers of species. However, this task is often difficult, slow, and idiosyncratic, especially when the sequences are highly diverged or include high rates of insertions and deletions (collectively known as indels). We present SATé (simultaneous alignment and tree estimation), an automated method to quickly and accurately estimate both DNA alignments and trees with the maximum likelihood criterion. In our study, it improved tree and alignment accuracy compared to the best two-phase methods currently available for data sets of up to 1000 sequences, showing that coestimation can be both rapid and accurate in phylogenetic studies.
Original language | English (US) |
---|---|
Pages (from-to) | 1561-1564 |
Number of pages | 4 |
Journal | Science |
Volume | 324 |
Issue number | 5934 |
DOIs | |
State | Published - Jun 19 2009 |
Externally published | Yes |
ASJC Scopus subject areas
- General
Fingerprint
Dive into the research topics of 'Rapid and accurate large-scale coestimation of sequence alignments and phylogenetic trees'. Together they form a unique fingerprint.Datasets
-
Data for Rapid and Accurate Large-Scale Coestimation of Sequence Alignments and Phylogenetic Trees
Liu, K. (Creator), Raghavan, S. (Creator), Nelesen, S. (Creator), Linder, C. R. (Creator) & Warnow, T. (Creator), University of Illinois Urbana-Champaign, Jun 19 2009
DOI: 10.13012/B2IDB-5139418_V1
Dataset
-
5000-het: Dataset of Nucleotide Sequences with a Form of Evolutionary Sequence Length Heterogeneity
Liu, B. (Creator), Shen, C. (Creator) & Warnow, T. (Creator), University of Illinois Urbana-Champaign, Aug 5 2022
DOI: 10.13012/B2IDB-3974819_V1
Dataset