Inferring an accurate evolutionary tree of life requires high-quality alignments of molecular sequence data sets from large numbers of species. However, this task is often difficult, slow, and idiosyncratic, especially when the sequences are highly diverged or include high rates of insertions and deletions (collectively known as indels). We present SATé (simultaneous alignment and tree estimation), an automated method to quickly and accurately estimate both DNA alignments and trees with the maximum likelihood criterion. In our study, it improved tree and alignment accuracy compared to the best two-phase methods currently available for data sets of up to 1000 sequences, showing that coestimation can be both rapid and accurate in phylogenetic studies.
|Original language||English (US)|
|Number of pages||4|
|State||Published - Jun 19 2009|
ASJC Scopus subject areas
FingerprintDive into the research topics of 'Rapid and accurate large-scale coestimation of sequence alignments and phylogenetic trees'. Together they form a unique fingerprint.
5000-het: Dataset of Nucleotide Sequences with a Form of Evolutionary Sequence Length Heterogeneity
Liu, B. (Creator), Shen, C. (Creator) & Warnow, T. (Creator), University of Illinois Urbana-Champaign, Aug 5 2022