A phylogenetic tree, also called an "evolutionary tree," is a leaf-labeled tree which represents the evolutionary history for a set of species, and the construction of such trees is a fundamental problem in biology. Here we address the issue of how many sequence sites are required in order to recover the tree with high probability when the sites evolve under standard Markov-style i.i.d. mutation models. We provide analytic upper and lower bounds for the required sequence length, by developing a new polynomial time algorithm. In particular, we show when the mutation probabilities are bounded the required sequence length can grow surprisingly slowly (a power of log n) in the number n of sequences, for almost all trees.
|Original language||English (US)|
|Number of pages||32|
|Journal||Random Structures and Algorithms|
|State||Published - Mar 1999|
ASJC Scopus subject areas
- Computer Graphics and Computer-Aided Design
- Applied Mathematics