Abstract

Motivation: With the rapid growth rate of newly sequenced genomes, species tree inference from multiple genes has become a basic bioinformatics task in comparative and evolutionary biology. However, accurate species tree estimation is difficult in the presence of gene tree discordance, which is often due to incomplete lineage sorting (ILS), modelled by the multi-species coalescent. Several highly accurate coalescent-based species tree estimation methods have been developed over the last decade, including MP-EST. However, the running time for MP-EST increases rapidly as the number of species grows. Results: We present divide-and-conquer techniques that improve the scalability of MP-EST so that it can run efficiently on large datasets. Surprisingly, this technique also improves the accuracy of species trees estimated by MP-EST, as our study shows on a collection of simulated and biological datasets.

Original languageEnglish (US)
Article numberS7
JournalBMC genomics
Volume15
Issue number6
DOIs
StatePublished - Oct 17 2014

Keywords

  • Disk covering methods
  • Divide-and-conquer
  • Incomplete lineage sorting
  • MP-EST
  • Multi-species coalescent process

ASJC Scopus subject areas

  • Biotechnology
  • Genetics

Fingerprint Dive into the research topics of 'Disk covering methods improve phylogenomic analyses'. Together they form a unique fingerprint.

  • Cite this