TY - GEN
T1 - A simulation study comparing supertree and combined analysis methods using SMIDGen
AU - Swenson, M. Shel
AU - Barbançon, François
AU - Linder, C. Randal
AU - Warnow, Tandy
PY - 2009
Y1 - 2009
N2 - Supertree methods comprise one approach to reconstructing large molecular phylogenies given estimated source trees for overlapping subsets of the entire set of taxa. These source trees are combined into a single supertree on the full set of taxa using various algorithmic techniques, with the most common being matrix representation with parsimony (MRP). When the data allow, the competing approach is a combined analysis (also known as a "supermatrix" or "total evidence" approach) whereby the different sequence data matrices for each of the different subsets of taxa are concatenated into a single supermatrix, and a tree is estimated on that supermatrix. In this paper, we report an extensive simulation study comparing the supertree methods MRP and weighted MRP against combined analysis methods on large model trees, using a novel simulation methodology (Super-Method Input Data Generator, or SMIDGen), which better reflects biological processes and the practices of systematists. This study shows that combined analysis based upon maximum likelihood outperforms all the other methods, giving especially big improvements when the largest subtree does not contain most of the taxa.
AB - Supertree methods comprise one approach to reconstructing large molecular phylogenies given estimated source trees for overlapping subsets of the entire set of taxa. These source trees are combined into a single supertree on the full set of taxa using various algorithmic techniques, with the most common being matrix representation with parsimony (MRP). When the data allow, the competing approach is a combined analysis (also known as a "supermatrix" or "total evidence" approach) whereby the different sequence data matrices for each of the different subsets of taxa are concatenated into a single supermatrix, and a tree is estimated on that supermatrix. In this paper, we report an extensive simulation study comparing the supertree methods MRP and weighted MRP against combined analysis methods on large model trees, using a novel simulation methodology (Super-Method Input Data Generator, or SMIDGen), which better reflects biological processes and the practices of systematists. This study shows that combined analysis based upon maximum likelihood outperforms all the other methods, giving especially big improvements when the largest subtree does not contain most of the taxa.
UR - http://www.scopus.com/inward/record.url?scp=70350349003&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70350349003&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-04241-6_28
DO - 10.1007/978-3-642-04241-6_28
M3 - Conference contribution
AN - SCOPUS:70350349003
SN - 3642042406
SN - 9783642042409
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 333
EP - 344
BT - Algorithms in Bioinformatics - 9th International Workshop, WABI 2009, Proceedings
T2 - 9th International Workshop on Algorithms in Bioinformatics, WABI 2009
Y2 - 12 September 2009 through 13 September 2009
ER -