TY - JOUR
T1 - Testing Empirical Support for Evolutionary Models that Root the Tree of Life
AU - Caetano-Anollés, Derek
AU - Nasir, Arshan
AU - Kim, Kyung Mo
AU - Caetano-Anollés, Gustavo
N1 - Publisher Copyright:
© 2019, The Author(s).
PY - 2019/4/15
Y1 - 2019/4/15
N2 - Trees of life (ToLs) can only be rooted with direct methods that seek optimization of character state information in ingroup taxa. This involves optimizing phylogenetic tree, model and data in an exercise of reciprocal illumination. Rooted ToLs have been built from a census of protein structural domains in proteomes using two kinds of models. Fully-reversible models use standard-ordered (additive) characters and Wagner parsimony to generate unrooted trees of proteomes that are then rooted with Weston’s generality criterion. Non-reversible models directly build rooted trees with unordered characters and asymmetric stepmatrices of transformation costs that penalize gain over loss of domains. Here, we test the empirical support for the evolutionary models with character state reconstruction methods using two published proteomic datasets. We show that the reversible models match reconstructed frequencies of character change and are faithful to the distribution of serial homologies in trees. In contrast, the non-reversible models go counter to trends in the data they must explain, attracting organisms with large proteomes to the base of the rooted trees while violating the triangle inequality of distances. This can lead to serious reconstruction inconsistencies that show model inadequacy. Our study highlights the aprioristic perils of disposing of countering evidence in natural history reconstruction.
AB - Trees of life (ToLs) can only be rooted with direct methods that seek optimization of character state information in ingroup taxa. This involves optimizing phylogenetic tree, model and data in an exercise of reciprocal illumination. Rooted ToLs have been built from a census of protein structural domains in proteomes using two kinds of models. Fully-reversible models use standard-ordered (additive) characters and Wagner parsimony to generate unrooted trees of proteomes that are then rooted with Weston’s generality criterion. Non-reversible models directly build rooted trees with unordered characters and asymmetric stepmatrices of transformation costs that penalize gain over loss of domains. Here, we test the empirical support for the evolutionary models with character state reconstruction methods using two published proteomic datasets. We show that the reversible models match reconstructed frequencies of character change and are faithful to the distribution of serial homologies in trees. In contrast, the non-reversible models go counter to trends in the data they must explain, attracting organisms with large proteomes to the base of the rooted trees while violating the triangle inequality of distances. This can lead to serious reconstruction inconsistencies that show model inadequacy. Our study highlights the aprioristic perils of disposing of countering evidence in natural history reconstruction.
KW - Characters
KW - Evolution
KW - Fold superfamily
KW - Phylogenetic analysis
KW - Superkingdoms
KW - Tree of life
UR - http://www.scopus.com/inward/record.url?scp=85063213560&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85063213560&partnerID=8YFLogxK
U2 - 10.1007/s00239-019-09891-7
DO - 10.1007/s00239-019-09891-7
M3 - Article
C2 - 30887086
AN - SCOPUS:85063213560
SN - 0022-2844
VL - 87
SP - 131
EP - 142
JO - Journal of Molecular Evolution
JF - Journal of Molecular Evolution
IS - 2-3
ER -