TY - GEN
T1 - Inferring Temporally Consistent Migration Histories
AU - Roddur, Mrinmoy Saha
AU - Snir, Sagi
AU - El-Kebir, Mohammed
N1 - Publisher Copyright:
© Mrinmoy Saha Roddur, Sagi Snir, and Mohammed El-Kebir; licensed under Creative Commons License CC-BY 4.0.
PY - 2023/8
Y1 - 2023/8
N2 - Not only do many biological populations undergo evolution, but population members may also migrate from one location to another. For example, tumor cells may migrate from the primary tumor and seed a new metastasis, and pathogens may migrate from one host to another. One may represent a population's migration history by labeling the vertices of a given phylogeny T with locations such that an edge incident to vertices with distinct locations represents a migration. Additionally, in some biological populations, taxa from distinct lineages may comigrate from one location to another in a single event, a phenomenon known as a comigration. Here, we show that a previous problem statement for inferring migration histories that are parsimonious in terms of migrations and comigrations may lead to temporally inconsistent solutions. To remedy this deficiency, we introduce precise definitions of temporal consistency of comigrations in a phylogeny, leading to three successive problems. First, we formulate the Temporally Consistent Comigrations (TCC) problem to check if a set of comigrations is temporally consistent and provide a linear time algorithm for solving this problem. Second, we formulate the Parsimonious Consistent Comigration (PCC) problem, which aims to find comigrations given a location labeling of a phylogeny. We show that PCC is NP-hard. Third, we formulate the Parsimonious Consistent Comigration History (PCCH) problem, which infers the migration history given a phylogeny and locations of its extant vertices only. We show that PCCH is NP-hard as well. On the positive side, we propose integer linear programming models to solve the PCC and PCCH problems. We apply our approach to real and simulated data.
AB - Not only do many biological populations undergo evolution, but population members may also migrate from one location to another. For example, tumor cells may migrate from the primary tumor and seed a new metastasis, and pathogens may migrate from one host to another. One may represent a population's migration history by labeling the vertices of a given phylogeny T with locations such that an edge incident to vertices with distinct locations represents a migration. Additionally, in some biological populations, taxa from distinct lineages may comigrate from one location to another in a single event, a phenomenon known as a comigration. Here, we show that a previous problem statement for inferring migration histories that are parsimonious in terms of migrations and comigrations may lead to temporally inconsistent solutions. To remedy this deficiency, we introduce precise definitions of temporal consistency of comigrations in a phylogeny, leading to three successive problems. First, we formulate the Temporally Consistent Comigrations (TCC) problem to check if a set of comigrations is temporally consistent and provide a linear time algorithm for solving this problem. Second, we formulate the Parsimonious Consistent Comigration (PCC) problem, which aims to find comigrations given a location labeling of a phylogeny. We show that PCC is NP-hard. Third, we formulate the Parsimonious Consistent Comigration History (PCCH) problem, which infers the migration history given a phylogeny and locations of its extant vertices only. We show that PCCH is NP-hard as well. On the positive side, we propose integer linear programming models to solve the PCC and PCCH problems. We apply our approach to real and simulated data.
KW - Integer Linear Programming
KW - Maximum parsimony
KW - Metastasis
KW - Migration
UR - http://www.scopus.com/inward/record.url?scp=85172084234&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85172084234&partnerID=8YFLogxK
U2 - 10.4230/LIPIcs.WABI.2023.9
DO - 10.4230/LIPIcs.WABI.2023.9
M3 - Conference contribution
AN - SCOPUS:85172084234
T3 - Leibniz International Proceedings in Informatics, LIPIcs
BT - 23rd International Workshop on Algorithms in Bioinformatics, WABI 2023
A2 - Belazzougui, Djamal
A2 - Ouangraoua, A�da
PB - Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
T2 - 23rd International Workshop on Algorithms in Bioinformatics, WABI 2023
Y2 - 4 September 2023 through 6 September 2023
ER -