TY - JOUR
T1 - Diverse Origins of Near-Identical Antifreeze Proteins in Unrelated Fish Lineages Provide Insights into Evolutionary Mechanisms of New Gene Birth and Protein Sequence Convergence
AU - Rives, Nathan
AU - Lamba, Vinita
AU - Cheng, C. H.Christina
AU - Zhuang, Xuan
N1 - We thank Kevin Bilyk for his assistance in acquiring the winter flounder and grubby sculpin specimens. We gratefully acknowledge the support of the Arkansas High Performance Computing Center for providing resources essential to this work. This research was supported in part by the National Institute of General Medical Sciences of the National Institutes of Health under awards P20GM139768, R15GM152956, the Arkansas Biosciences Institute\u2014the primary research component of the Arkansas Tobacco Settlement Proceeds Act of 2000, and the University of Illinois Foundation Research Fund 339045.
PY - 2024/9/1
Y1 - 2024/9/1
N2 - Determining the origins of novel genes and the mechanisms driving the emergence of new functions is challenging yet crucial for understanding evolutionary innovations. Recently evolved fish antifreeze proteins (AFPs) offer a unique opportunity to explore these processes, particularly the near-identical type I AFP (AFPI) found in four phylogenetically divergent fish taxa. This study tested the hypothesis of protein sequence convergence beyond functional convergence in three unrelated AFPI-bearing fish lineages. Through comprehensive comparative analyses of newly sequenced genomes of winter flounder and grubby sculpin, along with available high-quality genomes of cunner and 14 other related species, the study revealed that near-identical AFPI proteins originated from distinct genetic precursors in each lineage. Each lineage independently evolved a de novo coding region for the novel ice-binding protein while repurposing fragments from their respective ancestors into potential regulatory regions, representing partial de novo origination - a process that bridges de novo gene formation and the neofunctionalization of duplicated genes. The study supports existing models of new gene origination and introduces new ones: the innovation-amplification-divergence model, where novel changes precede gene duplication; the newly proposed duplication-degeneration-divergence model, which describes new functions arising from degenerated pseudogenes; and the duplication-degeneration-divergence gene fission model, where each new sibling gene differentially degenerates and renovates distinct functional domains from their parental gene. These findings highlight the diverse evolutionary pathways through which a novel functional gene with convergent sequences at the protein level can evolve across divergent species, advancing our understanding of the mechanistic intricacies in new gene formation.
AB - Determining the origins of novel genes and the mechanisms driving the emergence of new functions is challenging yet crucial for understanding evolutionary innovations. Recently evolved fish antifreeze proteins (AFPs) offer a unique opportunity to explore these processes, particularly the near-identical type I AFP (AFPI) found in four phylogenetically divergent fish taxa. This study tested the hypothesis of protein sequence convergence beyond functional convergence in three unrelated AFPI-bearing fish lineages. Through comprehensive comparative analyses of newly sequenced genomes of winter flounder and grubby sculpin, along with available high-quality genomes of cunner and 14 other related species, the study revealed that near-identical AFPI proteins originated from distinct genetic precursors in each lineage. Each lineage independently evolved a de novo coding region for the novel ice-binding protein while repurposing fragments from their respective ancestors into potential regulatory regions, representing partial de novo origination - a process that bridges de novo gene formation and the neofunctionalization of duplicated genes. The study supports existing models of new gene origination and introduces new ones: the innovation-amplification-divergence model, where novel changes precede gene duplication; the newly proposed duplication-degeneration-divergence model, which describes new functions arising from degenerated pseudogenes; and the duplication-degeneration-divergence gene fission model, where each new sibling gene differentially degenerates and renovates distinct functional domains from their parental gene. These findings highlight the diverse evolutionary pathways through which a novel functional gene with convergent sequences at the protein level can evolve across divergent species, advancing our understanding of the mechanistic intricacies in new gene formation.
KW - duplication-degeneration-divergence (DDD) model
KW - gene fission
KW - innovation-amplification-divergence (IAD) model
KW - new gene origination
KW - partial de novo evolution
KW - protein sequence convergent evolution
UR - http://www.scopus.com/inward/record.url?scp=85204260234&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85204260234&partnerID=8YFLogxK
U2 - 10.1093/molbev/msae182
DO - 10.1093/molbev/msae182
M3 - Article
C2 - 39213383
AN - SCOPUS:85204260234
SN - 0737-4038
VL - 41
JO - Molecular biology and evolution
JF - Molecular biology and evolution
IS - 9
M1 - msae182
ER -