Ancestrality and mosaicism of giant viruses supporting the definition of the fourth TRUC of microbes

Philippe Colson, Anthony Levasseur, Bernard La Scola, Vikas Sharma, Arshan Nasir, Pierre Pontarotti, Gustavo Caetano-Anollés, Didier Raoult

Research output: Contribution to journalArticle

Abstract

Giant viruses of amoebae were discovered in 2003. Since then, their diversity has greatly expanded. They were suggested to form a fourth branch of life, collectively named ‘TRUC’ (for “Things Resisting Uncompleted Classifications”) alongside Bacteria, Archaea, and Eukarya. Their origin and ancestrality remain controversial. Here, we specify the evolution and definition of giant viruses. Phylogenetic and phenetic analyses of informational gene repertoires of giant viruses and selected bacteria, archaea and eukaryota were performed, including structural phylogenomics based on protein structural domains grouped into 289 universal fold superfamilies (FSFs). Hierarchical clustering analysis was performed based on a binary presence/absence matrix constructed using 727 informational COGs from cellular organisms. The presence/absence of ‘universal’ FSF domains was used to generate an unrooted maximum parsimony phylogenomic tree. Comparison of the gene content of a giant virus with those of a bacterium, an archaeon, and a eukaryote with small genomes was also performed. Overall, both cladistic analyses based on gene sequences of very central and ancient proteins and on highly conserved protein fold structures as well as phenetic analyses were congruent regarding the delineation of a fourth branch of microbes comprised by giant viruses. Giant viruses appeared as a basal group in the tree of all proteomes. A pangenome and core genome determined for Rickettsia bellii (bacteria), Methanomassiliicoccus luminyensis (archaeon), Encephalitozoon intestinalis (eukaryote), and Tupanvirus (giant virus) showed a substantial proportion of Tupanvirus genes that overlap with those of the cellular microbes. In addition, a substantial genome mosaicism was observed, with 51, 11, 8, and 0.2% of Tupanvirus genes best matching with viruses, eukaryota, bacteria, and archaea, respectively. Finally, we found that genes themselves may be subject to lateral sequence transfers. In summary, our data highlight the quantum leap between classical and giant viruses. Phylogenetic and phyletic analyses and the study of protein fold superfamilies confirm previous evidence of the existence of a fourth TRUC of life that includes giant viruses, and highlight its ancestrality and mosaicism. They also point out that best evolutionary representations for giant viruses and cellular microorganisms are rhizomes, and that sequence transfers rather than gene transfers have to be considered.

Original languageEnglish (US)
Article number2668
JournalFrontiers in Microbiology
Volume9
Issue numberNOV
DOIs
StatePublished - Nov 27 2018

Fingerprint

Mosaicism
Archaea
Eukaryota
Bacteria
Genes
Genome
Encephalitozoon
Giant Viruses
Rickettsia
Rhizome
Proteins
Amoeba
Proteome
Cluster Analysis
Viruses

Keywords

  • Giant virus
  • Informational genes
  • Megavirales
  • Mimivirus
  • Protein structural domains
  • TRUC

ASJC Scopus subject areas

  • Microbiology
  • Microbiology (medical)

Cite this

Colson, P., Levasseur, A., Scola, B. L., Sharma, V., Nasir, A., Pontarotti, P., ... Raoult, D. (2018). Ancestrality and mosaicism of giant viruses supporting the definition of the fourth TRUC of microbes. Frontiers in Microbiology, 9(NOV), [2668]. https://doi.org/10.3389/fmicb.2018.02668

Ancestrality and mosaicism of giant viruses supporting the definition of the fourth TRUC of microbes. / Colson, Philippe; Levasseur, Anthony; Scola, Bernard La; Sharma, Vikas; Nasir, Arshan; Pontarotti, Pierre; Caetano-Anollés, Gustavo; Raoult, Didier.

In: Frontiers in Microbiology, Vol. 9, No. NOV, 2668, 27.11.2018.

Research output: Contribution to journalArticle

Colson, Philippe ; Levasseur, Anthony ; Scola, Bernard La ; Sharma, Vikas ; Nasir, Arshan ; Pontarotti, Pierre ; Caetano-Anollés, Gustavo ; Raoult, Didier. / Ancestrality and mosaicism of giant viruses supporting the definition of the fourth TRUC of microbes. In: Frontiers in Microbiology. 2018 ; Vol. 9, No. NOV.
@article{369e31fac9464bddb033e4e9715272de,
title = "Ancestrality and mosaicism of giant viruses supporting the definition of the fourth TRUC of microbes",
abstract = "Giant viruses of amoebae were discovered in 2003. Since then, their diversity has greatly expanded. They were suggested to form a fourth branch of life, collectively named ‘TRUC’ (for “Things Resisting Uncompleted Classifications”) alongside Bacteria, Archaea, and Eukarya. Their origin and ancestrality remain controversial. Here, we specify the evolution and definition of giant viruses. Phylogenetic and phenetic analyses of informational gene repertoires of giant viruses and selected bacteria, archaea and eukaryota were performed, including structural phylogenomics based on protein structural domains grouped into 289 universal fold superfamilies (FSFs). Hierarchical clustering analysis was performed based on a binary presence/absence matrix constructed using 727 informational COGs from cellular organisms. The presence/absence of ‘universal’ FSF domains was used to generate an unrooted maximum parsimony phylogenomic tree. Comparison of the gene content of a giant virus with those of a bacterium, an archaeon, and a eukaryote with small genomes was also performed. Overall, both cladistic analyses based on gene sequences of very central and ancient proteins and on highly conserved protein fold structures as well as phenetic analyses were congruent regarding the delineation of a fourth branch of microbes comprised by giant viruses. Giant viruses appeared as a basal group in the tree of all proteomes. A pangenome and core genome determined for Rickettsia bellii (bacteria), Methanomassiliicoccus luminyensis (archaeon), Encephalitozoon intestinalis (eukaryote), and Tupanvirus (giant virus) showed a substantial proportion of Tupanvirus genes that overlap with those of the cellular microbes. In addition, a substantial genome mosaicism was observed, with 51, 11, 8, and 0.2{\%} of Tupanvirus genes best matching with viruses, eukaryota, bacteria, and archaea, respectively. Finally, we found that genes themselves may be subject to lateral sequence transfers. In summary, our data highlight the quantum leap between classical and giant viruses. Phylogenetic and phyletic analyses and the study of protein fold superfamilies confirm previous evidence of the existence of a fourth TRUC of life that includes giant viruses, and highlight its ancestrality and mosaicism. They also point out that best evolutionary representations for giant viruses and cellular microorganisms are rhizomes, and that sequence transfers rather than gene transfers have to be considered.",
keywords = "Giant virus, Informational genes, Megavirales, Mimivirus, Protein structural domains, TRUC",
author = "Philippe Colson and Anthony Levasseur and Scola, {Bernard La} and Vikas Sharma and Arshan Nasir and Pierre Pontarotti and Gustavo Caetano-Anoll{\'e}s and Didier Raoult",
year = "2018",
month = "11",
day = "27",
doi = "10.3389/fmicb.2018.02668",
language = "English (US)",
volume = "9",
journal = "Frontiers in Microbiology",
issn = "1664-302X",
publisher = "Frontiers Media S. A.",
number = "NOV",

}

TY - JOUR

T1 - Ancestrality and mosaicism of giant viruses supporting the definition of the fourth TRUC of microbes

AU - Colson, Philippe

AU - Levasseur, Anthony

AU - Scola, Bernard La

AU - Sharma, Vikas

AU - Nasir, Arshan

AU - Pontarotti, Pierre

AU - Caetano-Anollés, Gustavo

AU - Raoult, Didier

PY - 2018/11/27

Y1 - 2018/11/27

N2 - Giant viruses of amoebae were discovered in 2003. Since then, their diversity has greatly expanded. They were suggested to form a fourth branch of life, collectively named ‘TRUC’ (for “Things Resisting Uncompleted Classifications”) alongside Bacteria, Archaea, and Eukarya. Their origin and ancestrality remain controversial. Here, we specify the evolution and definition of giant viruses. Phylogenetic and phenetic analyses of informational gene repertoires of giant viruses and selected bacteria, archaea and eukaryota were performed, including structural phylogenomics based on protein structural domains grouped into 289 universal fold superfamilies (FSFs). Hierarchical clustering analysis was performed based on a binary presence/absence matrix constructed using 727 informational COGs from cellular organisms. The presence/absence of ‘universal’ FSF domains was used to generate an unrooted maximum parsimony phylogenomic tree. Comparison of the gene content of a giant virus with those of a bacterium, an archaeon, and a eukaryote with small genomes was also performed. Overall, both cladistic analyses based on gene sequences of very central and ancient proteins and on highly conserved protein fold structures as well as phenetic analyses were congruent regarding the delineation of a fourth branch of microbes comprised by giant viruses. Giant viruses appeared as a basal group in the tree of all proteomes. A pangenome and core genome determined for Rickettsia bellii (bacteria), Methanomassiliicoccus luminyensis (archaeon), Encephalitozoon intestinalis (eukaryote), and Tupanvirus (giant virus) showed a substantial proportion of Tupanvirus genes that overlap with those of the cellular microbes. In addition, a substantial genome mosaicism was observed, with 51, 11, 8, and 0.2% of Tupanvirus genes best matching with viruses, eukaryota, bacteria, and archaea, respectively. Finally, we found that genes themselves may be subject to lateral sequence transfers. In summary, our data highlight the quantum leap between classical and giant viruses. Phylogenetic and phyletic analyses and the study of protein fold superfamilies confirm previous evidence of the existence of a fourth TRUC of life that includes giant viruses, and highlight its ancestrality and mosaicism. They also point out that best evolutionary representations for giant viruses and cellular microorganisms are rhizomes, and that sequence transfers rather than gene transfers have to be considered.

AB - Giant viruses of amoebae were discovered in 2003. Since then, their diversity has greatly expanded. They were suggested to form a fourth branch of life, collectively named ‘TRUC’ (for “Things Resisting Uncompleted Classifications”) alongside Bacteria, Archaea, and Eukarya. Their origin and ancestrality remain controversial. Here, we specify the evolution and definition of giant viruses. Phylogenetic and phenetic analyses of informational gene repertoires of giant viruses and selected bacteria, archaea and eukaryota were performed, including structural phylogenomics based on protein structural domains grouped into 289 universal fold superfamilies (FSFs). Hierarchical clustering analysis was performed based on a binary presence/absence matrix constructed using 727 informational COGs from cellular organisms. The presence/absence of ‘universal’ FSF domains was used to generate an unrooted maximum parsimony phylogenomic tree. Comparison of the gene content of a giant virus with those of a bacterium, an archaeon, and a eukaryote with small genomes was also performed. Overall, both cladistic analyses based on gene sequences of very central and ancient proteins and on highly conserved protein fold structures as well as phenetic analyses were congruent regarding the delineation of a fourth branch of microbes comprised by giant viruses. Giant viruses appeared as a basal group in the tree of all proteomes. A pangenome and core genome determined for Rickettsia bellii (bacteria), Methanomassiliicoccus luminyensis (archaeon), Encephalitozoon intestinalis (eukaryote), and Tupanvirus (giant virus) showed a substantial proportion of Tupanvirus genes that overlap with those of the cellular microbes. In addition, a substantial genome mosaicism was observed, with 51, 11, 8, and 0.2% of Tupanvirus genes best matching with viruses, eukaryota, bacteria, and archaea, respectively. Finally, we found that genes themselves may be subject to lateral sequence transfers. In summary, our data highlight the quantum leap between classical and giant viruses. Phylogenetic and phyletic analyses and the study of protein fold superfamilies confirm previous evidence of the existence of a fourth TRUC of life that includes giant viruses, and highlight its ancestrality and mosaicism. They also point out that best evolutionary representations for giant viruses and cellular microorganisms are rhizomes, and that sequence transfers rather than gene transfers have to be considered.

KW - Giant virus

KW - Informational genes

KW - Megavirales

KW - Mimivirus

KW - Protein structural domains

KW - TRUC

UR - http://www.scopus.com/inward/record.url?scp=85057610569&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85057610569&partnerID=8YFLogxK

U2 - 10.3389/fmicb.2018.02668

DO - 10.3389/fmicb.2018.02668

M3 - Article

AN - SCOPUS:85057610569

VL - 9

JO - Frontiers in Microbiology

JF - Frontiers in Microbiology

SN - 1664-302X

IS - NOV

M1 - 2668

ER -