A Tree of Cellular Life Inferred from a Genomic Census of Molecular Functions

Kyung Mo Kim, Arshan Nasir, Kyuin Hwang, Gustavo Caetano-Anollés

Research output: Contribution to journalArticle

Abstract

Phylogenomics aims to describe evolutionary relatedness between organisms by analyzing genomic data. The common practice is to produce phylogenomic trees from molecular information in the sequence, order, and content of genes in genomes. These phylogenies describe the evolution of life and become valuable tools for taxonomy. The recent availability of structural and functional data for hundreds of genomes now offers the opportunity to study evolution using more deep, conserved, and reliable sets of molecular features. Here, we reconstruct trees of life from the functions of proteins. We start by inferring rooted phylogenomic trees and networks of organisms directly from Gene Ontology annotations. Phylogenies and networks yield novel insights into the emergence and evolution of cellular life. The ancestor of Archaea originated earlier than the ancestors of Bacteria and Eukarya and was thermophilic. In contrast, basal bacterial lineages were non-thermophilic. A close relationship between Plants and Metazoa was also identified that disagrees with the traditional Fungi-Metazoa grouping. While measures of evolutionary reticulation were minimum in Eukarya and maximum in Bacteria, the massive role of horizontal gene transfer in microbes did not materialize in phylogenomic networks. Phylogenies and networks also showed that the best reconstructions were recovered when problematic taxa (i.e., parasitic/symbiotic organisms) and horizontally transferred characters were excluded from analysis. Our results indicate that functionomic data represent a useful addition to the set of molecular characters used for tree reconstruction and that trees of cellular life carry in deep branches considerable predictive power to explain the evolution of living organisms.

Original languageEnglish (US)
Pages (from-to)240-262
Number of pages23
JournalJournal of Molecular Evolution
Volume79
Issue number5-6
DOIs
StatePublished - Nov 29 2014

Fingerprint

Censuses
census
genomics
Phylogeny
Animalia
phylogeny
organisms
Eukaryota
ancestry
genome
Genome
Bacteria
Molecular Sequence Annotation
Horizontal Gene Transfer
Gene Ontology
bacterium
Gene Order
gene
gene transfer
Archaea

Keywords

  • Evolution
  • Gene ontology
  • Phylogenomics
  • Tree of life

ASJC Scopus subject areas

  • Ecology, Evolution, Behavior and Systematics
  • Molecular Biology
  • Genetics

Cite this

A Tree of Cellular Life Inferred from a Genomic Census of Molecular Functions. / Kim, Kyung Mo; Nasir, Arshan; Hwang, Kyuin; Caetano-Anollés, Gustavo.

In: Journal of Molecular Evolution, Vol. 79, No. 5-6, 29.11.2014, p. 240-262.

Research output: Contribution to journalArticle

Kim, Kyung Mo ; Nasir, Arshan ; Hwang, Kyuin ; Caetano-Anollés, Gustavo. / A Tree of Cellular Life Inferred from a Genomic Census of Molecular Functions. In: Journal of Molecular Evolution. 2014 ; Vol. 79, No. 5-6. pp. 240-262.
@article{8bd7af1a800941e8bd576739f508b096,
title = "A Tree of Cellular Life Inferred from a Genomic Census of Molecular Functions",
abstract = "Phylogenomics aims to describe evolutionary relatedness between organisms by analyzing genomic data. The common practice is to produce phylogenomic trees from molecular information in the sequence, order, and content of genes in genomes. These phylogenies describe the evolution of life and become valuable tools for taxonomy. The recent availability of structural and functional data for hundreds of genomes now offers the opportunity to study evolution using more deep, conserved, and reliable sets of molecular features. Here, we reconstruct trees of life from the functions of proteins. We start by inferring rooted phylogenomic trees and networks of organisms directly from Gene Ontology annotations. Phylogenies and networks yield novel insights into the emergence and evolution of cellular life. The ancestor of Archaea originated earlier than the ancestors of Bacteria and Eukarya and was thermophilic. In contrast, basal bacterial lineages were non-thermophilic. A close relationship between Plants and Metazoa was also identified that disagrees with the traditional Fungi-Metazoa grouping. While measures of evolutionary reticulation were minimum in Eukarya and maximum in Bacteria, the massive role of horizontal gene transfer in microbes did not materialize in phylogenomic networks. Phylogenies and networks also showed that the best reconstructions were recovered when problematic taxa (i.e., parasitic/symbiotic organisms) and horizontally transferred characters were excluded from analysis. Our results indicate that functionomic data represent a useful addition to the set of molecular characters used for tree reconstruction and that trees of cellular life carry in deep branches considerable predictive power to explain the evolution of living organisms.",
keywords = "Evolution, Gene ontology, Phylogenomics, Tree of life",
author = "Kim, {Kyung Mo} and Arshan Nasir and Kyuin Hwang and Gustavo Caetano-Anoll{\'e}s",
year = "2014",
month = "11",
day = "29",
doi = "10.1007/s00239-014-9637-9",
language = "English (US)",
volume = "79",
pages = "240--262",
journal = "Journal of Molecular Evolution",
issn = "0022-2844",
publisher = "Springer New York",
number = "5-6",

}

TY - JOUR

T1 - A Tree of Cellular Life Inferred from a Genomic Census of Molecular Functions

AU - Kim, Kyung Mo

AU - Nasir, Arshan

AU - Hwang, Kyuin

AU - Caetano-Anollés, Gustavo

PY - 2014/11/29

Y1 - 2014/11/29

N2 - Phylogenomics aims to describe evolutionary relatedness between organisms by analyzing genomic data. The common practice is to produce phylogenomic trees from molecular information in the sequence, order, and content of genes in genomes. These phylogenies describe the evolution of life and become valuable tools for taxonomy. The recent availability of structural and functional data for hundreds of genomes now offers the opportunity to study evolution using more deep, conserved, and reliable sets of molecular features. Here, we reconstruct trees of life from the functions of proteins. We start by inferring rooted phylogenomic trees and networks of organisms directly from Gene Ontology annotations. Phylogenies and networks yield novel insights into the emergence and evolution of cellular life. The ancestor of Archaea originated earlier than the ancestors of Bacteria and Eukarya and was thermophilic. In contrast, basal bacterial lineages were non-thermophilic. A close relationship between Plants and Metazoa was also identified that disagrees with the traditional Fungi-Metazoa grouping. While measures of evolutionary reticulation were minimum in Eukarya and maximum in Bacteria, the massive role of horizontal gene transfer in microbes did not materialize in phylogenomic networks. Phylogenies and networks also showed that the best reconstructions were recovered when problematic taxa (i.e., parasitic/symbiotic organisms) and horizontally transferred characters were excluded from analysis. Our results indicate that functionomic data represent a useful addition to the set of molecular characters used for tree reconstruction and that trees of cellular life carry in deep branches considerable predictive power to explain the evolution of living organisms.

AB - Phylogenomics aims to describe evolutionary relatedness between organisms by analyzing genomic data. The common practice is to produce phylogenomic trees from molecular information in the sequence, order, and content of genes in genomes. These phylogenies describe the evolution of life and become valuable tools for taxonomy. The recent availability of structural and functional data for hundreds of genomes now offers the opportunity to study evolution using more deep, conserved, and reliable sets of molecular features. Here, we reconstruct trees of life from the functions of proteins. We start by inferring rooted phylogenomic trees and networks of organisms directly from Gene Ontology annotations. Phylogenies and networks yield novel insights into the emergence and evolution of cellular life. The ancestor of Archaea originated earlier than the ancestors of Bacteria and Eukarya and was thermophilic. In contrast, basal bacterial lineages were non-thermophilic. A close relationship between Plants and Metazoa was also identified that disagrees with the traditional Fungi-Metazoa grouping. While measures of evolutionary reticulation were minimum in Eukarya and maximum in Bacteria, the massive role of horizontal gene transfer in microbes did not materialize in phylogenomic networks. Phylogenies and networks also showed that the best reconstructions were recovered when problematic taxa (i.e., parasitic/symbiotic organisms) and horizontally transferred characters were excluded from analysis. Our results indicate that functionomic data represent a useful addition to the set of molecular characters used for tree reconstruction and that trees of cellular life carry in deep branches considerable predictive power to explain the evolution of living organisms.

KW - Evolution

KW - Gene ontology

KW - Phylogenomics

KW - Tree of life

UR - http://www.scopus.com/inward/record.url?scp=84914673022&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84914673022&partnerID=8YFLogxK

U2 - 10.1007/s00239-014-9637-9

DO - 10.1007/s00239-014-9637-9

M3 - Article

C2 - 25128982

AN - SCOPUS:84914673022

VL - 79

SP - 240

EP - 262

JO - Journal of Molecular Evolution

JF - Journal of Molecular Evolution

SN - 0022-2844

IS - 5-6

ER -