Abstract
Comparisons of complete genome sequences allow the most objective and comprehensive descriptions possible of a lineage's evolution. This communication uses the completed genomes from four major euryarchaeal taxa to define a genomic signature for the Euryarchaeota and, by extension, the Archaea as a whole. The signature is defined in terms of the set of protein- encoding genes found in at least two diverse members of the euryarchaeal taxa that function uniquely within the Archaea; most signature proteins have no recognizable bacterial or eukaryal homologs. By this definition, 351 clusters of signature proteins have been identified. Functions of most proteins in this signature set are currently unknown. At least 70% of the clusters that contain proteins from all the euryarchaeal genomes also have crenarchaeal homologs. This conservative set, which appears refractory to horizontal gene transfer to the Bacteria or the Eukarya, would seem to reflect the significant innovations that were unique and fundamental to the archaeal 'design fabric.' Genomic protein signature analysis methods may be extended to characterize the evolution of any phylogenetically defined lineage. The complete set of protein clusters for the archaeal genomic signature is presented as supplementary material (see the PNAS web site, www.pnas.org).
Original language | English (US) |
---|---|
Pages (from-to) | 3304-3308 |
Number of pages | 5 |
Journal | Proceedings of the National Academy of Sciences of the United States of America |
Volume | 97 |
Issue number | 7 |
DOIs | |
State | Published - Mar 28 2000 |
Keywords
- Archaea
- Biological classification
- Protein clusters
- Taxonomic domains
ASJC Scopus subject areas
- General