Transcription factor IID in the Archaea: Sequences in the Thermococcus celer genome would encode a product closely related to the TATA-binding protein of eukaryotes

Terry L. Marsh, Claudia I. Reich, Robert B. Whitelock, Gary J Olsen

Research output: Contribution to journalArticle

Abstract

The first step in transcription initiation in eukaryotes is mediated by the TATA-binding protein, a subunit of the transcription factor IID complex. We have cloned and sequenced the gene for a presumptive homolog of this eukaryotic protein from Thermococcus celer, a member of the Archaea (formerly archaebacteria). The protein encoded by the archaeal gene is a tandem repeat of a conserved domain, corresponding to the repeated domain in its eukaryotic counterparts. Molecular phylogenetic analyses of the two halves of the repeat are consistent with the duplication occurring before the divergence of the archaeal and eukaryotic domains. In conjunction with previous observations of similarity in RNA polymerase subunit composition and sequences and the finding of a transcription factor IIB-like sequence in Pyrococcus woesei (a relative of T. celer) it appears that major features of the eukaryotic transcription apparatus were well-established before the origin of eukaryotic cellular organization. The divergence between the two halves of the archaeal protein is less than that between the halves of the individual eukaryotic sequences, indicating that the average rate of sequence change in the archaeal protein has been less than in its eukaryotic counterparts. To the extent that this lower rate applies to the genome as a whole, a clearer picture of the early genes (and gene families) that gave rise to present-day genomes is more apt to emerge from the study of sequences from the Archaea than from the corresponding sequences from eukaryotes.

Original languageEnglish (US)
Pages (from-to)4180-4184
Number of pages5
JournalProceedings of the National Academy of Sciences of the United States of America
Volume91
Issue number10
DOIs
StatePublished - May 10 1994

Fingerprint

Archaeal Proteins
Thermococcus
Transcription Factor TFIID
TATA-Box Binding Protein
Archaea
Eukaryota
Genome
Pyrococcus
Transcription Factor TFIIB
Genes
Tandem Repeat Sequences
Protein Subunits
DNA-Directed RNA Polymerases
Proteins

Keywords

  • gene duplication
  • least-squares distance
  • maximum likelihood and parsimony
  • molecular evolution
  • transcription initiation

ASJC Scopus subject areas

  • General

Cite this

@article{0cd0a9c65a364c94b1474313d13ed348,
title = "Transcription factor IID in the Archaea: Sequences in the Thermococcus celer genome would encode a product closely related to the TATA-binding protein of eukaryotes",
abstract = "The first step in transcription initiation in eukaryotes is mediated by the TATA-binding protein, a subunit of the transcription factor IID complex. We have cloned and sequenced the gene for a presumptive homolog of this eukaryotic protein from Thermococcus celer, a member of the Archaea (formerly archaebacteria). The protein encoded by the archaeal gene is a tandem repeat of a conserved domain, corresponding to the repeated domain in its eukaryotic counterparts. Molecular phylogenetic analyses of the two halves of the repeat are consistent with the duplication occurring before the divergence of the archaeal and eukaryotic domains. In conjunction with previous observations of similarity in RNA polymerase subunit composition and sequences and the finding of a transcription factor IIB-like sequence in Pyrococcus woesei (a relative of T. celer) it appears that major features of the eukaryotic transcription apparatus were well-established before the origin of eukaryotic cellular organization. The divergence between the two halves of the archaeal protein is less than that between the halves of the individual eukaryotic sequences, indicating that the average rate of sequence change in the archaeal protein has been less than in its eukaryotic counterparts. To the extent that this lower rate applies to the genome as a whole, a clearer picture of the early genes (and gene families) that gave rise to present-day genomes is more apt to emerge from the study of sequences from the Archaea than from the corresponding sequences from eukaryotes.",
keywords = "gene duplication, least-squares distance, maximum likelihood and parsimony, molecular evolution, transcription initiation",
author = "Marsh, {Terry L.} and Reich, {Claudia I.} and Whitelock, {Robert B.} and Olsen, {Gary J}",
year = "1994",
month = "5",
day = "10",
doi = "10.1073/pnas.91.10.4180",
language = "English (US)",
volume = "91",
pages = "4180--4184",
journal = "Proceedings of the National Academy of Sciences of the United States of America",
issn = "0027-8424",
publisher = "National Academy of Sciences",
number = "10",

}

TY - JOUR

T1 - Transcription factor IID in the Archaea

T2 - Sequences in the Thermococcus celer genome would encode a product closely related to the TATA-binding protein of eukaryotes

AU - Marsh, Terry L.

AU - Reich, Claudia I.

AU - Whitelock, Robert B.

AU - Olsen, Gary J

PY - 1994/5/10

Y1 - 1994/5/10

N2 - The first step in transcription initiation in eukaryotes is mediated by the TATA-binding protein, a subunit of the transcription factor IID complex. We have cloned and sequenced the gene for a presumptive homolog of this eukaryotic protein from Thermococcus celer, a member of the Archaea (formerly archaebacteria). The protein encoded by the archaeal gene is a tandem repeat of a conserved domain, corresponding to the repeated domain in its eukaryotic counterparts. Molecular phylogenetic analyses of the two halves of the repeat are consistent with the duplication occurring before the divergence of the archaeal and eukaryotic domains. In conjunction with previous observations of similarity in RNA polymerase subunit composition and sequences and the finding of a transcription factor IIB-like sequence in Pyrococcus woesei (a relative of T. celer) it appears that major features of the eukaryotic transcription apparatus were well-established before the origin of eukaryotic cellular organization. The divergence between the two halves of the archaeal protein is less than that between the halves of the individual eukaryotic sequences, indicating that the average rate of sequence change in the archaeal protein has been less than in its eukaryotic counterparts. To the extent that this lower rate applies to the genome as a whole, a clearer picture of the early genes (and gene families) that gave rise to present-day genomes is more apt to emerge from the study of sequences from the Archaea than from the corresponding sequences from eukaryotes.

AB - The first step in transcription initiation in eukaryotes is mediated by the TATA-binding protein, a subunit of the transcription factor IID complex. We have cloned and sequenced the gene for a presumptive homolog of this eukaryotic protein from Thermococcus celer, a member of the Archaea (formerly archaebacteria). The protein encoded by the archaeal gene is a tandem repeat of a conserved domain, corresponding to the repeated domain in its eukaryotic counterparts. Molecular phylogenetic analyses of the two halves of the repeat are consistent with the duplication occurring before the divergence of the archaeal and eukaryotic domains. In conjunction with previous observations of similarity in RNA polymerase subunit composition and sequences and the finding of a transcription factor IIB-like sequence in Pyrococcus woesei (a relative of T. celer) it appears that major features of the eukaryotic transcription apparatus were well-established before the origin of eukaryotic cellular organization. The divergence between the two halves of the archaeal protein is less than that between the halves of the individual eukaryotic sequences, indicating that the average rate of sequence change in the archaeal protein has been less than in its eukaryotic counterparts. To the extent that this lower rate applies to the genome as a whole, a clearer picture of the early genes (and gene families) that gave rise to present-day genomes is more apt to emerge from the study of sequences from the Archaea than from the corresponding sequences from eukaryotes.

KW - gene duplication

KW - least-squares distance

KW - maximum likelihood and parsimony

KW - molecular evolution

KW - transcription initiation

UR - http://www.scopus.com/inward/record.url?scp=0028176170&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0028176170&partnerID=8YFLogxK

U2 - 10.1073/pnas.91.10.4180

DO - 10.1073/pnas.91.10.4180

M3 - Article

C2 - 8183889

AN - SCOPUS:0028176170

VL - 91

SP - 4180

EP - 4184

JO - Proceedings of the National Academy of Sciences of the United States of America

JF - Proceedings of the National Academy of Sciences of the United States of America

SN - 0027-8424

IS - 10

ER -