TY - JOUR
T1 - Transcription factor IID in the Archaea
T2 - Sequences in the Thermococcus celer genome would encode a product closely related to the TATA-binding protein of eukaryotes
AU - Marsh, Terry L.
AU - Reich, Claudia I.
AU - Whitelock, Robert B.
AU - Olsen, Gary J.
N1 - Copyright:
Copyright 2017 Elsevier B.V., All rights reserved.
PY - 1994/5/10
Y1 - 1994/5/10
N2 - The first step in transcription initiation in eukaryotes is mediated by the TATA-binding protein, a subunit of the transcription factor IID complex. We have cloned and sequenced the gene for a presumptive homolog of this eukaryotic protein from Thermococcus celer, a member of the Archaea (formerly archaebacteria). The protein encoded by the archaeal gene is a tandem repeat of a conserved domain, corresponding to the repeated domain in its eukaryotic counterparts. Molecular phylogenetic analyses of the two halves of the repeat are consistent with the duplication occurring before the divergence of the archaeal and eukaryotic domains. In conjunction with previous observations of similarity in RNA polymerase subunit composition and sequences and the finding of a transcription factor IIB-like sequence in Pyrococcus woesei (a relative of T. celer) it appears that major features of the eukaryotic transcription apparatus were well-established before the origin of eukaryotic cellular organization. The divergence between the two halves of the archaeal protein is less than that between the halves of the individual eukaryotic sequences, indicating that the average rate of sequence change in the archaeal protein has been less than in its eukaryotic counterparts. To the extent that this lower rate applies to the genome as a whole, a clearer picture of the early genes (and gene families) that gave rise to present-day genomes is more apt to emerge from the study of sequences from the Archaea than from the corresponding sequences from eukaryotes.
AB - The first step in transcription initiation in eukaryotes is mediated by the TATA-binding protein, a subunit of the transcription factor IID complex. We have cloned and sequenced the gene for a presumptive homolog of this eukaryotic protein from Thermococcus celer, a member of the Archaea (formerly archaebacteria). The protein encoded by the archaeal gene is a tandem repeat of a conserved domain, corresponding to the repeated domain in its eukaryotic counterparts. Molecular phylogenetic analyses of the two halves of the repeat are consistent with the duplication occurring before the divergence of the archaeal and eukaryotic domains. In conjunction with previous observations of similarity in RNA polymerase subunit composition and sequences and the finding of a transcription factor IIB-like sequence in Pyrococcus woesei (a relative of T. celer) it appears that major features of the eukaryotic transcription apparatus were well-established before the origin of eukaryotic cellular organization. The divergence between the two halves of the archaeal protein is less than that between the halves of the individual eukaryotic sequences, indicating that the average rate of sequence change in the archaeal protein has been less than in its eukaryotic counterparts. To the extent that this lower rate applies to the genome as a whole, a clearer picture of the early genes (and gene families) that gave rise to present-day genomes is more apt to emerge from the study of sequences from the Archaea than from the corresponding sequences from eukaryotes.
KW - gene duplication
KW - least-squares distance
KW - maximum likelihood and parsimony
KW - molecular evolution
KW - transcription initiation
UR - http://www.scopus.com/inward/record.url?scp=0028176170&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0028176170&partnerID=8YFLogxK
U2 - 10.1073/pnas.91.10.4180
DO - 10.1073/pnas.91.10.4180
M3 - Article
C2 - 8183889
AN - SCOPUS:0028176170
SN - 0027-8424
VL - 91
SP - 4180
EP - 4184
JO - Proceedings of the National Academy of Sciences of the United States of America
JF - Proceedings of the National Academy of Sciences of the United States of America
IS - 10
ER -