A comprehensive catalog of human KRAB-associated zinc finger genes: Insights into the evolutionary history of a large family of transcriptional repressors

Stuart Huntley, Daniel M. Baggott, Aaron T. Hamilton, Mary Tran-Gyamfi, Shan Yang, Joomyeong Kim, Laurie Gordon, Elbert Branscomb, Lisa Stubbs

Research output: Contribution to journalArticle

Abstract

Krüppel-type zinc finger (ZNF) motifs are prevalent components of transcription factor proteins in all eukaryotes. KRAB-ZNF proteins, in which a potent repressor domain is attached to a tandem array of DNA-binding zinc-finger motifs, are specific to tetrapod vertebrates and represent the largest class of ZNF proteins in mammals. To define the full repertoire of human KRAB-ZNF proteins, we searched the genome sequence for key motifs and then constructed and manually curated gene models incorporating those sequences. The resulting gene catalog contains 423 KRAB-ZNF protein-coding loci, yielding alternative transcripts that altogether predict at least 742 structurally distinct proteins. Active rounds of segmental duplication, involving single genes or larger regions and including both tandem and distributed duplication events, have driven the expansion of this mammalian gene family. Comparisons between the human genes and ZNF loci mined from the draft mouse, dog, and chimpanzee genomes not only identified 103 KRAB-ZNF genes that are conserved in mammals but also highlighted a substantial level of lineage-specific change; at least 136 KRAB-ZNF coding genes are primate specific, including many recent duplicates. KRAB-ZNF genes are widely expressed and clustered genes are typically not coregulated, indicating that paralogs have evolved to fill roles in many different biological processes. To facilitate further study, we have developed a Web-based public resource with access to gene models, sequences, and other data, including visualization tools to provide genomic context and interaction with other public data sets.

Original languageEnglish (US)
Pages (from-to)669-677
Number of pages9
JournalGenome Research
Volume16
Issue number5
DOIs
StatePublished - May 1 2006
Externally publishedYes

Fingerprint

Zinc Fingers
History
Genes
Proteins
Mammals
Genomic Segmental Duplications
Genome
Biological Phenomena
Pan troglodytes
Oligonucleotide Array Sequence Analysis
Eukaryota
Primates
Vertebrates
Transcription Factors
Dogs

ASJC Scopus subject areas

  • Genetics
  • Genetics(clinical)

Cite this

A comprehensive catalog of human KRAB-associated zinc finger genes : Insights into the evolutionary history of a large family of transcriptional repressors. / Huntley, Stuart; Baggott, Daniel M.; Hamilton, Aaron T.; Tran-Gyamfi, Mary; Yang, Shan; Kim, Joomyeong; Gordon, Laurie; Branscomb, Elbert; Stubbs, Lisa.

In: Genome Research, Vol. 16, No. 5, 01.05.2006, p. 669-677.

Research output: Contribution to journalArticle

Huntley, Stuart ; Baggott, Daniel M. ; Hamilton, Aaron T. ; Tran-Gyamfi, Mary ; Yang, Shan ; Kim, Joomyeong ; Gordon, Laurie ; Branscomb, Elbert ; Stubbs, Lisa. / A comprehensive catalog of human KRAB-associated zinc finger genes : Insights into the evolutionary history of a large family of transcriptional repressors. In: Genome Research. 2006 ; Vol. 16, No. 5. pp. 669-677.
@article{c03746540cb44f2e99c5c6327452a904,
title = "A comprehensive catalog of human KRAB-associated zinc finger genes: Insights into the evolutionary history of a large family of transcriptional repressors",
abstract = "Kr{\"u}ppel-type zinc finger (ZNF) motifs are prevalent components of transcription factor proteins in all eukaryotes. KRAB-ZNF proteins, in which a potent repressor domain is attached to a tandem array of DNA-binding zinc-finger motifs, are specific to tetrapod vertebrates and represent the largest class of ZNF proteins in mammals. To define the full repertoire of human KRAB-ZNF proteins, we searched the genome sequence for key motifs and then constructed and manually curated gene models incorporating those sequences. The resulting gene catalog contains 423 KRAB-ZNF protein-coding loci, yielding alternative transcripts that altogether predict at least 742 structurally distinct proteins. Active rounds of segmental duplication, involving single genes or larger regions and including both tandem and distributed duplication events, have driven the expansion of this mammalian gene family. Comparisons between the human genes and ZNF loci mined from the draft mouse, dog, and chimpanzee genomes not only identified 103 KRAB-ZNF genes that are conserved in mammals but also highlighted a substantial level of lineage-specific change; at least 136 KRAB-ZNF coding genes are primate specific, including many recent duplicates. KRAB-ZNF genes are widely expressed and clustered genes are typically not coregulated, indicating that paralogs have evolved to fill roles in many different biological processes. To facilitate further study, we have developed a Web-based public resource with access to gene models, sequences, and other data, including visualization tools to provide genomic context and interaction with other public data sets.",
author = "Stuart Huntley and Baggott, {Daniel M.} and Hamilton, {Aaron T.} and Mary Tran-Gyamfi and Shan Yang and Joomyeong Kim and Laurie Gordon and Elbert Branscomb and Lisa Stubbs",
year = "2006",
month = "5",
day = "1",
doi = "10.1101/gr.4842106",
language = "English (US)",
volume = "16",
pages = "669--677",
journal = "Genome Research",
issn = "1088-9051",
publisher = "Cold Spring Harbor Laboratory Press",
number = "5",

}

TY - JOUR

T1 - A comprehensive catalog of human KRAB-associated zinc finger genes

T2 - Insights into the evolutionary history of a large family of transcriptional repressors

AU - Huntley, Stuart

AU - Baggott, Daniel M.

AU - Hamilton, Aaron T.

AU - Tran-Gyamfi, Mary

AU - Yang, Shan

AU - Kim, Joomyeong

AU - Gordon, Laurie

AU - Branscomb, Elbert

AU - Stubbs, Lisa

PY - 2006/5/1

Y1 - 2006/5/1

N2 - Krüppel-type zinc finger (ZNF) motifs are prevalent components of transcription factor proteins in all eukaryotes. KRAB-ZNF proteins, in which a potent repressor domain is attached to a tandem array of DNA-binding zinc-finger motifs, are specific to tetrapod vertebrates and represent the largest class of ZNF proteins in mammals. To define the full repertoire of human KRAB-ZNF proteins, we searched the genome sequence for key motifs and then constructed and manually curated gene models incorporating those sequences. The resulting gene catalog contains 423 KRAB-ZNF protein-coding loci, yielding alternative transcripts that altogether predict at least 742 structurally distinct proteins. Active rounds of segmental duplication, involving single genes or larger regions and including both tandem and distributed duplication events, have driven the expansion of this mammalian gene family. Comparisons between the human genes and ZNF loci mined from the draft mouse, dog, and chimpanzee genomes not only identified 103 KRAB-ZNF genes that are conserved in mammals but also highlighted a substantial level of lineage-specific change; at least 136 KRAB-ZNF coding genes are primate specific, including many recent duplicates. KRAB-ZNF genes are widely expressed and clustered genes are typically not coregulated, indicating that paralogs have evolved to fill roles in many different biological processes. To facilitate further study, we have developed a Web-based public resource with access to gene models, sequences, and other data, including visualization tools to provide genomic context and interaction with other public data sets.

AB - Krüppel-type zinc finger (ZNF) motifs are prevalent components of transcription factor proteins in all eukaryotes. KRAB-ZNF proteins, in which a potent repressor domain is attached to a tandem array of DNA-binding zinc-finger motifs, are specific to tetrapod vertebrates and represent the largest class of ZNF proteins in mammals. To define the full repertoire of human KRAB-ZNF proteins, we searched the genome sequence for key motifs and then constructed and manually curated gene models incorporating those sequences. The resulting gene catalog contains 423 KRAB-ZNF protein-coding loci, yielding alternative transcripts that altogether predict at least 742 structurally distinct proteins. Active rounds of segmental duplication, involving single genes or larger regions and including both tandem and distributed duplication events, have driven the expansion of this mammalian gene family. Comparisons between the human genes and ZNF loci mined from the draft mouse, dog, and chimpanzee genomes not only identified 103 KRAB-ZNF genes that are conserved in mammals but also highlighted a substantial level of lineage-specific change; at least 136 KRAB-ZNF coding genes are primate specific, including many recent duplicates. KRAB-ZNF genes are widely expressed and clustered genes are typically not coregulated, indicating that paralogs have evolved to fill roles in many different biological processes. To facilitate further study, we have developed a Web-based public resource with access to gene models, sequences, and other data, including visualization tools to provide genomic context and interaction with other public data sets.

UR - http://www.scopus.com/inward/record.url?scp=33745179207&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33745179207&partnerID=8YFLogxK

U2 - 10.1101/gr.4842106

DO - 10.1101/gr.4842106

M3 - Article

C2 - 16606702

AN - SCOPUS:33745179207

VL - 16

SP - 669

EP - 677

JO - Genome Research

JF - Genome Research

SN - 1088-9051

IS - 5

ER -