TY - JOUR
T1 - Nineteen additional unpredicted transcripts from human chromosome 21
AU - Reymond, Alexandre
AU - Camargo, Anamaria A.
AU - Deutsch, Samuel
AU - Stevenson, Brian J.
AU - Parmigiani, Raphael B.
AU - Ucla, Catherine
AU - Bettoni, Fabiana
AU - Rossier, Colette
AU - Lyle, Robert
AU - Guipponi, Michel
AU - De Souza, Sandro
AU - Iseli, Christian
AU - Jongeneel, C. Victor
AU - Bucher, Philipp
AU - Simpson, Andrew J.G.
AU - Antonarakis, Stylianos E.
N1 - Funding Information:
We thank Amos Bairoch, Laurent Cimasoni, Emmanouil Dermitzakis, Anne Estreicher, Marc Friedli, Olivier Menzel, Léonore Rougemont, and Marie Wattenhofer (University of Geneva Medical School) for suggestions and/or critical reading of the manuscript; and Marie-Pierre Papasavvas and Nathalie Scamuffa (University of Geneva Medical School) for core assistance. We gratefully acknowledge the support of the Genomics Program of the Ludwig Institute for Cancer Research. This work was supported by grants from the Jérôme Lejeune Foundation to R.L., A.R., and M.G.; from the Swiss FNRS 31.57149.99, the Swiss FNRS NPR38, and the European Union/OFES and ChildCare foundation to S.E.A.; from the Ludwig Institute for Cancer Research and from the Fundação de Amparo a Pesquisa do Estado de São Paulo (FAPESP) to A.A.C., S.D.S., and A.J.G.S.
PY - 2002
Y1 - 2002
N2 - The identification of all human chromosome 21 (HC21) genes is a necessary step in understanding the molecular pathogenesis of trisomy 21 (Down syndrome). The first analysis of the sequence of 21q included 127 previously characterized genes and predicted an additional 98 novel anonymous genes. Recently we evaluated the quality of this annotation by characterizing a set of HC21 open reading frames (C21orfs) identified by mapping spliced expressed sequence tags (ESTs) and predicted genes (PREDs), identified only in silico. This study underscored the limitations of in silico-only gene prediction, as many PREDs were incorrectly predicted. To refine the HC21 annotation, we have developed a reliable algorithm to extract and stringently map sequences that contain bona fide 3′ transcript ends to the genome. We then created a specific 21q graphical display allowing an integrated view of the data that incorporates new ESTs as well as features such as CpG islands, repeats, and gene predictions. Using these tools we identified 27 new putative genes. To validate these, we sequenced previously cloned cDNAs and carried out RT-PCR, 5′- and 3′-RACE procedures, and comparative mapping. These approaches substantiated 19 new transcripts, thus increasing the HC21 gene count by 9.5%. These transcripts were likely not previously identified because they are small and encode small proteins. We also identified four transcriptional units that are spliced but contain no obvious open reading frame. The HC21 data presented here further emphasize that current gene prediction algorithms miss a substantial number of transcripts that nevertheless can be identified using a combination of experimental approaches and multiple refined algorithms.
AB - The identification of all human chromosome 21 (HC21) genes is a necessary step in understanding the molecular pathogenesis of trisomy 21 (Down syndrome). The first analysis of the sequence of 21q included 127 previously characterized genes and predicted an additional 98 novel anonymous genes. Recently we evaluated the quality of this annotation by characterizing a set of HC21 open reading frames (C21orfs) identified by mapping spliced expressed sequence tags (ESTs) and predicted genes (PREDs), identified only in silico. This study underscored the limitations of in silico-only gene prediction, as many PREDs were incorrectly predicted. To refine the HC21 annotation, we have developed a reliable algorithm to extract and stringently map sequences that contain bona fide 3′ transcript ends to the genome. We then created a specific 21q graphical display allowing an integrated view of the data that incorporates new ESTs as well as features such as CpG islands, repeats, and gene predictions. Using these tools we identified 27 new putative genes. To validate these, we sequenced previously cloned cDNAs and carried out RT-PCR, 5′- and 3′-RACE procedures, and comparative mapping. These approaches substantiated 19 new transcripts, thus increasing the HC21 gene count by 9.5%. These transcripts were likely not previously identified because they are small and encode small proteins. We also identified four transcriptional units that are spliced but contain no obvious open reading frame. The HC21 data presented here further emphasize that current gene prediction algorithms miss a substantial number of transcripts that nevertheless can be identified using a combination of experimental approaches and multiple refined algorithms.
KW - Acedb
KW - Down syndrome
KW - Expression pattern
KW - Gene prediction
KW - Genomic sequences annotation
KW - Human chromosome 21
KW - Transcription map
KW - Trisomy 21
UR - http://www.scopus.com/inward/record.url?scp=0036267653&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0036267653&partnerID=8YFLogxK
U2 - 10.1006/geno.2002.6781
DO - 10.1006/geno.2002.6781
M3 - Article
C2 - 12036297
AN - SCOPUS:0036267653
SN - 0888-7543
VL - 79
SP - 824
EP - 832
JO - Genomics
JF - Genomics
IS - 6
ER -