TY - JOUR
T1 - Finishing the euchromatic sequence of the human genome
AU - International Human Genome Sequencing Consortium
AU - Abdellah, Zahra
AU - Ahmadi, Alireza
AU - Ahmed, Shahana
AU - Aimable, Matthew
AU - Ainscough, Rachael
AU - Almeida, Jeff
AU - Almond, Claire
AU - Ambler, Andrew
AU - Ambrose, Karen
AU - Ambrose, Kerrie
AU - Andrew, Robert
AU - Andrews, Daniel
AU - Andrews, Neil
AU - Andrews, Dan
AU - Apweiler, Eva
AU - Arbery, Hazel
AU - Archer, Beth
AU - Ash, Gareth
AU - Ashcroft, Kevin
AU - Ashurst, Jennifer
AU - Ashwell, Robert
AU - Atkin, Deborah
AU - Atkinson, Andrea
AU - Atkinson, Barry
AU - Attwood, John
AU - Aubin, Keith
AU - Auger, Katherine
AU - Avis, Terry
AU - Babbage, Anne
AU - Babbage, Sarah
AU - Bacon, Joanne
AU - Bagguley, Claire
AU - Bailey, Jonathan
AU - Baker, Andrew
AU - Banerjee, Ruby
AU - Bardill, Simon
AU - Barker, Darren
AU - Barker, Gary
AU - Barker, Daniel
AU - Barlow, Karen
AU - Baron, Laurent
AU - Barrett, Anika
AU - Bartlett, Rebecca
AU - Basham, David
AU - Basham, Victoria
AU - Bateman, Alex
AU - Bates, Karen
AU - Baynes, Caroline
AU - Beard, Lisa
AU - Stubbs, Lisa
N1 - Funding Information:
Acknowledgements WethankD.Lejaforgraphicdesignandproductionofthefigures.Wewould also like to thank the many dedicated support staff at the sequencing centres and funding agencies. In addition to finished sequence produced by the IHGSC between the time of publication of the draft human genome and April 2003, Build 35 contains some significant published finished sequence from other centres as listed in Table 1—for this we would like to acknowledge M. Adams, B. Roe and G. Evans. Build 35 also contains a small number of individual finished deposited accessions from a variety of other groups, which may or may not have been published; we would like to acknowledge all of this work. We acknowledge G. Sisk for help in preparing the manuscript. This work was supported by The Wellcome Trust; The US National Institutes of Health; The US Department of Energy; The Ministry of Education, Culture, Sports, Science and Technology, Japan; The Federal German Ministry of Education, Research, and Technology; Projektträger Biologie, Energie, Umwelt des BMBF und BMWT; the Max-Planck-Society; Deutsche Forschungsgemeinschaft; Thüringer Ministerium für Wissenschaft, Forschung, und Kunst; The Medical Research Council (UK); European Commission, Directorate Science, Research and Development; Chinese Academy of Sciences, Ministry of Science and Technology, National Natural Science Foundation of China.
PY - 2004/10/21
Y1 - 2004/10/21
N2 - The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers ∼99% of the euchromatic genome and is accurate to an error rate of ∼1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human enome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead.
AB - The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers ∼99% of the euchromatic genome and is accurate to an error rate of ∼1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human enome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead.
UR - http://www.scopus.com/inward/record.url?scp=7244245762&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=7244245762&partnerID=8YFLogxK
U2 - 10.1038/nature03001
DO - 10.1038/nature03001
M3 - Article
C2 - 15496913
AN - SCOPUS:7244245762
SN - 0028-0836
VL - 431
SP - 931
EP - 945
JO - Nature
JF - Nature
IS - 7011
ER -