TY - CHAP
T1 - Population genomic analysis of model and nonmodel organisms using sequenced RAD tags
AU - Hohenlohe, Paul A.
AU - Catchen, Julian
AU - Cresko, William A.
N1 - Copyright:
Copyright 2015 Elsevier B.V., All rights reserved.
PY - 2012
Y1 - 2012
N2 - The evolutionary processes of mutation, migration, genetic drift, and natural selection shape patterns of genetic variation among individuals, populations, and species, and they can do so differentially across genomes. The field of population genomics provides a comprehensive genome-scale view of these processes, even beyond traditional model organisms. Until recently, genome-wide studies of genetic variation have been prohibitively expensive. However, next-generation sequencing (NGS) technologies are revolutionizing the field of population genomics, allowing for genetic analysis at scales not previously possible even in organisms for which few genomic resources presently exist. To speed this revolution in evolutionary genetics, we and colleagues developed Restriction site Associated DNA (RAD) sequencing, a method that uses Illumina NGS to simultaneously type and score tens to hundreds of thousands of single nucleotide polymorphism (SNP) markers in hundreds of individuals for minimal investment of resources. The core molecular protocol is described elsewhere in this volume, which can be modified to suit a diversity of evolutionary genetic questions. In this chapter, we outline the conceptual framework of population genomics, relate genomic patterns of variation to evolutionary processes, and discuss how RAD sequencing can be used to study population genomics. In addition, we discuss bioinformatic considerations that arise from unique aspects of NGS data as compared to traditional marker based approaches, and we outline some general analytical approaches for RAD-seq and similar data, including a computational pipeline that we developed called Stacks. This software can be used for the analysis of RAD-seq data in organisms with and without a reference genome. Nonetheless, the development of analytical tools remains in its infancy, and further work is needed to fully quantify sampling variance and biases in these data types. As data-gathering technology continues to advance, our ability to understand genomic evolution in natural populations will be limited more by conceptual and analytical weaknesses than by the amount of molecular data.
AB - The evolutionary processes of mutation, migration, genetic drift, and natural selection shape patterns of genetic variation among individuals, populations, and species, and they can do so differentially across genomes. The field of population genomics provides a comprehensive genome-scale view of these processes, even beyond traditional model organisms. Until recently, genome-wide studies of genetic variation have been prohibitively expensive. However, next-generation sequencing (NGS) technologies are revolutionizing the field of population genomics, allowing for genetic analysis at scales not previously possible even in organisms for which few genomic resources presently exist. To speed this revolution in evolutionary genetics, we and colleagues developed Restriction site Associated DNA (RAD) sequencing, a method that uses Illumina NGS to simultaneously type and score tens to hundreds of thousands of single nucleotide polymorphism (SNP) markers in hundreds of individuals for minimal investment of resources. The core molecular protocol is described elsewhere in this volume, which can be modified to suit a diversity of evolutionary genetic questions. In this chapter, we outline the conceptual framework of population genomics, relate genomic patterns of variation to evolutionary processes, and discuss how RAD sequencing can be used to study population genomics. In addition, we discuss bioinformatic considerations that arise from unique aspects of NGS data as compared to traditional marker based approaches, and we outline some general analytical approaches for RAD-seq and similar data, including a computational pipeline that we developed called Stacks. This software can be used for the analysis of RAD-seq data in organisms with and without a reference genome. Nonetheless, the development of analytical tools remains in its infancy, and further work is needed to fully quantify sampling variance and biases in these data types. As data-gathering technology continues to advance, our ability to understand genomic evolution in natural populations will be limited more by conceptual and analytical weaknesses than by the amount of molecular data.
KW - Evolution
KW - Genetic mapping
KW - Genomics
KW - Genotyping
KW - Next-generation sequencing
KW - Population genetics
KW - RAD-seq
KW - Single nucleotide polymorphisms
UR - http://www.scopus.com/inward/record.url?scp=84866754226&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84866754226&partnerID=8YFLogxK
U2 - 10.1007/978-1-61779-870-2_14
DO - 10.1007/978-1-61779-870-2_14
M3 - Chapter
C2 - 22665285
AN - SCOPUS:84866754226
SN - 9781617798696
T3 - Methods in Molecular Biology
SP - 235
EP - 260
BT - Data Production and Analysis in Population Genomics
PB - Humana Press Inc.
ER -