Low or uneven read depth is a common limitation of genotyping-by-sequencing (GBS) and restriction site-associated DNA sequencing (RAD-seq), resulting in high missing data rates, heterozygotes miscalled as homozygotes, and uncertainty of allele copy number in heterozygous polyploids. Bayesian genotype calling can mitigate these issues, but previously has only been implemented in software that requires a reference genome or uses priors that may be inappropriate for the population. Here we present several novel Bayesian algorithms that estimate genotype posterior probabilities, all of which are implemented in a new R package, polyRAD. Appropriate priors can be specified for mapping populations, populations in Hardy-Weinberg equilibrium, or structured populations, and in each case can be informed by genotypes at linked markers. The polyRAD software imports read depth from several existing pipelines, and outputs continuous or discrete numerical genotypes suitable for analyses such as genome-wide association and genomic prediction.
|Original language||English (US)|
|Number of pages||11|
|Journal||G3: Genes, Genomes, Genetics|
|State||Published - Mar 1 2019|
ASJC Scopus subject areas
- Molecular Biology
FingerprintDive into the research topics of 'polyRAD: Genotype calling with uncertainty from sequencing data in polyploids and diploids'. Together they form a unique fingerprint.
Scripts for testing the error rate of polyRAD
Clark, L. V. (Creator), Lipka, A. E. (Creator) & Sacks, E. J. (Creator), University of Illinois Urbana-Champaign, Nov 21 2018