TY - JOUR
T1 - Marker density and read depth for genotyping populations using genotyping-by-sequencing
AU - Beissinger, Timothy M.
AU - Hirsch, Candice N.
AU - Sekhon, Rajandeep S.
AU - Foerster, Jillian M.
AU - Johnson, James M.
AU - Muttoni, German
AU - Vaillancourt, Brieanne
AU - Robin Buell, C.
AU - Kaeppler, Shawn M.
AU - de Leon, Natalia
PY - 2013
Y1 - 2013
N2 - Genotyping-by-sequencing (GBS) approaches provide low-cost, high-density genotype information. However, GBS has unique technical considerations, including a substantial amount of missing data and a nonuniform distribution of sequence reads. The goal of this study was to characterize technical variation using this method and to develop methods to optimize read depth to obtain desired marker coverage. To empirically assess the distribution of fragments produced using GBS, ~8.69 Gb of GBS data were generated on the Zea mays reference inbred B73, utilizing ApeKI for genome reduction and single-end reads between 75 and 81 bp in length. We observed wide variation in sequence coverage across sites. Approximately 76% of potentially observable cut siteadjacent sequence fragments had no sequencing reads whereas a portion had substantially greater read depth than expected, up to 2369 times the expected mean. The methods described in this article facilitate determination of sequencing depth in the context of empirically defined read depth to achieve desired marker density for genetic mapping studies RAD sequencing of a non-model organism.
AB - Genotyping-by-sequencing (GBS) approaches provide low-cost, high-density genotype information. However, GBS has unique technical considerations, including a substantial amount of missing data and a nonuniform distribution of sequence reads. The goal of this study was to characterize technical variation using this method and to develop methods to optimize read depth to obtain desired marker coverage. To empirically assess the distribution of fragments produced using GBS, ~8.69 Gb of GBS data were generated on the Zea mays reference inbred B73, utilizing ApeKI for genome reduction and single-end reads between 75 and 81 bp in length. We observed wide variation in sequence coverage across sites. Approximately 76% of potentially observable cut siteadjacent sequence fragments had no sequencing reads whereas a portion had substantially greater read depth than expected, up to 2369 times the expected mean. The methods described in this article facilitate determination of sequencing depth in the context of empirically defined read depth to achieve desired marker density for genetic mapping studies RAD sequencing of a non-model organism.
UR - http://www.scopus.com/inward/record.url?scp=84876367781&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84876367781&partnerID=8YFLogxK
U2 - 10.1534/genetics.112.147710
DO - 10.1534/genetics.112.147710
M3 - Article
C2 - 23410831
AN - SCOPUS:84876367781
SN - 0016-6731
VL - 193
SP - 1073
EP - 1081
JO - Genetics
JF - Genetics
IS - 4
ER -