TY - JOUR
T1 - Design of DNA pooling to allow incorporation of covariates in rare variants analysis
AU - Guan, Weihua
AU - Li, Chun
N1 - Publisher Copyright:
© 2014 Guan, Li.
PY - 2014/12/8
Y1 - 2014/12/8
N2 - Background: Rapid advances in next-generation sequencing technologies facilitate genetic association studies of an increasingly wide array of rare variants. To capture the rare or less common variants, a large number of individuals will be needed. However, the cost of a large scale study using whole genome or exome sequencing is still high. DNA pooling can serve as a cost-effective approach, but with a potential limitation that the identity of individual genomes would be lost and therefore individual characteristics and environmental factors could not be adjusted in association analysis, which may result in power loss and a biased estimate of genetic effect. Methods: For case-control studies, we propose a design strategy for pool creation and an analysis strategy that allows covariate adjustment, using multiple imputation technique. Results: Simulations show that our approach can obtain reasonable estimate for genotypic effect with only slight loss of power compared to the much more expensive approach of sequencing individual genomes. Conclusion: Our design and analysis strategies enable more powerful and costeffective sequencing studies of complex diseases, while allowing incorporation of covariate adjustment.
AB - Background: Rapid advances in next-generation sequencing technologies facilitate genetic association studies of an increasingly wide array of rare variants. To capture the rare or less common variants, a large number of individuals will be needed. However, the cost of a large scale study using whole genome or exome sequencing is still high. DNA pooling can serve as a cost-effective approach, but with a potential limitation that the identity of individual genomes would be lost and therefore individual characteristics and environmental factors could not be adjusted in association analysis, which may result in power loss and a biased estimate of genetic effect. Methods: For case-control studies, we propose a design strategy for pool creation and an analysis strategy that allows covariate adjustment, using multiple imputation technique. Results: Simulations show that our approach can obtain reasonable estimate for genotypic effect with only slight loss of power compared to the much more expensive approach of sequencing individual genomes. Conclusion: Our design and analysis strategies enable more powerful and costeffective sequencing studies of complex diseases, while allowing incorporation of covariate adjustment.
UR - http://www.scopus.com/inward/record.url?scp=84915747047&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84915747047&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0114523
DO - 10.1371/journal.pone.0114523
M3 - Article
C2 - 25485788
AN - SCOPUS:84915747047
SN - 1932-6203
VL - 9
JO - PloS one
JF - PloS one
IS - 12
M1 - e114523
ER -