TY - JOUR
T1 - MatrixEpistasis
T2 - Ultrafast, exhaustive epistasis scan for quantitative traits with covariate adjustment
AU - Zhu, Shijia
AU - Fang, Gang
N1 - Publisher Copyright:
© The Author(s) 2018.
PY - 2018/7/15
Y1 - 2018/7/15
N2 - Motivation: For many traits, causal loci uncovered by genetic mapping studies explain only a minority of the heritable contribution to trait variation. Multiple explanations for this 'missing heritability' have been proposed. Single nucleotide polymorphism (SNP)-SNP interaction (epistasis), as one of the compelling models, has been widely studied. However, the genome-wide scan of epistasis, especially for quantitative traits, poses huge computational challenges. Moreover, covariate adjustment is largely ignored in epistasis analysis due to the massive extra computational undertaking. Results: In the current study, we found striking differences among epistasis models using both simulation data and real biological data, suggesting that not only can covariate adjustment remove confounding bias, it can also improve power. Furthermore, we derived mathematical formulas, which enable the exhaustive epistasis scan together with full covariate adjustment to be expressed in terms of large matrix operation, therefore substantially improving the computational efficiency (104× faster than existing methods). We call the new method MatrixEpistasis. With MatrixEpistasis, we re-analyze a large real yeast dataset comprising 11 623 SNPs, 1008 segregants and 46 quantitative traits with covariates fully adjusted and detect thousands of novel putative epistasis with P-values<1.48e-10.
AB - Motivation: For many traits, causal loci uncovered by genetic mapping studies explain only a minority of the heritable contribution to trait variation. Multiple explanations for this 'missing heritability' have been proposed. Single nucleotide polymorphism (SNP)-SNP interaction (epistasis), as one of the compelling models, has been widely studied. However, the genome-wide scan of epistasis, especially for quantitative traits, poses huge computational challenges. Moreover, covariate adjustment is largely ignored in epistasis analysis due to the massive extra computational undertaking. Results: In the current study, we found striking differences among epistasis models using both simulation data and real biological data, suggesting that not only can covariate adjustment remove confounding bias, it can also improve power. Furthermore, we derived mathematical formulas, which enable the exhaustive epistasis scan together with full covariate adjustment to be expressed in terms of large matrix operation, therefore substantially improving the computational efficiency (104× faster than existing methods). We call the new method MatrixEpistasis. With MatrixEpistasis, we re-analyze a large real yeast dataset comprising 11 623 SNPs, 1008 segregants and 46 quantitative traits with covariates fully adjusted and detect thousands of novel putative epistasis with P-values<1.48e-10.
UR - http://www.scopus.com/inward/record.url?scp=85055029599&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85055029599&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/bty094
DO - 10.1093/bioinformatics/bty094
M3 - Article
C2 - 29509873
AN - SCOPUS:85055029599
SN - 1367-4803
VL - 34
SP - 2341
EP - 2348
JO - Bioinformatics
JF - Bioinformatics
IS - 14
ER -