TY - JOUR
T1 - A unified framework for detecting genetic association with multiple SNPs in a candidate gene or region
T2 - Contrasting genotype scores and ld patterns between cases and controls
AU - Pan, Wei
PY - 2009/10
Y1 - 2009/10
N2 - It is critical to develop and apply powerful statistical tests for genetic association studies due to typically weak associations with complex human diseases or phenotypes. For population-based case-control studies with unphased multilocus genotype data, most of the existing methods are based on comparing genotype scores, e.g. allele frequencies, between the case and control groups. Another class of approaches are motivated to contrast linkage disequilibrium (LD) patterns between the two groups. It is expected that no single test can be uniformly most powerful across all situations, and different tests may perform better under different scenarios. A recent effort has been devoted to combining the above two classes of approaches, which however has some potential drawbacks. Here we propose a general and simple framework to unify the above two classes of approaches: it is based on the simple idea to incorporate LD measurements, in addition to genotype scores, as covariates in a logistic regression model, from which various tests can be constructed by taking advantage of the nice properties of the score statistics for the logistic model. It also has an advantage in easily accommodating covariates and other study designs. We use simulated data to show that our proposed tests performed well across several scenarios. In particular, in contrast to either of the two classes of the tests that is only powerful in detecting only one, but not both, of the two types of the distributional differences between cases and controls, our proposed tests are sensitive to both.
AB - It is critical to develop and apply powerful statistical tests for genetic association studies due to typically weak associations with complex human diseases or phenotypes. For population-based case-control studies with unphased multilocus genotype data, most of the existing methods are based on comparing genotype scores, e.g. allele frequencies, between the case and control groups. Another class of approaches are motivated to contrast linkage disequilibrium (LD) patterns between the two groups. It is expected that no single test can be uniformly most powerful across all situations, and different tests may perform better under different scenarios. A recent effort has been devoted to combining the above two classes of approaches, which however has some potential drawbacks. Here we propose a general and simple framework to unify the above two classes of approaches: it is based on the simple idea to incorporate LD measurements, in addition to genotype scores, as covariates in a logistic regression model, from which various tests can be constructed by taking advantage of the nice properties of the score statistics for the logistic model. It also has an advantage in easily accommodating covariates and other study designs. We use simulated data to show that our proposed tests performed well across several scenarios. In particular, in contrast to either of the two classes of the tests that is only powerful in detecting only one, but not both, of the two types of the distributional differences between cases and controls, our proposed tests are sensitive to both.
KW - Genome-wide association study
KW - Linkage disequilibrium
KW - Linkage disequilibrium contrast test
KW - Logistic regression Multilocus analysis
KW - SNP
KW - Score test
KW - Sum of squared score tests
UR - http://www.scopus.com/inward/record.url?scp=70349608366&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70349608366&partnerID=8YFLogxK
U2 - 10.1159/000243149
DO - 10.1159/000243149
M3 - Article
C2 - 19797904
AN - SCOPUS:70349608366
SN - 0001-5652
VL - 69
SP - 1
EP - 13
JO - Human heredity
JF - Human heredity
IS - 1
ER -