TY - JOUR
T1 - Impact of small group size on neighbourhood influences in multilevel models
AU - Theall, Katherine P.
AU - Scribner, Richard
AU - Broyles, Stephanie
AU - Yu, Qingzhao
AU - Chotalia, Jigar
AU - Simonsen, Neal
AU - Schonlau, Matthias
AU - Carlin, Bradley P.
PY - 2011/8
Y1 - 2011/8
N2 - Background Given the growing availability of multilevel data from national surveys, researchers interested in contextual effects may find themselves with a small number of individuals per group. Although there is a growing body of literature on sample size in multilevel modelling, few have explored the impact of group sizes of less than five. Methods In a simulated analysis of real data, the impact of a group size of less than five was examined on both a continuous and dichotomous outcome in a simple twolevel multilevel model. Models with group sizes one to five were compared with models with complete data. Four different linear and logistic models were examined: empty models; models with a group-level covariate; models with an individual-level covariate and models with an aggregated group-level covariate. The study evaluated further whether the impact of small group size differed depending on the total number of groups. Results When the number of groups was large (N=459), neither fixed nor random components were affected by small group size, even when 90% of tracts had only one individual per tract and even when an aggregated group-level covariate was examined. As the number of groups decreased, the SE estimates of both fixed and random effects were inflated. Furthermore, group-level variance estimates were more affected than were fixed components. Conclusions Datasets in which there is a small to moderate number of groups, with the majority of very small group size (n<5), size may fail to find or even consider a group-level effect when one may exist and also may be underpowered to detect fixed effects.
AB - Background Given the growing availability of multilevel data from national surveys, researchers interested in contextual effects may find themselves with a small number of individuals per group. Although there is a growing body of literature on sample size in multilevel modelling, few have explored the impact of group sizes of less than five. Methods In a simulated analysis of real data, the impact of a group size of less than five was examined on both a continuous and dichotomous outcome in a simple twolevel multilevel model. Models with group sizes one to five were compared with models with complete data. Four different linear and logistic models were examined: empty models; models with a group-level covariate; models with an individual-level covariate and models with an aggregated group-level covariate. The study evaluated further whether the impact of small group size differed depending on the total number of groups. Results When the number of groups was large (N=459), neither fixed nor random components were affected by small group size, even when 90% of tracts had only one individual per tract and even when an aggregated group-level covariate was examined. As the number of groups decreased, the SE estimates of both fixed and random effects were inflated. Furthermore, group-level variance estimates were more affected than were fixed components. Conclusions Datasets in which there is a small to moderate number of groups, with the majority of very small group size (n<5), size may fail to find or even consider a group-level effect when one may exist and also may be underpowered to detect fixed effects.
UR - http://www.scopus.com/inward/record.url?scp=79960269705&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79960269705&partnerID=8YFLogxK
U2 - 10.1136/jech.2009.097956
DO - 10.1136/jech.2009.097956
M3 - Article
C2 - 20508007
AN - SCOPUS:79960269705
SN - 0143-005X
VL - 65
SP - 688
EP - 695
JO - Journal of Epidemiology and Community Health
JF - Journal of Epidemiology and Community Health
IS - 8
ER -