TY - JOUR

T1 - Linear regression with an independent variable subject to a detection limit

AU - Nie, Lei

AU - Chu, Haitao

AU - Liu, Chenglong

AU - Cole, Stephen R.

AU - Vexler, Albert

AU - Schisterman, Enrique F.

PY - 2010/7

Y1 - 2010/7

N2 - Background: Linear regression with a left-censored independent variable X due to limit of detection (LOD) was recently considered by 2 groups of researchers: Richardson and Ciampi (Am J Epidemiol. 2003;157:355-363), and Schisterman et al (Am J Epidemiol. 2006;163:374-383). Methods: Both groups obtained consistent estimators for the regression slopes by replacing left-censored X with a constant, that is, the expectation of X given X below LOD E(X|X<LOD) in the former group and the sample mean of X given X above LOD in the latter. Results: Schisterman et al argued that their approach would be a better choice because the sample mean of X given X above LOD is available, whereas E(X|X<LOD) is unknown. Other substitution methods, such as replacing the left-censored values with LOD, or LOD/2,have been extensively used in the literature. Simulations were conducted to compare the performance under 2 scenarios in which the independent variable is normally and not normally distributed. Conclusion: Recommendations are given based on theoretical and simulation results. These recommendations are illustrated with one case study.

AB - Background: Linear regression with a left-censored independent variable X due to limit of detection (LOD) was recently considered by 2 groups of researchers: Richardson and Ciampi (Am J Epidemiol. 2003;157:355-363), and Schisterman et al (Am J Epidemiol. 2006;163:374-383). Methods: Both groups obtained consistent estimators for the regression slopes by replacing left-censored X with a constant, that is, the expectation of X given X below LOD E(X|X<LOD) in the former group and the sample mean of X given X above LOD in the latter. Results: Schisterman et al argued that their approach would be a better choice because the sample mean of X given X above LOD is available, whereas E(X|X<LOD) is unknown. Other substitution methods, such as replacing the left-censored values with LOD, or LOD/2,have been extensively used in the literature. Simulations were conducted to compare the performance under 2 scenarios in which the independent variable is normally and not normally distributed. Conclusion: Recommendations are given based on theoretical and simulation results. These recommendations are illustrated with one case study.

UR - http://www.scopus.com/inward/record.url?scp=77953878539&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77953878539&partnerID=8YFLogxK

U2 - 10.1097/EDE.0b013e3181ce97d8

DO - 10.1097/EDE.0b013e3181ce97d8

M3 - Article

C2 - 21422965

AN - SCOPUS:77953878539

SN - 1044-3983

VL - 21

SP - S17-S24

JO - Epidemiology

JF - Epidemiology

IS - SUPPL. 4

ER -