TY - JOUR
T1 - Quantitative structure-activity relationship (QSAR) modeling of human blood
T2 - Air partitioning with proper statistical methods and validation
AU - Basak, Subhash C
AU - Mills, Denise
AU - Hawkins, Douglas M
AU - Kraker, Jessica J.
PY - 2009
Y1 - 2009
N2 - Blood: Air partition coefficient (BApc) is important in assessing toxicokinetics of chemicals. Since very few experimental data are available, quantitative structure-activity relationship (QSAR) models based on calculated molecular descriptors can be useful in estimating BApc. Since all descriptors used in the analysis are computed strictly from structure, they can be applied to any chemical, real or hypothetical. In this article, we report models for BApc estimation using four methods, including stepwise ordinary least-squares regression, which is commonly used in QSAR studies but often results in an inflated 'naïve' q2, over-representing the predictive ability of the model. The models developed using proper statistical techniques had q2 values of 0.825 and 0.841, and may be used to reliably predict BApc values for new compounds that are structurally similar to those upon which the models are based. The models developed using improper techniques had associated q2 values, as computed using naïve methods, of 0.920 and 0.934, severely overstating their actual quality.
AB - Blood: Air partition coefficient (BApc) is important in assessing toxicokinetics of chemicals. Since very few experimental data are available, quantitative structure-activity relationship (QSAR) models based on calculated molecular descriptors can be useful in estimating BApc. Since all descriptors used in the analysis are computed strictly from structure, they can be applied to any chemical, real or hypothetical. In this article, we report models for BApc estimation using four methods, including stepwise ordinary least-squares regression, which is commonly used in QSAR studies but often results in an inflated 'naïve' q2, over-representing the predictive ability of the model. The models developed using proper statistical techniques had q2 values of 0.825 and 0.841, and may be used to reliably predict BApc values for new compounds that are structurally similar to those upon which the models are based. The models developed using improper techniques had associated q2 values, as computed using naïve methods, of 0.920 and 0.934, severely overstating their actual quality.
UR - http://www.scopus.com/inward/record.url?scp=65649131952&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=65649131952&partnerID=8YFLogxK
U2 - 10.1002/cbdv.200800111
DO - 10.1002/cbdv.200800111
M3 - Article
C2 - 19353545
AN - SCOPUS:65649131952
SN - 1612-1872
VL - 6
SP - 487
EP - 502
JO - Chemistry and Biodiversity
JF - Chemistry and Biodiversity
IS - 4
ER -