Three regression methods, namely ridge regression (RR), partial least squares (PLS), and principal components regression (PCR), were used to develop models for the prediction of rat blood:air partition coefficient for increasingly diverse data sets. Initially, modeling was performed for a set of 13 chlorocarbons. To this set, 10 additional hydrophobic compounds were added, including aromatic and non-aromatic hydrocarbons. A set of 16 hydrophilic compounds was also modeled separately. Finally, all 39 compounds were combined into one data set for which comprehensive models were developed. A large set of diverse, theoretical molecular descriptors was calculated for use in the current study. The topostructural (TS), topochemical (TC), and geometrical or 3-dimensional (3D) indices were used hierarchically in model development. In addition, single-class models were developed using the TS, TC, and 3D descriptors. In most cases, RR outperformed PLS and PCR, and the models developed using TC indices were superior to those developed using other classes of descriptors.
- Physiologically based pharmacokinetic models
- Quantitative structure-activity relationships
- Ridge regression
- Risk assessment
- Volatile organic chemicals (VOCs)