Three classes of arbitrary quantitative molecular similarity analysis (QMSA) methods have been computed using atom pairs (APs), topological indices (TIs), and principal components (PCs) derived from topological indices. Tailored QMSA models have been developed from TIs selected through ridge regression. K-nearest neighbor (kNN) based estimation has been applied to all of the methods to estimate normal vapor pressure (pvap) and water solubility (sol) for a set of 194 chemicals. Results show that the tailored QMSA methods are superior to arbitrary similarity methods in estimating both of these properties for the given set of chemicals.
- Atom pairs
- Quantitative molecular similarity analysis (QMSA)
- Tailored QMSA
- Topological indices