TY - JOUR
T1 - A similarity-based data-fusion approach to the visual characterization and comparison of compound databases
AU - Medina-Franco, José L.
AU - Maggiora, Gerald M.
AU - Giulianotti, Marc A.
AU - Pinilla, Clemencia
AU - Houghten, Richard A.
PY - 2007/11
Y1 - 2007/11
N2 - A low-dimensional method, based on the use of multiple fusion-based similarity measures, is described for graphically depicting and characterizing relationships among molecules in compound databases. The measures are used to construct multi-fusion similarity maps that characterize the relationship of a set of 'test' molecules to a set of 'reference' molecules. The reference set is very general and can be made of molecules from, for example, the set of test molecules itself (the self-referencing case), from a small library or large compound collection, or from actives in a given assay or group of assays. The test set is any collection of compounds to be analyzed with respect to the specified reference set. Multiple fusion similarity measures tend to provide more information than single fusion-based measures, including information on the nature of the chemical-space neighborhoods surrounding reference-set molecules. A general discussion is presented on how to interpret multi-fusion similarity maps, and several examples are given that illustrate how these maps can be used to compare compound libraries or collections, to select compounds for screening or acquisition, and to identify new active molecules using ligand-based virtual screening.
AB - A low-dimensional method, based on the use of multiple fusion-based similarity measures, is described for graphically depicting and characterizing relationships among molecules in compound databases. The measures are used to construct multi-fusion similarity maps that characterize the relationship of a set of 'test' molecules to a set of 'reference' molecules. The reference set is very general and can be made of molecules from, for example, the set of test molecules itself (the self-referencing case), from a small library or large compound collection, or from actives in a given assay or group of assays. The test set is any collection of compounds to be analyzed with respect to the specified reference set. Multiple fusion similarity measures tend to provide more information than single fusion-based measures, including information on the nature of the chemical-space neighborhoods surrounding reference-set molecules. A general discussion is presented on how to interpret multi-fusion similarity maps, and several examples are given that illustrate how these maps can be used to compare compound libraries or collections, to select compounds for screening or acquisition, and to identify new active molecules using ligand-based virtual screening.
KW - Combinatorial libraries
KW - Compound acquisition
KW - Compound selection
KW - Data visualization
KW - Diversity analysis
KW - Fusion-based similarity
KW - Ligand-based virtual screening
KW - Multi-fusion similarity maps
UR - http://www.scopus.com/inward/record.url?scp=35348970306&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=35348970306&partnerID=8YFLogxK
U2 - 10.1111/j.1747-0285.2007.00579.x
DO - 10.1111/j.1747-0285.2007.00579.x
M3 - Article
C2 - 17927720
AN - SCOPUS:35348970306
SN - 1747-0277
VL - 70
SP - 393
EP - 412
JO - Chemical Biology and Drug Design
JF - Chemical Biology and Drug Design
IS - 5
ER -