Abstract
Partitioning objects into closely related groups that have different states allows to understand the underlying structure in the data set treated. Different kinds of similarity measure with clustering algorithms are commonly used to find an optimal clustering or closely akin to original clustering. Using shrinkage-based and rank-based correlation coefficients, which are known to be robust, the recovery level of six chosen clustering algorithms is evaluated using Rand's C values. The recovery levels using weighted likelihood estimate of correlation coefficient are obtained and compared to the results from using those correlation coefficients in applying agglomerative clustering algorithms.
Original language | English (US) |
---|---|
Pages (from-to) | 715-727 |
Number of pages | 13 |
Journal | Statistical Papers |
Volume | 49 |
Issue number | 4 |
DOIs | |
State | Published - Oct 2008 |
Bibliographical note
Funding Information:This work was supported by RIC(R) grants from Traditional and Bio-Medical Research Center, Daejeon University (RRC04713, 2005) by ITEP in Republic of Korea.
Keywords
- Agglomerative clustering algorithm
- Rand's C statistic
- Weighted likelihood estimate