TY - JOUR
T1 - Combining nearest neighbor classifiers versus cross-validation selection
AU - Paik, Minhui
AU - Yang, Yuhong
PY - 2004
Y1 - 2004
N2 - Various discriminant methods have been applied for classification of tumors based on gene expression profiles, among which the nearest neighbor (NN) method has been reported to perform relatively well. Usually cross-validation (CV) is used to select the neighbor size as well as the number of variables for the NN method. However, CV can perform poorly when there is considerable uncertainty in choosing the best candidate classifier. As an alternative to selecting a single "winner," we propose a weighting method to combine the multiple NN rules. Four gene expression data sets are used to compare its performance with CV methods. The results show that when the CV selection is unstable, the combined classifier performs much better.
AB - Various discriminant methods have been applied for classification of tumors based on gene expression profiles, among which the nearest neighbor (NN) method has been reported to perform relatively well. Usually cross-validation (CV) is used to select the neighbor size as well as the number of variables for the NN method. However, CV can perform poorly when there is considerable uncertainty in choosing the best candidate classifier. As an alternative to selecting a single "winner," we propose a weighting method to combine the multiple NN rules. Four gene expression data sets are used to compare its performance with CV methods. The results show that when the CV selection is unstable, the combined classifier performs much better.
KW - Combining classifiers
KW - Cross-validation
KW - Nearest neighbor method
UR - http://www.scopus.com/inward/record.url?scp=14644433818&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=14644433818&partnerID=8YFLogxK
U2 - 10.2202/1544-6115.1054
DO - 10.2202/1544-6115.1054
M3 - Article
AN - SCOPUS:14644433818
SN - 1544-6115
VL - 3
JO - Statistical Applications in Genetics and Molecular Biology
JF - Statistical Applications in Genetics and Molecular Biology
IS - 1
M1 - 12
ER -