Combining nearest neighbor classifiers versus cross-validation selection

Minhui Paik, Yuhong Yang

Research output: Contribution to journalArticlepeer-review

13 Scopus citations

Abstract

Various discriminant methods have been applied for classification of tumors based on gene expression profiles, among which the nearest neighbor (NN) method has been reported to perform relatively well. Usually cross-validation (CV) is used to select the neighbor size as well as the number of variables for the NN method. However, CV can perform poorly when there is considerable uncertainty in choosing the best candidate classifier. As an alternative to selecting a single "winner," we propose a weighting method to combine the multiple NN rules. Four gene expression data sets are used to compare its performance with CV methods. The results show that when the CV selection is unstable, the combined classifier performs much better.

Original languageEnglish (US)
Article number12
JournalStatistical Applications in Genetics and Molecular Biology
Volume3
Issue number1
DOIs
StatePublished - 2004

Keywords

  • Combining classifiers
  • Cross-validation
  • Nearest neighbor method

Fingerprint

Dive into the research topics of 'Combining nearest neighbor classifiers versus cross-validation selection'. Together they form a unique fingerprint.

Cite this