Statistical redundancy testing for improved gene selection in cancer classification using microarray data

Simin Hu, J. Sunil Rao

Research output: Contribution to journalArticlepeer-review

10 Scopus citations

Abstract

In gene selection for cancer classification using microarray data, we define an eigenvalue-ratio statistic to measure a gene's contribution to the joint discriminability when this gene is included into a set of genes. Based on this eigenvalue-ratio statistic, we define a novel hypothesis testing for gene statistical redundancy and propose two gene selection methods. Simulation studies illustrate the agreement between statistical redundancy testing and gene selection methods. Real data examples show the proposed gene selection methods can select a compact gene subset which can not only be used to build high quality cancer classifiers but also show biological relevance.

Original languageEnglish (US)
Pages (from-to)29-41
Number of pages13
JournalCancer Informatics
Volume3
DOIs
StatePublished - 2007
Externally publishedYes

Bibliographical note

Funding Information:
J. Sunil Rao was partially supported by NIH grant K25-CA89868.

Keywords

  • Cancer classification
  • Gene selection
  • Microarray
  • Statistical redundancy

Fingerprint

Dive into the research topics of 'Statistical redundancy testing for improved gene selection in cancer classification using microarray data'. Together they form a unique fingerprint.

Cite this