Abstract
In gene selection for cancer classification using microarray data, we define an eigenvalue-ratio statistic to measure a gene's contribution to the joint discriminability when this gene is included into a set of genes. Based on this eigenvalue-ratio statistic, we define a novel hypothesis testing for gene statistical redundancy and propose two gene selection methods. Simulation studies illustrate the agreement between statistical redundancy testing and gene selection methods. Real data examples show the proposed gene selection methods can select a compact gene subset which can not only be used to build high quality cancer classifiers but also show biological relevance.
Original language | English (US) |
---|---|
Pages (from-to) | 29-41 |
Number of pages | 13 |
Journal | Cancer Informatics |
Volume | 3 |
DOIs | |
State | Published - 2007 |
Externally published | Yes |
Bibliographical note
Funding Information:J. Sunil Rao was partially supported by NIH grant K25-CA89868.
Keywords
- Cancer classification
- Gene selection
- Microarray
- Statistical redundancy