TY - GEN
T1 - Biomarker identification by knowledge-driven multi-scale independent component analysis
AU - Chen, Li
AU - Xuan, Jianhua
AU - Clarke, Robert
AU - Wang, Yue
PY - 2007
Y1 - 2007
N2 - Many statistical methods have been proposed to identify biomarkers from gene expression profiles. However, from expression data alone, statistical methods often fail to identify biologically meaningful biomarkers related to a specific biological process or disease under study. In this paper, we develop a novel strategy, namely knowledge-driven multi-scale independent component analysis (ICA), to infer regulatory signals and identify biologically relevant biomarkers from microarray data. Specifically, based on partial prior knowledge and clustering results, we apply ICA to find the most knowledge relevant linear regulatory mode in each subset of genes and then extract associated biomarkers according to their weighted loading factors. We have applied our method to a yeast cell cycle microarray dataset to find cell cycle regulated biomarkers. The experimental results indicate that our knowledge-driven multi-scale ICA method outperforms both baseline ICA method and correlation method significantly.
AB - Many statistical methods have been proposed to identify biomarkers from gene expression profiles. However, from expression data alone, statistical methods often fail to identify biologically meaningful biomarkers related to a specific biological process or disease under study. In this paper, we develop a novel strategy, namely knowledge-driven multi-scale independent component analysis (ICA), to infer regulatory signals and identify biologically relevant biomarkers from microarray data. Specifically, based on partial prior knowledge and clustering results, we apply ICA to find the most knowledge relevant linear regulatory mode in each subset of genes and then extract associated biomarkers according to their weighted loading factors. We have applied our method to a yeast cell cycle microarray dataset to find cell cycle regulated biomarkers. The experimental results indicate that our knowledge-driven multi-scale ICA method outperforms both baseline ICA method and correlation method significantly.
UR - http://www.scopus.com/inward/record.url?scp=50949099313&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=50949099313&partnerID=8YFLogxK
U2 - 10.1109/LSSA.2007.4400934
DO - 10.1109/LSSA.2007.4400934
M3 - Conference contribution
AN - SCOPUS:50949099313
SN - 9781424418138
T3 - 2007 IEEE/NIH Life Science Systems and Applications Workshop, LISA
SP - 261
EP - 264
BT - 2007 IEEE/NIH Life Science Systems and Applications Workshop, LISA
PB - IEEE Computer Society
T2 - 2007 IEEE/NIH Life Science Systems and Applications Workshop, LISA
Y2 - 8 November 2007 through 9 November 2007
ER -