TY - JOUR
T1 - Penalized mixtures of factor analyzers with application to clustering high-dimensional microarray data.
AU - Xie, Benhuai
AU - Pan, Wei
AU - Shen, Xiaotong
PY - 2010
Y1 - 2010
N2 - Motivation: Model-based clustering has been widely used, e.g. in microarray data analysis. Since for high-dimensional data variable selection is necessary, several penalized model-based clustering methods have been proposed tørealize simultaneous variable selection and clustering. However, the existing methods all assume that the variables are independent with the use of diagonal covariance matrices. Results: To model non-independence of variables (e.g. correlated gene expressions) while alleviating the problem with the large number of unknown parameters associated with a general non-diagonal covariance matrix, we generalize the mixture of factor analyzers to that with penalization, which, among others, can effectively realize variable selection. We use simulated data and real microarray data to illustrate the utility and advantages of the proposed method over several existing ones. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.
AB - Motivation: Model-based clustering has been widely used, e.g. in microarray data analysis. Since for high-dimensional data variable selection is necessary, several penalized model-based clustering methods have been proposed tørealize simultaneous variable selection and clustering. However, the existing methods all assume that the variables are independent with the use of diagonal covariance matrices. Results: To model non-independence of variables (e.g. correlated gene expressions) while alleviating the problem with the large number of unknown parameters associated with a general non-diagonal covariance matrix, we generalize the mixture of factor analyzers to that with penalization, which, among others, can effectively realize variable selection. We use simulated data and real microarray data to illustrate the utility and advantages of the proposed method over several existing ones. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics online.
UR - http://www.scopus.com/inward/record.url?scp=77952308097&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77952308097&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btp707
DO - 10.1093/bioinformatics/btp707
M3 - Article
C2 - 20031967
AN - SCOPUS:77949520571
SN - 1367-4811
SN - 1460-2059
VL - 26
SP - 501
EP - 508
JO - Bioinformatics (Oxford, England)
JF - Bioinformatics (Oxford, England)
IS - 4
ER -