Model-based cluster analysis of microarray gene-expression data

Wei Pan, Jizhen Lin, Chap T. Le

Research output: Contribution to journalArticlepeer-review

94 Scopus citations

Abstract

Background: Microarray technologies are emerging as a promising tool for genomic studies. The challenge now is how to analyze the resulting large amounts of data. Clustering techniques have been widely applied in analyzing microarray gene-expression data. However, normal mixture model-based cluster analysis has not been widely used for such data, although it has a solid probabilistic foundation. Here, we introduce and illustrate its use in detecting differentially expressed genes. In particular, we do not cluster gene-expression patterns but a summary statistic, the t-statistic. Results: The method is applied to a data set containing expression levels of 1,176 genes of rats with and without pneumococcal middle-ear infection. Three clusters were found, two of which contain more than 95% genes with almost no altered gene-expression levels, whereas the third one has 30 genes with more or less differential gene-expression levels. Conclusions: Our results indicate that model-based clustering of t-statistics (and possibly other summary statistics) can be a useful statistical tool to exploit differential gene expression for microarray data.

Original languageEnglish (US)
Article numberresearch0009.1
JournalGenome biology
Volume3
Issue number2
DOIs
StatePublished - 2002

Bibliographical note

Publisher Copyright:
© 2002, Pan et al., licensee BioMed Central Ltd.

Keywords

  • Akaike Information Criterion
  • Bayesian Information Criterion
  • Normal Mixture
  • Otitis Medium
  • Posterior Probability

Fingerprint

Dive into the research topics of 'Model-based cluster analysis of microarray gene-expression data'. Together they form a unique fingerprint.

Cite this