TY - JOUR
T1 - Improved detection of differentially expressed genes through incorporation of gene locations
AU - Xiao, Guanghua
AU - Reilly, Cavan
AU - Khodursky, Arkady B.
N1 - Copyright:
Copyright 2011 Elsevier B.V., All rights reserved.
PY - 2009/9
Y1 - 2009/9
N2 - In determining differential expression in cDNA microarray experiments, the expression level of an individual gene is usually assumed to be independent of the expression levels of other genes, but many recent studies have shown that a gene's expression level tends to be similar to that of its neighbors on a chromosome, and differentially expressed (DE) genes are likely to form clusters of similar transcriptional activity along the chromosome. When modeled as a one-dimensional spatial series, the expression level of genes on the same chromosome frequently exhibit significant spatial correlation, reflecting spatial patterns in transcription. By modeling these spatial correlations, we can obtain improved estimates of transcript levels. Here, we demonstrate the existence of spatial correlations in transcriptional activity in the Escherichia coli (E. coli) chromosome across more than 50 experimental conditions. Based on this finding, we propose a hierarchical Bayesian model that borrows information from neighboring genes to improve the estimation of the expression level of a given gene and hence the detection of DE genes. Furthermore, we extend the model to account for the circular structure of E. coli chromosome and the intergenetic distance between gene neighbors. The simulation studies and analysis of real data examples in E. coli and yeast Saccharomyces cerevisiae show that the proposed method outperforms the commonly used significant analysis of microarray (SAM) t-statistic in detecting DE genes.
AB - In determining differential expression in cDNA microarray experiments, the expression level of an individual gene is usually assumed to be independent of the expression levels of other genes, but many recent studies have shown that a gene's expression level tends to be similar to that of its neighbors on a chromosome, and differentially expressed (DE) genes are likely to form clusters of similar transcriptional activity along the chromosome. When modeled as a one-dimensional spatial series, the expression level of genes on the same chromosome frequently exhibit significant spatial correlation, reflecting spatial patterns in transcription. By modeling these spatial correlations, we can obtain improved estimates of transcript levels. Here, we demonstrate the existence of spatial correlations in transcriptional activity in the Escherichia coli (E. coli) chromosome across more than 50 experimental conditions. Based on this finding, we propose a hierarchical Bayesian model that borrows information from neighboring genes to improve the estimation of the expression level of a given gene and hence the detection of DE genes. Furthermore, we extend the model to account for the circular structure of E. coli chromosome and the intergenetic distance between gene neighbors. The simulation studies and analysis of real data examples in E. coli and yeast Saccharomyces cerevisiae show that the proposed method outperforms the commonly used significant analysis of microarray (SAM) t-statistic in detecting DE genes.
KW - Autocorrelation function
KW - Gene expression
KW - Spatial smoothing
UR - http://www.scopus.com/inward/record.url?scp=70349254460&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70349254460&partnerID=8YFLogxK
U2 - 10.1111/j.1541-0420.2008.01161.x
DO - 10.1111/j.1541-0420.2008.01161.x
M3 - Article
C2 - 19173705
AN - SCOPUS:70349254460
SN - 0006-341X
VL - 65
SP - 805
EP - 814
JO - Biometrics
JF - Biometrics
IS - 3
ER -