TY - JOUR
T1 - Differential gene expression detection using penalized linear regression models
T2 - The improved SAM statistics
AU - Wu, Baolin
N1 - Funding Information:
This research was supported by a startup fund from the Division of Biostatistics, University of Minnesota. The author would like to
PY - 2005/4/15
Y1 - 2005/4/15
N2 - Summary: Differential gene expression detection using microarrays has received lots of research interests recently. Many methods have been proposed, including variants of F-statistics, non-parametric approaches and empirical Bayesian methods etc. The SAM statistics has been shown to have good performance in empirical studies. SAM is more like an ad hoc shrinkage method. The idea is that for small sample microarray data, it is often useful to pool information across genes to improve efficiency. Under Bayesian framework Smyth formally derived the test statistics with shrinkage using the hierarchical models. In this paper we cast differential gene expression detection in the familiar framework of linear regression model. Commonly used test statistics correspond to using least squares to estimate the regression parameters. Based on the vast literature of research on linear models, we can naturally consider other alternatives. Here we explore the penalized linear regression. We propose the penalized t-/ F-statistics for two-class microarray data based on L1 penalty. We will show that the penalized test statistics intuitively makes sense and through applications we illustrate its good performance.
AB - Summary: Differential gene expression detection using microarrays has received lots of research interests recently. Many methods have been proposed, including variants of F-statistics, non-parametric approaches and empirical Bayesian methods etc. The SAM statistics has been shown to have good performance in empirical studies. SAM is more like an ad hoc shrinkage method. The idea is that for small sample microarray data, it is often useful to pool information across genes to improve efficiency. Under Bayesian framework Smyth formally derived the test statistics with shrinkage using the hierarchical models. In this paper we cast differential gene expression detection in the familiar framework of linear regression model. Commonly used test statistics correspond to using least squares to estimate the regression parameters. Based on the vast literature of research on linear models, we can naturally consider other alternatives. Here we explore the penalized linear regression. We propose the penalized t-/ F-statistics for two-class microarray data based on L1 penalty. We will show that the penalized test statistics intuitively makes sense and through applications we illustrate its good performance.
UR - http://www.scopus.com/inward/record.url?scp=17444374677&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=17444374677&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/bti217
DO - 10.1093/bioinformatics/bti217
M3 - Article
C2 - 15598833
AN - SCOPUS:17444374677
VL - 21
SP - 1565
EP - 1571
JO - Bioinformatics
JF - Bioinformatics
SN - 1367-4803
IS - 8
ER -