TY - JOUR
T1 - Pattern discovery in expression profiling data
AU - Katagiri, Fumiaki
AU - Glazebrook, Jane
PY - 2009/2/13
Y1 - 2009/2/13
N2 - In expression profiling studies, it is often necessary to identify groups of genes with similar expression profiles in a variety of samples, and/or groups of samples with similar expression profiles. Each profile can be expressed as a single data point in a space with the same number of dimensions as there are parameters in the profiles. In this way, pattern discovery among expression profiles is translated into pattern discovery in the spatial distribution of data points: the similarity between profiles is defined by the distance between the corresponding data points. Various multivariate analysis methods, such as clustering and dimensionality reduction methods, are used to summarize the data point distribution to help the investigator recognize major trends. As different methods may identify different features of the distribution, it is important to analyze a particular data set with multiple methods.
AB - In expression profiling studies, it is often necessary to identify groups of genes with similar expression profiles in a variety of samples, and/or groups of samples with similar expression profiles. Each profile can be expressed as a single data point in a space with the same number of dimensions as there are parameters in the profiles. In this way, pattern discovery among expression profiles is translated into pattern discovery in the spatial distribution of data points: the similarity between profiles is defined by the distance between the corresponding data points. Various multivariate analysis methods, such as clustering and dimensionality reduction methods, are used to summarize the data point distribution to help the investigator recognize major trends. As different methods may identify different features of the distribution, it is important to analyze a particular data set with multiple methods.
KW - Dimensionality reduction
KW - Hierarchical clustering
KW - K-means
KW - Multivariate analysis
KW - Pearson correlation coefficient
KW - Principal component analysis
KW - Self-organizing map
UR - http://www.scopus.com/inward/record.url?scp=59749086689&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=59749086689&partnerID=8YFLogxK
U2 - 10.1002/0471142727.mb2205s85
DO - 10.1002/0471142727.mb2205s85
M3 - Review article
C2 - 19170028
AN - SCOPUS:59749086689
SN - 1934-3639
SP - 22.5.1-22.5.15
JO - Current Protocols in Molecular Biology
JF - Current Protocols in Molecular Biology
IS - SUPPL. 85
ER -