TY - GEN
T1 - Reconstructing transcriptional regulatory networks by probabilistic network component analysis
AU - Gu, Jinghua
AU - Xuan, Jianhua
AU - Wang, Xiao
AU - Shajahan, Ayesha N.
AU - Hilakivi-Clarke, Leena
AU - Clarke, Robert
PY - 2013
Y1 - 2013
N2 - Despite encouraging progress made by integrating multiplatform data for regulatory network reconstruction, identification of transcriptional regulatory networks remains challenging due to imperfection in current biotechnology and complexity of biological systems. It is important to develop new computational approaches for reliable regulatory network reconstruction, especially those of robustness against noise in gene expression data and 'structural error' (i.e., false connections) in binding data. We propose a new method, namely probabilistic network component analysis (pNCA), to estimate the posterior binding matrix given observed gene expression and binding data. The elements in the binding matrix, instead of taking deterministic binary values, are modeled as unknown Bernoulli random variables that represent the probability of regulation. A novel two-stage Gibbs sampling framework is employed to iteratively estimate both hidden transcription factor activities and the posterior distribution of binding matrix. Numerical simulation on synthetic data has demonstrated improved performance of the proposed method over several existing methods for regulatory network identification. Notably, the robustness of pNCA against 'structural error' in initial binding data is fortified with high tolerance of false negative connections in addition to that of false positive connections. The proposed method has been applied to breast cancer cell line data to reconstruct biologically meaningful regulatory networks, revealing condition-specific regulatory rewiring and important cooperative regulation associated with estrogen signaling and action in breast cancer cells.
AB - Despite encouraging progress made by integrating multiplatform data for regulatory network reconstruction, identification of transcriptional regulatory networks remains challenging due to imperfection in current biotechnology and complexity of biological systems. It is important to develop new computational approaches for reliable regulatory network reconstruction, especially those of robustness against noise in gene expression data and 'structural error' (i.e., false connections) in binding data. We propose a new method, namely probabilistic network component analysis (pNCA), to estimate the posterior binding matrix given observed gene expression and binding data. The elements in the binding matrix, instead of taking deterministic binary values, are modeled as unknown Bernoulli random variables that represent the probability of regulation. A novel two-stage Gibbs sampling framework is employed to iteratively estimate both hidden transcription factor activities and the posterior distribution of binding matrix. Numerical simulation on synthetic data has demonstrated improved performance of the proposed method over several existing methods for regulatory network identification. Notably, the robustness of pNCA against 'structural error' in initial binding data is fortified with high tolerance of false negative connections in addition to that of false positive connections. The proposed method has been applied to breast cancer cell line data to reconstruct biologically meaningful regulatory networks, revealing condition-specific regulatory rewiring and important cooperative regulation associated with estrogen signaling and action in breast cancer cells.
KW - Constrained optimization
KW - Gene regulatory networks
KW - Gibbs sampling
KW - Multivariate linear regression
KW - Network component analysis
UR - http://www.scopus.com/inward/record.url?scp=84888194934&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84888194934&partnerID=8YFLogxK
U2 - 10.1145/2506583.2506599
DO - 10.1145/2506583.2506599
M3 - Conference contribution
AN - SCOPUS:84888194934
SN - 9781450324342
T3 - 2013 ACM Conference on Bioinformatics, Computational Biology and Biomedical Informatics, ACM-BCB 2013
SP - 96
EP - 105
BT - 2013 ACM Conference on Bioinformatics, Computational Biology and Biomedical Informatics, ACM-BCB 2013
T2 - 2013 4th ACM Conference on Bioinformatics, Computational Biology and Biomedical Informatics, ACM-BCB 2013
Y2 - 22 September 2013 through 25 September 2013
ER -