Integrating multi-source biological data for transcriptional regulatory module discovery

Habtom W. Ressom, Yuji Zhang, Jianhua Xuan, Yue Wang, Robert Clarke

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The design principles of gene transcriptional regulation networks in cells have been puzzles due to their unknown dynamic and nonlinear mechanisms. Although high-throughput biotechnologies have generated unprecedented amounts of data, the integration of multi-source data to better understand the process of gene regulation has been a challenge in post genomics era. Gene expression data are limited in providing information about the underlying causal relationships among genes. Prior biological knowledge such as protein binding data and gene ontology annotation, albeit limited in quantity, reflects physical processes of gene regulation. In this paper, we introduce a computational framework for utilizing time course gene expression patterns, protein binding data, and gene ontology information to infer transcriptional regulatory modules. The proposed method mainly consists of three parts: (1) a fuzzy c-means clustering approach that exploits gene functional category information to define gene clusters; (2) a network motif detection tool that classifies the transcription factors into different kinds of regulatory modules based on protein binding data; and (3) a recurrent neural network model for each transcription factor that mimics the architecture of the predicted regulatory module. A hybrid of genetic algorithm and particle swarm optimization method is applied to search for gene cluster that may be regulated by the transcription factor and to determine the parameters of the recurrent neural network. The proposed method is tested on yeast cell cycle process. The inferred gene transcriptional regulatory networks are compared with previously reported results in the literature.

Original languageEnglish (US)
Title of host publication2007 IEEE/NIH Life Science Systems and Applications Workshop, LISA
PublisherIEEE Computer Society
Pages184-187
Number of pages4
ISBN (Print)9781424418138
DOIs
StatePublished - 2007
Externally publishedYes
Event2007 IEEE/NIH Life Science Systems and Applications Workshop, LISA - Bethesda, MD, United States
Duration: Nov 8 2007Nov 9 2007

Publication series

Name2007 IEEE/NIH Life Science Systems and Applications Workshop, LISA

Other

Other2007 IEEE/NIH Life Science Systems and Applications Workshop, LISA
CountryUnited States
CityBethesda, MD
Period11/8/0711/9/07

Fingerprint Dive into the research topics of 'Integrating multi-source biological data for transcriptional regulatory module discovery'. Together they form a unique fingerprint.

Cite this