TY - GEN
T1 - Modeling regulatory sites with higher order position-dependent weight matrices
AU - Zare, Hossein
AU - Kaveh, Mostafa
AU - Khodursky, Arkady B.
PY - 2008
Y1 - 2008
N2 - Identification of regulatory signals in DNA depends on the nature and quality of the patterns of representative sequences. These patterns are constructed from training sets of sequences by means of probabilistic models that either assume independence between positions or that suffer from considerable computational complexity. We have developed and tested higher order mod-els that account for significant dependent position pairs or triads, thereby capturing position-dependent information hidden in DNA binding sites. We have evaluated our algorithm on several data sets, including eukaryotic and bacterial transcription factor binding sites and shown that the scores from the higher order representation of binding sites have significant positive correlation to the binding affinity scores.
AB - Identification of regulatory signals in DNA depends on the nature and quality of the patterns of representative sequences. These patterns are constructed from training sets of sequences by means of probabilistic models that either assume independence between positions or that suffer from considerable computational complexity. We have developed and tested higher order mod-els that account for significant dependent position pairs or triads, thereby capturing position-dependent information hidden in DNA binding sites. We have evaluated our algorithm on several data sets, including eukaryotic and bacterial transcription factor binding sites and shown that the scores from the higher order representation of binding sites have significant positive correlation to the binding affinity scores.
KW - DNA binding sites
KW - Position weight matrix
KW - Regulatory signal
KW - Transcription factor
UR - http://www.scopus.com/inward/record.url?scp=51449111657&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=51449111657&partnerID=8YFLogxK
U2 - 10.1109/ICASSP.2008.4517688
DO - 10.1109/ICASSP.2008.4517688
M3 - Conference contribution
AN - SCOPUS:51449111657
SN - 1424414849
SN - 9781424414840
T3 - ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
SP - 629
EP - 632
BT - 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP
T2 - 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP
Y2 - 31 March 2008 through 4 April 2008
ER -