TY - JOUR
T1 - Binding profiles for 961 Drosophila and C. elegans transcription factors reveal tissue-specific regulatory relationships
AU - Kudron, Michelle
AU - Gevirtzman, Louis
AU - Victorsen, Alec
AU - Lear, Bridget C.
AU - Gao, Jiahao
AU - Xu, Jinrui
AU - Samanta, Swapna
AU - Frink, Emily
AU - Tran-Pearson, Adri
AU - Huynh, Chau
AU - Vafeados, Dionne
AU - Hammonds, Ann
AU - Fisher, William
AU - Wall, Martha
AU - Wesseling, Greg
AU - Hernandez, Vanessa
AU - Lin, Zhichun
AU - Kasparian, Mary
AU - White, Kevin
AU - Allada, Ravi
AU - Gerstein, Mark
AU - Hillier, La Deana
AU - Celniker, Susan E.
AU - Reinke, Valerie
AU - Waterston, Robert H.
N1 - Publisher Copyright:
© 2024 Kudron et al.
PY - 2024/12
Y1 - 2024/12
N2 - A catalog of transcription factor (TF) binding sites in the genome is critical for deciphering regulatory relationships. Here, we present the culmination of the efforts of the modENCODE (model organism Encyclopedia of DNA Elements) and modERN (model organism Encyclopedia of Regulatory Networks) consortia to systematically assay TF binding events in vivo in two major model organisms, Drosophila melanogaster (fly) and Caenorhabditis elegans (worm). These data sets comprise 605 TFs identifying 3.6 M sites in the fly and 356 TFs identifying 0.9 M sites in the worm, and represent the majority of the regulatory space in each genome. We demonstrate that TFs associate with chromatin in clusters termed “metapeaks,” that larger metapeaks have characteristics of high-occupancy target (HOT) regions, and that the importance of consensus sequence motifs bound by TFs depends on metapeak size and complexity. Combining ChIP-seq data with single-cell RNA-seq data in a machine-learning model identifies TFs with a prominent role in promoting target gene expression in specific cell types, even differentiating between parent–daughter cells during embryogenesis. These data are a rich resource for the community that should fuel and guide future investigations into TF function. To facilitate data accessibility and utility, all strains expressing green fluorescent protein (GFP)-tagged TFs are available at the stock centers for each organism. The chromatin immunoprecipitation sequencing data are available through the ENCODE Data Coordinating Center, GEO, and through a direct interface that provides rapid access to processed data sets and summary analyses, as well as widgets to probe the cell-type-specific TF–target relationships.
AB - A catalog of transcription factor (TF) binding sites in the genome is critical for deciphering regulatory relationships. Here, we present the culmination of the efforts of the modENCODE (model organism Encyclopedia of DNA Elements) and modERN (model organism Encyclopedia of Regulatory Networks) consortia to systematically assay TF binding events in vivo in two major model organisms, Drosophila melanogaster (fly) and Caenorhabditis elegans (worm). These data sets comprise 605 TFs identifying 3.6 M sites in the fly and 356 TFs identifying 0.9 M sites in the worm, and represent the majority of the regulatory space in each genome. We demonstrate that TFs associate with chromatin in clusters termed “metapeaks,” that larger metapeaks have characteristics of high-occupancy target (HOT) regions, and that the importance of consensus sequence motifs bound by TFs depends on metapeak size and complexity. Combining ChIP-seq data with single-cell RNA-seq data in a machine-learning model identifies TFs with a prominent role in promoting target gene expression in specific cell types, even differentiating between parent–daughter cells during embryogenesis. These data are a rich resource for the community that should fuel and guide future investigations into TF function. To facilitate data accessibility and utility, all strains expressing green fluorescent protein (GFP)-tagged TFs are available at the stock centers for each organism. The chromatin immunoprecipitation sequencing data are available through the ENCODE Data Coordinating Center, GEO, and through a direct interface that provides rapid access to processed data sets and summary analyses, as well as widgets to probe the cell-type-specific TF–target relationships.
UR - https://www.scopus.com/pages/publications/85213206504
UR - https://www.scopus.com/pages/publications/85213206504#tab=citedBy
U2 - 10.1101/gr.279037.124
DO - 10.1101/gr.279037.124
M3 - Article
C2 - 39438113
AN - SCOPUS:85213206504
SN - 1088-9051
VL - 34
SP - 2319
EP - 2334
JO - Genome Research
JF - Genome Research
IS - 12
ER -