Architecture of the human regulatory network derived from ENCODE data - PubMed
- ️Sun Jan 01 2012
. 2012 Sep 6;489(7414):91-100.
doi: 10.1038/nature11245.
Mark B Gerstein # 1 2 3 , Manoj Hariharan # 5 , Stephen G Landt # 5 , Koon-Kiu Yan # 1 2 , Chao Cheng # 1 2 , Xinmeng Jasmine Mu # 1 , Ekta Khurana # 1 2 , Joel Rozowsky # 2 , Roger Alexander # 1 2 , Renqiang Min # 1 2 6 , Pedro Alves # 1 , Alexej Abyzov 1 2 , Nick Addleman 5 , Nitin Bhardwaj 1 2 , Alan P Boyle 5 , Philip Cayting 5 , Alexandra Charos 7 , David Z Chen 2 , Yong Cheng 5 , Declan Clarke 8 , Catharine Eastman 5 , Ghia Euskirchen 5 , Seth Frietze 9 , Yao Fu 1 , Jason Gertz 10 , Fabian Grubert 5 , Arif Harmanci 1 2 , Preti Jain 10 , Maya Kasowski 5 , Phil Lacroute 5 , Jing Jane Leng 1 , Jin Lian 11 , Hannah Monahan 7 , Henriette O'Geen 12 , Zhengqing Ouyang 5 , E Christopher Partridge 10 , Dorrelyn Patacsil 5 , Florencia Pauli 10 , Debasish Raha 7 , Lucia Ramirez 5 , Timothy E Reddy 10 , Brian Reed 7 , Minyi Shi 5 , Teri Slifer 5 , Jing Wang 1 , Linfeng Wu 5 , Xinqiong Yang 5 , Kevin Y Yip 1 2 13 , Gili Zilberman-Schapira 1 , Serafim Batzoglou 4 , Arend Sidow 14 , Peggy J Farnham 9 , Richard M Myers 10 , Sherman M Weissman 11 , Michael Snyder 5
Affiliations
- PMID: 22955619
- PMCID: PMC4154057
- DOI: 10.1038/nature11245
Architecture of the human regulatory network derived from ENCODE data
Mark B Gerstein et al. Nature. 2012.
Abstract
Transcription factors bind in a combinatorial fashion to specify the on-and-off states of genes; the ensemble of these binding events forms a regulatory network, constituting the wiring diagram for a cell. To examine the principles of the human transcriptional regulatory network, we determined the genomic binding information of 119 transcription-related factors in over 450 distinct experiments. We found the combinatorial, co-association of transcription factors to be highly context specific: distinct combinations of factors bind at specific genomic locations. In particular, there are significant differences in the binding proximal and distal to genes. We organized all the transcription factor binding into a hierarchy and integrated it with other genomic information (for example, microRNA regulation), forming a dense meta-network. Factors at different levels have different properties; for instance, top-level transcription factors more strongly influence expression and middle-level ones co-regulate targets to mitigate information-flow bottlenecks. Moreover, these co-regulations give rise to many enriched network motifs (for example, noise-buffering feed-forward loops). Finally, more connected network components are under stronger selection and exhibit a greater degree of allele-specific activity (that is, differential binding to the two parental alleles). The regulatory information obtained in this study will be crucial for interpreting personal genome sequences and understanding basic principles of human biology and disease.
Figures

(a) The co-binding map for the GATA1 focus-factor context in K562 shows the binding intensity of peaks of all TFs in K562 (rows) that overlap each GATA1 peak (columns). The colored rectangles represent 8 key clusters consisting of different combinations of co-associating partner-factors. (b) The GATA1 context-specific relative importance scores (RI) of all partner-factors (top) and the matrix of co-association scores (CS) between all pairs of TFs (bottom). Primary and local partners of GATA have high RI scores. The co-association score matrix captures the 8 clusters observed in (a). (c) Different partner-factors are preferentially enriched at gene-distal (positive differential RI) and proximal (negative differential RI) GATA1 peaks. (d) The aggregate factor importance matrix, obtained by stacking the RI of all partner-factors (columns) from all focus-factor contexts (rows) in K562, shows 9 functionally distinct clusters (C1 to C9) of contexts that can be broadly grouped as distal, proximal, mixed, and repressive. The blue rectangles highlight representative partner-factors with high RI in the clusters. The arrow from (b) to (d) indicates that the GATA1 context-specific RI scores form one row in this matrix. (e) Co-association variability map of partners (columns) of GATA1 (left panel) and FOS (right panel) over all K562 focus-factor contexts (rows). TAL1 and GATA2 show consistently high CS with GATA1 over most focus-factor contexts, but JUND shows context-specific co-association. FOS shows dramatic changes in CS of partner-factors over different contexts (e.g. FOS-JUND in distal contexts and FOS-SP2 in proximal ones). (More details in Fig. S2c, S2f-1, S2d, S2l-2.)

(a) Close-up of the TF hierarchy. The nodes depict the TFs: TFSSs are triangles, and non-TFSSs are circles. At the left we show the proximal-edge hierarchy with downward pointing edges colored in green, and upward pointing ones colored in red. The nodes are shaded according to their out-degree in the full network (as described in Table 1). The right part shows the TFs placed in the same proximal hierarchy but now with edges corresponding to distal regulation colored green and red, and nodes recolored according to out-degree in the distal network. We see that the distal edges do not follow the proximal-edge hierarchy. (b) Close-up of TF-miRNA regulation. The outer circle contains the 119 TFs, while the inner circle contains miRNAs. Red edges correspond to miRNAs regulating TFs; green ones, TFs regulating miRNAs. TFs and miRNAs each are arranged by their out-degree, beginning at 12 o'clock and decreasing in order clock-wise. Node sizes are proportional to out-degree. For TFs, the out-degree is as described in Table 1; for miRNAs, it is according to the out-degree in this network. Red nodes are enriched for miRNA-TF edges and green nodes are enriched for TF-miRNA edges. Gray nodes have a balanced number of edges (within ±1). (c) Average values of various properties (topological, dynamic, expression-related, and selection-related -ordered consistently with Table 1) for each level are shown for the proximal-edge hierarchy. The top, middle, and bottom rows correspond to the top, middle, and bottom of the hierarchy, respectively. The sizing of the grey circles indicate the relative ordering of the values for the three levels. Significantly different values (P<0.05) using the Wilcoxon-rank-sum test are indicated by black brackets. The proximal-edge hierarchy depicted on the right shows non-synonymous SNP density, where the shading corresponds to the density for the associated TF. (More details in Fig S4.)

(a) Enrichment of collaborating TF pairs from different levels (T,M,B). The TFs are represented by two nodes below each bar graph. The dashed orange line indicates the expected level of collaboration. Significant enrichment above or depletion below that level are marked by asterisks (P<0.05). (More details in SOM/G.1,2.) (b) Enrichment of proximal and distal co-regulatory pairs in the network hierarchy. Co-regulatory pairs from different levels are shown by the two nodes below each bar.

Motifs are accompanied by the occurrence frequency, N. Enriched motifs are highlighted in green, and depleted ones, in red. An occurrence frequency with a star means that the corresponding enrichment/depletion is statistically significant (P=1e-5). The motifs are sorted such that those at the ends have more significant p-values. (More details in Fig. S9h.) (a) Systematic search of 3-TF motifs. The most enriched motif is the FFL. A particular example formed by STAT1, STAT3 and RUNX1 is highlighted. Here, the “+” sign on an edge indicates that the correlation between the gene expression of the source and the target across tissues is positive. Other motifs containing a toggle-switch regulation on top of the basic FFL design are also indicated. (b) Proximal-Distal-PPI MIMs. Here we searched all motifs involving the co-regulation of two TFs (which could be either proximal or distal) with (or without) a protein-protein interaction between them. We found the motifs containing the protein-protein interaction tended to be enriched. (c) miRNA-SIMs. This figure shows the 2 enriched motifs resulting from enumerating all motifs in which a miRNA targets two TFs that are connected in various ways. These 2 motifs contain a protein complex of 2 TFs and a cooperative pair of promoter and distal regulatory TFs. (d) The auto-regulator motif is enriched in the TF-TF network: 28 of all TFs are auto-regulators. Moreover, auto-regulators are more likely to be repressors (-) relative to non-auto regulators, and they tend to have more ncRNAs as their targets.

(a) An “allelic effects network” depicting the increasing coordination between ASB and ASE as the number of TFs regulating a target increases. Central white nodes denote TFs, and peripheral nodes denote targets, which are blue (red) if they are expressed from the paternal (maternal) allele. Blue (red) edges denote ASB to the paternal (maternal) allele. This network represents the strongest differences between the paternal- and maternal-specific regulatory networks. As one goes around the larger circle counterclockwise (clockwise), each of the small circular clusters represents targets with progressively more paternal (maternal) regulation, indicated by the small blue (red) numbers to the side of the clusters. Moreover, within each of the clusters the fraction of predominantly paternally (maternally) expressed targets increases as one goes around the larger circle. As an illustration, this fraction is explicitly indicated by the ratios within three of the larger clusters at bottom right. (b) Relationship between TF allelicity and selection. The bar height is the ratio of the degree of selection (as measured by SNP density or average DAF) in those TF-binding peaks showing allelic behavior to the degree of selection in all other TF-binding peaks. Asterisks represent significant differences (P<0.05, Wilcoxon-rank-sum test). (More details in SOM/I.2 and Fig S10b,c.)
Similar articles
-
Ecker JR, Bickmore WA, Barroso I, Pritchard JK, Gilad Y, Segal E. Ecker JR, et al. Nature. 2012 Sep 6;489(7414):52-5. doi: 10.1038/489052a. Nature. 2012. PMID: 22955614 No abstract available.
-
An expansive human regulatory lexicon encoded in transcription factor footprints.
Neph S, Vierstra J, Stergachis AB, Reynolds AP, Haugen E, Vernot B, Thurman RE, John S, Sandstrom R, Johnson AK, Maurano MT, Humbert R, Rynes E, Wang H, Vong S, Lee K, Bates D, Diegel M, Roach V, Dunn D, Neri J, Schafer A, Hansen RS, Kutyavin T, Giste E, Weaver M, Canfield T, Sabo P, Zhang M, Balasundaram G, Byron R, MacCoss MJ, Akey JM, Bender MA, Groudine M, Kaul R, Stamatoyannopoulos JA. Neph S, et al. Nature. 2012 Sep 6;489(7414):83-90. doi: 10.1038/nature11212. Nature. 2012. PMID: 22955618 Free PMC article.
-
An integrated encyclopedia of DNA elements in the human genome.
ENCODE Project Consortium. ENCODE Project Consortium. Nature. 2012 Sep 6;489(7414):57-74. doi: 10.1038/nature11247. Nature. 2012. PMID: 22955616 Free PMC article.
-
Unraveling transcription regulatory networks by protein-DNA and protein-protein interaction mapping.
Walhout AJ. Walhout AJ. Genome Res. 2006 Dec;16(12):1445-54. doi: 10.1101/gr.5321506. Epub 2006 Oct 19. Genome Res. 2006. PMID: 17053092 Review.
-
The complex transcription regulatory landscape of our genome: control in three dimensions.
Splinter E, de Laat W. Splinter E, et al. EMBO J. 2011 Sep 27;30(21):4345-55. doi: 10.1038/emboj.2011.344. EMBO J. 2011. PMID: 21952046 Free PMC article. Review.
Cited by
-
PreDREM: a database of predicted DNA regulatory motifs from 349 human cell and tissue samples.
Zheng Y, Li X, Hu H. Zheng Y, et al. Database (Oxford). 2015 Feb 27;2015:bav007. doi: 10.1093/database/bav007. Print 2015. Database (Oxford). 2015. PMID: 25725063 Free PMC article.
-
Merienne N, Le Douce J, Faivre E, Déglon N, Bonvento G. Merienne N, et al. Front Cell Neurosci. 2013 Jul 5;7:106. doi: 10.3389/fncel.2013.00106. eCollection 2013. Front Cell Neurosci. 2013. PMID: 23847471 Free PMC article.
-
Liao M, Zhu X, Lu Y, Yi X, Hu Y, Zhao Y, Ye Z, Guo X, Liang M, Jin X, Zhang H, Wang X, Zhao Z, Chen Y, Yan H. Liao M, et al. Nat Commun. 2024 Aug 25;15(1):7324. doi: 10.1038/s41467-024-51624-y. Nat Commun. 2024. PMID: 39183203 Free PMC article.
-
Epigenetic changes in the developing brain: Effects on behavior.
Keverne EB, Pfaff DW, Tabansky I. Keverne EB, et al. Proc Natl Acad Sci U S A. 2015 Jun 2;112(22):6789-95. doi: 10.1073/pnas.1501482112. Proc Natl Acad Sci U S A. 2015. PMID: 26034282 Free PMC article. No abstract available.
-
Passive and active DNA methylation and the interplay with genetic variation in gene regulation.
Gutierrez-Arcelus M, Lappalainen T, Montgomery SB, Buil A, Ongen H, Yurovsky A, Bryois J, Giger T, Romano L, Planchon A, Falconnet E, Bielser D, Gagnebin M, Padioleau I, Borel C, Letourneau A, Makrythanasis P, Guipponi M, Gehrig C, Antonarakis SE, Dermitzakis ET. Gutierrez-Arcelus M, et al. Elife. 2013 Jun 4;2:e00523. doi: 10.7554/eLife.00523. Elife. 2013. PMID: 23755361 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources