Genome Dashboards: Framework and Examples - PubMed
- ️Wed Jan 01 2020
Genome Dashboards: Framework and Examples
Zilong Li et al. Biophys J. 2020.
Abstract
Genomics is a sequence-based informatics science and a three-dimensional-structure-based material science. However, in practice, most genomics researchers utilize sequence-based informatics approaches or three-dimensional-structure-based material science techniques, not both. This division is, at least in part, the result of historical developments rather than a fundamental necessity. The underlying computational tools, experimental techniques, and theoretical models were developed independently. The primary result presented here is a framework for the unification of informatics- and physics-based data associated with DNA, nucleosomes, and chromatin. The framework is based on the mathematical representation of geometrically exact rods and the generalization of DNA basepair step parameters. Data unification enables researchers to integrate computational, experimental, and theoretical approaches for the study of chromatin biology. The framework can be implemented using model-view-controller design principles, existing genome browsers, and existing molecular visualization tools. We developed a minimal, web-based genome dashboard, G-Dash-min, and applied it to two simple examples to demonstrate the usefulness of data unification and proof of concept. Genome dashboards developed using the framework and design principles presented here are extensible and customizable and are therefore more broadly applicable than the examples presented. We expect a number of purpose-specific genome dashboards to emerge as a novel means of investigating structure-function relationships for genomes that range from basepairs to entire chromosomes and for generating, validating, and testing mechanistic hypotheses.
Copyright © 2020 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Figures
![Figure 1](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5504/7203004/f9838655cc67/gr1.gif)
Unification is the process of merging data from different sources. Physical structures and informatics data are unified by mathematical representations of an oriented space curve in laboratory [r→(s), D(s)] and material [Γ→(s),Ω→(s)] reference frames. The conformation of a physical structure C(s) is associated with the laboratory frame, and informatics track data T(s) is associated with the material frame. Masks M(s) alter the material properties of DNA and may be expressed in either representation. Exchanging data between laboratory and material frames unifies the physical structure and informatics.
![Figure 2](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5504/7203004/8406a5b4cd6c/gr2.gif)
MVC design. Model: Laboratory frame [r→(s), D(s)] and material frame [Γ→(s),Ω→(s)] descriptions of DNA as the common thread, an inventory of masks M(si, si + ni), and procedures for converting between representations are given. View: an MV displays C(s), a genome browser (GB) displays T(s), and a CP provides a graphical interface to the controller. G-Dash-min uses JSmol and Biodalliance for the MV and GB components, respectively. An OTS approach enables a genome dashboard to use any desired MVs and GBs. Controller manages the exchange of data between model and views.
![Figure 3](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5504/7203004/58745e960375/gr3.gif)
Colored boxes: C(s) and T(s) representations of two allowed states indicated by red and blue boxes, respectively. Upper boxes are T(s) representations of nucleosome positions (blue bars) and an ERE (red bar). Lower boxes are C(s) representations (small beads represent five basepairs; large beads represent histone octamers). Colored ellipses are the corresponding all-atom structures with the estrogen receptor DNA-binding domain docked to the DNA as in PDB:
1HCQ. (a) The ERE is located within a nucleosome, with the major groove facing inward. The receptor is prohibited from binding. (b) The ERE is located in a nucleosome-free region. Docking PDB: 1HCQ indicates that the ERE is physically accessible.
![Figure 4](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5504/7203004/4f1889596ea3/gr4.gif)
(a) HOXC coarse-grained model of chromatin containing ∼55,000 basepairs of DNA and 284 nucleosomes. Uploading the HOXC model to G-Dash-min generates (b) a two-angle representation of the HOXC model, the color bar represents the index of nucleosomes, from red to blue, (c) a distance-distance matrix based on nucleosome centers of mass, the color bar represents the distance between nucleosomes, darker is closer, and black for the distance between nucleosomes less than 10 nm, and (d) structural informatics data. “Generalized Helical Parameter” (“Twist” and “Rise”) and nucleosome position (“Nucleosomes”) data are displayed alongside experimentally determined nucleosome positions (“Nuc-Pos”) and other informatics data (“Gencode”).
Similar articles
-
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).
Foffi G, Pastore A, Piazza F, Temussi PA. Foffi G, et al. Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2. Phys Biol. 2013. PMID: 23912807
-
Sargent L, Liu Y, Leung W, Mortimer NT, Lopatto D, Goecks J, Elgin SCR. Sargent L, et al. PLoS Comput Biol. 2020 Jun 4;16(6):e1007863. doi: 10.1371/journal.pcbi.1007863. eCollection 2020 Jun. PLoS Comput Biol. 2020. PMID: 32497138 Free PMC article.
-
Applications of the pipeline environment for visual informatics and genomics computations.
Dinov ID, Torri F, Macciardi F, Petrosyan P, Liu Z, Zamanyan A, Eggert P, Pierce J, Genco A, Knowles JA, Clark AP, Van Horn JD, Ames J, Kesselman C, Toga AW. Dinov ID, et al. BMC Bioinformatics. 2011 Jul 26;12:304. doi: 10.1186/1471-2105-12-304. BMC Bioinformatics. 2011. PMID: 21791102 Free PMC article.
-
Contributions of Sequence to the Higher-Order Structures of DNA.
Todolli S, Perez PJ, Clauvelin N, Olson WK. Todolli S, et al. Biophys J. 2017 Feb 7;112(3):416-426. doi: 10.1016/j.bpj.2016.11.017. Epub 2016 Dec 9. Biophys J. 2017. PMID: 27955889 Free PMC article. Review.
-
Challenges for visualizing three-dimensional data in genomic browsers.
Goodstadt M, Marti-Renom MA. Goodstadt M, et al. FEBS Lett. 2017 Sep;591(17):2505-2519. doi: 10.1002/1873-3468.12778. Epub 2017 Aug 24. FEBS Lett. 2017. PMID: 28771695 Free PMC article. Review.
Cited by
-
Multiscale Genome Organization: Dazzling Subject and Inventive Methods.
Schlick T. Schlick T. Biophys J. 2020 May 5;118(9):E1-E3. doi: 10.1016/j.bpj.2020.04.007. Epub 2020 Apr 16. Biophys J. 2020. PMID: 32305070 Free PMC article. No abstract available.
-
The Nucleome Data Bank: web-based resources to simulate and analyze the three-dimensional genome.
Contessoto VG, Cheng RR, Hajitaheri A, Dodero-Rojas E, Mello MF, Lieberman-Aiden E, Wolynes PG, Di Pierro M, Onuchic JN. Contessoto VG, et al. Nucleic Acids Res. 2021 Jan 8;49(D1):D172-D182. doi: 10.1093/nar/gkaa818. Nucleic Acids Res. 2021. PMID: 33021634 Free PMC article.
-
The nucleosome reference frame and standard geometries for octasomes.
Sun R, Bishop T. Sun R, et al. Biophys Rev. 2024 Jul 19;16(3):315-330. doi: 10.1007/s12551-024-01206-5. eCollection 2024 Jun. Biophys Rev. 2024. PMID: 39099844 Review.
References
-
- Fussner E., Ching R.W., Bazett-Jones D.P. Living without 30nm chromatin fibers. Trends Biochem. Sci. 2011;36:1–6. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources