research papers
A correction has been published for this article. To view the correction, click here.
High-performance macromolecular data delivery and visualization for the web
aCEITEC – Central European Institute of Technology, Masaryk University, Kamenice 753/5, 625 00 Brno, Czech Republic, bNational Centre for Biomolecular Research, Faculty of Science, Masaryk University, Kamenice 753/5, 625 00 Brno, Czech Republic, cProtein Data Bank in Europe (PDBe), European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL–EBI), Wellcome Genome Campus, Hinxton CB10 1SD, United Kingdom, dRegional Centre of Advanced Technologies and Materials, Department of Physical Chemistry, Faculty of Science, Palacký University Olomouc, Šlechtitelů 241/27, 779 00 Olomouc, Czech Republic, eResearch Collaboratory for Structural Bioinformatics (RCSB), San Diego Supercomputer Center, University of California San Diego, 9500 Gilman Drive, La Jolla, San Diego, CA 92093-0743, USA, fRCSB Protein Data Bank, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, 174 Frelinghuysen Road, Piscataway, NJ 08854-8076, USA, gCancer Institute of New Jersey, Rutgers, The State University of New Jersey, 195 Little Albany Street, New Brunswick, NJ 08903-2681, USA, and hRCSB Protein Data Bank, San Diego Supercomputer Center and Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0654, USA
*Correspondence e-mail: david.sehnal@mail.muni.cz, radka.svobodova@ceitec.muni.cz
Biomacromolecular structural data make up a vital and crucial scientific resource that has grown not only in terms of its amount but also in its size and complexity. Furthermore, these data are accompanied by large and increasing amounts of experimental data. Additionally, the macromolecular data are enriched with value-added annotations describing their biological, physicochemical and structural properties. Today, the scientific community requires fast and fully interactive web visualization to exploit this complex structural information. This article provides a survey of the available cutting-edge web services that address this challenge. Specifically, it focuses on data-delivery problems, discusses the visualization of a single structure, including experimental data and annotations, and concludes with a focus on the results of molecular-dynamics simulations and the visualization of structural ensembles.
Keywords: data delivery; visualization; macromolecules; browser-based; web-based.
1. Introduction
Biomacromolecular structural data, originating from more than seven decades of intensive research, form a highly valuable and scientifically vital resource that provides a mechanistic understanding of biological systems. Currently, more than 166 000 experimentally determined three-dimensional structures of biological macromolecules are available from the open-access Protein Data Bank (PDB) that is jointly managed by the Worldwide Protein Data Bank consortium (wwPDB Consortium, 2019), with 200–300 new structures being added every week. Biomacromolecular structural data (atomic coordinates) obtained from macromolecular crystallography (MX) or (3DEM) are accompanied by experimental data: measured structure factors yielding electron-density maps or directly measured Coulomb maps. NMR entries in the PDB are represented by ensembles of 10–20 structures, reflecting the uncertainty in the structural model estimated from experimentally derived restraints.
The PDB archive continues to grow both in terms of the number of new entries (count of structures; see Fig. 1a) and also in the complexity and size of the structures deposited in the PDB (the size of individual structures; see Fig. 1b). The largest single entry is the HIV-1 capsid (PDB entry 3j3q), with 2.4 million atoms (Zhao et al., 2013). However, the entries in the PDB also serve as building blocks for much larger models of biological systems, such as the cellPACK (Johnson et al., 2013a,b, 2015) model of the HIV-1 capsid in blood serum (Johnson et al., 2014), with nearly 68 million atoms, as shown in Fig. 2(a). The PDB-Dev (Johnson et al., 2013a,b, 2014) prototype system collects gigantic structural models obtained using integrative/hybrid modelling; for example, the nuclear pore complex (PDB-Dev ID PDBDEV_00000012; Kim et al., 2018), which contains 243 000 residues (represented as coarse elements; see Fig. 2b).
Value-added annotations provide the necessary biological context for the macromolecular structure data. The increasing amount of value-added annotations include information about many biological properties (mutation positions and effects, functional and active sites, channels and pores etc.), physicochemical properties (charges, flexibility etc.) and structural properties (ligand binding, various quality criteria etc.). These data are stored in multiple databases, and many of them are collected in the PDBe-KB database (PDBe-KB Consortium, 2020) and the RCSB PDB database (Goodsell et al., 2020). Moreover, not surprisingly, the number of available annotation types is also growing.
Biological macromolecules are inherently dynamic in nature, and hence the research community is not only interested in the static data available in the PDB but also generates information on the dynamics of the structures using
and, increasingly, experimental techniques such as X-ray free-electron lasers and serial crystallography. These data are the basis of various complex analyses and examinations providing mechanistic insights into the function of biological macromolecules.Visualization of these scientific data is pivotal in the analysis of biomacromolecular structures by the broader scientific user base, with most non-structural biology users expecting access to these data via web-based visualization tools. Specifically, researchers require a fast, responsive and fully interactive web visualization of all of the data mentioned above: atomic coordinates, experimental data, annotations and molecular dynamics.
Here, we describe the currently available cutting-edge web services that address these challenges.
Firstly, we focus on data-delivery problems. We then discuss the visualization of a single structure, including experimental data and annotations, and conclude with a focus on dynamic data and the visualization of structural ensembles.
For alternative surveys of tools for molecular visualization, we direct the reader to Martinez et al. (2020) and Miao et al. (2019).
2. Methods
2.1. Data delivery: coordinates
Until now, the naïve approach of delivering the complete coordinate model worked owing to the modest size of the macromolecular structures studied using traditional structure-determination methods. This approach was used successfully even when only a small part was visualized, such as a binding site or a cartoon representation of a backbone without side chains. With rapid advances in structure-determination techniques, increasingly large macromolecular machines are within the reach of structure-determination studies (Zhao et al., 2013; Kim et al., 2018). The naïve approach of delivering the complete coordinate model is increasingly inadequate because of its low performance for large systems. To alleviate this issue, advanced approaches such as selective data delivery are required.
One approach is to precompute static subsets of data on the server for delivery, but this would require every possible combination to be precomputed, making this approach unfeasible. Another more practical approach is to create a system with a dynamic query language to compute the subsets of macromolecular structures on the fly (Sehnal, Pravda et al., 2015) for efficient data delivery. This second approach was further developed in the LiteMol suite (Sehnal et al., 2017), which includes a module called CoordinateServer. CoordinateServer performs an on-the-fly selection of several critical parts of the biomacromolecule; for example, backbone, side chains, heteroatoms, atoms necessary for cartoon model visualization, chains, residues and ligands including their surroundings etc. Based on the user request, a relevant subset of atoms is selected by CoordinateServer and delivered for further analysis or visualization. The same approach of selective delivery is also implemented in Mol* (Sehnal et al., 2018). CoordinateServer (and its Mol* successor ModelServer1) is available for use by any software that supports the standard mmCIF format for the storage and efficient delivery of macromolecular structure data.
Further improvements were made to the data-delivery process by introducing novel coordinate-file formats requiring markedly less space than standard PDBx/mmCIF and PDB files; namely, the MMTF format (Bradley et al., 2017) and the BinaryCIF format (originally introduced as part of the LiteMol suite), which was later integrated into Mol*. Fig. 3(a) shows a marked reduction in the size of the data delivered for the visualization of the HIV-1 capsid via the LiteMol suite.
In parallel, enhancements can be achieved by utilizing an efficient in-memory representation of the macromolecular structure. Fig. 3(b) illustrates the efficient memory usage in Mol* that is necessary to represent large viral assemblies.
2.2. Data delivery: volumetric data (electron-density and maps)
Volumetric data such as electron-density or 5ire) map file (EMDB entry EMD-8116; Sirohi et al., 2016) is 1.6 GB when stored using the CCP4 format. This amount of data is prohibitively large for display in a web browser. Therefore, a more efficient system for data delivery is even more crucial than for coordinates. The selective delivery of electron-density and maps was first introduced in the LiteMol suite. It includes DensityServer, which can adaptively downsample, slice and compress the volumetric data. For the Zika virus capsid data set, it can reduce the size to 1 MB while still maintaining visual fidelity (see Fig. 4).
maps are markedly larger than coordinate files. For example, the Zika virus capsid (PDB entryIn Mol*, DensityServer was superseded by VolumeServer (available to users at https://maps.rcsb.org/), which adds support for more data formats. Web tools that support DensityServer/VolumeServer are JSmol (Hanson et al., 2013), the LiteMol suite and Mol*.
2.3. Data delivery: annotations
Many tools and biological databases provide enriched annotations for macromolecular data. Some of them (for example, structure-validation data, as collected in wwPDB validation reports; Gore et al., 2017) are part of the Protein Data Bank FTP site. Many others are collected together in PDBe-KB, which is a community-driven resource for structural and functional annotations. Specifically, PDBe-KB currently contains more than 500 million manually curated or predicted residue-level annotations for PDB structures obtained from close to 20 partner resources: for example, residue depths, binding-site predictions, ligand interactions, interaction interfaces, backbone flexibility predictions, solvent accessibility, kinase-target predictions, molecular channels, functional site predictions, druggable pocket predictions, energetic consequences of mutations, curated regulatory sites, curated post-translational modification (PTM) sites, curated catalytic sites, mutations in the human proteome, curated metal-binding sites and short linear motifs. PDBe-KB data are integrated with the core PDBe data in a graph database. Weekly snapshots of the graph database index are made available on the PDBe FTP area (ftp://ftp.ebi.ac.uk/pub/databases/msd/graphdb/). To ensure consistent and robust access to all of the PDBe-KB data, PDBe-KB also maintains a REST API (implemented in the Flask framework for Python), which includes 50 public endpoints. This API provides a source of annotation for web visualization tools.
2.4. Visualization: coordinate and experimental data
NGL (Rose & Hildebrand, 2015), LiteMol and Mol* can provide interactive and highly responsive visualizations. Electron-density and maps can be displayed using JSmol, the LiteMol suite and Mol*. The LiteMol suite visualizes either experimental data for the whole structure (an overview, no details depicted) or for individual residues and their surroundings (only part of the data is shown). Mol* uses a similar approach to the LiteMol suite. JSmol, the LiteMol suite and Mol* all have support for DensityServer/VolumeServer. Mol* is the successor to the NGL and LiteMol viewers, combining the strengths of both viewers.
2.5. Visualization: annotations
Annotation data include structural properties, physicochemical properties, charges and biological properties. They can be based on the whole structure or only a subset, such as the ligands.
Because validation annotation data are directly supported in the PDB archive, they are also integrated into the LiteMol suite, i.e. residues are coloured according to a value that describes the cumulative number of validation problems that occur in the residue (see Fig. 5a). In this way, residue quality is also included in JSmol and NGL. Mol* offers even more detail: each residue can also be coloured according to several individual validation problems (atom clashes, Ramachandran outliers etc.). Moreover, the LiteMol suite and Mol* also integrate information about the quality of ligands (e.g. problems; see Fig. 5d). The LiteMol suite and Mol* also include nomenclature annotations, i.e. Carbohydrate Symbols (3D-SNFG; see Fig. 5f).
External tools and data resources deliver other types of annotation data, and their visualization is performed on the individual web pages. Specifically, these resources integrate and customize a web visualization tool and utilize it for displaying niche data specific to that particular resource. This integration and customization can be very straightforward, such as merely colouring residues or atoms according to some property (for example partial atomic charge; see Figs. 5b and 5f), or may require some code extensions, such as showing defined protein surfaces, which is necessary for channel visualization (see Fig. 5c). Other similar examples include Fragalysis (https://fragalysis.diamond.ac.uk), a web-based platform for fragment-based drug discovery, ProteinPlus (Fährrolfes et al., 2017), which serves as a structure-based modelling support server, and 3DBionotes-WS, which automatically annotates biochemical and biomedical information onto structural models (Segura et al., 2019).
2.6. Dynamic data and structure ensembles from molecular-dynamics simulations
Since 2000, there have been multiple proof-of-concept studies (Meyer et al., 2010) trying to enable the sharing of biomolecular simulation data. One of the surviving online databases of biomolecular trajectories from this era is MoDEL (Molecular Dynamics Extended Library; Meyer et al., 2010; https://mmb.pcb.ub.es/MoDEL/), which pioneered the usage of a download capability for trajectories together with their metadata description with automatic analyses of the system (for example r.m.s.d. or contacts) and visualization using Jmol (Hanson, 2010) and non-interactive video. Decades later, all aspects of simulation data delivery have been enhanced by current transfer speeds, but mainly by new graphics possibilities owing to the development of WebGL (Khronos Group Inc, 2020).
First of all, online data-storage capacity increased to allow the storage of gigabytes of data daily. Molecular-dynamics trajectories are thus shared in general-purpose scientific data-storage systems such as Zenodo (https://zenodo.org/), OSF (Center for Open Science; https://osf.io) and FigShare (https://figshare.com), on the webpages of individual institutions or within specialized journals such as Scientific Data (Hoffmann et al., 2020). This is the method that is used for the sharing of simulation data within the NMRlipids (Botan et al., 2015) project and the current BioExcel/MolSSI COVID-19 repository (Amaro et al., 2020; https://covid.molssi.org/simulations/). This approach allows data sharing, but it does not provide easy-to-use visualization, nor does it support any form of unified metadata describing the simulations.
Visualization can be provided based on the download and visualization of individual frames, such as with MDsrv (Tiemann et al., 2017), HTMol (Carrillo-Tripp et al., 2018), JSmol or even with generic 3D viewers such as Autodesk360 (https://a360.autodesk.com/). Autodesk360 was used for the visualization of cellPACK ensembles (Johnson et al., 2016). JSmol is used in the BiGNAsim repository for the visualization of molecular-dynamics simulations of (Hospital et al., 2016). However, JSmol is only capable of reading very few trajectory file formats and requires expert knowledge to use, and also has size limits. MDsrv together with NGLviewer (Rose & Hildebrand, 2015) was used in the novel GPCRmd platform (https://gpcrmd.org/), which offers insight into the 3D-GPCRome (analysis of the intrinsic flexibility of the structures of G-protein-coupled receptors; GPCRs), which are the primary targets for many drugs and endogenous signalling pathways (Rodríguez-Espigares et al., 2020). To date, GPCRmd contains more than 580 molecular-dynamics trajectories along with an interactive on-site analysis provided by MDtraj (McGibbon et al., 2015) and Caver 3.0 (Chovancova et al., 2012), and custom scripts for more than 200 GPCR systems. GPCRmd thus allows comparative analyses of the whole GPCR protein family to be performed in an open, collaborative and reproducible manner. Furthermore, this is currently a goal of many approaches in the field of molecular simulation (Abraham et al., 2019).
However, network capabilities limit making such approaches available on a general basis, as molecular-dynamics simulations can generate almost limitless amounts of data depending on the system size and sampling. Downloads of complete simulation data are thus slow owing to their size, which often can range from hundreds of megabytes to terabytes or more. Since the analysis of such data requires only a subset of all the generated data, it is probably the best option to transfer only the subset of data necessary for the analysis (i.e. visualization for insight and a list of critical molecular variables over time). A possible solution would be the utilization of selective data delivery that enables web visualization to speed up. MDSrv enables the selection and display of a specific frame and atomic selection. Mol* allows the animated visualization of the trajectory, and hence optimization of the data-delivery protocol via ModelServer allows the more efficient online visualization of molecular-dynamics trajectories.
3. Conclusion
Data delivery and responsive web visualization of biomacromolecular structures, including their experimental data, annotations and data on dynamics, is a large challenge in the utilization of macromolecular structure data, especially since the sizes and numbers of snapshots of structures are still growing. To solve this complex challenge, relevant algorithms and software solutions have been developed and are in the process of further improvement. Specifically, selective data delivery, providing only the coordinate data necessary for visualization, was introduced in the LiteMol suite (using CoordinateServer and later improved by ModelServer in Mol*) and adopted by NGL and JSmol. In parallel, the selective delivery of volumetric data was implemented first in the LiteMol suite (via DensityServer) and then also in NGL. This was followed by the development of VolumeServer, which is integrated with Mol* and JSmol. Annotation data are collected into PDBe-KB and are accessible using its REST API. The visualization tools LiteMol suite, NGL, JSmol and Mol* benefit from these data-delivery services. These tools are integrated into many tools focused on providing structure annotations for specific domains. Structural ensembles can be visualized using MDSrv with NGL or Mol* to perform analysis of structure ensembles such as MD trajectories, and also support selective data delivery for its effective visualization, even for large systems. The future thus might lead to the establishment of online environments for structural data analysis based on cloud solutions, effectively speeding up their investigation.
Footnotes
1Source codes for the Mol* ModelServer are available at https://github.com/molstar/molstar/tree/master/src/servers/model.
Acknowledgements
Open access funding enabled and organized by Projekt DEAL.
Funding information
Funding for this research was provided by: ELIXIR-CZ research infrastructure project including access to computing and storage facilities (grant No. LM2018131), and the European Regional Development Fund (grant No. CZ.02.1.01/0.0/0.0/16_013/0001777).
References
Abraham, M., Apostolov, R., Barnoud, J., Bauer, P., Blau, C., Bonvin, A. M. J. J., Chavent, M., Chodera, J., Čondić-Jurkić, K., Delemotte, L., Grubmüller, H., Howard, R. J., Jordan, E. J., Lindahl, E., Ollila, O. H. S., Selent, J., Smith, D. G. A., Stansfeld, P. J., Tiemann, J. K. S., Trellet, M., Woods, C. & Zhmurov, A. (2019). J. Chem. Inf. Model. 59, 4093–4099. CrossRef CAS PubMed Google Scholar
Amaro, R. E. & Mulholland, A. J. (2020). J. Chem. Inf. Model. 60, 2653–2656. CrossRef CAS PubMed Google Scholar
Botan, A., Favela-Rosales, F., Fuchs, P. F. J., Javanainen, M., Kanduč, M., Kulig, W., Lamberg, A., Loison, C., Lyubartsev, A., Miettinen, M. S., Monticelli, L., Määttä, J., Ollila, O. H. S., Retegan, M., Róg, T., Santuz, H. & Tynkkynen, J. (2015). J. Phys. Chem. B, 119, 15075–15088. CrossRef CAS PubMed Google Scholar
Bradley, A. R., Rose, A. S., Pavelka, A., Valasatava, Y., Duarte, J. M., Prlić, A. & Rose, P. W. (2017). PLoS Comput. Biol. 13, e1005575. CrossRef PubMed Google Scholar
Carrillo-Tripp, M., Alvarez-Rivera, L., Lara-Ramírez, O. I., Becerra-Toledo, F. J., Vega-Ramírez, A., Quijas-Valades, E., González-Zavala, E., González-Vázquez, J. C., García-Vieyra, J., Santoyo-Rivera, N. B., Chapa-Vergara, S. V. & Meneses-Viveros, A. (2018). J. Comput. Aided Mol. Des. 32, 869–876. CAS PubMed Google Scholar
Chovancova, E., Pavelka, A., Benes, P., Strnad, O., Brezovsky, J., Kozlikova, B., Gora, A., Sustr, V., Klvana, M., Medek, P., Biedermannova, L., Sochor, J. & Damborsky, J. (2012). PLoS Comput. Biol. 8, e1002708. Web of Science CrossRef PubMed Google Scholar
Fährrolfes, R., Bietz, S., Flachsenberg, F., Meyder, A., Nittinger, E., Otto, T., Volkamer, A. & Rarey, M. (2017). Nucleic Acids Res. 45, W337–W343. Web of Science PubMed Google Scholar
Ferrara, C., Grau, S., Jäger, C., Sondermann, P., Brünker, P., Waldhauer, I., Hennig, M., Ruf, A., Rufer, A. C., Stihle, M., Umaña, P. & Benz, J. (2011). Proc. Natl Acad. Sci. USA, 108, 12669–12674. CrossRef CAS PubMed Google Scholar
Goodsell, D. S., Zardecki, C., Di Costanzo, L., Duarte, J. M., Hudson, B. P., Persikova, I., Segura, J., Shao, C., Voigt, M., Westbrook, J. D., Young, J. Y. & Burley, S. K. (2020). Protein Sci. 29, 52–65. CrossRef CAS PubMed Google Scholar
Gore, S., Sanz García, E., Hendrickx, P. M. S., Gutmanas, A., Westbrook, J. D., Yang, H., Feng, Z., Baskaran, K., Berrisford, J. M., Hudson, B. P., Ikegawa, Y., Kobayashi, N., Lawson, C. L., Mading, S., Mak, L., Mukhopadhyay, A., Oldfield, T. J., Patwardhan, A., Peisach, E., Sahni, G., Sekharan, M. R., Sen, S., Shao, C., Smart, O. S., Ulrich, E. L., Yamashita, R., Quesada, M., Young, J. Y., Nakamura, H., Markley, J. L., Berman, H. M., Burley, S. K., Velankar, S. & Kleywegt, G. J. (2017). Structure, 25, 1916–1927. Web of Science CrossRef CAS PubMed Google Scholar
Hanson, R. M. (2010). J. Appl. Cryst. 43, 1250–1260. Web of Science CrossRef CAS IUCr Journals Google Scholar
Hanson, R. M., Prilusky, J., Renjian, Z., Nakane, T. & Sussman, J. L. (2013). Isr. J. Chem. 53, 207–216. Web of Science CrossRef CAS Google Scholar
Hoffmann, C., Centi, A., Menichetti, R. & Bereau, T. (2020). Sci Data, 7, 51. CrossRef PubMed Google Scholar
Hospital, A., Andrio, P., Cugnasco, C., Codo, L., Becerra, Y., Dans, P. D., Battistini, F., Torres, J., Goñi, R., Orozco, M. & Gelpí, J. L. (2016). Nucleic Acids Res. 44, D272–D278. CrossRef CAS PubMed Google Scholar
Johnson, G., Autin, L., Al-Alusi, M., Goodsell, D., Sanner, M. & Olson, A. (2013a). cellPACK. University of California San Francisco and The Scripps Research Institute, San Francisco, USA. Google Scholar
Johnson, G., Autin, L., Al-Alusi, M., Goodsell, D., Sanner, M. & Olson, A. (2013b). autoPACK. University of California San Francisco and The Scripps Research Institute, San Francisco, USA. Google Scholar
Johnson, G., Autin, L., Al-Alusi, M., Goodsell, D., Sanner, M. & Olson, A. (2016). Use – cellPACK, https://www.cellpack.org/use. Google Scholar
Johnson, G. T., Autin, L., Al-Alusi, M., Goodsell, D. S., Sanner, M. F. & Olson, A. J. (2015). Nat. Methods, 12, 85–91. CrossRef CAS PubMed Google Scholar
Johnson, G. T., Goodsell, D. S., Autin, L., Forli, S., Sanner, M. F. & Olson, A. J. (2014). Faraday Discuss. 169, 23–44. CrossRef CAS PubMed Google Scholar
Kim, S. J., Fernandez-Martinez, J., Nudelman, I., Shi, Y., Zhang, W., Raveh, B., Herricks, T., Slaughter, B. D., Hogan, J. A., Upla, P., Chemmama, I. E., Pellarin, R., Echeverria, I., Shivaraju, M., Chaudhury, A. S., Wang, J., Williams, R., Unruh, J. R., Greenberg, C. H., Jacobs, E. Y., Yu, Z., de la Cruz, M. J., Mironska, R., Stokes, D. L., Aitchison, J. D., Jarrold, M. F., Gerton, J. L., Ludtke, S. J., Akey, C. W., Chait, B. T., Sali, A. & Rout, M. P. (2018). Nature, 555, 475–482. Web of Science CrossRef CAS PubMed Google Scholar
Klose, T., Reteno, D. G., Benamar, S., Hollerbach, A., Colson, P., La Scola, B. & Rossmann, M. G. (2016). Proc. Natl Acad. Sci. USA, 113, 6206–6211. CrossRef CAS PubMed Google Scholar
Martinez, X., Chavent, M. & Baaden, M. (2020). Biochem. Soc. Trans. 48, 499–506. CrossRef CAS PubMed Google Scholar
McGibbon, R. T., Beauchamp, K. A., Harrigan, M. P., Klein, C., Swails, J. M., Hernández, C. X., Schwantes, C. R., Wang, L.-P., Lane, T. J. & Pande, V. S. (2015). Biophys. J. 109, 1528–1532. CrossRef CAS PubMed Google Scholar
Meyer, T., D'Abramo, M., Hospital, A., Rueda, M., Ferrer-Costa, C., Pérez, A., Carrillo, O., Camps, J., Fenollosa, C., Repchevsky, D., Gelpí, J. L. & Orozco, M. (2010). Structure, 18, 1399–1409. Web of Science CrossRef CAS PubMed Google Scholar
Miao, H., Klein, T., Kouřil, D., Mindek, P., Schatz, K., Gröller, M. E., Kozlíková, B., Isenberg, T. & Viola, I. (2019). J. Mol. Biol. 431, 1049–1070. CrossRef CAS PubMed Google Scholar
PDBe-KB Consortium (2020). Nucleic Acids Res. 48, D344–D353. CrossRef PubMed Google Scholar
Pravda, L., Sehnal, D., Svobodová Vařeková, R., Navrátilová, V., Toušek, D., Berka, K., Otyepka, M. & Koča, J. (2018). Nucleic Acids Res. 46, D399–D405. CrossRef CAS PubMed Google Scholar
Raček, T., Schindler, O., Toušek, D., Horský, V., Berka, K., Koča, J. & Svobodová, R. (2020). Nucleic Acids Res. 48, W591–W596. PubMed Google Scholar
Rodríguez-Espigares, I., Torrens-Fontanals, M., Tiemann, J. K. S., Aranda-García, D., Ramírez-Anguita, J. M., Stepniewski, T. M., Worp, N., Varela-Rial, A., Morales-Pastor, A., Medel-Lacruz, B., Pándy-Szekeres, G., Mayol, E., Giorgino, T., Carlsson, J., Deupi, X., Filipek, S., Filizola, M., Gómez-Tamayo, J. C., Gonzalez, A., Gutiérrez-de-Terán, H., Jiménez-Rosés, M., Jespers, W., Kapla, J., Khelashvili, G., Kolb, P., Latek, D., Marti-Solano, M., Matricon, P., Matsoukas, M.-T., Miszta, P., Olivella, M., Perez-Benito, L., Provasi, D., Ríos, S., Torrecillas, I. R., Sallander, J., Sztyler, A., Vasile, S., Weinstein, H., Zachariae, U., Hildebrand, P. W., De Fabritiis, G., Sanz, F., Gloriam, D. E., Cordomi, A., Guixà-González, R. & Selent, J. (2020). Nat. Methods, 17, 777–787. PubMed Google Scholar
Rose, A. S. & Hildebrand, P. W. (2015). Nucleic Acids Res. 43, W576–W579. Web of Science CrossRef CAS PubMed Google Scholar
Segura, J., Sanchez-Garcia, R., Sorzano, C. O. S. & Carazo, J. M. (2019). Bioinformatics, 35, 3512–3513. CrossRef CAS PubMed Google Scholar
Sehnal, D., Deshpande, M., Vařeková, R. S., Mir, S., Berka, K., Midlik, A., Pravda, L., Velankar, S. & Koča, J. (2017). Nat. Methods, 14, 1121–1122. Web of Science CrossRef CAS PubMed Google Scholar
Sehnal, D. & Grant, O. C. (2019). J. Proteome Res. 18, 770–774. CrossRef CAS PubMed Google Scholar
Sehnal, D., Pravda, L., Svobodová Vařeková, R., Ionescu, C.-M. & Koča, J. (2015). Nucleic Acids Res. 43, W383–W388. CrossRef CAS PubMed Google Scholar
Sehnal, D., Rose, A. S., Koča, J., Burley, S. K. & Velankar, S. (2018). MolVA'18: Proceedings of the Workshop on Molecular Graphics and Visual Analysis of Molecular Data, edited by J. Byška, M. Krone & B. Sommer, pp. 29–33. Goslar: Eurographics Association. Google Scholar
Sehnal, D., Svobodová Vařeková, R., Pravda, L., Ionescu, C.-M., Geidl, S., Horský, V., Jaiswal, D., Wimmerová, M. & Koča, J. (2015). Nucleic Acids Res. 43, D369–D375. Web of Science CrossRef CAS PubMed Google Scholar
Sirohi, D., Chen, Z., Sun, L., Klose, T., Pierson, T. C., Rossmann, M. G. & Kuhn, R. J. (2016). Science, 352, 467–470. Web of Science CrossRef CAS PubMed Google Scholar
Tiemann, J. K. S., Guixà-González, R., Hildebrand, P. W. & Rose, A. S. (2017). Nat. Methods, 14, 1123–1124. CrossRef CAS PubMed Google Scholar
Unwin, N. (2005). J. Mol. Biol. 346, 967–989. Web of Science CrossRef PubMed CAS Google Scholar
wwPDB Consortium (2019). Nucleic Acids Res. 47, D520–D528. Web of Science CrossRef PubMed Google Scholar
Xu, K., Rajashankar, K. R., Chan, Y.-P., Himanen, J. P., Broder, C. C. & Nikolov, D. B. (2008). Proc. Natl Acad. Sci. USA, 105, 9953–9958. Web of Science CrossRef PubMed CAS Google Scholar
Zhao, G., Perilla, J. R., Yufenyuy, E. L., Meng, X., Chen, B., Ning, J., Ahn, J., Gronenborn, A. M., Schulten, K., Aiken, C. & Zhang, P. (2013). Nature, 497, 643–646. CrossRef CAS PubMed Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.