scientific commentaries
virusMED: your travel guide to the virus world
aInfection Program, Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Clayton, VIC 3800, Australia
*Correspondence e-mail: fasseli.coulibaly@monash.edu
Keywords: virus hotspots; viral protein structures; epitopes; antiviral drugs; DrugBank; viral metal proteins; virusMED database.
The virus world is wide and vast. And we have just started venturing further in our explorations through metagenomics studies and the characterization of the multitude of variants in viruses relevant to human health.
Fortunately, help is coming to explore the virosphere – another term used to describe the virus world – in the form of a new server called virusMED (Metal binding sites, antigenic Epitopes and Drug binding sites, https://virusmed.biocloud.top) developed by Wladek Minor, Heping Zheng and their collaborators (see Zhang et al., 2021, in this issue of IUCrJ).
To understand the scale of the task at hand, viruses infecting bacteria, called phages, alone represent the most abundant biological entity on earth, outnumbering cellular organisms by a factor of 10 (Dion et al., 2020). They have been less studied than their medically relevant counterparts but a flood of data has begun, owing in part to the renewed interest in phage therapy to combat antimicrobial resistance (Gordillo Altamirano & Barr, 2019) and the proposed role of phages in regulating the population dynamics of bacteria playing an important role in carbon capture (Suttle, 2005). Sampling of our oceans, soil and even our own gut has already expanded our view of phage diversity and generated masses of viral and virus-like sequences (Paez-Espino et al., 2016; Camarillo-Guerrero et al., 2021; Dion et al., 2020).
The search for viruses that infect eukaryotes is equally raging. For some discoveries, like mimiviruses and related giant viruses (La Scola et al., 2003), the biological significance is complex but they represent a wealth of new proteins, novel biological processes and almost infinite research questions. For others, the impact is immediately evident such as the characterization of the RNA virome in animals that represent known reservoirs for zoonotic viruses such as flaviviruses, haemorrhagic fever viruses, influenza viruses and coronaviruses.
As a tool for exploration, sequencing is extremely powerful in organizing viruses into families and setting up a robust classification of these organisms in the ever-growing Taxonomy of Viruses (https://talk.ictvonline.org). A complementary approach has focused on the determination and cataloging of three-dimensional structures produced by viruses. Indeed, of intact viruses has been at the forefront of developments in structural biology since its birth (Harrison, 2015; Rossmann, 2013). These structures are made freely available to all through the Protein Data Bank (Johnson & Olson, 2021), which celebrates 50 years of existence this month, as well as the virus particle explorer database (VIPERdb; https://viperdb.scripps.edu) (Montiel-Garcia et al., 2021). These invaluable resources are highly curated and constant efforts have been made to improve the quality of their content and make it more accessible to all despite exponential growth (Smart et al., 2018; Montiel-Garcia et al., 2021).
While these resources are free, analysis of each viral structure is onerous and requires expertise in several inter-connected fields such as biochemistry, chemistry, molecular virology and immunology. This is where virusMED comes in to save the day. Think of it as your GPS navigation app that will guide you quickly and safely to your destination. Like a navigation app, it taps into tools that are readily available elsewhere: primary databases with sequence information, 3D structures, functional sites, metal and drug binding sites, and (a)].
repositories. The power of the server is to help researchers combine information from these different sources and make sense of it for the specific goal of understanding and combating viruses [Fig. 1Taking SARS CoV-2 as an example, in the space of 21 months (February 2020–October 2021) over 117 000 publications were made available in PubMED and 1555 structures in the PDB. This represents an average of over 180 articles and 2 new structures every day. Organizing, curating and providing visualization tools for this large amount of data is essential to extract the most relevant information in a format that will help generate novel insights [Fig. 1(b)].
virusMED provides an integrated portal to navigate through 7041 structures across 75 viral families. One can browse the database using several pre-set entry points or design specific searches to rapidly gain an overview of where and how metals and small molecules bind to a specific target. Many of these `hotspots' will be important functionally and may represent targets for drug development. With drug repurposing in mind, the results can be filtered to focus only on known drugs found in Drugbank or those that are already FDA approved.
The same portal allows mapping of antigenic sites in viral proteins, providing a database of more than 5000 B- and T-cell epitopes for 329 individual proteins. These can be combined to determine the antigenic landscape of viral proteins, identify variants likely to escape vaccination or reveal targets for potent and broadly neutralizing epitopes.
As the database grows, future developments can be anticipated such as more complex visualization options and an integrated search engine allowing the comparison of new structures with known hotspots. Data on binding sites for other viral proteins and cellular factors accumulate rapidly (Goodacre et al., 2020; Gordon et al., 2020) and, while a huge task, would deserve a similar atlas. For now, there is little doubt that many in the structural virology community will adopt this tool to accelerate their research and facilitate the development of antiviral strategies.
References
Camarillo-Guerrero, L. F., Almeida, A., Rangel-Pineros, G., Finn, R. D. & Lawley, T. D. (2021). Cell, 184, 1098–1109.e9. CAS PubMed Google Scholar
Dion, M. B., Oechslin, F. & Moineau, S. (2020). Nat. Rev. Microbiol. 18, 125–138. CrossRef CAS PubMed Google Scholar
Goodacre, N., Devkota, P., Bae, E., Wuchty, S. & Uetz, P. (2020). Semin. Cell Dev. Biol. 99, 31–39. CrossRef CAS PubMed Google Scholar
Gordillo Altamirano, F. L. & Barr, J. J. (2019). Clin. Microbiol. Rev. 32(2). Google Scholar
Gordon, D. E., Jang, G. M., Bouhaddou, M., Xu, J., Obernier, K., White, K. M., O'Meara, M. J., Rezelj, V. V., Guo, J. Z., Swaney, D. L., Tummino, T. A., Hüttenhain, R., Kaake, R. M., Richards, A. L., Tutuncuoglu, B., Foussard, H., Batra, J., Haas, K., Modak, M., Kim, M., Haas, P., Polacco, B. J., Braberg, H., Fabius, J. M., Eckhardt, M., Soucheray, M., Bennett, M. J., Cakir, M., McGregor, M. J., Li, Q., Meyer, B., Roesch, F., Vallet, T., Mac Kain, A., Miorin, L., Moreno, E., Naing, Z. Z. C., Zhou, Y., Peng, S., Shi, Y., Zhang, Z., Shen, W., Kirby, I. T., Melnyk, J. E., Chorba, J. S., Lou, K., Dai, S. A., Barrio-Hernandez, I., Memon, D., Hernandez-Armenta, C., Lyu, J., Mathy, C. J. P., Perica, T., Pilla, K. B., Ganesan, S. J., Saltzberg, D. J., Rakesh, R., Liu, X., Rosenthal, S. B., Calviello, L., Venkataramanan, S., Liboy-Lugo, J., Lin, Y., Huang, X. P., Liu, Y., Wankowicz, S. A., Bohn, M., Safari, M., Ugur, F. S., Koh, C., Savar, N. S., Tran, Q. D., Shengjuler, D., Fletcher, S. J., O'Neal, M. C., Cai, Y., Chang, J. C. J., Broadhurst, D. J., Klippsten, S., Sharp, P. P., Wenzell, N. A., Kuzuoglu-Ozturk, D., Wang, H. Y., Trenker, R., Young, J. M., Cavero, D. A., Hiatt, J., Roth, T. L., Rathore, U., Subramanian, A., Noack, J., Hubert, M., Stroud, R. M., Frankel, A. D., Rosenberg, O. S., Verba, K. A., Agard, D. A., Ott, M., Emerman, M., Jura, N., von Zastrow, M., Verdin, E., Ashworth, A., Schwartz, O., d'Enfert, C., Mukherjee, S., Jacobson, M., Malik, H. S., Fujimori, D. G., Ideker, T., Craik, C. S., Floor, S. N., Fraser, J. S., Gross, J. D., Sali, A., Roth, B. L., Ruggero, D., Taunton, J., Kortemme, T., Beltrao, P., Vignuzzi, M., García-Sastre, A., Shokat, K. M., Shoichet, B. K. & Krogan, N. J. (2020). Nature, 583, 459–468. CrossRef CAS PubMed Google Scholar
Harrison, S. C. (2015). Annu. Rev. Biochem. 84, 37–60. CrossRef CAS PubMed Google Scholar
Johnson, J. E. & Olson, A. J. (2021). J. Biol. Chem. 296, 100554. CrossRef PubMed Google Scholar
La Scola, B., Audic, S., Robert, C., Jungang, L., de Lamballerie, X., Drancourt, M., Birtles, R., Claverie, J. M. & Raoult, D. (2003). Science, 299, 2033. Web of Science PubMed Google Scholar
Montiel-Garcia, D., Santoyo-Rivera, N., Ho, P., Carrillo-Tripp, M., Iii, C. L. B., Johnson, J. E. & Reddy, V. S. (2021). Nucleic Acids Res. 49, D809–D816. CAS PubMed Google Scholar
Paez-Espino, D., Eloe-Fadrosh, E. A., Pavlopoulos, G. A., Thomas, A. D., Huntemann, M., Mikhailova, N., Rubin, E., Ivanova, N. N. & Kyrpides, N. C. (2016). Nature, 536, 425–430. CAS PubMed Google Scholar
Rossmann, M. G. (2013). Q. Rev. Biophys. 46, 133–180. Web of Science CrossRef CAS PubMed Google Scholar
Smart, O. S., Horský, V., Gore, S., Svobodová Vařeková, R., Bendová, V., Kleywegt, G. J. & Velankar, S. (2018). Acta Cryst. D74, 237–244. CrossRef IUCr Journals Google Scholar
Suttle, C. A. (2005). Nature, 437, 356–361. CrossRef PubMed CAS Google Scholar
Zhang, H., Chen, P., Ma, H., Woińska, M., Liu, D., Cooper, D. R., Peng, G., Peng, Y., Deng, L., Minor, W. & Zheng, H. (2021). IUCrJ, 8, 931–942. CrossRef IUCr Journals Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.