topical reviews
The future of biomolecular simulation in the pharmaceutical industry: what we can learn from aerodynamics modelling and weather prediction. Part 1. understanding the physical and computational complexity of in silico drug design
aSchool of Molecular and Cellular Biology, University of Leeds, Leeds, United Kingdom, bAstbury Centre for Structural and Molecular Biology, University of Leeds, Leeds, United Kingdom, cVernalis (R&D) Ltd, Cambridge, United Kingdom, dSchool of Physics and Astronomy, University of Leeds, Leeds, United Kingdom, and eSchool of Pharmacy, University College London, London, United Kingdom
*Correspondence e-mail: s.a.harris@leeds.ac.uk
The predictive power of simulation has become embedded in the infrastructure of modern economies. Computer-aided design is ubiquitous throughout industry. In aeronautical engineering, built infrastructure and materials manufacturing, simulations are routinely used to compute the performance of potential designs before construction. The ability to predict the behaviour of products is a driver of innovation by reducing the cost barrier to new designs, but also because radically novel ideas can be piloted with relatively little risk. Accurate weather forecasting is essential to guide domestic and military flight paths, and therefore the underpinning simulations are critical enough to have implications for national security. However, in the pharmaceutical and biotechnological industries, the application of computer simulations remains limited by the capabilities of the technology with respect to the complexity of molecular biology and human physiology. Over the last 30 years, molecular-modelling tools have gradually gained a degree of acceptance in the pharmaceutical industry. Drug discovery has begun to benefit from physics-based simulations. While such simulations have great potential for improved
much scepticism remains about their value. The motivations for such reservations in industry and areas where simulations show promise for efficiency gains in preclinical research are discussed. In this, the first of two complementary papers, the scientific and technical progress that needs to be made to improve the predictive power of biomolecular simulations, and how this might be achieved, is firstly discussed (Part 1). In Part 2, the status of computer simulations in pharma is contrasted with aerodynamics modelling and weather forecasting, and comments are made on the cultural changes needed for equivalent computational technologies to become integrated into life-science industries.Keywords: biomolecular simulation; molecular docking; in silico drug design; pharmaceutical industry.
1. The physical properties of biomolecules and how this relates to their function
In biochemistry, the accepted paradigm is that structure underpins function. Structural biology provides the basis for our understanding of biological mechanisms, including diseases caused by mutations and infection, and the design of potential therapies. The concept of a specific binding pocket with the correct shape and chemical complementarity to accommodate a drug (embraced by the `lock-and-key hypothesis' of biomolecular interactions; Lemieux & Spohr, 1994) is central to rational The Protein Data Bank contains over 180 000 biomolecular structures (as of May 2021) determined by X-ray, NMR and electron cryo-microscopy (cryo-EM) to atomic resolution (https://www.rcsb.org; Burley et al., 2021). While this is invaluable for investigating biomolecular recognition within complexes, it is often not sufficient to translate the biochemical `structure–function' concept into a quantitative algorithm for predicting how strongly a putative drug will bind to its target protein and whether it would disrupt functional protein–protein interactions. Nevertheless, the observation that pharmaceutical companies run automated downloads of the PDB, as well as often performing in-house using X-ray crystallography and now increasingly cryo-EM, shows the value of this structural information to structure-based drug design.
The relationship between the static structures of interacting partners and their binding affinities is, however, obscured by the dynamic nature of biomolecules. Proteins are sufficiently deformable that they are classified as `soft matter'. As a consequence of this soft mechanics, thermal motion generates molecular conformations that can differ substantially from the average structure. This ensemble of conformations is very difficult to observe experimentally, as are the rearrangements of water networks that take place during molecular recognition. Measurements of biomolecular affinities performed using techniques such as isothermal titration ) or microscale thermophoresis (Jerabek-Willemsen et al., 2014) probe the average interaction between ∼1013 biomolecules over timescales of seconds to minutes under environmental conditions that are frequently different from those required for experimental structural studies. Solvent interactions, especially hydrophobic effects, are central to biomolecular recognition, but remain poorly understood at the structural level. The balance between structure and dynamics in biomolecular recognition is captured by the thermodynamic definition of the free energy (ΔG; see Fig. 1), which is directly related to the binding affinity. Additional understanding of the underlying molecular changes can be obtained by performing ITC at different temperatures and measuring heat-capacity changes (ΔCp). While ΔCp is broadly correlated with the degree of burial of apolar surfaces on complexation, other factors such as changes in protein flexibility during induced fit, or salt concentration, can make a significant contribution (Bergqvist et al., 2004).
(ITC; Huddler & Zartler, 2017The soft mechanics of biomolecules is vital to their function. It enables them to act as molecular switches and machines. The mixture of stiff, ordered secondary structure and flexible disordered loops and hinges in proteins generates a complex underlying free-energy landscape containing multiple energy minima separated by energy barriers. In biomolecular switches, the binding of a specific activator or repressor perturbs the shape of this free-energy landscape to favour a new conformation. Allosteric communication, signalling cascades, cell membrane transporters and molecular motors all perform their functions by undergoing large conformational changes in response to the binding of other proteins or metabolites. It is often difficult to characterize all of the important states using structural studies of fixed species, particularly for membrane proteins (see Table 1).
The solvent/lipid membrane environment has an enormous effect on protein kinetic timescales, as well as on protein structure and thermodynamics. The ratio of the inertial to viscous forces (the Reynolds number) for proteins in water is extremely low, meaning that protein motions are heavily overdamped. Overcoming multiple free-energy barriers in a highly viscous environment requires time. Consequently, proteins explore their conformational states very slowly, with implications for relevant simulation timescales.
2. Physics-based biomolecular simulation for pharmaceutical design: successes and limitations
Biomolecular simulation provides dynamic information to better connect static experimental structures to biological function. These `physics-based' simulations rely on two fundamental ingredients which both introduce approximations and caveats into the computer models. The first is the calculation of conformational energies via a set of empirical potentials known as the `force field' and the second is sampling the numerous configurations of a molecular system, including solute conformations and solvent motions. Not all sampling methods are suitable to compute physically sound quantities; currently, et al., 2011; Kmiecik et al., 2016) and to improve the efficiency of relative binding free-energy calculations (Cournia et al., 2017). MD simulations use Newtonian mechanics to evolve biomolecular conformations as a function of time, often in full atomic detail, thereby generating an ensemble of molecular structures that arises due to thermal fluctuations, as shown in Fig. 2. Water can be represented explicitly, so the fluctuating water networks that drive hydrophobic interactions are accounted for, and charged counterions can be included.
(MD) under defined temperature and pressure has emerged as the main strategy, although Monte Carlo-based probabilistic sampling has been used for coarse-grained modelling (OuldridgeMolecular-dynamics simulations have become almost routine, and the availability of fast and cheap GPU processing has made them accessible to researchers who do not have access to high-performance computing (HPC) facilities. An overview of the field has been published by the Collaborative Computational Project for Biomolecular Simulation (CCPBioSim; Huggins et al., 2019). A comprehensive collection of accessible reviews of current MD topics, such as advanced sampling, force-field development and the inclusion of experimental data (Bonomi & Camilloni, 2019), and expert articles describing state of the art in computational drug design (Wade & Salo-Ahen, 2019) are also available.
In principle, MD simulations have the potential to allow us to observe the binding equilibria between ligands and their biomolecular targets, to obtain on/off rates and binding affinities, and to predict large-scale protein conformational changes in response to external stimuli, such as effector or cofactor binding. In practice, however, the speed of the algorithms and the accuracy of the simulations are both still sufficiently limited that the calculations are not fully predictive, for the reasons discussed in detail below (Sections 2.1 and 2.2). The opportunities for improving the quantitative predictions of biomolecular simulation are summarized in Table 2.
|
2.1. Conformational sampling is limited by computational cost
An atomistic MD simulation of a protein is typically performed over microsecond timescales, which may take around one month of simulation time depending on the size of the protein and the computational resources available. The constraints of the shortest length scale in an atomistic simulation (usually covalent bonds to hydrogen) place a strict upper limit on the integration time step that can be used to evolve the dynamic trajectory. The timescale of MD simulations is restricted by the dual constraints of this short time step and the maximal speed achievable per step. As a time step of 2 fs is common, a typical simulation requires 109 MD cycles. Unfortunately, increasing the number of processors used to run the simulation can only improve the speed up to a hard limit. Eventually, the communication time required to convey information between processors starts to outweigh the advantage of adding more. For Keap1 (see Fig. 2), a molecular target for anti-inflammatory and antioxidant drug design (Cuadrado et al., 2019), it is possible to obtain ∼330 ns per day using the GPU version of AMBER18 on a standard RTX2060 Nvidia graphics card (MD simulations of Keap1 contain around 30 500 atoms when solvated). For comparison, dihydrofolate reductase (which contains around 23 000 atoms when solvated) runs at a speed of 85 µs per day on the specialized Anton 2 supercomputer architecture (Shaw et al., 2014). In 2020 the UK HECBioSim consortium performed a comprehensive benchmarking exercise for popular MD codes (for example NAMD, AMBER and GROMACS), including HPC architectures that have not yet been used extensively for MD, such as ARM (see https://www.hecbiosim.ac.uk/access-hpc/our-benchmark-results/dirac-arm-benchmarks and https://www.hecbiosim.ac.uk/access-hpc/our-benchmark-results/isambard-benchmarks) and IBM Power 9 (see https://www.hecbiosim.ac.uk/access-hpc/our-benchmark-results/bede-benchmarks). This exercise highlighted that GPU versions of GROMACS and AMBER performed particularly well (see https://www.hecbiosim.ac.uk/access-hpc/our-benchmark-results/jade2-benchmarks), showing how the optimization of codes for new computational architectures such as GPUs can be transformative in terms of computational speeds.
Biomolecular simulations are stochastic because atomic motion is driven by random thermal noise. Minor perturbations to the starting conditions, such as swapping around the atomic speeds at the beginning of the simulation, will result in subtly different simulation trajectories, and structures, being sampled from the same phase space. To account for this inherent randomness, practioners run `repeat' calculations from arbitrarily different starting conditions to generate a statistical ensemble. Comparing simulation trajectories, for example two simulations run with different force fields, is therefore challenging, because converged statistical averaging is needed to detect any discrepancies. However, replica calculations have the advantage that each runs concurrently, so achieving tenfold better sampling does not require waiting ten times longer for the simulation to finish, assuming that sufficient computational resources are available.
Pharmacological activity is sensitive to drug-binding kinetics as well as thermodynamics. Kinetics is relevant to factors such as clinical indication of the therapy and the duration of the therapeutic effect. The distinctive roles of thermodynamics and kinetics in drug discovery have been explained in a recent review (Tonge, 2018). In principle, on and off rates will be calculable using atomistic MD simulations when simulations are fast enough to observe multiple binding–unbinding events, simply by observing binding/unbinding kinetics within the MD trajectories. However, in practice advanced sampling methods need to be employed (Bruce et al., 2018). The binding kinetics of representative biomolecular interactions, for example the biotin–streptavidin interaction, the saquinavir–HIV1 protease interaction and the DOT1L–aminonucleoside inhibitor interaction, show that the on rates for ligand binding are relatively consistent (in the range 106–108 m−1 s−1) but the off rate varies with the dissociation constant of the interaction (koff, 10−6–102 s−1; Kd, 10−14–10−6 M). This implies that to observe unbinding events simulations of 0.01–100 000 s in length are required (Copeland, 2016), which are currently computationally unfeasible. Chemically activated conformational changes, for example in membrane-bound transporters, occur over millisecond timescales, and molecular motor timescales, which often additionally involve negotiating a complex, crowded cellular environment, can take minutes. The possible solutions are to either speed up the calculations or simplify the problem (see Table 2).
For the foreseeable future, multi-scale methods and enhanced sampling will be required, especially for larger systems such as protein–protein interactions. A robust comparison of current state-of-the-art methods for calculating protein–ligand association and disassociation rates against two well characterized benchmark systems (mutant T4 lysosyme–ligand and N-HSP90–inhibitor complexes) showed that simulations can already usefully predict relative dissociation rates, but emphasized that access to high-quality experimental data sets is essential for further methods development and validation (Nunes-Alves et al., 2020). Advanced sampling applied to G-protein coupled receptors has enabled the complex conformational landscape to be reconstructed, providing structures of previously unseen active intermediates and revealing state-dependent cholesterol hotspots that are potential allosteric regulatory sites (Lovera et al., 2019).
2.2. MD force-field parameterization is crucial for accuracy
The accuracy of an MD simulation depends critically on the accuracy of the underlying energy model (force field; Dauber-Osguthorpe & Hagler, 2019), because this is how the relative energies of each are calculated. Force-field development is highly challenging because the potential must be carefully refined by balancing numerous parameters for every chemical motif of interest. As yet, no systematic automated method has emerged which performs this task satisfactorily, placing severe limitations on the reliability of computational predictions for pharmaceutical especially for molecular-recognition events. The equilibria that govern binding and unbinding events in molecular recognition are exquisitely sensitive to small changes in the underlying free energy (Foloppe & Hubbard, 2006), since there is an exponential relation between the association binding constant and the corresponding binding free energy (Fig. 1). A 1 kcal mol−1 free-energy change results in an almost tenfold change in the corresponding binding constant (Foloppe & Hubbard, 2006). Consequently, very small errors in the calculated binding potential energies (and the accompanying free energies) result in exponentially magnified errors in the binding constants, i.e. unreliable predictions of ligand–target affinities. Thus, somewhat quantitative binding-affinity predictions would require force fields that are accurate to at least 0.5 kcal mol−1. Unfortunately, the complexity and diversity of intermolecular interactions has made such accuracy elusive. This issue has plagued binding-affinity calculations (Mikulskis et al., 2014), especially when confronted with the vast diversity of small molecules investigated for drug discovery. There is no fundamental obstacle to the derivation of a force field covering the vast array of chemistries encountered in pharmaceutical discovery, apart from the tremendous determination and effort required. This is being tackled by some research groups, with incremental but steady progress (Vanommeslaeghe & MacKerell, 2015; Harder et al., 2016; Hagler, 2019; Piana et al., 2020). Alongside improved configurational sampling, this provides a stronger foundation for molecular simulations to contribute to pharmaceutical research.
2.3. Current applications of biosimulation in pharma
Many small molecules are difficult, time-consuming or resource-intensive to make synthetically. Simulations capable of predicting biomolecular binding free energies reduce the number of compounds that need to be synthesized and tested in the laboratory, improving the efficiency of the drug-discovery pipeline. Free-energy perturbation (FEP) has begun to be used by pharma to predict the relative binding free energies of congeneric compounds (Jorgensen, 2009; Wang et al., 2015; Schindler et al., 2020). Most commonly, FEP calculations morph one ligand (or interacting residue in the binding site) into another using a series of small alchemical changes (Michel et al., 2010). This can be computationally expensive because the perturbation must be applied slowly to obtain adequate sampling. FEP is successful as a theory since it is based on a sound statistical-mechanical treatment, can be implemented computationally and is adapted to the medicinal chemistry practice of introducing stepwise modifications to lead compounds during optimization. FEP considers local perturbations resulting from small chemical changes to the ligand or its binding pocket. This reduces the computational complexity of the calculations, because it does not need to either predict the relative affinity of chemically diverse ligands or identify de novo binding modes/sites, or sample large-scale conformational rearrangements of the protein. Moreover, by focusing on a single chemical scaffold, researchers can tune their parameterization to achieve the accuracy necessary, without the need to provide a general solution for the whole of chemical space. In addition, FEP can be used to select mutations to engineer protein stability (Duan et al., 2020; Ford & Babaoglu, 2017); such stabilized proteins are sought for more robust assays, increased chances of crystallization or more stable therapeutic biologics.
Simulations in drug discovery go well beyond FEP calculations. MD simulations have been used to identify binding hotspots on protein surfaces via so-called co-solvent simulations (Ghanakota & Carlson, 2016), in which a protein is simulated in the presence of selected small solutes present at high concentrations in aqueous solution; it can identify protein surface patches with a propensity to bind organic fragments (in competition with water), or highlight the type of chemical group displacing water efficiently in a particular pocket. Since the surface of a protein is dynamic, some pockets open only transiently and may not be observed in an X-ray structure, and for this reason have been dubbed `cryptic pockets' (Vajda et al., 2018). Even a small cryptic pocket may be of interest if it can be reached from a nearby larger binding site, in particular when targeting shallow protein sites involved in protein–protein interactions. Cryptic pockets may be revealed by standard MD (Martinez-Rosell et al., 2020) or enhanced sampling methods (Oleinikovas et al., 2016). The same approaches may also reveal allosteric pockets, and allosteric modulation of proteins (through long-range changes in structure or dynamics; Motlagh et al., 2014), which have been a focus of the pharmaceutical industry in recent years (Durrant & McCammon, 2011). MD simulations in explicit solvent can also be used to observe the conformational flexibility of compounds in their unbound state (Foloppe & Chen, 2016) to approach the energetic and entropic contributions of compound conformational focusing upon binding to a biomolecule. The prediction of compound permeation across lipid membranes, which is a physicochemical property vital to both drug uptake, distribution and toxicity, is yet another promising application of simulations (Awoonor-Williams & Rowley, 2016).
The ability to examine the dynamic structure of a protein via plain MD should not be underestimated. Visualizing side-chain rearrangements in a targeted site or the dynamics of nearby loop conformations can provide mechanistic insight that is invaluable for drug development, for example in the analysis of antivirals against influenza (Amaro et al., 2009). Indeed, simulations have been likened to a `computational microscope' (Dror et al., 2012). While simulations have begun to contribute to in the pharmaceutical and biotechnology industries, other engineering fields, such as aerospace, have benefitted more rapidly from the adoption of computational tools. The developments needed to bring more computational tools into pharma are summarized in Table 3.
|
3. The future potential of biomolecular simulation for pharma
All molecular recognition in biology fundamentally involves chemical complementarity, molecular flexibility and the surrounding solvent environment. Physics-based simulations, such as MD, are uniquely able to capture the details of this physical chemistry because these models are built up from a physical understanding of molecular interactions and mechanics. They have the capability to capture both atomistic details and global conformational changes. Other (very useful) computational chemistry approaches include docking using empirically derived scoring functions, quantitative structure–activity relationship analyses and quantum-mechanical calculations. However, these approaches are unable to represent the full complexity of binding phenomena in aqueous solvent, because when used in isolation these methods are unable to account for dynamics or hydration of the compounds and the protein. Therefore, it is essential that drug-discovery teams learn how to harness the growing power of physics-based simulations. In addition to drug-design applications, this should provide much-needed theoretical insights into molecular recognition. An essential additional development must be the improvement of the potentials (`force fields') used to propagate the simulations. As argued above, much of the groundwork for this is being laid, and progress towards those objectives is achievable in the foreseeable future. However, cultural shifts will also be required alongside technical improvements.
The time taken to obtain results is an important criterion because of the fast pace of industrial drug-discovery projects. The rapid response of the biomolecular simulation community to the COVID-19 pandemic shows that MD is now fast enough to provide insightful results as the situation evolves. MD simulations combined with cryo-EM have identified a linoleic acid binding site in the SARS-CoV-2 spike glycoprotein, which offers a new target site for drug design (Toelzer et al., 2020); simulations of emerging mutations in the receptor-binding site of the spike protein have provided molecular-level insights into the associated changes in transmissibility (Luan et al., 2021) and MD studies of spike-protein glycoforms have shown how much of the surface is shielded by with implications for antibody recognition and design (Grant et al., 2020).
As biomedical interventions become more sophisticated, for example using antibody–drug conjugates, smart drug-delivery vehicles, theranostics or other biologics, fundamentally new types of computational models to optimize design will be needed. Computer models constructed to complement experimental studies help researchers to visualize the different components of their experimental procedures, which can assist in identifying variables that need to be controlled. The success of the AlphaFold neural network in predicting protein structures (Senior et al., 2020) has generated much interest in applying artificial intelligence to pharmaceutical design (Schneider et al., 2020); however, past experience also suggests that overenthusiasm for nascent tools can lead to disappointment (Jordan, 2018). Engineering capabilities have been enhanced by computer models throughout industry, and in Part 2 we will discuss how developments in computer hardware, software and methods for standardization and validation have enabled aerodynamics and weather modelling to become embedded in the research culture in these fields. Quantitative biomolecular simulations promise equivalent benefits and may not be far behind.
Supporting information
Supplementary Movie for Figure 1. DOI: https://doi.org/10.1107/S2059798321009712/qr5005sup1.mp4
Acknowledgements
We would like to thank Philippe Garang, Paul Selwood, Mohsen Lahooti, Jemma Shipton, Spencer Sherwin and Beth Wingate for helpful discussions on aerodynamics engineering and weather modelling.
References
Amaro, R. E., Cheng, X., Ivanov, I., Xu, D. & McCammon, J. A. (2009). J. Am. Chem. Soc. 131, 4702–4709. CrossRef PubMed CAS Google Scholar
Awoonor-Williams, E. & Rowley, C. N. (2016). Biochim. Biophys. Acta, 1858, 1672–1687. CAS PubMed Google Scholar
Beierlein, F. R., Michel, J. & Essex, J. W. (2011). J. Phys. Chem. B, 115, 4911–4926. CrossRef CAS PubMed Google Scholar
Bergqvist, S., Williams, M. A., O'Brien, R. & Ladbury, J. E. (2004). J. Mol. Biol. 336, 829–842. CrossRef PubMed CAS Google Scholar
Bhati, A. P., Wan, S., Hu, Y., Sherborne, B. & Coveney, P. V. (2018). J. Chem. Theory Comput. 14, 2867–2880. CrossRef CAS PubMed Google Scholar
Bonomi, M. & Camilloni, C. (2019). Editors. Biomolecular Simulations. New York: Humana Press. Google Scholar
Bruce, N. J., Ganotra, G. K., Kokh, D. B., Sadiq, S. K. & Wade, R. C. (2018). Curr. Opin. Struct. Biol. 49, 1–10. CrossRef CAS PubMed Google Scholar
Burley, S. K., Bhikadiya, C., Bi, C., Bittrich, S., Chen, L., Crichlow, G. V., Christie, C. H., Dalenberg, K., Di Costanzo, L., Duarte, J. M., Dutta, S., Feng, Z., Ganesan, S., Goodsell, D. S., Ghosh, S., Green, R. K., Guranović, V., Guzenko, D., Hudson, B. P., Lawson, C. L., Liang, Y., Lowe, R., Namkoong, H., Peisach, E., Persikova, I., Randle, C., Rose, A., Rose, Y., Sali, A., Segura, J., Sekharan, M., Shao, C., Tao, Y. P., Voigt, M., Westbrook, J. D., Young, J. Y., Zardecki, C. & Zhuravleva, M. (2021). Nucleic Acids Res. 49, D437–D451. CrossRef CAS PubMed Google Scholar
Copeland, R. A. (2016). Nat. Rev. Drug Discov. 15, 87–95. Web of Science CrossRef CAS PubMed Google Scholar
Cournia, Z., Allen, B. & Sherman, W. (2017). J. Chem. Inf. Model. 57, 2911–2937. CrossRef CAS PubMed Google Scholar
Cuadrado, A., Rojo, A. I., Wells, G., Hayes, J. D., Cousin, S. P., Rumsey, W. L., Attucks, O. C., Franklin, S., Levonen, A. L., Kensler, T. W. & Dinkova-Kostova, A. T. (2019). Nat. Rev. Drug Discov. 18, 295–317. CrossRef CAS PubMed Google Scholar
Dauber-Osguthorpe, P. & Hagler, A. T. (2019). J. Comput. Aided Mol. Des. 33, 133–203. CAS PubMed Google Scholar
De Vivo, M., Masetti, M., Bottegoni, G. & Cavalli, A. (2016). J. Med. Chem. 59, 4035–4061. CrossRef CAS PubMed Google Scholar
Dror, R. O., Dirks, R. M., Grossman, J. P., Xu, H. & Shaw, D. E. (2012). Annu. Rev. Biophys. 41, 429–452. Web of Science CrossRef CAS PubMed Google Scholar
Duan, J., Lupyan, D. & Wang, L. (2020). Biophys. J. 119, 115–127. CrossRef CAS PubMed Google Scholar
Durrant, J. D. & McCammon, J. A. (2011). BMC Biol. 9, 71. Google Scholar
Foloppe, N. & Chen, I. J. (2016). Bioorg. Med. Chem. 24, 2159–2189. CrossRef CAS PubMed Google Scholar
Foloppe, N. & Hubbard, R. (2006). Curr. Med. Chem. 13, 3583–3608. CrossRef PubMed CAS Google Scholar
Ford, M. C. & Babaoglu, K. (2017). J. Chem. Inf. Model. 57, 1276–1285. CrossRef CAS PubMed Google Scholar
Gaieb, Z., Parks, C. D., Chiu, M., Yang, H., Shao, C., Walters, W. P., Lambert, M. H., Nevins, N., Bembenek, S. D., Ameriks, M. K., Mirzadegan, T., Burley, S. K., Amaro, R. E. & Gilson, M. K. (2019). J. Comput. Aided Mol. Des. 33, 1–18. CrossRef CAS PubMed Google Scholar
Ghanakota, P. & Carlson, H. A. (2016). J. Med. Chem. 59, 10383–10399. CrossRef CAS PubMed Google Scholar
Grant, O. C., Montgomery, D., Ito, K. & Woods, R. J. (2020). Sci. Rep. 10, 14991. CrossRef PubMed Google Scholar
Grossfield, A., Patrone, P. N., Roe, D. R., Schultz, A. J., Siderius, D. W. & Zuckerman, D. M. (2018). Living J. Comput. Mol. Sci. 1, 5067. PubMed Google Scholar
Hafner, A. E., Krausser, J. & Šarić, A. (2019). Curr. Opin. Struct. Biol. 58, 43–52. CrossRef CAS PubMed Google Scholar
Hagler, A. T. (2019). J. Comput. Aided Mol. Des. 33, 205–264. CrossRef CAS PubMed Google Scholar
Harder, E., Damm, W., Maple, J., Wu, C., Reboul, M., Xiang, J. Y., Wang, L., Lupyan, D., Dahlgren, M. K., Knight, J. L., Kaus, J. W., Cerutti, D. S., Krilov, G., Jorgensen, W. L., Abel, R. & Friesner, R. A. (2016). J. Chem. Theory Comput. 12, 281–296. Web of Science CrossRef CAS PubMed Google Scholar
Huber, G. A. & McCammon, J. A. (2019). Trends Chem. 1, 727–738. CrossRef CAS PubMed Google Scholar
Huddler, D. & Zartler, E. R. (2017). Editors. Applied Biophysics for Drug Discovery. Chichester: John Wiley & Sons. Google Scholar
Huggins, D. J., Biggin, P. C., Dämgen, M. A., Essex, J. W., Harris, S. A., Henchman, R. H., Khalid, S., Kuzmanic, A., Laughton, C. A., Michel, J., Mulholland, A. J., Rosta, E., Sansom, M. S. P. & van der Kamp, M. W. (2019). WIREs Comput. Mol. Sci. 9, e1393. Google Scholar
Jagger, B. R., Lee, C. T. & Amaro, R. E. (2018). J. Phys. Chem. Lett. 9, 4941–4948. CrossRef CAS PubMed Google Scholar
Jerabek-Willemsen, M., André, T., Wanner, R., Roth, H. M., Duhr, S., Baaske, P. & Breitsprecher, D. (2014). J. Mol. Struct. 1077, 101–113. CAS Google Scholar
Jing, Z., Liu, C., Cheng, S. Y., Qi, R., Walker, B. D., Piquemal, J. P. & Ren, P. (2019). Annu. Rev. Biophys. 48, 371–394. CrossRef CAS PubMed Google Scholar
Jordan, A. M. (2018). ACS Med. Chem. Lett. 9, 1150–1152. CrossRef CAS PubMed Google Scholar
Jorgensen, W. L. (2009). Acc. Chem. Res. 42, 724–733. Web of Science CrossRef PubMed CAS Google Scholar
Kamerlin, S. C. L. & Warshel, A. (2011). WIREs Comput. Mol. Sci. 1, 30–45. CrossRef CAS Google Scholar
Kmiecik, S., Gront, D., Kolinski, M., Wieteska, L., Dawid, A. E. & Kolinski, A. (2016). Chem. Rev. 116, 7898–7936. CrossRef CAS PubMed Google Scholar
Koukos, P. I. & Bonvin, A. M. J. J. (2020). J. Mol. Biol. 432, 2861–2881. CrossRef CAS PubMed Google Scholar
Lazim, R., Suh, D. & Choi, S. (2020). Int. J. Mol. Sci. 21, 6339. CrossRef Google Scholar
Lemieux, R. U. & Spohr, U. (1994). Adv. Carbohydr. Chem. Biochem. 50, 1–20. CrossRef CAS PubMed Google Scholar
Lovera, S., Cuzzolin, A., Kelm, S., De Fabritiis, G. & Sands, Z. A. (2019). Sci. Rep. 9, 14199. CrossRef PubMed Google Scholar
Lowe, D. (2019). Chemistry World. https://www.chemistryworld.com/opinion/will-computers-ever-discover-drugs-from-scratch/4010483.article. Google Scholar
Luan, B., Wang, H. & Huynh, T. (2021). FEBS Lett. 595, 1454–1461. CrossRef CAS PubMed Google Scholar
Martinez-Rosell, G., Lovera, S., Sands, Z. A. & De Fabritiis, G. (2020). J. Chem. Inf. Model. 60, 2314–2324. CAS PubMed Google Scholar
McGuffee, S. R. & Elcock, A. H. (2010). PLoS Comput. Biol. 6, e1000694. CrossRef PubMed Google Scholar
McLeish, T. C., Cann, M. J. & Rodgers, T. L. (2015). Biophys. J. 109, 1240–1250. CrossRef CAS PubMed Google Scholar
Merz, K. M. Jr (2010). J. Chem. Theory Comput. 6, 1769–1776. CrossRef CAS Google Scholar
Michel, J., Foloppe, N. & Essex, J. W. (2010). Mol. Inf. 29, 570–578. CAS Google Scholar
Mikulskis, P., Genheden, S. & Ryde, U. (2014). J. Chem. Inf. Model. 54, 2794–2806. CrossRef CAS PubMed Google Scholar
Motlagh, H. N., Wrabl, J. O., Li, J. & Hilser, V. J. (2014). Nature, 508, 331–339. Web of Science CrossRef CAS PubMed Google Scholar
Muller, M. P., Jiang, T., Sun, C., Lihan, M., Pant, S., Mahinthichaichan, P., Trifan, A. & Tajkhorshid, E. (2019). Chem. Rev. 119, 6086–6161. CrossRef CAS PubMed Google Scholar
Nunes-Alves, A., Kokh, D. B. & Wade, R. C. (2020). Curr. Opin. Struct. Biol. 64, 126–133. CAS PubMed Google Scholar
Oleinikovas, V., Saladino, G., Cossins, B. P. & Gervasio, F. L. (2016). J. Am. Chem. Soc. 138, 14257–14263. CrossRef CAS PubMed Google Scholar
Ouldridge, T. E., Louis, A. A. & Doye, J. P. (2011). J. Chem. Phys. 134, 085101. CrossRef PubMed Google Scholar
Piana, S., Robustelli, P., Tan, D., Chen, S. & Shaw, D. E. (2020). J. Chem. Theory Comput. 16, 2494–2507. CrossRef CAS PubMed Google Scholar
Robinson, C. V., Sali, A. & Baumeister, W. (2007). Nature, 450, 973–982. CrossRef PubMed CAS Google Scholar
Schindler, C. E. M., Baumann, H., Blum, A., Böse, D., Buchstaller, H. P., Burgdorf, L., Cappel, D., Chekler, E., Czodrowski, P., Dorsch, D., Eguida, M. K. I., Follows, B., Fuchss, T., Grädler, U., Gunera, J., Johnson, T., Jorand Lebrun, C., Karra, S., Klein, M., Knehans, T., Koetzner, L., Krier, M., Leiendecker, M., Leuthner, B., Li, L., Mochalkin, I., Musil, D., Neagu, C., Rippmann, F., Schiemann, K., Schulz, R., Steinbrecher, T., Tanzer, E. M., Unzue Lopez, A., Viacava Follis, A., Wegener, A. & Kuhn, D. (2020). J. Chem. Inf. Model. 60, 5457–5474. CrossRef CAS PubMed Google Scholar
Schneider, P., Walters, W. P., Plowright, A. T., Sieroka, N., Listgarten, J., Goodnow, R. A. Jr, Fisher, J., Jansen, J. M., Duca, J. S., Rush, T. S., Zentgraf, M., Hill, J. E., Krutoholow, E., Kohler, M., Blaney, J., Funatsu, K., Luebkemann, C. & Schneider, G. (2020). Nat. Rev. Drug Discov. 19, 353–364. CrossRef CAS PubMed Google Scholar
Senior, A. W., Evans, R., Jumper, J., Kirkpatrick, J., Sifre, L., Green, T., Qin, C., Žídek, A., Nelson, A. W. R., Bridgland, A., Penedones, H., Petersen, S., Simonyan, K., Crossan, S., Kohli, P., Jones, D. T., Silver, D., Kavukcuoglu, K. & Hassabis, D. (2020). Nature, 577, 706–710. Web of Science CrossRef CAS PubMed Google Scholar
Shaw, D. E., Grossman, J. P., Bank, J. A., Batson, B., Butts, J. A., Chao, J. C., Deneroff, M. M., Dror, R. O., Even, A., Fenton, C. H., Forte, A., Gagliardo, J., Gill, G., Greskamp, B., Ho, C. R., Ierardi, D. J., Iserovich, L., Kuskin, J. S., Larson, R. H., Layman, T., Lee, L.-S., Lerer, A. K., Li, C., Killebrew, D., Mackenzie, K. M., Mok, S. Y.-H., Moraes, M. A., Mueller, R., Nociolo, L. J., Peticolas, J. L., Quan, T., Ramot, D., Salmon, J. K., Scarpazza, D. P., Schafer, U. B., Siddique, N., Snyder, C. W., Spengler, J., Tang, P. T. P., Theobald, M., Toma, H., Towles, B., Vitale, B., Wang, S. C. & Young, C. (2014). Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC14), pp. 41–53. Piscataway: IEEE. Google Scholar
Souza, P. C. T., Thallmair, S., Conflitti, P., Ramírez-Palacios, C., Alessandri, R., Raniolo, S., Limongelli, V. & Marrink, S. J. (2020). Nat. Commun. 11, 3714. CrossRef PubMed Google Scholar
Tobi, D. & Bahar, I. (2005). Proc. Natl Acad. Sci. USA, 102, 18908–18913. Web of Science CrossRef PubMed CAS Google Scholar
Toelzer, C., Gupta, K., Yadav, S. K. N., Borucu, U., Davidson, A. D., Kavanagh Williamson, M., Shoemark, D. K., Garzoni, F., Staufer, O., Milligan, R., Capin, J., Mulholland, A. J., Spatz, J., Fitzgerald, D., Berger, I. & Schaffitzel, C. (2020). Science, 370, 725–730. CrossRef CAS PubMed Google Scholar
Tonge, P. J. (2018). ACS Chem. Neurosci. 9, 29–39. CrossRef CAS PubMed Google Scholar
Vajda, S., Beglov, D., Wakefield, A. E., Egbert, M. & Whitty, A. (2018). Curr. Opin. Chem. Biol. 44, 1–8. CrossRef CAS PubMed Google Scholar
Vanommeslaeghe, K. & MacKerell, A. D. Jr (2015). Biochim. Biophys. Acta, 1850, 861–871. CrossRef CAS PubMed Google Scholar
Wade, R. C. & Salo-Ahen, O. M. H. (2019). Molecules, 24, 321. CrossRef Google Scholar
Wang, L., Wu, Y., Deng, Y., Kim, B., Pierce, L., Krilov, G., Lupyan, D., Robinson, S., Dahlgren, M. K., Greenwood, J., Romero, D. L., Masse, C., Knight, J. L., Steinbrecher, T., Beuming, T., Damm, W., Harder, E., Sherman, W., Brewer, M., Wester, R., Murcko, M., Frye, L., Farid, R., Lin, T., Mobley, D. L., Jorgensen, W. L., Berne, B. J., Friesner, R. A. & Abel, R. (2015). J. Am. Chem. Soc. 137, 2695–2703. CrossRef CAS PubMed Google Scholar
Willow, S. Y., Xie, B., Lawrence, J., Eisenberg, R. S. & Minh, D. D. L. (2020). Phys. Chem. Chem. Phys. 22, 12044–12057. CrossRef CAS PubMed Google Scholar
Yamashita, T., Peng, Y., Knight, C. & Voth, G. A. (2012). J. Chem. Theory Comput. 8, 4863–4875. CrossRef CAS PubMed Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.