research papers\(\def\hfill{\hskip 5em}\def\hfil{\hskip 3em}\def\eqno#1{\hfil {#1}}\)

Journal logoSTRUCTURAL
BIOLOGY
ISSN: 2059-7983

Polder maps: improving OMIT maps by excluding bulk solvent

CROSSMARK_Color_square_no_text.svg

aMolecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA 94720, USA, bBioscience Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA, and cDepartment of Bioengineering, University of California Berkeley, Berkeley, CA 94720, USA
*Correspondence e-mail: dcliebschner@lbl.gov

(Received 17 June 2016; accepted 14 November 2016; online 1 February 2017)

The crystallographic maps that are routinely used during the structure-solution workflow are almost always model-biased because model information is used for their calculation. As these maps are also used to validate the atomic models that result from model building and refinement, this constitutes an immediate problem: anything added to the model will manifest itself in the map and thus hinder the validation. OMIT maps are a common tool to verify the presence of atoms in the model. The simplest way to compute an OMIT map is to exclude the atoms in question from the structure, update the corresponding structure factors and compute a residual map. It is then expected that if these atoms are present in the crystal structure, the electron density for the omitted atoms will be seen as positive features in this map. This, however, is complicated by the flat bulk-solvent model which is almost universally used in modern crystallographic refinement programs. This model postulates constant electron density at any voxel of the unit-cell volume that is not occupied by the atomic model. Consequently, if the density arising from the omitted atoms is weak then the bulk-solvent model may obscure it further. A possible solution to this problem is to prevent bulk solvent from entering the selected OMIT regions, which may improve the interpretative power of residual maps. This approach is called a polder (OMIT) map. Polder OMIT maps can be particularly useful for displaying weak densities of ligands, solvent molecules, side chains, alternative conformations and residues both in terminal regions and in loops. The tools described in this manuscript have been implemented and are available in PHENIX.

1. Introduction

OMIT maps (Bhat & Cohen, 1984[Bhat, T. N. & Cohen, G. H. (1984). J. Appl. Cryst. 17, 244-248.]) are a widely used tool to verify whether a certain region of a model in a crystallographic map has sufficient density to justify its presence in the model. An OMIT map is calculated by excluding the atoms in question from the model and is especially useful to verify the presence of ligands, solvent molecules, alternative conformations and residues with weak electron density. Various kinds of OMIT maps have been proposed (Bhat, 1988[Bhat, T. N. (1988). J. Appl. Cryst. 21, 279-281.]; Hodel et al., 1992[Hodel, A., Kim, S.-H. & Brünger, A. T. (1992). Acta Cryst. A48, 851-858.]; Vellieux & Dijkstra, 1997[Vellieux, F. M. D. & Dijkstra, B. W. (1997). J. Appl. Cryst. 30, 396-399.]; Gunčar et al., 2000[Gunčar, G., Klemenčič, I., Turk, B., Turk, V., Karaoglanovic-Carmona, A., Juliano, L. & Turk, D. (2000). Structure, 8, 305-313.]; Terwilliger, Grosse-Kunstleve, Afonine, Moriarty, Adams et al., 2008[Terwilliger, T. C., Grosse-Kunstleve, R. W., Afonine, P. V., Moriarty, N. W., Adams, P. D., Read, R. J., Zwart, P. H. & Hung, L.-W. (2008). Acta Cryst. D64, 515-524.]; Pražnikar et al., 2009[Pražnikar, J., Afonine, P. V., Gunčar, G., Adams, P. D. & Turk, D. (2009). Acta Cryst. D65, 921-931.]). These maps have their advantages and disadvantages. In the case of ligands and alternative conformations, it is desirable to first build and correct the rest of the model before placing a ligand or building difficult-to-interpret residues (`discovery map'; Tronrud, 2008[Tronrud, D. E. (2008). CCP4bb. https://www.mail-archive.com/ccp4bb@jiscmail.ac.uk/msg04276.html.]).

The utility of OMIT maps may be limited by the flat bulk-solvent model that is commonly used in macromolecular crystallographic packages such as CNS (Brünger et al., 1998[Brünger, A. T., Adams, P. D., Clore, G. M., DeLano, W. L., Gros, P., Grosse-Kunstleve, R. W., Jiang, J.-S., Kuszewski, J., Nilges, M., Pannu, N. S., Read, R. J., Rice, L. M., Simonson, T. & Warren, G. L. (1998). Acta Cryst. D54, 905-921.]), REFMAC (Murshudov et al., 2011[Murshudov, G. N., Skubák, P., Lebedev, A. A., Pannu, N. S., Steiner, R. A., Nicholls, R. A., Winn, M. D., Long, F. & Vagin, A. A. (2011). Acta Cryst. D67, 355-367.]) and PHENIX (Adams et al., 2010[Adams, P. D. et al. (2010). Acta Cryst. D66, 213-221.]). This model assumes a constant solvent density (typically 0.2–0.6 e Å−3) anywhere in the unit-cell volume that is not occupied by the current atomic model (Jiang & Brünger, 1994[Jiang, J. & Brünger, A. (1994). J. Mol. Biol. 243, 100-115.]). The areas of the unit cell that are not interpreted in terms of an atomic model are filled with the bulk solvent by constructing a binary mask set to 0 inside the atomic model and 1 elsewhere (Phillips, 1980[Phillips, S. E. (1980). J. Mol. Biol. 142, 531-554.]; Fokine & Urzhumtsev, 2002[Fokine, A. & Urzhumtsev, A. (2002). Acta Cryst. D58, 1387-1392.]; Afonine et al., 2005[Afonine, P. V., Grosse-Kunstleve, R. W. & Adams, P. D. (2005). Acta Cryst. D61, 850-855.]; Weichenberger et al., 2015[Weichenberger, C. X., Afonine, P. V., Kantardjieff, K. & Rupp, B. (2015). Acta Cryst. D71, 1023-1038.]). If the ligand is removed from the model, the region where it was modeled will be filled with bulk solvent, as shown in Figs. 1[link](a) and 1[link](b). This may diminish regions of weak density, complicating their interpretation.

[Figure 1]
Figure 1
Illustration of how the bulk-solvent mask changes when a ligand (MES 88, PDB entry 1aba) is included (a) or excluded (b) in its construction. Exclusion of the ligand results in the bulk solvent filling the area previously occupied by the MES molecule.

Two typical ways of computing an OMIT map are worth mentioning. One is to remove the atoms in question from the model. In this case, the region containing atoms before their omission will be considered as part of the solvent region. Such a map would not be a true OMIT map since the flat bulk-solvent model will be placed into this region. This would obscure the signal from any structured atoms in this region, making it relative in comparison with the bulk-solvent density. Alternatively, one may restrict filling this region with the solvent model. Typically, this is performed by keeping the atoms in question in the model and setting their occupancies to zero so that they do not contribute to the scattering and also demarcate the solvent mask in this region as nonsolvent (see, for example, Fig. S1b in Choudhary et al., 2014[Choudhary, O. P., Paz, A., Adelman, J. L., Colletier, J.-P., Abramson, J. & Grabe, M. (2014). Nature Struct. Mol. Biol. 21, 626-632.]). The problem with this approach is that the residual map calculated using such a model is likely to show positive density mimicking the shape of the excluded region. This density may arise from the pure bulk solvent or structured atoms or a mixture of these, and owing to the way that this map is calculated it is not possible to discern the source of this density. To illustrate this, a ligand was placed into an arbitrary location in the bulk-solvent region, its occupancy was set to 0 and a residual mFobs − DFmodel map was calculated (Fig. 2[link]a). Rather strong positive density clearly follows the molecule. Obviously, this density reveals the bulk solvent (Fig. 2[link]b) and not the ligand. In the following, we refer to this kind of map as a biased residual OMIT map.

[Figure 2]
Figure 2
(a) A ligand molecule (GRG, PDB entry 4opi) is moved to an arbitrary location in the bulk-solvent region devoid of any electron-density peaks that could justify its position. The volume around the ligand is excluded from mask calculation and its contribution to the structure factor is ignored. The mFobsDFmodel map contoured at 3σ shows strong positive density that follows the shape of the molecule. (b) Example of the mask when ligand is taken into account for mask computation but not for structure-factor calculation (biased map). The protein region is marked 0 and the bulk-solvent mask is marked 1. The mask employed in (b) was used to compute the difference map in (a).

A possible solution to this problem is to realise that the model of the crystal content consists of two major contributions: atomic model and non-atomic model (bulk solvent). Therefore, both the atoms and the bulk solvent should be excluded from the OMIT map calculation and not just the atomic model. Several options have been proposed to deal with the bulk solvent in calculating OMIT maps, such as truncating the low-resolution data and not using the bulk-solvent model at all, or using alternative bulk-solvent models that do not employ a priori modeling (masking), such as the exponential scaling (Babinet) model (Moews & Kretsinger, 1975[Moews, P. C. & Kretsinger, R. H. (1975). J. Mol. Biol. 91, 201-225.]; Tronrud, 1997[Tronrud, D. E. (1997). Methods Enzymol. 277, 306-319.]). Both options are problematic. Truncating the low-resolution data will degrade the quality of the map (Lunin, 1988[Lunin, V. Y. (1988). Acta Cryst. A44, 144-150.]; Urzhumtsev et al., 1989[Urzhumtsev, A. G., Lunin, V. Y. & Luzyanina, T. B. (1989). Acta Cryst. A45, 34-39.]; Urzhumtseva & Urzhumtsev, 2011[Urzhumtseva, L. & Urzhumtsev, A. (2011). J. Appl. Cryst. 44, 865-872.]; Cowtan, 1996[Cowtan, K. D. (1996). Proceedings of the CCP4 Study Weekend. Macromolecular Refinement, edited by E. Dodson, M. Moore, A. Ralph & S. Bailey, pp. 23-28. Warrington: Daresbury Laboratory.]), while the Babinet bulk-solvent model is only valid at resolutions below 8–10 Å (Podjarny & Urzhumtsev, 1997[Podjarny, A. D. & Urzhumtsev, A. G. (1997). Methods Enzymol. 276, 641-658.]). Yet another alternative is to define the OMIT region as larger than the molecule that one wants to identify and then fill the region with a regular grid of small scatterers (electrons or fractions thereof) and refine occupancies, B factors and coordinates restrained to their initial positions (Urzhumtsev, 1997[Urzhumtsev, A. G. (1997). Acta Cryst. D53, 540-543.]). The program BUSTER (Bricogne et al., 2016[Bricogne, G., Blanc, E., Brandl, M., Flensburg, C., Keller, P., Paciorek, W., Roversi, P., Sharff, A., Smart, O. S., Vonrhein, C. & Womack, T. O. (2016). BUSTER. Global Phasing Ltd, Cambridge, England.]) allows the exclusion of regions from bulk solvent by processing an additional file which describes the binding site (without resembling the putative ligand). Furthermore, statistical treatment of non-uniformity of bulk solvent or as yet unmodeled regions has been discussed (Blanc et al., 2004[Blanc, E., Roversi, P., Vonrhein, C., Flensburg, C., Lea, S. M. & Bricogne, G. (2004). Acta Cryst. D60, 2210-2221.]; Perrakis et al., 1999[Perrakis, A., Morris, R. & Lamzin, V. S. (1999). Nature Struct. Biol. 6, 458-463.]; Roversi et al., 2000[Roversi, P., Blanc, E., Vonrhein, C., Evans, G. & Bricogne, G. (2000). Acta Cryst. D56, 1316-1323.]).

In this manuscript, we describe a new approach implemented in the PHENIX software suite. The tool is called phenix.polder and the corresponding maps are referred to as polder maps. The term `polder' was chosen as an analogy to the Dutch term for lowland reclaimed from the sea: polder is land gained by keeping water from penetrating the area. A polder map helps to enhance weak features in electron-density maps by keeping bulk solvent out of the area. Better features are achieved because the polder OMIT density is essentially raised by a constant value equal to the bulk-solvent electron density, compared with an OMIT density where the OMIT region is filled with bulk solvent. In a polder map, the density in the OMIT region is therefore not biased by the bulk solvent.

2. Methods

The calculation of polder OMIT maps consists of several stages (Fig. 3[link]). Firstly, the OMIT region of the unit-cell volume is identified by selecting a group of atoms in the input model that are located in the OMIT region. An intersection of spheres of radius 5 Å around each selected atom is used to mark this region. The choice of 5 Å for the sphere radius is rather arbitrary and is based on two requirements. One is that the OMIT region needs to be large enough to avoid biasing the map by the shape of the masking atoms. The other is that the OMIT region should not remove too much scattering from the model because otherwise it will be damaging to the map. This is especially important for weak features in the map as they may be particularly susceptible to model deterioration. The results from calculations testing different radii for bulk-solvent mask exclusion are presented in the Supporting Information.

[Figure 3]
Figure 3
Working chart for phenix.polder. See text for details.

In the second step, a solvent mask is calculated from the atomic model that does not contain the selected atoms; this mask is then modified to exclude the solvent in the OMIT region defined above.

Finally, structure factors are calculated from the modified mask (`polder mask') and from the atomic model with the selected atoms omitted. These structure factors are then added together, scaled to Fobs as described in Afonine et al. (2013[Afonine, P. V., Grosse-Kunstleve, R. W., Adams, P. D. & Urzhumtsev, A. (2013). Acta Cryst. D69, 625-634.]) and used for calculation of an mFobsDFmodel map. phenix.polder produces a reflection file with two sets of Fourier map coefficients. One set corresponds to the polder OMIT map, and the other to the OMIT map where bulk solvent is allowed to penetrate the omitted region (the latter set of Fourier map coefficients is present for comparison). Biased residual OMIT maps were computed by including the OMIT atoms in calculating the bulk-solvent mask and calculating structure factors for the atomic model without the OMIT atoms. An example of the mask from this procedure (`biased mask') is shown in Fig. 2[link](b). To validate polder maps we have designed a numerical test, which is described in §[link]5.

3. Results

In this section, several examples of the utility of polder maps are presented.

3.1. Ligand density

3.1.1. Ligand GRG 502 in PDB entry 4opi

Figs. 4[link](a) and 4[link](b) show the OMIT map and the polder map for ligand GRG 502 from PDB entry 4opi (Kung et al., 2014[Kung, Y., McAndrew, R. P., Xie, X., Liu, C. C., Pereira, J. H., Adams, P. D. & Keasling, J. D. (2014). Structure, 22, 1028-1036.]), respectively. In the OMIT map, only one phosphate group (left) and the tail located at the right side have positive density at a contour level of 3σ, whereas the center part of the molecule does not have density. At a similar contour level, the polder map shows density for the entire molecule (except for the O and C atoms next to the phosphate group). In order to obtain a similar shape of electron density as the polder map shows at a 3σ contour level, the contour of the OMIT map would have to be decreased to 1.5σ, which is much lower than what is usually accepted as a significant difference density peak (Fig. 4[link]c). The local correlation coefficient between the ligand model map and residual OMIT map (CC) is 0.70 and 0.75 for the OMIT map and the polder map, respectively, suggesting that the polder map is locally of better quality than the OMIT map. The minimum, maximum and mean values of the OMIT map and the polder map at the atomic centres of the ligand are summarized in Table 1[link]. They are systematically larger in the polder map than in the OMIT map. For example, the average value is 4.14 e Å−3 in the former and 2.32 e Å−3 in the latter (reflection F000 was not accounted for here and everywhere else where absolute map values are reported). These higher map values are consistent with the results of visual comparison of the two maps, and with the expectation that a dominant effect of the polder map is to raise the level of the density by the bulk-solvent electron density in the OMIT region.

Table 1
Minimum, maximum and mean values of the electron density in e Å−3 at atomic centers in polder maps and OMIT maps

PDB code Map Minimum Maximum Mean
4opi Polder 1.415 6.071 4.143
OMIT −0.262 4.544 2.321
1aba Polder 1.847 13.440 4.591
OMIT 0.252 12.734 3.454
1c2k Polder 2.137 5.299 4.009
OMIT 1.419 4.396 3.106
1f8t Polder 3.200 12.564 7.425
OMIT 1.504 11.737 6.219
[Figure 4]
Figure 4
(a) OMIT map and (b) polder map for ligand GRG 502 in PDB entry 4opi. The positive and negative mFobsDFmodel OMIT difference density contoured at 3σ is displayed in green and red, respectively. (c) OMIT map contoured at ±1.5σ, at which the ligand density has a similar shape to the polder map.
3.1.2. Ligand MES 88 in PDB entry 1aba

Solvent molecules from the crystallization solution or the soaking or purification steps may be present in the crystal. As for ligands, strong evidence is needed to justify their presence in as yet un­modeled density. Figs. 5[link](a) and 5[link](b) show the OMIT map and the polder map for the solvent molecule MES 88 of PDB entry 1aba (Eklund et al., 1992[Eklund, H., Ingelman, M., Söderberg, B. O., Uhlin, T., Nordlund, P., Nikkola, M., Sonnerstam, U., Joelson, T. & Petratos, K. (1992). J. Mol. Biol. 228, 596-618.]). In the OMIT map, there is only some density for the O, N and S atoms. Placement of the ligand appears difficult to justify looking at the electron density alone. In the polder map, there is difference electron density for the entire molecule; even the ring is correctly resolved. Additional positive electron-density peaks (on top of the ring) can be seen in the polder map. These peaks are likely to correspond to bulk solvent, which has been excluded around the MES molecule. In order to obtain a similar shape of difference density, the contour of the OMIT map has to be decreased to about 2σ (Fig. 5[link]c). The CC in the region of the MES molecule is 0.76 and 0.80 for the OMIT map and the polder map, respectively. The visual improvement in the density is therefore also supported by an increase in the local CC for the polder map. Furthermore, the map values are systematically larger for the polder map than for the OMIT map (Table 1[link]).

[Figure 5]
Figure 5
OMIT maps for ligand MES 88 in PDB entry 1aba. The positive and negative mFobsDFmodel OMIT difference density is displayed in green and red, respectively. (a) OMIT map contoured at ±3σ. (b) Polder map contoured at ±3σ. (c) OMIT map contoured at ±2σ. (d) OMIT map using a Babinet solvent model (±3σ). (e) OMIT map not using any bulk-solvent model and truncating the data at 5 Å resolution (±3σ). (f) OMIT map using a Babinet model (±2σ). (g) OMIT map not using a solvent model and truncating at 5 Å resolution (±2σ).
3.1.3. Ligand ABI 246 in PDB entry 1c2k

Figs. 6[link](a) and 6[link](b) show the OMIT map and the polder map of the ligand molecule 5-amidinobenzimidazole (ABI 246) from PDB entry 1c2k (Katz et al., 1998[Katz, B. A., Clark, J. M., Finer-Moore, J. S., Jenkins, T. E., Johnson, C. R., Ross, M. J., Luong, C., Moore, W. R. & Stroud, R. M. (1998). Nature (London), 391, 608-612.]). In the OMIT map, the density of the central six-membered ring is weak or missing, while the indole and the CN2 group are poorly resolved. In the polder map, the entire molecule is covered by density; the elongated shape of the residual peak around the CN2 group suggests that it slightly rotates around the C—C bond. In order to obtain a similar shape of difference density, the contour of the OMIT map had to be decreased to 2.1σ (Fig. 7[link]c). The map–model CC in the region of the ABI molecule is 0.64 and 0.69 for the OMIT map and the polder map, respectively. The mean map values at the atomic centers are 3.11 and 4.01 e Å−3 in the OMIT map and the polder map, respectively (Table 1[link]), which is in agreement with the visual inspection, with more features visible in the polder map.

[Figure 6]
Figure 6
(a) OMIT map and (b) polder map for ligand ABI 246 in PDB entry 1c2k. The positive and negative mFobsDFmodel OMIT difference density contoured at 3σ is displayed in green and red, respectively. (c) OMIT map contoured at ±2σ, at which the ligand density has a similar shape as the polder map. The gray sphere represents a Zn ion.
[Figure 7]
Figure 7
(a) Original 2mFobsDFmodel (blue, 1σ contour) and mFobsDFmodel maps, (b) OMIT map and (c) polder map for residue GlnH105 in structure 1f8t. The positive and negative mFobsDFmodel difference density contoured at 3σ is displayed in green and red, respectively. In (c), the Gln side chain was real-space refined in Coot.

3.2. Side chains protruding into the solvent region

The side-chain orientations of residues on the surface of the protein are often difficult to model. This is because they are highly mobile and are typically represented by an ensemble of conformations in crystal structures, and thus may have very weak or no density.

Fig. 7[link] shows the original 2mFobsDFmodel and mFobsDFmodel maps, the OMIT and the polder map for residue GlnH105 of PDB entry 1f8t (Fokine et al., 2000[Fokine, A. V., Afonin, P. V., Mikhailova, I. Y., Tsygannik, I. N., Mareeva, T. Y., Nesmeianov, V. A., Pangborn, W., Lee, N., Duax, W., Siszak, E. & Pletnev, V. Z. (2000). Bioorg. Khim. 26, 571-578.]). There is no density for the Gln side chain in the 2mFobsDFmodel map, and the residual mFobsDFmodel map has a negative peak at the side chain position, suggesting that the modeled orientation is incorrect. In the OMIT map (Fig. 7[link]b) there is no density to indicate the orientation of the glutamine side chain either. In contrast, the polder map, while noisy, shows continuous V-shaped density suggesting a different orientation for the side chain. After real-space refinement in Coot (Emsley et al., 2010[Emsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. (2010). Acta Cryst. D66, 486-501.]), the side chain indeed fits well into the difference density (Fig. 7[link]c). The local CC improves from 0.77 in the OMIT map to 0.83 in the polder map. After real-space refinement based on the OMIT maps, these correlations increase to 0.80 and 0.87, respectively. This suggests that the new orientation describes the experimental data better. It should be noted that the map values at the atomic centers are systematically larger in the polder map.

4. Comparison of several methods to decrease the influence of bulk solvent in OMIT regions

Several approaches have been proposed to decrease the influence of flat bulk solvent in OMIT maps: not using a bulk-solvent model at all and truncating the data at ∼5–6 Å resolution or employing a solvent model which does not use a mask, such as the Babinet model. Fig. 5[link] shows these maps for ligand MES 88 of PDB entry 1aba. For the map computed without using any bulk-solvent model, the resolution was truncated at 5 Å. This resolution cutoff was obtained by comparing the curves of the R factor versus resolution calculated using the flat bulk-solvent model and without using any bulk-solvent model (Fig. 8[link]). The curves are similar up to 4 Å resolution and begin to diverge at about 5 Å.

[Figure 8]
Figure 8
R factor versus resolution for PDB entry 1aba computed using the flat bulk-solvent model (red squares), Babinet solvent model (green circles) and no solvent model at all (blue triangles).

Both maps, calculated using low-resolution truncated data and using the Babinet-based solvent model, show no improvement in the density for the MES molecule (Figs. 5[link]d and 5[link]e) at a contour level of 3σ. At a lower contour level (2σ; Figs. 5[link]f and 5[link]g) the map computed using the Babinet model (Fig. 5[link]f) is slightly superior to the truncated map (Fig. 5[link]g), but both are inferior to the OMIT map computed at a 2σ contour level (Fig. 5[link]c) and especially to the polder map (Fig. 5[link]a). Thus, the approach of truncating the low-resolution data to avoid bulk-solvent mask artifacts in residual maps is the least useful, which emphasizes the importance of low-resolution reflections for map quality. The Babinet model and the method ignoring low-resolution reflections are therefore not appropriate in cases of weak density and do not show an improvement compared with OMIT maps.

5. Validation of polder maps

If the omitted region is surrounded by other atomic features of the model, such as a ligand in a compact binding pocket, the residual density revealed by the polder map might show behavior similar to biased OMIT maps: such density may correspond to either bulk solvent or ordered atoms. The reason is that in tightly packed environments bulk solvent or ligands are likely to mimic the shape of the pocket.

The susceptibility of the polder approach to bias owing to the shape of the ligand-binding pocket was tested by computing three polder maps. Two maps were computed using synthetic data (Fobs = |Fmodel|), one assuming that the omitted atoms are present (m1) and the other assuming that the onitted atoms are not present (m2). The third map is a polder map using actual experimental data (m3). Local correlation coefficients (CC) and peak correlation coefficients (CCpeak; Urzhumtsev et al., 2014[Urzhumtsev, A., Afonine, P. V., Lunin, V. Y., Terwilliger, T. C. & Adams, P. D. (2014). Acta Cryst. D70, 2593-2606.]) were calculated between all three maps using map values from the OMIT region only. By the construction of the test, map m1 is expected to show omitted atoms and m2 is expected to show omitted bulk-solvent density. If the polder map m3 shows the omitted atoms it is expected to correlate best with map m1. If m3 shows the bulk solvent then it is expected to correlate best with map m2. However, if the omitted atoms are highly mobile (as manifested, for example, by large B factors) and/or the resolution is low, map m1 may be rather smeared and could resemble the bulk-solvent map, yielding high correlation with map m2. Also, Urzhumtsev et al. (2014[Urzhumtsev, A., Afonine, P. V., Lunin, V. Y., Terwilliger, T. C. & Adams, P. D. (2014). Acta Cryst. D70, 2593-2606.]) have demonstrated that CCpeak is more adequate for the comparison of three-dimensional functions. Considering pairwise comparisons of all three correlation coefficients, one can assess the confidence of interpreting the polder map in terms of an atomic model. Table 2[link] shows these correlation coefficients for all examples considered in this manuscript. Also, phenix.polder reports all of the abovementioned correlation coefficients.

Table 2
Local correlation coefficients (CC) and CCpeak between three polder maps: m1, m2 and m3

See §[link]5 for details. Map 1 (m1), calculated Fobs assuming that the omitted atoms are present. Map 2 (m2), calculated Fobs assuming that the omitted atoms are not present. Map 3 (m3), polder map using experimental data.

    m1–m2 m1–m3 m2–m3
PDB code   CC CCpeak CC CCpeak CC CCpeak
4opi Fictitious ligand in solvent area 0.30 0.35 0.28 0.35 0.51 0.50
4opi Ligand GRG 502 0.66 0.69 0.77 0.74 0.65 0.66
1aba Ligand MES 88 0.59 0.69 0.80 0.72 0.48 0.54
1c2k Ligand ABI 246 0.63 0.69 0.72 0.69 0.64 0.63
1f8t Residue Gln105 0.17 0.30 0.88 0.77 0.09 0.16
1us0 Ligand LDT 320 0.24 0.53 0.99 0.85 0.24 0.47

5.1. Ligand LDT 320 in PDB entry 1us0

As the examples in this manuscript consider weak OMIT densities, validation was also carried out for a ligand with very clear density in a high-resolution structure. The acetic acid molecule LDT 320 in PDB entry 1us0 (Howard et al., 2004[Howard, E. I., Sanishvili, R., Cachau, R. E., Mitschler, A., Chevrier, B., Barth, P., Lamour, V., Van Zandt, M., Sibley, E., Bon, C., Moras, D., Schneider, T. R., Joachimiak, A. & Podjarny, A. (2004). Proteins, 55, 792-804.]) is modeled with full occupancy (except for the Br atom, which has an occupancy of 0.94) and has very low disorder, with an average isotropic B factor of 4.2 Å2. For LDT 320, the pairs of correlation coefficients behave as expected. CCm1m2 and CCm2m3 are low (both have values of 0.24), while CCm1m3 is very high (0.99). The peak correlation coefficients follow the same trend, although the difference between CCm1m2/CCm2m3 and CCm1m3 is smaller (Table 2[link]).

5.2. Ligand GRG 502 in PDB entry 4opi

For the fictitious ligand placed in the bulk-solvent area in PDB entry 4opi, the correlation coefficient between m2 and m3 is the largest (CCm2m3 = 0.51), i.e. the map from calculated data without ligand correlates best with the experimental polder map (which, by the construction of this example, only contains bulk solvent). At the same time, the CC between m1 and m3 is very poor (CCm1m3 = 0.28), which also suggests that no ligand is present at this location.

For ligand GRG 502 in PDB entry 4opi, the CC between m1 and m3 is the largest (CCm1m3 = 0.77), while the other CCs are smaller (CCm1m2 = 0.66 and CCm2m3 = 0.65), thus suggesting that the binding cavity contains ligand and not bulk solvent.

5.3. Ligand MES 88 in PDB entry 1aba

The CC between m1 and m3 for MES 88 is 0.80, while CCm1m2 (0.59) and CCm2m3 (0.48) are much lower. The CCs therefore strongly suggest that the ligand is present.

5.4. Ligand ABI 246 in PDB entry 1c2k

For ligand ABI 246, CCm1m3 is larger (0.72) than CCm1m2 and CCm2m3 (0.63 and 0.64, respectively), thus favoring the ligand. However, the values of CCpeak are similar for CCpeak-m1m2 (0.69) and CCpeak-m1m3 (0.69) and only marginally differ from CCpeak-m2m3 (0.63). Therefore, the electron density of this ligand should be interpreted with care. While numerical measures are not sufficient to decide whether the density belongs to ABI or bulk solvent, several considerations justify the placement of the ABI molecule. The polder electron density has the same shape as the ABI molecule; for example, it shows an `opening' in the six-membered benzene ring of the indole group (at the 3σ contour level), and the two strongest peaks are at the N3 atom of the pyrrole ring and at the N1 atom of the CN2 group. Furthermore, the orientation of the molecule is such that it forms numerous hydrogen bonds to protein residues (for example N2—HH22⋯OAsp189), water molecules (for example N4—HN4⋯OHOH260) and a Zn ion (N3⋯ZN258). As the molecule displays rather strong disorder (Biso of ∼50 Å2, occupancy 0.69), it is likely that its density resembles bulk solvent and therefore yielded high correlation to the bulk-solvent density in the binding region. Fig. 9[link] shows the three validation maps (m1, m2 and m3) for the ABI molecule. Map m1, which is based on calculated structure factors, clearly follows the shape of the molecule. Map m2, which represents bulk-solvent density, has a similar shape, although the density peak does not cover the CN2 group and also occupies the region further away from the indole group. This explains why m1 and m2 yield a relatively high correlation coefficient CCpeak. The experimental map m3 shows greater resemblance to the calculated map m1 than to the bulk solvent in map m2, which is reflected by the CCpeak values, which are 0.69 and 0.63, respectively. It can be further noted that the ABI molecule was modeled with an occupancy of 0.69. Therefore, in 31% of the instances the binding pocket is filled with bulk solvent, which may explain the lack of a clear distinction between ligand and bulk solvent in this case.

[Figure 9]
Figure 9
Trial polder maps of ligand molecule ABI 246 in PDB entry 1c2k used for the computation of correlation coefficients as described in §[link]5: m1 (a), m2 (b) and m3 (c). All maps are contoured at 2.5σ.

5.5. Residue GlnH105 in PDB entry 1f8t

The last example, residue GlnH105, shows a rather strong correlation between m1 and m3 (CCm1m3 = 0.88), whereas it is rather weak for the other maps (CCm1m2 = 0.17 and CCm2m3 = 0.09). The correlation coefficients between maps from synthetic data and from experimental data are therefore a good measure of the reliability of polder maps.

6. Comparison of simulated-annealing OMIT and polder maps

It is often suspected that omitting atoms does not entirely remove model bias (Hodel et al., 1992[Hodel, A., Kim, S.-H. & Brünger, A. T. (1992). Acta Cryst. A48, 851-858.]). It is therefore common to carry out several rounds of refinement, optionally adding simulated annealing (SA) to remove the `memory' of the atoms to be omitted (Rupp, 2009[Rupp, B. (2009). Biomolecular Crystallography: Principles, Practice, and Application to Structural Biology. New York: Garland Science.]; Terwilliger, Grosse-Kunstleve, Afonine, Moriarty, Zwart et al., 2008[Terwilliger, T. C., Grosse-Kunstleve, R. W., Afonine, P. V., Moriarty, N. W., Zwart, P. H., Hung, L.-W., Read, R. J. & Adams, P. D. (2008). Acta Cryst. D64, 61-69.]; Brünger et al., 1998[Brünger, A. T., Adams, P. D., Clore, G. M., DeLano, W. L., Gros, P., Grosse-Kunstleve, R. W., Jiang, J.-S., Kuszewski, J., Nilges, M., Pannu, N. S., Read, R. J., Rice, L. M., Simonson, T. & Warren, G. L. (1998). Acta Cryst. D54, 905-921.]). To compare the result of the polder procedure with a standard SA map, SA refinement was performed with the simulated_annealing=True option for the first macrocycle in phenix.refine (Afonine et al., 2012[Afonine, P. V., Grosse-Kunstleve, R. W., Echols, N., Headd, J. J., Moriarty, N. W., Mustyakimov, M., Terwilliger, T. C., Urzhumtsev, A., Zwart, P. H. & Adams, P. D. (2012). Acta Cryst. D68, 352-367.]) for model 4opi (without ligand GRG 502). The OMIT map and the polder map for ligand GRG 502 are displayed in Fig. 10[link]. Similar to the results discussed in §[link]3.1.1, the SA OMIT map (Fig. 10[link]a) has much less clear ligand density than the SA polder map (Fig. 10[link]b).

[Figure 10]
Figure 10
(a) Simulated-annealing OMIT map and (b) polder map for ligand GRG 502 in PDB entry 4opi. The positive and negative mFobsDFmodel OMIT difference density contoured at 3σ is displayed in green and red, respectively.

However, it is not recommended to carry out SA refinement routinely for polder maps. SA refinement may be appropriate for reducing model bias, but it has a limited scope of application. Firstly, during SA refinement the quality of the model may deteriorate if performed with regions of the model omitted. Generally, SA refinement is most appropriate at the initial stages of refinement (Adams et al., 1999[Adams, P. D., Pannu, N. S., Read, R. J. & Brunger, A. T. (1999). Acta Cryst. D55, 181-190.]) as opposed to the final stages, when the polder map is likely to be needed. Since the aim of a polder map is to amplify weak features in electron-density maps, any potential worsening of the model is counterproductive. Finally, SA requires consideration of the refinement strategy, which is specific to the model, data and model-to-data fit qualities, and is not well suited to routine map calculation.

7. Conclusions

The flat bulk-solvent model affects OMIT maps. To avoid its influence, a new tool, phenix.polder, has been developed as part of the PHENIX software suite. The tool calculates OMIT maps by not only excluding the selected atoms but also preventing the bulk-solvent mask from penetrating the region in question. As shown by several examples, phenix.polder is useful in cases where the density of the selected atoms is weak and possibly obscured by the bulk solvent. phenix.polder produces less biased maps than procedures in which the atoms are simply removed from the model or where the atom-selection occupancy is set to zero and included in the solvent-mask calculation. In the latter case, the resulting difference density can have a similar shape as the selected atoms. In the polder procedure, a larger volume from the bulk solvent is excluded and therefore prevents the misinterpretation of bulk-solvent density as OMIT density, making it a map-improvement technique that is suitable for parts of the structure with weak density. The program is available as from the command line as well as in the PHENIX GUI.

Supporting information


Acknowledgements

This work was supported by the NIH (Project 1P01 GM063210), the PHENIX Industrial Consortium and, in part, by the US Department of Energy under Contract No. DE-AC02-05CH11231.

References

First citationAdams, P. D. et al. (2010). Acta Cryst. D66, 213–221.  Web of Science CrossRef CAS IUCr Journals Google Scholar
First citationAdams, P. D., Pannu, N. S., Read, R. J. & Brunger, A. T. (1999). Acta Cryst. D55, 181–190.  Web of Science CrossRef CAS IUCr Journals Google Scholar
First citationAfonine, P. V., Grosse-Kunstleve, R. W. & Adams, P. D. (2005). Acta Cryst. D61, 850–855.  Web of Science CrossRef CAS IUCr Journals Google Scholar
First citationAfonine, P. V., Grosse-Kunstleve, R. W., Adams, P. D. & Urzhumtsev, A. (2013). Acta Cryst. D69, 625–634.  Web of Science CrossRef CAS IUCr Journals Google Scholar
First citationAfonine, P. V., Grosse-Kunstleve, R. W., Echols, N., Headd, J. J., Moriarty, N. W., Mustyakimov, M., Terwilliger, T. C., Urzhumtsev, A., Zwart, P. H. & Adams, P. D. (2012). Acta Cryst. D68, 352–367.  Web of Science CrossRef CAS IUCr Journals Google Scholar
First citationBhat, T. N. (1988). J. Appl. Cryst. 21, 279–281.  CrossRef Web of Science IUCr Journals Google Scholar
First citationBhat, T. N. & Cohen, G. H. (1984). J. Appl. Cryst. 17, 244–248.  CrossRef CAS Web of Science IUCr Journals Google Scholar
First citationBlanc, E., Roversi, P., Vonrhein, C., Flensburg, C., Lea, S. M. & Bricogne, G. (2004). Acta Cryst. D60, 2210–2221.  Web of Science CrossRef CAS IUCr Journals Google Scholar
First citationBricogne, G., Blanc, E., Brandl, M., Flensburg, C., Keller, P., Paciorek, W., Roversi, P., Sharff, A., Smart, O. S., Vonrhein, C. & Womack, T. O. (2016). BUSTER. Global Phasing Ltd, Cambridge, England.  Google Scholar
First citationBrünger, A. T., Adams, P. D., Clore, G. M., DeLano, W. L., Gros, P., Grosse-Kunstleve, R. W., Jiang, J.-S., Kuszewski, J., Nilges, M., Pannu, N. S., Read, R. J., Rice, L. M., Simonson, T. & Warren, G. L. (1998). Acta Cryst. D54, 905–921.  Web of Science CrossRef IUCr Journals Google Scholar
First citationChoudhary, O. P., Paz, A., Adelman, J. L., Colletier, J.-P., Abramson, J. & Grabe, M. (2014). Nature Struct. Mol. Biol. 21, 626–632.  CrossRef CAS Google Scholar
First citationCowtan, K. D. (1996). Proceedings of the CCP4 Study Weekend. Macromolecular Refinement, edited by E. Dodson, M. Moore, A. Ralph & S. Bailey, pp. 23–28. Warrington: Daresbury Laboratory.  Google Scholar
First citationEklund, H., Ingelman, M., Söderberg, B. O., Uhlin, T., Nordlund, P., Nikkola, M., Sonnerstam, U., Joelson, T. & Petratos, K. (1992). J. Mol. Biol. 228, 596–618.  CrossRef PubMed CAS Web of Science Google Scholar
First citationEmsley, P., Lohkamp, B., Scott, W. G. & Cowtan, K. (2010). Acta Cryst. D66, 486–501.  Web of Science CrossRef CAS IUCr Journals Google Scholar
First citationFokine, A. V., Afonin, P. V., Mikhailova, I. Y., Tsygannik, I. N., Mareeva, T. Y., Nesmeianov, V. A., Pangborn, W., Lee, N., Duax, W., Siszak, E. & Pletnev, V. Z. (2000). Bioorg. Khim. 26, 571–578.  Google Scholar
First citationFokine, A. & Urzhumtsev, A. (2002). Acta Cryst. D58, 1387–1392.  Web of Science CrossRef CAS IUCr Journals Google Scholar
First citationGunčar, G., Klemenčič, I., Turk, B., Turk, V., Karaoglanovic-Carmona, A., Juliano, L. & Turk, D. (2000). Structure, 8, 305–313.  Web of Science PubMed Google Scholar
First citationHodel, A., Kim, S.-H. & Brünger, A. T. (1992). Acta Cryst. A48, 851–858.  CrossRef CAS Web of Science IUCr Journals Google Scholar
First citationHoward, E. I., Sanishvili, R., Cachau, R. E., Mitschler, A., Chevrier, B., Barth, P., Lamour, V., Van Zandt, M., Sibley, E., Bon, C., Moras, D., Schneider, T. R., Joachimiak, A. & Podjarny, A. (2004). Proteins, 55, 792–804.  Web of Science CrossRef PubMed CAS Google Scholar
First citationJiang, J. & Brünger, A. (1994). J. Mol. Biol. 243, 100–115.  CrossRef CAS PubMed Web of Science Google Scholar
First citationKatz, B. A., Clark, J. M., Finer-Moore, J. S., Jenkins, T. E., Johnson, C. R., Ross, M. J., Luong, C., Moore, W. R. & Stroud, R. M. (1998). Nature (London), 391, 608–612.  Web of Science CrossRef CAS PubMed Google Scholar
First citationKung, Y., McAndrew, R. P., Xie, X., Liu, C. C., Pereira, J. H., Adams, P. D. & Keasling, J. D. (2014). Structure, 22, 1028–1036.  CrossRef CAS Google Scholar
First citationLunin, V. Y. (1988). Acta Cryst. A44, 144–150.  CrossRef CAS Web of Science IUCr Journals Google Scholar
First citationMoews, P. C. & Kretsinger, R. H. (1975). J. Mol. Biol. 91, 201–225.  CrossRef PubMed CAS Web of Science Google Scholar
First citationMurshudov, G. N., Skubák, P., Lebedev, A. A., Pannu, N. S., Steiner, R. A., Nicholls, R. A., Winn, M. D., Long, F. & Vagin, A. A. (2011). Acta Cryst. D67, 355–367.  Web of Science CrossRef CAS IUCr Journals Google Scholar
First citationPerrakis, A., Morris, R. & Lamzin, V. S. (1999). Nature Struct. Biol. 6, 458–463.  Web of Science CrossRef PubMed CAS Google Scholar
First citationPhillips, S. E. (1980). J. Mol. Biol. 142, 531–554.  CrossRef CAS PubMed Web of Science Google Scholar
First citationPodjarny, A. D. & Urzhumtsev, A. G. (1997). Methods Enzymol. 276, 641–658.  CrossRef CAS Web of Science Google Scholar
First citationPražnikar, J., Afonine, P. V., Gunčar, G., Adams, P. D. & Turk, D. (2009). Acta Cryst. D65, 921–931.  Web of Science CrossRef IUCr Journals Google Scholar
First citationRoversi, P., Blanc, E., Vonrhein, C., Evans, G. & Bricogne, G. (2000). Acta Cryst. D56, 1316–1323.  Web of Science CrossRef CAS IUCr Journals Google Scholar
First citationRupp, B. (2009). Biomolecular Crystallography: Principles, Practice, and Application to Structural Biology. New York: Garland Science.  Google Scholar
First citationTerwilliger, T. C., Grosse-Kunstleve, R. W., Afonine, P. V., Moriarty, N. W., Adams, P. D., Read, R. J., Zwart, P. H. & Hung, L.-W. (2008). Acta Cryst. D64, 515–524.  Web of Science CrossRef CAS IUCr Journals Google Scholar
First citationTerwilliger, T. C., Grosse-Kunstleve, R. W., Afonine, P. V., Moriarty, N. W., Zwart, P. H., Hung, L.-W., Read, R. J. & Adams, P. D. (2008). Acta Cryst. D64, 61–69.  Web of Science CrossRef CAS IUCr Journals Google Scholar
First citationTronrud, D. E. (1997). Methods Enzymol. 277, 306–319.  CrossRef CAS PubMed Web of Science Google Scholar
First citationTronrud, D. E. (2008). CCP4bb. https://www.mail-archive.com/ccp4bb@jiscmail.ac.uk/msg04276.htmlGoogle Scholar
First citationUrzhumtsev, A. G. (1997). Acta Cryst. D53, 540–543.  CrossRef CAS Web of Science IUCr Journals Google Scholar
First citationUrzhumtsev, A., Afonine, P. V., Lunin, V. Y., Terwilliger, T. C. & Adams, P. D. (2014). Acta Cryst. D70, 2593–2606.  Web of Science CrossRef IUCr Journals Google Scholar
First citationUrzhumtsev, A. G., Lunin, V. Y. & Luzyanina, T. B. (1989). Acta Cryst. A45, 34–39.  CrossRef CAS Web of Science IUCr Journals Google Scholar
First citationUrzhumtseva, L. & Urzhumtsev, A. (2011). J. Appl. Cryst. 44, 865–872.  Web of Science CrossRef CAS IUCr Journals Google Scholar
First citationVellieux, F. M. D. & Dijkstra, B. W. (1997). J. Appl. Cryst. 30, 396–399.  CrossRef CAS Web of Science IUCr Journals Google Scholar
First citationWeichenberger, C. X., Afonine, P. V., Kantardjieff, K. & Rupp, B. (2015). Acta Cryst. D71, 1023–1038.  Web of Science CrossRef IUCr Journals Google Scholar

This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.

Journal logoSTRUCTURAL
BIOLOGY
ISSN: 2059-7983
Follow Acta Cryst. D
Sign up for e-alerts
Follow Acta Cryst. on Twitter
Follow us on facebook
Sign up for RSS feeds