research papers
Averaged kick maps: less noise, more signal…and probably less bias
aJožef Stefan Institute, Slovenia, bLawrence Berkeley National Laboratory, Berkeley, USA, and cDepartment of Bioengineering, University of California, Berkeley, USA
*Correspondence e-mail: dusan.turk@ijs.si
Use of reliable density maps is crucial for rapid and successful σA) weighted. Analysis shows that they are comparable and correspond better to the final model than σA and simulated-annealing maps. The AK maps were challenged by a difficult structure-validation case, in which they were able to clarify the problematic region in the density without the need for model rebuilding. The conclusion is that AK maps can be useful throughout the entire progress of determination, offering the possibility of improved map interpretation.
determination. Here, the averaged kick (AK) map approach is investigated, its application is generalized and it is compared with other map-calculation methods. AK maps are the sum of a series of kick maps, where each kick map is calculated from atomic coordinates modified by random shifts. As such, they are a numerical analogue of maps. AK maps can be unweighted or (Keywords: kick maps; OMIT maps; density-map calculation; model bias; maximum likelihood.
1. Introduction
After crystallographic phases have been obtained, an iterative procedure is used to cycle through density-map calculation, molecular model building, rebuilding and ) the model is the sole source of phasing information which, by transformation into density maps, guides model building and rebuilding (Watenpaugh et al., 1973). Here, we focus on density-map calculation where a prior molecular model is used as the sole source of phasing, although the proposed procedure can also be applied to de novo The density maps have the potential to reveal more information than is provided by the current working model. Simultaneously, they are the source of misleading information: it is typically the case that the molecular-replacement models used for phasing may be partially incorrect and thus bias the resulting maps. Sometimes a thin line separates the correct interpretation of a density map from an incorrect interpretation. Therefore, it is important to derive maps which assure that the model modifications suggested by map interpretation indeed converge towards the true structure.
until the consistency of the model with the experimentally measured structure factors is maximized. When experimental phases are available, they provide a source of phasing information that is independent of the model. However, in the molecular-replacement case (Rossmann, 1972Throughout the history of ; Woolfson, 1956; Sim, 1959; Raman, 1959; Ramachandran & Raman, 1959; Srinivasan, 1961; Ramachandran & Srinivasan, 1961, 1970; Main, 1979; Vijayan, 1980). Read (1986) generalized these approaches and proposed that the least biased maps be computed as (2m|Fobs| − D|Fmodel|)exp(iφmodel) for noncentric data and |Fobs|exp(iφmodel) for centric data; these maps should correspond maximally with the true map. A further improvement towards reducing map bias was the computation of weighting terms (m and D) using a subset of reflections set aside for cross-validation (`test' reflections; Brünger, 1992) and used for estimation of phase errors (Lunin & Skovoroda, 1995; Pannu & Read, 1996; Urzhumtsev et al., 1996). More sophisticated approaches, such as the use of multi-start simulated annealing (Hodel et al., 1992; Rice et al., 1998), also provide improved estimates for the error model and map weights. As low-resolution data are important for map quality, a series of bulk-solvent correction models have been developed in order to be able to include all reflections in the map calculation rather than applying a low-resolution truncation (Moews & Kretsinger, 1975; Phillips, 1980; Jiang & Brünger, 1994; Tronrud, 1997; Badger, 1997; Urzhumtsev, 2000; Fokine & Urzhumtsev, 2002). Crystal anisotropy can severely distort the map appearance so an anisotropic correction is typically used (Sheriff & Hendrickson, 1987; Murshudov et al., 1998; Usón et al., 1999). All these advances have been combined in efficient and robust model structure-factor (Fmodel) calculation protocols such as those of Afonine et al. (2005b) and Brunger (2007). Averaging of either electron-density maps or structure factors has a long tradition in crystallography, starting with averaging as first described by Rossmann & Blow (1963) and progressing to averaging of maps and structure factors generated from several (Perrakis et al., 1997) or multiple models (Hodel et al., 1992; Terwilliger et al., 2007). The averaged kick (AK) map method that we describe here is a complementary approach to enhancing the signal and reducing the noise in maps calculated from molecular models with errors.
determination, a significant amount of effort has been directed into the development of density-map calculations with the aim of enhancing the signal and reducing errors and noise (Luzzati, 1953More than a decade ago, during the final stages of the determination of the et al., 1998), the question of the directionality of the binding of the eight-residue-long propeptide attached to the body of the enzyme by a disulfide bond, termed the mini-chain, prompted the development of new methods to maximize map quality. Visual interpretation of the map suggested that the mini-chain has to be positioned in the reverse direction compared with the propeptide as expected from previous proenzyme structures such as cathepsin B (Turk et al., 1996). Neither the inspection of the unweighted difference map combined with of the mini-chain traced in two alternative directions nor the use of simulated annealing in were able to deliver maps which unambiguously revealed the directionality of the mini-chain.
of porcine cathepsin H (GunčarWe hypothesized that if atoms were displaced by independent random shifts then the correlations of atomic positions imposed through via structure factors and chemistry terms would be lost. As a result, random shifts of coordinates, termed kicking, was introduced into the program MAIN (Turk, 1997, 2007; Gunčar et al., 1998) and applied to map calculations and The question was asked: `What would happen when a series of kick maps were averaged?' (T. Terwilliger, personal communication). The resulting procedure appears to be an analogue of the (ML) approach, as pointed out by an anonymous referee: the crystallographic ML theory supposes that the current model can be corrected by introducing random errors and suggests structure-factor correction after `theoretical averaging' of such models with random shifts. It has been demonstrated that these maps are an improvement over single kick maps (Gunčar et al., 2000; Than et al., 2002, 2005). Additionally, the concept of a second-generation averaged kick map was introduced (discussed later). The latter approach was found to reveal the problem areas of a structure, allowing rebuilding and subsequent (Fernandez-Catalan et al., 1998). In this work, we have analyzed the kick-map method and its potential and have optimized and generalized the procedure for its use. Comparison with other maps such as unweighted (UN), weighted (ML) and simulated annealing (SA) maps show that the averaged kick (AK) map has the potential to reduce or eliminate model bias and can be a useful alternative scheme for map calculations at any stage of structure solution.
2. Methods and models
2.1. Software applications
All map calculations and their comparisons were performed with the crystallographic program MAIN (Turk, 1992), with the exception of the simulated-annealing runs, which were performed using the phenix.refine program (Afonine et al., 2005a) of the PHENIX package (Adams et al., 2002).
2.2. Electron-density maps
We used the following maps for all tests discussed below.
|
The kicked maps are based on structural models with randomly displaced atoms. Each kick modifies the atomic coordinates by a random shift within a given interval for which each point within a cube around the atomic position has an equal probability for the new position. Structure factors are calculated from the `kicked' atomic model and used for map calculation, mostly of 2Fobs − Fmodel-type maps. After each map calculation the model coordinates are restored to their original values. The final AK map is the average of the whole series of kick maps, where Fmodel for each map is calculated from the kicked atomic positions generated with a different random number seed. Each time the model is kicked, the starting seed is changed to produce a novel series of random numbers that is unique for each consecutive model modification. In this scheme, the |Fmodel| are scaled to |Fobs| using the unweighted or weighting schemes. For ML AK maps, the m and D values computed from the original (undisturbed) model are used in all subsequent individual kick-map calculations. (The coefficients derived from the distorted models result in values which can produce substantially worse maps.) Analogously, the bulk solvent ksol and Bsol parameters and bulk-solvent structure factors as well as the anisotropic scale matrix should be retained from the original model and used when computing Fmodel for kicked structures.
There are two methods for AK map calculation: one can average the Fmodel structure factors and calculate the map using the averaged Fmodel or average the individual kick maps,
The principal difference between the two is the following: in the first case scaling of |Fmodel| to |Fobs| is performed once for the set of averaged Fmodel and the 2Fobs − Fmodel map is computed (1), whereas in the second case each individual |Fmodel| from the series is scaled to |Fobs| and an individual 2Fobs − Fmodel map is computed, added to the sum of the maps and averaged at the end. The latter approach has been found to be superior and is used throughout this work (data not shown). To speed up the process and since the Fourier transformation (FT) is a linear operation, the averaging of maps in real space can be replaced by the averaging of corresponding Fourier coefficients followed by a single map calculation.
2.3. Map comparison
To compare the maps, we used overall and local map correlation coefficients (CCs) as commonly used (Lunin & Woolfson, 1993; Lunin & Skovoroda, 1995) and the density at positions of the atomic model. For the overall CC the whole was used, whereas for the CC of the local map the region around part of the atomic model was selected. A CC of greater than 0.8 is generally described as a strong correlation, whereas a CC of less than 0.5 is a weak correlation (Lunin & Woolfson, 1993; Lunin & Skovoroda, 1995).
The density at the positions of the atoms was obtained by linear interpolation of the density values from the eight surrounding grid points.
2.4. The second-generation maps
After the electron-density map of 2Fobs − Fmodel type has been calculated from the working model (the first-generation map), the parts of the model that are inconsistent with the map are omitted from the next map calculation. Practically, the occupancy of atoms is assigned in accordance with the first-generation map. The procedure in MAIN, termed `AUTO_WEIGHT', checks all atoms of every residue in the model starting from the root atom onwards to the end of the side chain. As long as the atoms lie in density above the threshold value (the default is 1.0σ) their occupancy is set to 1.0, otherwise the search is terminated and the nonchecked atoms receive zero occupancy. The resulting map is termed the second-generation map. The second-generation map can be calculated using any kind of 2Fobs − Fmodel map.
2.5. Crystal structures
As study cases, we have used the molecular-replacement solutions of stefin B (PDB code 2oct ; Jenko et al., 2007), cathepsin H (PDB code 8pch ; Gunčar et al., 1998), ammodytin L (PDB code 3dih ; D. Turk, G. Gunčar & I. Krizaj, unpublished work) and three additional cases from the PDB, namely PDB codes 2ahn (Y. Dall'Antonia, T. Pavkov, H. Fuchs, H. Breiteneder & W. Keller, unpublished work), 2fy2 (Kim et al., 2006) and 1twl (Southeast Collaboratory for Structural Genomics, unpublished work). The statistics for the diffraction data and models are provided in Table 1. Molecular-replacement solutions were either taken from the original works or found using EPMR (Kissinger et al., 1999).
|
3. Results
To validate AK maps, we considered addressing the following questions.
|
3.1. How to calculate the best AK map
3.1.1. How many kick maps should be averaged to achieve convergence?
To find out how many kick maps are required for convergence (where the average map does not change upon the addition of one more kick), we compared three series of AK maps using two different cases (Table 1). Each series of maps was generated with a starting random seed separated sufficiently to exclude the repetition of any kick map (Fig. 1). The increases in the CC of the partial sums of the three series overlap so closely that they are indistinguishable, whereas the CC of each map in the series, when compared with the final AK map, fluctuates around the starting pairwise correlations between the three series of around 0.4–0.5. The CCs of the partial sums increase rapidly towards 0.8. After approximately ten averaged maps the CCs are already at 0.9 and converge after 40 averaged maps to a CC of approximately 0.97–0.98. Thus, at least 40 individual kick maps, the similarity of the CCs of which to the final map of the series fluctuates within the ±0.05 interval, are required to result in a convergent and reproducible AK map. An equivalent comparison has also shown that the series of AK maps converges towards the correct map. (The plots are not shown since they appear equivalent to those in Fig. 1, only with a lower final CC.)
3.1.2. Which kick size to use
Clearly, a small kick size produces structure factors that are similar to those derived from the unperturbed model and a very large kick size produces meaningless structure factors. The optimal kick size may be dependent on the resolution range of the diffraction data used and the quality of the structural model itself. We inspected this dependence using models saved in the course of determination of the
of cathepsin H.AK maps were calculated from four models saved at various steps of R factors range from 0.43 to 0.25 and the r.m.s. deviations of their Cα atoms from the final model range from 0.91 to 0.27 Å (Table 2), reflecting the increasing similarity of the models to the refined structure. The smallest kick size applied was 0.1 Å and the largest was 1.2 Å (Fig. 2). Fig. 2(a) shows that with an increase in the kick size the AK maps start approaching the final Fmodel map. After reaching a rather broad peak of optimal kick size their similarity to the final Fmodel map begins to decrease. The map improvement is larger (0.03) for earlier models than with model 4 (0.01). This behaviour is shared by both UN AK and ML AK maps. The ML AK maps start higher and show a smaller increase than the UN AK maps. An equivalent picture is revealed by the density at positions of the final model (Fig. 2b). The highest increase in average density at positions of the final model in the AK maps is increased by 0.06, 0.06, 0.04 and 0.01 σ levels, respectively. As expected, concomitant with the progress of and the increase of the upper limit of diffraction data resolution, the optimal kick size decreases. For the lowest resolution model 1 (3.0 Å resolution) it is about 1.0 and for models 2, 3 and 4 it is 0.8, 0.6 and 0.4 Å, respectively. The optima are rather broad. The optimal kick size depends on the model quality and only marginally depends on the resolution range of the data applied (data not shown), which is consistent with the expectations from the ML error estimates (Table 3).
corresponding to four different high-resolution limits in the range 3.0–2.2 Å. Their
|
|
This brief analysis suggests that the optimal kick size should be adjusted for each particular case. Interestingly, the optimal kick sizes are not far from the ML-based coordinate error estimates, thus confirming the analogy between the AK map and ML approaches. To avoid the need for a decision on the kick size, a series of AK maps using kick sizes in the range from 0.1 to 1.0 was calculated and averaged. The resulting maps deliver CCs (0.58, 0.69, 0.77, 0.84) which are close to the best AK map from the series (0.60, 0.70, 0.77, 0.84) and are approximately the same for the UN AK and ML AK maps. This generalizes the AK map-calculation setup and makes it more broadly applicable.
3.2. Comparison with other types of maps
The
of molecules using can be described by three basic phases: initial, intermediate and final.The quality of the AK maps was explored and compared with the final Fmodel map in parallel with the other available map types: UN, ML and SA. Comparisons were performed on the global level, taking into account all density points in the unit cell, as well as locally around selected regions of the molecular models.
3.2.1. Maps from starting models
In the initial phase, the search model positioned by 8pch ; Gunčar et al., 1998), ammodytin L (PDB code 3dih ; D. Turk, G. Gunčar & I. Krizaj, unpublished work), stefin B (PDB code 2oct ; Jenko et al., 2007) and three additional cases from the PDB, namely 2ahn (Y. Dall'Antonia, T. Pavkov, H. Fuchs, H. Breiteneder & W. Keller, unpublished work, Table 1), 2fy2 (Kim et al., 2006) and 1twl (Southeast Collaboratory for Structural Genomics, unpublished work), using actinidin (Baker, 1980), Crotalus atrox phospholipase A2 (Keith et al., 1981), 1thv (Ko et al., 1994), 1q6x (Cai et al., 2004) and 1ude (Liu et al., 2004) as search models. The maps were calculated from the search models using all protein atoms.
is used to calculate the electron-density map. To demonstrate the use of AK maps, we have calculated molecular-replacement solutions and maps for cathepsin H (PDB codeMaps were compared with the final Fmodel map. The obtained CCs are plotted in Fig. 3 using the CC coefficient of the ML map to denote the aimed value. The graphs reveal that UN AK and ML AK maps can deliver maps that are closer to the final Fmodel map than the ML map. The map improvements in the series correspond to an increase in the CC of up to 0.04. The average map of series of AK maps from 0.1 to 1.0 Å kick size delivers a somewhat lower increase in CC (0.02). In cases where omission of atoms inconsistent with the map was particularly successful, an increase in the CC of the second-generation maps was also observed (0.02 for 2ahn , 0.02 for 1twl , 0.02 for ammodytin L) despite the reduced scattering power of the remaining parts of the molecular model.
Another insight into the properties of the AK maps is provided by the match of the final model to the initial maps equivalent to Fig. 2(b) (data not shown). Density values were calculated at the positions of all protein atoms of the final model for each map and averaged. When compared with the density delivered by the ML map, the averaged series of AK maps for all six cases show improved positioning of the final model in the map, with the highest average increase of 0.09σ in the case of ammodytin L. Clearly, this average gain in the maps indicates that local map improvements were even more significant.
These cases indicate that AK maps can produce maps that are closer to the final solution and may have the potential to reveal map features that are otherwise inaccessible by map calculation alone, as manifested in the case below. However, the map improvements are not uniformly distributed: they are position-dependent and case-dependent.
3.2.2. Maps from the intermediate phase
In the intermediate phase, the models are partially refined and more or less complete. They still contain regions with errors and lack flexible loops and ligands. The positions of the residues still need to be examined and adjusted to best fit the electron-density maps.
To recreate a typical situation from the middle of R values of around 0.30. None of the models contained solvent or ligands and none were entirely correct. However, the models contained no sequence frame shifts. For the map comparison, we picked one of the models and used it as an input for UN, ML and AK map calculation, whereas all 30 models were used as input for SA map calculation. The following map comparisons with the final Fmodel maps are based on the whole as well as along the chain of residues. While the whole unit-cell comparisons provide a measure of the overall quality of the map, the local comparisons provide insight into individual features.
multiple SA runs were performed, resulting in 30 models with crystallographicMaps from the intermediate phase: global comparison for the whole unit cell. Table 3 shows that the AK, ML and SA maps are rather similar to the final Fmodel map, with CCs in the ranges 0.84–0.85 and 0.80–0.81 for the ammodytin L and cathepsin H cases, respectively. The CCs of the whole unit-cell UN maps compared with the Fmodel maps are about 0.02–0.03 lower (Table 3). The differences between the UN, ML and AK maps are approximately halved when compared with the differences at the beginning of as presented in the previous section (Figs. 1 and 2). This is in agreement with the general expectation that with the progress of the differences between the UN maps and those based on the error estimate function will decrease.
Maps from the intermediate phase: local comparison along the chain. The comparison of maps locally at individual residues along the polypeptide chain revealed that ML and AK maps interchange in their ranking of similarity to the final Fmodel map. Since these maps were generated from the same structures from the intermediate phase of the fit of the residues along the chain fluctuates and so do the CCs, which differ from map to map. For illustration, the ammodytin L case was chosen. Fig. 4 shows the residue-based CC of the maps plotted along the whole chain. Visual inspection has confirmed that correlations below 0.5, such as in the regions around residues 19 and 78, do indeed indicate poor similarity of local maps of any kind to the final Fmodel map. To illustrate the differences between the AK and ML maps, we have chosen two regions in which the plots of the ML and AK maps are higher than 0.5, not overlapping and ranked differently. In the first region, Ile9–Thr13 (Figs. 5a, 5b and 5c), the ML map provides a more clear representation of the model, whereas in the second region, Ile94–Glu98 (Figs. 5d, 5e and 5f), the AK map is closer to the final map. In the first region the ML map closes the density gap (Fig. 5b), whereas in the second region the AK corresponds best to the final position of the Phe95, resolving the side-chain ambiguity (Fig. 5f). These comparisons illustrate that the use of AK maps in combination with ML maps can be useful during model-building procedures and are consistent with past experience.
3.2.3. Maps from the final phase
In the final phase, remaining weak density and dubious map features require interpretation. They are commonly occupied by ligands attached to the macromolecular structure or flexible likely surface-located regions and residue side chains, and exhibit a larger degree of disorder when compared with the core of the structure. We have addressed this issue by re-examining the cathepsin H mini-chain case that initiated kick-map development. The molecular model for this case was generated by using the SA approach with the exclusion of solvent, carbohydrate and mini-chain atoms and yielded an R factor of 0.30 at 2.1 Å resolution. We have assumed that this model has lost any memory of the excluded parts and thus represents the structure in the state prior to building the mini-chain residues. The maps resulting from this model are OMIT maps with erased memory and are termed erased-memory maps. To find out which map calculation is most suitable for revealing the correct solution and simultaneously exposing the model contributions and its bias, we attached the eight mini-chain residues to the SA model. The mini-chain residues were built into the same density region in the correct and reverse directions. The two models were refined using an initial 0.3 Å random shift (kick) of each atom coordinate followed by two cycles of positional and B-value until convergence. Using these three models (SA, memory erased, correctly and reversely built mini-chain), we have calculated UN, ML and AK maps with the mini-chain residues erased, included and omitted from the map calculations. The direct effects were monitored by comparisons of the maps in the local region in the vicinity of the mini-chain with the final Fmodel map (Table 4).
|
The erased-memory map of the mini-chain region from the UN map had the lowest CC (0.65), followed by the CCs of the ML (0.68), AK (0.69) and SA (0.69) maps. The same map regions of the correctly built mini-chain have CCs for included as well as omitted maps that were equal or higher than the regions from the erased model, whereas the reversely built mini-chain results in lower or equivalent CCs. The CCs of maps with the correctly built mini-chain are higher than those of the reversely built mini-chain for all the map calculation method used; the best signal is from the AK map. The signal of the OMIT maps is less pronounced; nevertheless, the maps from the reversely built mini-chain are still ranked lower. For comparison, the Fmodel maps of the mini-chain region of the correct and reverse models have a CC of 0.66, indicating that there is substantial similarity between them.
None of the OMIT map calculations was capable of eliminating the contribution of the correctly and reversely built mini-chain atoms refined into the structure. However, the comparisons showed that in this case the AK maps provided the strongest signal for the correctly built model and can therefore be a useful alternative to the ML maps. Additionally, this exercise indicates that comparison of the maps calculated from the same model alone may not provide sufficient support for selecting the correct model when multiple interpretations seem possible. In such cases, alternative models should be built, refined and ranked before the correct one can be chosen.
3.3. Application to a problematic case
Previous work on the removal of model bias (Terwilliger et al., 2008) used PDB entry 1zen (Cooper et al., 1996) determined at 2.5 Å resolution as an example. Residues 6–16 of this structure were in disagreement with the very closely related structure 1b57 (Hall et al., 1999) determined at 2.0 Å resolution with two copies of the same molecule in the The Fobs − Fcalc map exposed a series of other mismatches along the chain (Terwilliger et al., 2008) using the EDS server (Kleywegt et al., 2004). It was shown (Terwilliger et al., 2008) that the impact of model bias on the map can be removed using an iterative-build OMIT-map approach, whereas other approaches such as ML and SA ML maps of the 2Fobs − Fcalc type alone failed to resolve the side-chain ambiguity. Visual inspection of both entries together with their maps and their superimpositions revealed that the structures are misaligned in several regions (1–15, 110–114, 194–198, 225–235, 265–271 and 352–356), comprising a total of 46 residues out of 338 (346 in 1b57 ). Furthermore, the maps also indicate that a solvent molecule with a B factor of 2 Å2 should have been identified as a metal ion, which was likely to be Zn coordinated by two histidines and two solvent molecules. Since the region 288–340 did not superimpose well owing to real differences between the crystal structures, it was excluded from model superimposition. The remaining 238 matching Cα-atoms pairs yielded an r.m.s.d. of 0.53 Å, whereas also taking into account the nonmatching Cα-atom pairs (284) yielded an r.m.s.d. of 2.0 Å. Visual inspection of the maps revealed that Asp15 was built into density belonging to the main chain of the helix N-terminus, thereby causing the one-residue shift. In addition to the density, the chemical environment of the side-chain residues in the region also points to a likely mistake in sequence register. For example, the Thr12 and Asp5 residues were positioned in a hydrophobic environment instead of Ile13 and Ile3. The same kind of error, which cuts the helix one residue too short at its N-terminus, was repeated at position 113 and caused the residue shift in the 110–114 region. To summarize, the 1zen model is partially incorrect (over 10% of residues appear to be displaced from the positions observed in the 1b57 structure).
The question here was, can the AK map approach correctly assign the density cloud of the Phe4 side-chain moiety using the model as deposited without any rebuilding? The procedure, if it is to be successful, must deal not only with the direct and indirect bias of the loop itself but also with the indirect bias of the misplaced atoms spread out through the remainder of the structure. For this purpose, we generated a variety of regional first- and second-generation OMIT and non-OMIT maps of the UN, ML, UN AK and ML AK 2Fobs − Fcalc types. Since no correct model of the 1zen deposition is available, we could not compare the maps with the final Fmodel map. Instead, we show the maps around the omitted region of interest (residues 3–15) on the background of the 1zen and 1b57 models (Fig. 6). It turned out that of these maps, only the second-generation UN AK and ML AK maps with kicks between 0.7 and 1.0 Å and with the region 1–15 omitted could resolve the map ambiguity, thus assigning the density cloud to the correct position of the side chain of Phe4 (Figs. 6g and 6h) and not to the side chain of Lys8 as present in the 1zen model (Figs. 6a and 6b). In addition, the map resulting from the averaging of all ten second-generation AK maps provided the correct answer although with a less clear map. (The CC between the second-generation UN AK and ML AK maps was 0.97 and that between the average of the AK maps with a kick size between 0.1 and 1.0 Å and UN AK map using a 0.8 Å kick was 0.94.)
This case thus demonstrates that AK maps have the potential to remove substantial model bias and can also be used as a valuable structure-validation tool.
4. Discussion
If atoms were kicked and their ensembles were then averaged, it would be expected that the averaged structure would essentially look the same as the original structure. Indeed, when the Fmodel structure factors used in map calculation are the sum of the contributions of a series of randomly shifted structures, the resulting AK map reveals no significant improvement when compared with the map calculated from the starting model. This indicates that the map improvement as seen in the AK maps does not solely arise from the averaging of structures, but also contains other error-correction mechanism(s). As indicated above, the averaging of structure factors prior to their scaling to Fobs eliminates the map improvement. This shows that the change in phases must be coupled with individual scaling of Fmodel to Fobs in order to achieve the desired effect. The change of the phases is similar to the SA concept, which calculates maps from an ensemble of structures, while application of modified scaling coefficients exhibits similarity to the ML weighting scheme (Read, 1986; Pannu & Read, 1996). The lower noise of the AK maps also makes them more similar to the ML maps. Comparison of optimal kick size and ML estimates of coordinate error (Table 2) further confirms the analogy between the two approaches. The ML error estimate of the coordinates does not necessary coincide with the best model bias-removal value, although the values in Table 3 indicate that they become rather close. Although averaging of ten AK maps generated with different kick sizes in principle successfully eliminates the need for the kick-size estimate, in extreme cases when the map is seriously biased by the model the application of a larger kick size such as 0.8–0.9 Å may be crucial. However, there are two differences between the AK and SA approaches. Firstly, during the SA procedure parts of the structure may drift away by several angstroms (multi-start SA has the potential to model multiple alternative conformations), whereas kicking keeps atoms within the specified frame. Secondly, the violations of chemical terms are severe in kicked structures (the r.m.s. deviation of bond lengths from their targets is usually only slightly lower than the kick specified), whereas SA molecular models remain chemically reasonable during and after the procedure, which preserves a higher degree of direct and indirect atomic interactions and consequent coupling of model errors. Because of the shifting of atoms around the starting point with a predefined maximal shift size, the concept of kicking is more similar to the ML approach since they both address the random model errors, whereas SA has the potential to shift parts of the model over larger distances and also has the potential to fix systematic errors. Interestingly, however, the phase errors of AK maps are slightly higher than those of the input model, indicating that the lower noise of the maps cannot be directly accounted for by the phase improvement (data not shown). However, looking at the R values obtained by Fourier transformation of each AK map (Fig. 7) it appears that all R values of UN AK as well as ML AK maps start lower than the R value of the initial models (Table 3) and then increase in a kick-size-dependent manner. (Shallow minima are observed in the UN AK map series.) The average R factor of a series of AK maps is much lower than the sum of the series (individual kick maps have significantly higher R factors), indicating the importance of the weighting-scheme contribution to the success of the AK approach.
As shown above with the 1zen case, second-generation AK OMIT maps were able to clarify a problematic region in the density without the need for model rebuilding. Omitting the problematic region of the structure appeared to be essential for reducing the direct model bias, while kicking in combination with omission of the parts inconsistent with the first-generation AK maps resulted in sufficiently reduced indirect model bias. With this, the AK maps approach exhibited a potential similar to the achievements of the iterative-build OMIT-maps approach (Terwilliger et al., 2008). The latter maps are, in comparison with the AK map approach, rather complex and computer-time-demanding procedures. Hence, the AK map approach is yet another contribution to the series of map calculations dealing with model-bias removal such as OMIT, SA OMIT maps, composite OMIT maps (Bhat & Cohen, 1984; Bhat, 1988; Hodel et al., 1992; Brünger et al., 1998) and the prime-and-switch phasing approach (Terwilliger, 2004). As such, AK maps can also be used in de novo determination. The potential revealed here suggests that AK maps are a fast and simple approach that may offer considerable help during macromolecular determination.
Acknowledgements
The Slovenian Research Agency is gratefully acknowledged for providing a young researcher scholarship and financial support of the Structural Biology program P1-0048. PDA would like to thank NIH/NIGMS for generous support of the PHENIX project (1P01 GM063210). This work was supported in part by the US Department of Energy under Contract No. DE-AC02-05CH11231.
References
Adams, P. D., Grosse-Kunstleve, R. W., Hung, L.-W., Ioerger, T. R., McCoy, A. J., Moriarty, N. W., Read, R. J., Sacchettini, J. C., Sauter, N. K. & Terwilliger, T. C. (2002). Acta Cryst. D58, 1948–1954. Web of Science CrossRef CAS IUCr Journals Google Scholar
Afonine, P. V., Grosse-Kunstleve, R. W. & Adams, P. D. (2005a). CCP4 Newsl. 42, contribution 8. Google Scholar
Afonine, P. V., Grosse-Kunstleve, R. W. & Adams, P. D. (2005b). Acta Cryst. D61, 850–855. Web of Science CrossRef CAS IUCr Journals Google Scholar
Badger, J. (1997). Methods Enzymol. 277, 344–352. CrossRef PubMed CAS Web of Science Google Scholar
Baker, E. N. (1980). J. Mol. Biol. 141, 441–484. CrossRef CAS PubMed Web of Science Google Scholar
Bhat, T. N. (1988). J. Appl. Cryst. 21, 279–281. CrossRef Web of Science IUCr Journals Google Scholar
Bhat, T. N. & Cohen, G. H. (1984). J. Appl. Cryst. 17, 244–248. CrossRef CAS Web of Science IUCr Journals Google Scholar
Brünger, A. T. (1992). Nature (London), 355, 472–474. PubMed Web of Science Google Scholar
Brunger, A. T. (2007). Nature Protoc. 2, 2728–2733. Web of Science CrossRef CAS Google Scholar
Brünger, A. T., Adams, P. D. & Rice, L. M. (1998). Direct Methods for Solving Macromolecular Structures, edited by S. Fortier, pp. 143–157. Dordrecht: Kluwer Academic Publishers. Google Scholar
Cai, Y., Cronin, C. N., Engel, A. G., Ohno, K., Hersh, L. B. & Rodgers, D. W. (2004). EMBO J. 23, 2047–2058. Web of Science CrossRef PubMed CAS Google Scholar
Cooper, S. J., Leonard, G. A., McSweeney, S. M., Thompson, A. W., Naismith, J. H., Qamar, S., Plater, A., Berry, A. & Hunter, W. N. (1996). Structure, 4, 1303–1315. CrossRef CAS PubMed Web of Science Google Scholar
Fernandez-Catalan, C., Bode, W., Huber, R., Turk, D., Calvete, J. J., Lichte, A., Tschesche, H. & Maskos, K. (1998). EMBO J. 17, 5238–5248. Web of Science CrossRef CAS PubMed Google Scholar
Fokine, A. & Urzhumtsev, A. (2002). Acta Cryst. D58, 1387–1392. Web of Science CrossRef CAS IUCr Journals Google Scholar
Gunčar, G., Podobnik, M., Pungerčar, J., Štrukelj, B., Turk, V. & Turk, D. (1998). Structure, 6, 51–61. Web of Science CAS PubMed Google Scholar
Gunčar, G., Klemenčič, I., Turk, B., Turk, V., Karaoglanovic Carmona, A., Juliano, L. & Turk, D. (2000). Structure, 8, 305–313. Web of Science CrossRef PubMed CAS Google Scholar
Hall, D. R., Leonard, G. A., Reed, C. D., Watt, C. I., Berry, A. & Hunter, W. N. (1999). J. Mol. Biol. 287, 383–394. Web of Science CrossRef PubMed CAS Google Scholar
Hodel, A., Kim, S.-H. & Brünger, A. T. (1992). Acta Cryst. A48, 851–858. CrossRef CAS Web of Science IUCr Journals Google Scholar
Jenko, K. S., Gunčar, G., Štern, I., Morgan, G., Rabzelj, S., Kenig, M., Staniforth, R. A., Waltho, J. P., Žerovnik, E. & Turk, D. (2007). J. Mol. Biol. 366, 1569–1579. PubMed Google Scholar
Jiang, J.-S. & Brünger, A. T. (1994). J. Mol. Biol. 243, 100–115. CrossRef CAS PubMed Web of Science Google Scholar
Keith, C., Feldman, D. S., Deganello, S., Glick, J., Ward, K. B., Jones, E. O. & Sigler, P. B. (1981). J. Biol. Chem. 256, 8602–8607. CAS PubMed Web of Science Google Scholar
Kim, A. R., Rylett, R. J. & Shilton, B. H. (2006). Biochemistry, 45, 14621–14631. Web of Science CrossRef PubMed CAS Google Scholar
Kissinger, C. R., Gehlhaar, D. K. & Fogel, D. B. (1999). Acta Cryst. D55, 484–491. Web of Science CrossRef CAS IUCr Journals Google Scholar
Kleywegt, G. J., Harris, M. R., Zou, J., Taylor, T. C., Wählby, A. & Jones, T. A. (2004). Acta Cryst. D60, 2240–2249. Web of Science CrossRef CAS IUCr Journals Google Scholar
Ko, T.-P., Day, J., Greenwood, A. & McPherson, A. (1994). Acta Cryst. D50, 813–825. CrossRef CAS Web of Science IUCr Journals Google Scholar
Liu, B., Bartlam, M., Gao, R., Zhou, W., Pang, H., Liu, Y., Feng, Y. & Rao, Z. (2004). Biophys. J. 86, 420–427. Web of Science CrossRef PubMed CAS Google Scholar
Lunin, V. Yu. & Woolfson, M. M. (1993). Acta Cryst. D49, 530–533. CrossRef CAS Web of Science IUCr Journals Google Scholar
Lunin, V. Yu. & Skovoroda, T. P. (1995). Acta Cryst. A51, 880–887. CrossRef CAS Web of Science IUCr Journals Google Scholar
Luzzati, V. (1953). Acta Cryst. 6, 142–152. CrossRef CAS IUCr Journals Web of Science Google Scholar
Main, P. (1979). Acta Cryst. A35, 779–785. CrossRef IUCr Journals Web of Science Google Scholar
Moews, P. C. & Kretsinger, R. H. (1975). J. Mol. Biol. 91, 201–228. CrossRef PubMed CAS Web of Science Google Scholar
Murshudov, G. N., Davies, G. J., Isupov, M., Krzywda, S. & Dodson, E. J. (1998). CCP4 Newsl. 35, 37–43. Google Scholar
Murshudov, G. N., Vagin, A. A. & Dodson, E. J. (1997). Acta Cryst. D53, 240–255. CrossRef CAS Web of Science IUCr Journals Google Scholar
Pannu, N. S. & Read, R. J. (1996). Acta Cryst. A52, 659–668. CrossRef CAS Web of Science IUCr Journals Google Scholar
Perrakis, A., Sixma, T. K., Wilson, K. S. & Lamzin, V. S. (1997). Acta Cryst. D53, 448–455. CrossRef CAS Web of Science IUCr Journals Google Scholar
Phillips, S. E. V. (1980). J. Mol. Biol. 142, 531–554. CrossRef CAS PubMed Web of Science Google Scholar
Ramachandran, G. N. & Raman, S. (1959). Acta Cryst. 12, 957–964. CrossRef CAS IUCr Journals Web of Science Google Scholar
Ramachandran, G. N. & Srinivasan, R. (1961). Nature (London), 190, 159–161. CrossRef CAS Web of Science Google Scholar
Ramachandran, G. N. & Srinivasan, R. (1970). Fourier Methods in Crystallography. New York: Wiley. Google Scholar
Raman, S. (1959). Acta Cryst. 12, 964–975. CrossRef CAS IUCr Journals Web of Science Google Scholar
Read, R. J. (1986). Acta Cryst. A42, 140–149. CrossRef CAS Web of Science IUCr Journals Google Scholar
Rice, L. M., Shamoo, Y. & Brünger, A. T. (1998). J. Appl. Cryst. 31, 798–805. Web of Science CrossRef CAS IUCr Journals Google Scholar
Rossmann, M. G. (1972). The Molecular Replacement Method. New York: Gordon & Breach. Google Scholar
Rossmann, M. G. & Blow, D. M. (1963). Acta Cryst. 16, 39–45. CrossRef CAS IUCr Journals Google Scholar
Sheriff, S. & Hendrickson, W. A. (1987). Acta Cryst. A43, 118–121. CrossRef CAS Web of Science IUCr Journals Google Scholar
Sim, G. A. (1959). Acta Cryst. 12, 813–815. CrossRef IUCr Journals Web of Science Google Scholar
Srinivasan, R. (1961). Proc. Indian Acad. Sci. A, 53, 252–264. Google Scholar
Terwilliger, T. C. (2004). Acta Cryst. D60, 2144–2149. Web of Science CrossRef CAS IUCr Journals Google Scholar
Terwilliger, T. C., Grosse-Kunstleve, R. W., Afonine, P. V., Adams, P. D., Moriarty, N. W., Zwart, P., Read, R. J., Turk, D. & Hung, L.-W. (2007). Acta Cryst. D63, 597–610. Web of Science CrossRef CAS IUCr Journals Google Scholar
Terwilliger, T. C., Grosse-Kunstleve, R. W., Afonine, P. V., Moriarty, N. W., Adams, P. D., Read, R. J., Zwart, P. H. & Hung, L.-W. (2008). Acta Cryst. D64, 515–524. Web of Science CrossRef CAS IUCr Journals Google Scholar
Than, M. E., Bourenkov, G. P., Henrich, S., Mann, K. & Bode, W. (2005). Biol. Chem. 386, 759–766. Web of Science CrossRef PubMed CAS Google Scholar
Than, M. E., Henrich, S., Huber, R., Ries, A., Mann, K., Kühn, K., Timpl, R., Bourenkov, G. P., Bartunik, H. D. & Bode, W. (2002). Proc. Natl Acad. Sci. USA, 99, 6607–6612. Web of Science CrossRef PubMed CAS Google Scholar
Tronrud, D. E. (1997). Methods Enzymol. 277, 306–319. CrossRef CAS PubMed Web of Science Google Scholar
Turk, D. (1992). PhD thesis. Technische Universität, München, Germany. Google Scholar
Turk, D. (1997). ACA Meeting Abstracts, 24, 76. Google Scholar
Turk, D. (2007). Evolving Methods for Macromolecular Crystallography, edited by R. J. Read & J. L. Sussman, pp. 111–122. New York: Springer. Google Scholar
Turk, D., Podobnik, M., Kuhelj, R., Dolinar, M. & Turk, V. (1996). FEBS Lett. 384, 211–214. CrossRef CAS PubMed Web of Science Google Scholar
Urzhumtsev, A. G. (2000). CCP4 Newsl. 38, 38–49. Google Scholar
Urzhumtsev, A. G., Skovoroda, T. P. & Lunin, V. Y. (1996). J. Appl. Cryst. 29, 741–744. CrossRef CAS Web of Science IUCr Journals Google Scholar
Usón, I., Pohl, E., Schneider, T. R., Dauter, Z., Schmidt, A., Fritz, H. J. & Sheldrick, G. M. (1999). Acta Cryst. D55, 1158–1167. Web of Science CrossRef IUCr Journals Google Scholar
Vijayan, M. (1980). Acta Cryst. A36, 295–298. CrossRef CAS IUCr Journals Web of Science Google Scholar
Watenpaugh, K. D., Sieker, L. C., Herriott, J. R. & Jensen, L. H. (1973). Acta Cryst. B29, 943–956. CrossRef CAS IUCr Journals Web of Science Google Scholar
Woolfson, M. M. (1956). Acta Cryst. 9, 804–810. CrossRef CAS IUCr Journals Web of Science Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.