- 1. Introduction
- 2. Problems with low-resolution data
- 3. Artificially extending the data
- 4. Weighting the extended data
- 5. Magnitudes for the extended data
- 6. Testing the estimation methods
- 7. Enhancements in density modification
- 8. Testing the new ACORN
- 9. Discussion
- Supporting Information
- References
- 1. Introduction
- 2. Problems with low-resolution data
- 3. Artificially extending the data
- 4. Weighting the extended data
- 5. Magnitudes for the extended data
- 6. Testing the estimation methods
- 7. Enhancements in density modification
- 8. Testing the new ACORN
- 9. Discussion
- Supporting Information
- References
research papers
A modified ACORN to solve protein structures at resolutions of 1.7 Å or better
aYork Structural Biology Laboratory, Department of Chemistry, University of York, Heslington, York YO10 5YW, England, and bDepartment of Physics, University of York, Heslington, York YO10 5YW, England
*Correspondence e-mail: yao@ysbl.york.ac.uk
ACORN has previously been shown to provide an efficient density-modification procedure for the solution of protein structures using diffraction data to better than 1.3 Å. The initial phase set could be obtained from a variety of sources such as the position of a heavy atom, a set of scatterers such as S that had been positioned from measurements, a fragment or a very low homology model placed from a molecular-replacement search. Several structures solved using the early version of ACORN have been reported in the literature. Here, the effect of applying the original ACORN procedures at lower resolution is reported and new procedures that yield good-quality maps with data sets of resolution down to 1.7 Å are described. These new procedures involve the artificial extension of data to atomic resolution and new density-modification processes that develop density at atomic positions that was previously suppressed. The test calculations were aimed firstly towards a proof of principle using a small fragment of a known structure to demonstrate that the procedure could generate correct density and a derived model in initially empty regions of the cell. Further tests addressed the use of more realistic starting models.
1. Introduction
ACORN (Foadi et al., 2000; Yao et al., 2002) was developed as a procedure that can refine a poor starting phase set without intervention by the user when the resolution of the data is 1.3 Å or better (atomic resolution). With good-quality data, the procedure can reduce the phase error from an initial phase error of up to 80° to below 20° and thus provide a clear and complete image of the structure. It has been applied successfully to the solution of several new structures when the initial fragment is a poor or incomplete molecular-replacement solution, a fragment of known conformation, such as an α-helix (Yao, 2002), a set of heavier atoms positioned using anomalous or isomorphous differences (Gordon et al., 2001) or, for metalloproteins, the heavy-atom structure alone (McAuley et al., 2001).
The phase-improvement tool starts with the phases from the fragment and uses a density-modification procedure, DDM (dynamic density modification), to develop the complete structure. We give here a short description of the basic DDM process as the starting point for the improvements we have made.
Firstly, a map is calculated with appropriate Fourier coefficients and phases derived from the previously modified map (or from the fragment in the first W(h)Eo(h) for the stronger Eo(h). The limit on Eo(h) is chosen to give sufficient reflections for the procedure: about ten times the number of atoms in the The weight W(h) is akin to a Sim weight (Sim, 1960) and is given by
cycle). The initial coefficients arewhere X = |Eo(h)||Ec(h)| and |Ec(h)| is the amplitude of the Fourier coefficient of the previous map scaled to the values of |Eo(h)| in shells in reciprocal space.
Secondly, the standard deviation σ of the map density is calculated. The map density is then modified by
where k is a constant given by the user (default value 3) and n is the minimum of the cycle number and 5.
The basis of DDM is that even when the mean phase error is high, say 75°, the expected value of the map density at an atomic centre is several standard deviations σ above the average density, which will be close to zero (see supplementary material1). If the density at a point in the map is 1σ, then this is well within the bounds of random fluctuations and the probability that an atom is centred at that point is quite low. In contrast, if the density at a point is 4σ, then this represents a rare excursion from the mean by a random fluctuation so that the probability that an atom is situated there is quite high. The DDM process modifies density in such a way that lower density is proportionately reduced, thus increasing the probability that the residual density models true atomic positions. The high density that arises in the positions of fragment atoms is savagely truncated to reduce the initial phase bias.
To follow the process of s, is calculated between the magnitudes of the Fourier coefficients of the modified map and the magnitudes of the reflections not included in calculating the map. For a well behaved with high-resolution data the value of CCs rises steadily as the mean phase error reduces and saturation of the value of CCs may be taken to indicate the end of the process (Foadi et al., 2000).
a CCIt has been found beneficial to introduce one or two steps of what is called Sayre-equation et al., 2000). Paradoxically, although SER usually makes both the value of CCs smaller and the mean phase error larger, it conditions the phase set so that new peaks appear in the resultant map and the subsequent by DDM progresses further than it would have done without the SER intervention.
(SER) before commencing DDM and occasionally within the procedure, especially if the appears to stagnate. This procedure modifies phases in such a way that the is better satisfied for both large and small structure factors. It involves the calculation of six Fourier transforms per step (Foadi2. Problems with low-resolution data
The analysis given in the supplementary material1 indicates that the expectation density at an atomic centre in units of the standard deviation and assuming an equal atom distribution falls dramatically with a reduction in the resolution limit of the data. For data with resolution 1.7 Å and perfect phases the density at an atomic centre is about 5σ, which means that with perfect phases the atoms will show up well. The corresponding atomic centre density for perfect phases at 1.0 Å is about 11σ. Approximately, for a mean phase error of say 75°, the expectation values at atomic centres must be multiplied by cos75° (0.259), giving 2.8σ and 1.3σ for data resolutions of 1.0 and 1.7 Å, respectively. This gives a sufficiently good starting point for at the higher resolution but not at the lower. A lower initial mean phase error, in the region 55–60°, might enable a structure to be refined at 1.7 Å, but such a starting point may be difficult to achieve.
However, even if a suitable starting point for 1 also shows that the expectation density at atomic centres favours heavier atoms disproportionately. In trials with known structures with data sets in the resolution range 1.5–1.7 Å, the application of DDM gives final mean phase errors of the order of 45–50° and the final maps tend to show all S atoms and many O and N atoms but comparatively few C atoms, although C=O often appears as a single peak. For atoms bonded to three other atoms the density is often weak or missing because for 1.6 ± 0.1 Å data they lie in the diffraction ripple minimum of the neighbouring atoms. This category includes the Cα atoms, so the main chain appears broken and automated chain-tracing methods fail.
could be found, the difficulties with low-resolution data do not end there. The supplementary materialThe atomic resolution required for a successful solution has severely limited the proportion of structures to which ACORN could be applied, so we have endeavoured to extend the resolution range of applicability, the results of which we now report.
3. Artificially extending the data
A key strategy that has been found to improve the performance of ACORN at low resolution greatly is to extend the data to 1 Å artificially and to include these non-observed reflections in the phasing process. (Indeed, there is also considerable benefit in filling in any missing data in the observed resolution range, especially at low resolution.) If reasonable phase estimates can be made for these artificially introduced reflections then they add information to the resultant map, the peak shape becomes more `atomic' and the DDM procedure more effective; it is well known that if phases are correct the resultant map will show the expected features, almost regardless of the magnitudes. The question then arises of what magnitudes to give these introduced reflections and how to weight their contributions to a map.
4. Weighting the extended data
The default weighting scheme used for the observed data when calculating maps is given in (1). For the extended reflections there is no problem in finding |Ec(h)| since it can be calculated from the original fragment to any resolution and can also be obtained from successive maps. The problem is to find an estimate of |Eo(h)| for the extended reflections, an estimate we represent by |Eext(h)|. If precise estimates could be made for these reflections then the position would be equivalent to that of having high-resolution data and weighting scheme (1) could be used as it is. However, only crude estimates can be made of these normalized structure amplitudes, so that some down-weighting of their contribution to the map is desirable. This down-weighting should also take account of the number of extended reflections relative to the number of observed, otherwise these flawed reflections would overwhelm the effect of the observed reflections. An ad hoc weighting scheme that we have found to be effective for the extended reflections is
where
and nobs is the number of observed reflections, next is the number of extended reflections and |Ec(h)| is the from the fragment or map scaled in shells in to make 〈|Ec|2〉 = 1. This has the effect that the greater the percentage of extended reflections used, the smaller their weights. More sophisticated weighting schemes based on estimates of the phase error from correlation coefficients were also tried, but to date none of those led to consistent results for the current set of test structures.
5. Magnitudes for the extended data
It is necessary to provide some estimate of the normalized structure amplitudes for the artificially extended data. Various ideas were explored.
5.1. Extension (i): using the average value of |E|2
By definition, the average value of |E|2 is 1, so taking |Eext(h)| = 1 for all the extended reflections provides a very simple estimate, considerably better than effectively setting these values to 0.0.
5.2. Extension (ii): an from |Efrag(h)|
Although the normalized contribution of the fragment itself, Efrag(h), is found to be a poor indicator of the magnitude of the extended normalized structure amplitudes, they should still have some influence on the Intuitively, it is evident that the greater the contribution from the fragment then the greater the of the normalized For a structure with N equal atoms in the and a fragment of m atoms, the root-mean-square of the normalized of an extended reflection is
This gives some departure from the m/N = 0.05 the is 0.975 for |Efrag(h)| = 0 and 1.183 for |Efrag(h)| = 3.
of unity as given by extension (i). For example, with5.3. Extension (iii): using calculated |E| values from the fragment
An important diagnostic tool in the correct location of a fragment is the
between the structure amplitudes calculated from the fragment and those of the observed reflections. For a small fragment this is weak (of the order 0.01–0.02, but positive), so we tested the idea of using the fragment contributions as estimates of the measured observations.6. Testing the estimation methods
Tests were made with the structure of penicillopepsin (PDB code 1bxo ; Ding et al., 1998). The is C2, with observed data to 0.9 Å and 2977 independent non-H atoms in the For the tests the data were truncated to 1.5 Å so that comparisons with actual observations could be made in the 1.5–1.0 Å range. A starting fragment was selected containing 400 atoms (13.4% of the structure). Using data to 1.0 Å resolution, the structure can be developed by ACORN to a final mean phase error of 13.1°, yielding a final map that clearly shows individual atoms and that can be interpreted by automatic procedures (Foadi et al., 2000). (A much smaller fragment can be used to give a successful solution with 1.0 Å data, but since 400 atoms are required with 1.5 Å data we use that size of fragment for comparison purposes.)
With the 400-atom fragment using 1.0 Å data the variation of the unweighted mean phase error for reflections with |E| > Estrong (1.2 in this case) with cycles of DDM is shown in Table 1.
|
At the truncated 1.5 Å resolution it was necessary to keep all observations with |E| > 0.8 in order to have sufficient reflections for the strong set. The initial R factor and the between calculated structure amplitudes from a 400-atom fragment and all observed structure amplitudes are more or less uniform over the whole resolution range and are equal to 55% and 0.15, respectively. Various phase errors between phases calculated from the fragment and phases calculated from the refined structure were calculated in the form
for wh = 1, W(h), |Eo(h)| and W(h)|Eo(h)|. These mean phase errors, calculated in resolution shells, are shown in Fig. 1. The lowest mean phase errors are for weights W(h)|Eo(h)|, which are the amplitudes of the coefficients used for the calculation of maps. Various criteria were used to estimate the effectiveness of the extension methods (i) to (iv) with data truncated to 1.5 Å. Fig. 2 illustrates the R factor between the different estimates of |Eext| and the true values in resolution shells for the resolution range 1.5–1.0 Å. It can be seen that the R factors for the pairs {(i), (ii)} and {(iii), (iv)} are similar and are much smaller for the first pair: approximately 0.41 compared with 0.55. From this it seems obvious that if we are looking for an effective way to extend magnitudes in the extended range we should just look at extension methods (i) and (ii); further testing confirms this conclusion.
Applying extension method (i) for the extended reflections gave the variations of mean phase error reported in Table 2 for reflections to 1.5 Å resolution with cycle number, which are compared with the results from the inferior magnitude-extension method (iii).
|
Applying DDM phase
with extension method (ii) was slightly worse than for method (i), but not significantly so. It can be concluded that as a default procedure the simple method (i) should be used since it is independent of the size and accuracy of the fragment.7. Enhancements in density modification
An application of the original ACORN to the structure of 1bxo using only the truncated data at 1.5 Å gives a final mean phase error that differs very little from the starting error of ∼60°: the introduction of the extension procedure has thus made a dramatic improvement. Nevertheless, for the reasons given in §2, the maps are still difficult to interpret, especially since many Cα atoms have no density. It is known that density using several different procedures is often more effective than any one of the procedures alone (Abrahams & Leslie, 1996; Cowtan & Main, 1996; Refaat et al., 1996), so we have introduced some new density-modification processes to enhance weaker density and thus to reveal lighter atoms.
Some other types of map coefficients are used in these steps. The first maps used for DDM used coefficients W(h)Eo(h). As the electron density shows more of the model, maps with other kinds of coefficients also become useful. We have investigated the use of coefficients W(h)[|2Eo(h)| − |Ec(h)|] and Wext(h)[2|Eext(h)| − |Ec(h)|] for the observed and extended data, respectively, or σA-weighted maps with map coefficients 2m(h)|Eo(h)| − σA|Ec(h)| and 2m(h)|Eext(h)| − σA|Ec(h)|. Density-modification procedures starting from these maps are designated as DDM1 and DDM2 in the following discussion and the original procedure designated as DDM0. The maps used for DDM1 and DDM2 have some similarity to those calculated with coefficient amplitudes 2|Eo| − |Ec| traditionally used to bring up the density for missing atoms.
A description of the quantity σA was first given by Srinivasan (1966) and developed for macromolecular applications by Read (1986). The value of σA reflects the current mean phase error [in ACORN estimated from the correlation coefficients between |Eo| and |Ec|] and m is a figure of merit calculated for each reflection based on the agreement of Eo(h) and Ec(h) and the current value of σA. When the phase error is large, σA is small and DDM2 gives more or less the same results as DDM0.
There are four steps in an enhancement cycle.
The density-enhancement procedure is introduced for one cycle whenever the process of phase s indicate that no further improvement is possible. We are investigating many more test cases to try to select the best protocol.
slows down. After an enhancement step, further cycles (of DDM0, DDM1 or DDM2) will continue, as specified by user-given keywords. The default procedure is to use DDM0 only. The is stopped when the values of the CC8. Testing the new ACORN
The new ACORN was tested on a series of known structures selected to establish its performance with data in the 1.5–1.7 Å range. Half of the tests described here were initiated from a fragment of the refined model. This is of course unrealistic, but allows us to analyse the final maps for features in regions outside the fragment. Details are summarized in Table 3.
|
The tested structures were the following.
|
Taken together, these five examples provide an excellent validation of the usefulness of the new ACORN.
8.1. Penicillopepsin (PDB code 1bxo )
The data were restricted to 1.5 Å for the purposes of the test and extended to 1.0 Å by extension method (i). The starting model of 400 atoms was taken from the final model and gave an unweighted mean phase error of 60.7° for observed reflections with |E| > 0.8 (a limit chosen to give sufficient reflections which is about ten times the number of atoms in the asymmetric unit). After 58 cycles of DDM0 (without SER insertions) the value of CCs was 0.084 and the mean phase error was reduced to 39.6°. The map was 0.6860 for the whole structure, 0.7056 for the main chain and 0.6641 for side chains.
With the inclusion of the density-enhancement procedures, the value of CCs became 0.119, the mean phase error was reduced to 34.5° and the map was 0.7754 for the whole structure, 0.7959 for the main chain and 0.7488 for side chains. The improvement in the mean phase error and map is substantial. The map now showed atoms that were previously missing. The R factor and CC found in resolution shells are illustrated in Fig. 3(a), in which the sharp changes beyond the imposed artificial resolution limit are clearly seen. The final mean phase errors, also in resolution shells, are given in Fig. 3(b) for the same weighting schemes used in Fig. 1. It is interesting that although no observed data are used beyond 1.5 Å resolution, the phases for the extended reflections are significantly better than random.
8.2. C. jejuni dUTPase (PDB code 1w2y )
For this structure, with P212121, the experimental data reach 1.65 Å resolution and there are 430 residues in the (Moroz et al., 2004). A low-homology model, Trypanosoma cruzi dUTPase (PDB code 1ogk ), was used to solve the structure by with data to 4 Å.
As a proof of principle, ACORN was first applied with phases calculated from a starting fragment made up of 400 atoms (13% of the model) taken from the refined structure. The initial phase error was 62.2° and fell to 46.4° after 35 cycles of DDM0. The enhanced density-modification procedures were then applied. The sequence of application of the various components of density enhancement is given in Fig. 4, where ENH represents the enhancement cycle and the cycles using DDM1 and DDM2 are also indicated. The final mean phase error was reduced to 35.7° with a of 0.137. Using these phases, the map correlation was 0.7521 for the total structure, 0.8142 for the main chain and 0.7238 for side chains. While this is clearly an unrealistic starting fragment, it demonstrated the ability of ACORN to introduce correctly placed atoms in regions missing from the starting fragment and allowed us to analyse maps for new features in detail.
This structure is used to illustrate the improvements described in this paper. Fig. 5(a) shows the final ACORN map at 1.65 Å without the use of data extension or density enhancement; in fact, by use of the original ACORN. Adding data extension to assist the application of DDM0 but still calculating the map using 1.65 Å data only gives Fig. 5(b), which shows the advantage from the data extension very clearly. Adding the extended terms with their associated weights gives the further improvement seen in Fig. 5(c). Using both data extension and density enhancement to improve phases, the 1.65 Å map is shown in Fig. 5(d) and it is seen that the density enhancement introduces density previously missing; as an example, in the left centre of Fig. 5(d) the main chain is indicated by an unresolved bar of density. Finally, with data extension and density enhancement a 1.0 Å map, shown in Fig. 5(e), breaks up the unresolved density in Fig. 5(d) into resolved features representing the individual atoms.
Once a final ACORN map has been produced it can be interpreted in terms of atomic positions, usually by an automatic procedure such as ARP/wARP (Perrakis et al., 1999). Table 4 gives an analysis of the maps illustrated in Fig. 5 in terms of the peaks they contain and the proportion corresponding to true atomic positions. For this purpose a `good' peak is defined as being within 0.4 Å of a refined atomic position.
The most important peaks for automatic interpretation of the map are those in the main chain and the number of these in the maps is given in Table 5, which also gives the number of heavier atoms (sulfur and magnesium) found. The amount of new information, corresponding to atoms outside the fragment, is significantly higher for the map illustrated in Fig. 5(e) than for any of the other maps.
|
We subsequently used a realistic starting model, namely the T. cruzi structure positioned by Compared with the final structure, this model gave an r.m.s. deviation of 2.1 Å for the core 116 residues, an initial phase error of 78.3° for 26 859 E > 0.8 and CCs = 0.02206. 841 cycles of ACORN, using DDM0 and DDM1 with enhancement procedures, reduced the phase error to 43.4° with CCs = 0.10683. This was a challenging problem for ACORN as there are substantial changes in the relative orientations of the two domains between the T. cruzi and the C. jejuni enzymes. The final map correlation of 0.66 was nevertheless a considerable improvement on the starting model, giving a density map that was much improved compared with that from the MR model.
A third approach involved rigid-body ACORN. The r.m.s. deviation was 1.38 Å for 165 residues and the initial phase error was 52.6° with a more even resolution distribution. ACORN now reduced the phase error to 36.7° after 27 cycles of DDM0. Further using the enhanced density-modification procedures reduced it to 33.4° after a total of 140 cycles. The resultant maps, with a of 0.77, could be easily built using automated procedures.
of the initial MR model before applying8.3. Pseudoazurin (PDB code 1paz )
This is a known structure, P65, with data to 1.55 Å resolution containing 123 residues and one Cu atom in the (Petratos et al., 1988). The new ACORN program used a random search procedure with native data only to locate the Cu atom, a position that gave an initial mean phase error of 73.8° for reflections with |E| > 0.8. The reflections were extended to 1.0 Å by method (i), yielding 55 367 extended reflections compared with 19 625 observed.
In the s began to fall again and the mean phase error had fallen slightly to 71.4°. Two cycles of SER were then carried out, which had the effect of making the CCs lower and the mean phase error higher, although the latter information would not be available for an unknown structure. After the SER cycles DDM0 was recommenced until changes in CCs were sluggish, when SER cycles were again introduced. By cycle 192 there had been ten interventions of SER and the mean phase error had fallen to 26.4°. The same general procedure was then continued, but with DDM1 replacing DDM0, until cycle 256, at which stage the mean phase error had reduced to 25.5°. The process was then continued with the addition of density-enhancement with DDM1 from cycle 257 to 295, reducing the phase error further to 22.0°. Finally, from cycle 296 to 301 DDM2 was used to give a final mean phase error of 21.6°. The final weighted phase errors in resolution shells are illustrated in Fig. 6. The whole process took less than 2 h on a laptop (Toshiba Tecra M1). DDM0 was replaced by DDM1 or DDM2 when the CCs > 0.15, but further tests will be needed to select the best protocol.
procedure, DDM0 was carried out for 25 cycles, at which stage CC8.4. Gliding protein (PDB code 1j3w )
ACORN was applied to the gliding protein-MGLB from Thermus thermophilus (Lokanath et al., 2004). The structure belongs to C2221 with four molecules in the and the observed data reach 1.5 Å. It has 535 residues with 4665 atoms.
The first 150 main-chain atoms in the PDB file (3.2% of the structure) were taken as a starting fragment, giving an initial mean phase error of 71.3°. The number of extended reflections to 1.0 Å was 244 485, giving a ratio nobs/(nobs + next) of 0.30. After 85 cycles of DDM0 the phase error was reduced to 29.8° and the enhanced density-modification steps were then carried out and followed by DDM0. After 136 cycles, with five interventions of enhanced density modification, the value of CCs was 0.211 and the mean phase error was 24.0°. Density-enhancement procedures followed by DDM1 were then carried out to cycle 162, at which stage the mean phase error was 19.7°. Finally, DDM2 was used from cycles 163 to 168, reducing the phase error to 19.6°. The ACORN map was 0.8838 for the complete structure, 0.9243 for the main chain and 0.8556 for side chains. As for 1bxo , the phases for extended reflections were by no means random and the quality of maps can be improved by adding the extended reflections with estimated magnitudes, with appropriate weights and with phases coming from the Fig. 7 shows constant density surfaces for an F map with observed data to 1.5 Å and an E map which includes the extended reflections to 1.0 Å. The F map was easily interpreted or traced automatically by computer and the E map showed most individual atoms clearly.
8.5. Exo-mannosidase (PDB code 1uuq )
This is a known structure, P21212, solved in York using data to 1.5 Å resolution. It has 440 residues with 16 methionines (three in two conformations) in the (Dias et al., 2004). All data were collected from a single crystal. Measurements at the Se peak wavelength were available to 1.6 Å and for the remote wavelength to 1.5 Å resolution. The peak data were used to locate the selenium positions and estimate initial experimental phases at a range of resolutions using SHELXD and SHELXE. Starting from these phases, the new ACORN program was applied in two different modes.
The first used the experimental phases determined to 2.5, 2.1 and 1.6 Å by SHELXE (Sheldrick, 2002) as a starting point with phase errors of 49.3° (16 105 reflections), 41.6° (27 284 reflections) and 28.7° (74 907 reflections), respectively. (For comparison, using SHELXE to extend the phases to 1.5 Å using the native and the 1.6 Å anomalous set the mean phase error was 29° for all 74 907 reflections). The reflections were extended to 1.0 Å by method (i), yielding 176 780 extended reflections compared with 74 865 observed. The standard procedure rapidly improved and extended the phase set in all three cases, requiring only a few cycles of DDM0. The initial and final phase errors for all reflections are illustrated in Fig. 8. The phase error for the strong reflections |E| > 0.8 (38 372 reflections to 1.5 Å resolution) reduced to 29.6° after 38 cycles of DDM0, 24.9° after 18 cycles of DDM0 and 20.4° after 11 cycles of DDM0. The resultant map was of excellent quality. This is an extremely cost-effective use of ACORN: the SAD phasing set could be carefully measured to a limited resolution, conventional density-modification procedures used to refine the phases and excellent high-resolution phases generated very quickly from ACORN.
The second approach used the Se positions alone as a starting fragment. These gave an initial mean phase error of 74.1° for the strong reflections |E| > 0.8. The procedure used cycles of DDM0, with single passes of SER carried out whenever CCs stopped increasing and slowly reduced the phase error. The improvement was extremely sluggish, but these refinements were carried out in a fully automatic mode. The procedure continued until cycle 2712, when the CCs was 0.17 and the mean phase error was 30°. The same general scheme was continued, but with DDM1 replacing DDM0 from cycle 2712 to 2756 until a final CCs of 0.25 and mean phase error of 21.0° were obtained. The whole process took about 9 h on a laptop (Toshiba Tecra M1) but with no human intervention. An F map with observed data to 1.5 Å and an E map with the extended reflections to 1.0 Å are shown in Fig. 9.
For these data, starting from the experimental SAD phases is much faster than from phases calculated from the positions of the Se atoms as an initial fragment. This contrasts with previous successful applications of ACORN with measured data to atomic resolution, for example positioning the S atoms using 2.5 Å resolution sulfur-SAD data, and using these as a starting fragment with the 1.15 Å resolution data from the same crystal (Gordon et al., 2001).
9. Discussion
The effectiveness of ACORN has previously been established for structures with atomic resolution data and several previously unknown structures have been solved by its use. However, we realise that comparatively few protein data sets are collected to atomic resolution, but that if the resolution limit for ACORN could be raised to 1.5 Å or thereabouts, the application would extend to a much greater proportion of structures. For this reason, we set ourselves the objective of lowering the resolution limit of application. What has been demonstrated here is that if a suitable fragment can be found consisting typically of 10% of the structure, then ACORN can still give considerable phase improvement with lower resolution data. Rajakannan et al. (2004) tested ACORN with 1.9 Å resolution data in solving a 39.5 kDa structure; starting with a fragment consisting of 14% of the structure, they obtained a map with a around 0.5 that was of too low a quality to interpret directly. However, it gave a successful starting point for ARP/wARP.
Our new improved ACORN with the enhanced density-modification techniques offers a viable approach using data of resolution 1.7 Å or better. In particular, it should be very effective for structures containing heavier atoms whose positions can usually be readily determined. If heavier atoms are not present in the structure, then the barrier to using the improved ACORN is that of finding a suitable starting fragment: this will be the basis of future work.
During the preparation of this paper, but before its submission, a contribution by Caliandro et al. (2005) appeared which dealt with phasing beyond the resolution limit, a component of the present work. This approach was different to our own and confirmed the effectiveness of the procedure. They estimate the extended E amplitudes based on values calculated from the current map which depend on the phase error. This corresponds to procedure (iii) discussed in §5. In contrast, we found that the best approach was to set the extended amplitudes to their mean of 1, which is independent of the phase error. In the Caliandro paper, the number of extended amplitudes is not allowed to exceed 75% of the measured number, whereas for some of our applications, e.g. the case of 1w2y , the extended/measured ratio is as high as 3.5. However, the major difference in our studies lies in the quality of the starting model. Caliandro and coworkers typically started from phases with errors around 30–60° and gain a modest improvement, whilst we are able to start from a much poorer phase set from an earlier stage of the solution process.
A key feature of ACORN remains the significance of the CC estimate as a correct indicator of a successful application. It is straightforward to see when ACORN has produced a meaningful increase in CC and perhaps more importantly to see when it has failed to do so.
Acknowledgements
We are grateful for the support BBSRC has given to this project through grant 87/B15857. We also express our thanks to Olga Moroz for generously providing the data and molecular-replacement model for our 1w2y trial prior to publication of the structure.
References
Abrahams, J. P. & Leslie, A. G. W. (1996). Acta Cryst. D52, 30–42. CrossRef CAS Web of Science IUCr Journals Google Scholar
Caliandro, R., Carrozzini, B., Cascarano, G. L., De Caro, L., Giacovazzo, C. & Siliqi, D. (2005). Acta Cryst. D61, 556–565. Web of Science CrossRef CAS IUCr Journals Google Scholar
Cowtan, K. & Main, P. (1996). Acta Cryst. D52, 43–48. CrossRef CAS Web of Science IUCr Journals Google Scholar
Dias, F. M. V., Vincent, F., Pell, G., Prates, J. A. M., Centeno, M. S. J., Tailford, L. E., Ferreira, L. M. A., Fontes, C. M. G. A., Davies, G. J. & Gilbert, H. J. (2004). J. Biol. Chem. 279, 25517–25526. Web of Science CrossRef PubMed CAS Google Scholar
Ding, J., Frasere, M. E., Meyer, J. H. & Bartlett, P. A. (1998). J. Am. Chem. Soc. 120, 4610–4621. Web of Science CrossRef CAS Google Scholar
Foadi, J., Woolfson, M. M., Dodson, E. J., Wilson, K. S., Yao, J.-X. & Zheng, C.-D. (2000). Acta Cryst. D56, 1137–1147. Web of Science CrossRef CAS IUCr Journals Google Scholar
Gordon, E. J., Leonard, G. A., McSweeney, S. & Zagalsky, P. F. (2001). Acta Cryst. D57, 1230–1237. Web of Science CrossRef CAS IUCr Journals Google Scholar
Lokanath, N. K., Shiromizu, I., Ohshima, N., Nodake, Y., Sugahara, M., Yokoyama, S., Kuramitsu, S., Miyano, M. & Kunishima, N. (2004). Acta Cryst. D60, 1816–1823. Web of Science CrossRef CAS IUCr Journals Google Scholar
McAuley, K. E., Yao, J.-X., Dodson, E. J., Lehmbeck, J., Ostergaard, P. R. & Wilson, K. S. (2001). Acta Cryst. D57, 1571–1578. Web of Science CrossRef CAS IUCr Journals Google Scholar
Moroz, O. V., Harkiolaki, M., Galperin, M. Y., Vagin, A. A., González-Pacanowska, D. & Wilson, K. S. (2004). J. Mol. Biol. 342, 1583–1597. Web of Science CrossRef PubMed CAS Google Scholar
Perrakis, A., Morris, R. & Lamzin, V. S. (1999). Nature Struct. Biol. 6, 458–463. Web of Science CrossRef PubMed CAS Google Scholar
Petratos, K., Dauter, Z. & Wilson, K. S. (1988). Acta Cryst. B44, 628–636. CrossRef CAS Web of Science IUCr Journals Google Scholar
Rajakannan, V., Selvanayagan, S., Yamane, T., Shirai, T., Kobayashi, T., Ito, S. & Velmurugan, D. (2004). J. Synchrotron Rad. 11, 358–362. Web of Science CrossRef CAS IUCr Journals Google Scholar
Read, R. J. (1986). Acta Cryst. A42, 140–149. CrossRef CAS Web of Science IUCr Journals Google Scholar
Refaat, L. S., Tate, C. & Woolfson, M. M. (1996). Acta Cryst. D52, 1119–1124. CrossRef CAS Web of Science IUCr Journals Google Scholar
Sheldrick, G. M. (2002). Z. Kristallogr. 217, 644–650. Web of Science CrossRef CAS Google Scholar
Sim, G. A. (1960). Acta Cryst. 13, 511–512. CrossRef IUCr Journals Web of Science Google Scholar
Srinivasan, R. (1966). Acta Cryst. 20, 143–144. CrossRef CAS IUCr Journals Web of Science Google Scholar
Wang, B.-C. (1981). Acta Cryst. A37, C11. CrossRef IUCr Journals Google Scholar
Wang, B.-C. (1985). Methods Enzymol. 115, 90–112. CrossRef CAS PubMed Google Scholar
Yao, J.-X. (2002). Acta Cryst. D58, 1941–1947. Web of Science CrossRef CAS IUCr Journals Google Scholar
Yao, J.-X., Woolfson, M. M., Wilson, K. S. & Dodson, E. J. (2002). Z. Kristallogr. 217, 636–643. Web of Science CrossRef CAS Google Scholar
© International Union of Crystallography. Prior permission is not required to reproduce short quotations, tables and figures from this article, provided the original authors and source are cited. For more information, click here.