research papers
The use of ACORN in solving a 39.5 kDa macromolecule with 1.9 Å resolution laboratory source data
aDepartment of Crystallography and Biophysics, University of Madras, Guindy Campus, Chennai 600 025, India, bDepartment of Biotechnology and Biomaterial Science, Graduate School of Engineering, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8603, Japan, cDepartment of Computational Biology, Biomolecular Engineering Research Institute, Furuedai 6-2-3, Suita, Osaka 565-0874, Japan, dTochigi Research Laboratories of Kao Corporation, 2606 Alkabane, Ichikai, Haga, Tochigi 321-3497, Japan, and eJapan Marine Science and Technology Center, 2-15 Natushima, Yokosuka 237-0061, Japan
*Correspondence e-mail: d_velu@yahoo.com
Data from the alkaline cellulase apo form were collected at a resolution of 1.9 Å using an in-house X-ray source (Cu Kα). By using different fragments of helices from the model solved by macromolecular crystallographic means, the direct-methods program ACORN was used to arrive at the complete model. Attempts have been made to use various percentages of input phasing information from these helices. The minimum input phasing required in feeding the fragments was about 14% of the whole structure. The phases obtained from ACORN were of superb quality, allowing automated model building to be carried out using ARP/wARP. Minimal manual model building was required and the was completed using the program REFMAC. The whole process, starting from the running of ACORN and ending with the refined model, took nearly 15 h of CPU time using a Pentium III PC.
Keywords: alkaline cellulase; ACORN; 1.9 Å resolution; macromolecules; ARP/wARP; in-house X-ray sources.
1. Introduction
Ab initio solutions of the crystal structures of small molecules are possible by using atomic-resolution diffraction data, usually at ∼0.8 Å. Most of these small molecular crystal structures are usually solved using direct-methods programs. The estimation of the linear relationships among phases forms a major part of However, the probabilistic estimates of these relationships become invalid as the number of atoms in the becomes very large (e.g. thousands of atoms). Macromolecules come under this category. Also, the diffraction data available for macromolecular crystals are not usually at an atomic resolution. For these reasons, cannot be used to solve macromolecules.
During the last decade, admirable advances have taken place in the data-collection facilities and techniques available to the macromolecular crystallographer. With advances such as more intense X-ray sources, in particular dedicated synchrotron beamlines, highly efficient two-dimensional detectors in the form of imaging plates and, more recently, charged-couple devices, and cryogenic et al., 2002) per week is around 75 at present, and around 25000 structures have so far been deposited. With the above advances, more data sets appear to be coming from atomic-resolution data. The above possibility of gaining atomic-resolution data even for macromolecules prompted the direct-methods practitioners to make attempts to extend the using other macromolecular techniques (using anomalous scattering/density modification approaches etc.) to enable them to tackle the structure solution of macromolecules. ACORN is a comprehensive and efficient phasing procedure involving for the determination of protein structures when atomic-resolution data are available (better than 1.2 Å).
to alleviate the effects of radiation damage and extend the resolution of data accessible, the average number of structures deposited in the Protein Data Bank (Berman2. Description of the program
Foadi et al. (2000), Foadi (2003) and Yao (2002) have described how ACORN can generate phases that are good enough to reveal the whole molecule from a slightly `better-than-random' starting set. This initial phasing can be generated: (i) from a correctly positioned structural fragment (obtained from methods), (ii) from a few heavier atoms such as a metal or the sulphur atoms of a molecule, (iii) in favourable cases by testing many randomly positioned atoms or selecting the most promising for phase extension, (iv) from a small fragment of a known structure with some sequence homology or (v) from an idealized secondary structural element such as an alpha helix or small section of a beta sheet. The use of ACORN to determine substructures has recently been detailed (Dodson & Yao, 2003). It has even been successful in locating 155 Se sites. Foadi (2003) has detailed the general underlying concepts of ACORN for the solution of protein structures. Using data, a can be determined (e.g. one containing S or Se atoms) in order to be used for the determination of whole protein structures. ACORN needs an atomic resolution higher than 1.3 Å, but for determination of substructures the resolution can be as low as 3 Å (see Mukherjee et al., 1989). Dauter & Adamiak (2001) and Dauter et al. (2002) have described the question of accuracy and the attainment of this in measurements in macromolecules at higher resolutions where the errors in measurements are often of the same order as the free signal.
Ramagopal et al. (2003) have detailed how even a small Bijvoet ratio of 0.6% arising from sulfur can be used favourably in solving a macromolecule. By using any of the above models, ACORN then uses a combination of approaches, most importantly dynamic density modification (DDM), to set up a refined set of phases. Foadi (2003) has given a detailed explanation of the reasons for the failure of ACORN when the resolution is below 1.2 Å. In ACORN the DDM modifies the density by sharpening or broadening. The whole purpose of doing this is to build up individual atomic peaks. Such a process is clearly suitable when the map shows details at atomic resolution; it is less clear how the density can be properly modified when it does not show atomic details. At atomic resolution, two neighboring atomic peaks will be two separate entities and the DDM will enhance both of them. At lower resolutions, these two peaks will merge into a single peak and the DDM, when acting on this peak, will just enhance it and hence no positive phase can be expected in this situation. As the height of the various initial density peaks will be different at different resolutions, the shape of the density modification curve will have to be properly modified and, as such, this part in ACORN is likely to be difficult at resolutions which do not show `peakiness'. Our present work, reported in this paper, overcomes this problem at low resolution (1.9 Å) by using fragments as `seed phasing' information. As detailed papers on the ACORN program have already appeared in the literature (McAuley et al., 2001; Banumathi et al., 2002; Rajakannan et al., 2002, 2003; Velmurugan et al., 2002), including its various options and its use in determining substructures, and the ab initio of macromolecules (Rajakannan et al., 2003), this paper mainly focuses on the discussion of the applications of ACORN to the structure elucidation of a 357 residue containing the alkaline cellulase apo form diffracting at 1.9 Å resolution with laboratory source Cu Kα data (Shirai et al., 2001).
3. Materials and methods
3.1. Crystallization
A crystal was grown using the hanging-drop vapor-diffusion method at 291 K. The initial conditions were 1 ml of 40 mM cadmium sulfate hydrate and 0.5 M sodium acetate in 0.1 M HEPES buffer (pH 7.5) for the reservoir, and a mixture of 4 ml of the reservoir solution and 4 ml of 2% (w/v) protein solution for the drop. The crystal grew to an approximate size of 0.3 mm × 0.3 mm × 0.4 mm in one week.
3.2. Intensity data collection
The X-ray diffraction experiment was carried out by using an R-AXIS IV imaging-plate detector with a mirror–monochromator mounted on a Rigaku RU-300 copper rotating-anode X-ray generator (wavelength = 1.54 Å). The crystal was flash frozen in liquid nitrogen and kept in a 100 K dry-N2 stream (Oxford Cryosystems) during the experiment. The crystal belonged to trigonal P3121 and diffraction images were processed using the DENZO and SCALEPACK programs (Otwinowski & Minor, 1997). The statistics for the data collection are summarized in Table 1.
|
4. Overview of the method
Table 1 lists the crystallographic data and the various helices used as `seeds' in ACORN, along with the total number of residues in each of these helices. This table also defines the various sets, which contain different combinations of these helices, along with the CPU time for running ACORN in each case. ACORN output which yielded successful and unsuccessful models are denoted by Y and N, respectively, in the `Result' column.
For set 1, 11 helices containing 102 residues were given as input to ACORN. Here, the ACORN PHASE option was selected for the structure solution. The R-factor and for the medium reflections with the normalized structure factors E from 0.1 to 1.2 of the initial model are 49.5% and 0.1034, respectively. Within ten cycles of the DDM the R-factor and attained 47.9% and 0.1437 indicating a good solution. A map was calculated for the ACORN output phases and we were able to find 546 peaks which were above the 5σ cut-off. The phases were then fed to ARP/wARP (Perrakis et al., 1999) and REFMAC (Murshudov et al., 1999). After the initial model building by ARP/wARP, the Rw and Rf values were 41.8 and 42.3%. This initial model was refined and ten cycles of auto-building along with five cycles of REFMAC in each auto-building cycle were performed. Finally, ARP/wARP was able to build 346 out of 357 residues in five chains. At this stage the Rw and Rf values were 19 and 24.5%, respectively. The map also showed the densities in the missing region, so the manual model building was carried out for the remaining 11 residues. After the manual model building, 20 cycles of were performed using REFMAC and solvent atoms were updated after the using the ARP/wARP `build solvent atoms' script. The final Rw and Rf values were 15.7 and 18.4%, respectively. The backbone of this final model was superimposed with structure conventionally solved by the method. The root-mean-square deviation was 0.29 Å and the details are shown in Table 2. The results for sets 2 to 8 are also shown.
|
5. Results and discussion
Figs. 1, 2 and 3 show sections of the final for sets 1, 4 and 8, respectively. Table 2 lists the ACORN statistics and the ARP/wARP running details for these cases. The final results obtained in each case are also given. The results show that a minimum of 321 residues can be built automatically when the `seed phasing' information comes from the helices corresponding to sets 1 to 7. Although only 16 residues in three chains were initially built in the case of set 8 when given as `seed', an iterative cycle carried out with these output phases (16 residues plus 4012 dummy atoms) revealed around 346 residues with a high value of 0.98 for the connectivity index. Table 2 also lists the root-mean-square deviations of the backbone atoms with the reported structure for each final model. The correlation coefficients of the maps before and after ACORN for the final map for all the sets are also shown in Table 2.
6. Conclusions
Based on the published work and the work carried out by our group (Rajakannan et al., 2004), it has now become clear that very little initial information is needed to determine the structure of a protein using ACORN. Among the multiple solutions, the correct solutions can be obtained in all trials with high reliability by the working of the and hence high resolution and fairly complete diffraction data enable one to solve a protein ab initio, in a relatively short amount of time. Thus ACORN has the great potential to establish itself as a program for high-throughput structure determination.
The success of ACORN relies mainly on the DDM, and the work carried out by us so far clearly substantiates the fact that automatic model building of ACORN output phases was possible in most cases because the quality of the output by ACORN was superb. To substantiate our finding we have superposed the final model obtained with the electron density from ACORN with the background. The final model in most of the cases fits with the electron density output from ACORN. The essence of this paper is in the applicability of ACORN to real data from laboratory sources even at a resolution of 1.9 Å. Success was obtained when the initial percentage of input information was just 14%. Currently, in order to extend the applicability of ACORN to lower resolutions, the seed phasing has been obtained from the native structure itself (as the structure had already been solved by traditional macromolecular crystallographic methods). We are at present working on the seed-feeding aspect of ACORN using a data-mining approach of the secondary structural elements which can be obtained from the data deposited in the Protein Data Bank.
Given the success of ab initio using ACORN at medium high resolution (1.9 Å) with laboratory source Cu Kα data, this result, if reproduced with other protein crystals needing to be solved de novo, would alter the nature of the balance between synchrotron- and laboratory-based protein crystalography research.
Acknowledgements
VR and DV thank the Department of Science and Technology, Government of India, for funding a major project, which enabled them to carry out this work. Part of this work was carried out at the Department of Biotechnology, School of Engineering, Nagoya University, Nagoya, Japan, under the VBL Visiting Professorship Scheme awarded to one of the authors (DV).
References
Banumathi, S., Rajakannan, V., Velmurugan, D., Dauter, Z., Dauter, M., Tsai, M. D. & Sekar, K. (2002). Japanese Crystallographic Society Meeting, Poster, P3-II-27, 123.
Berman, H. M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T. N., Weissig, H., Shindyalov, I. N. & Bourne, P. E. (2000). Nucl. Acids Res. 28, 235–242. Web of Science CrossRef PubMed CAS
Dauter, Z. (2002). Acta Cryst. D58, 1958–1967. Web of Science CrossRef CAS IUCr Journals
Dauter, Z. & Adamiak, D. A. (2001). Acta Cryst. D57, 990–995. Web of Science CrossRef CAS IUCr Journals
Dauter, Z., Dauter, M. & Dodson, E. J. (2002). Acta Cryst. D58, 494–506. Web of Science CrossRef CAS IUCr Journals
Dodson, E. J. & Yao, J. X. (2003). Crystallogr. Rev. 9, 67–72. CrossRef CAS
Foadi, J. (2003). Crystallogr. Rev. 9, 43–65. CrossRef CAS
Foadi, J., Woolfson, M. M., Dodson, E. J., Wilson, K. S., Jia-xing, Y. & Chao-de, Z. (2000). Acta Cryst. D56, 1137–1147. Web of Science CrossRef CAS IUCr Journals
McAuley, K. E., Yao, J. X., Dodson, E. J., Lehmbeck, J., Astergaard, P. R. & Wilson, K. S. (2001). Acta Cryst. D57, 1571–1578. Web of Science CrossRef CAS IUCr Journals
Mukherjee, A. K., Helliwell, J. R. & Main, P. (1989). Acta Cryst. A45, 715–718. CrossRef CAS Web of Science IUCr Journals
Murshudov, G. N., Lebedev, A., Vagin, A. A., Wilson, K. S. & Dodson, E. J. (1999). Acta Cryst. D55, 247–255. Web of Science CrossRef CAS IUCr Journals
Otwinowski, Z. & Minor, W. (1997). Methods Enzymol. 276, 307–326. CrossRef CAS Web of Science
Perrakis, A., Morris, R. M. & Lamzin, V. S. (1999). Nature Struct. Biol. 6, 458–463. Web of Science CrossRef PubMed CAS
Rajakannan, V., Velmurugan, D., Yamane, T., Dauter, Z., Dauter, M., Tsai, M. D. & Sekar, K. (2002). Japanese Crystallograhic Society Meeting, Poster P3-I-22, 84.
Rajakannan, V., Yamane, T., Shirai, T., Kobayashi, T., Ito, S. & Velmurugan, D. (2003). International Symposium on Diffraction Structural Biology, Tsukuba, Japan, 28–31 May 2003, Poster P-085.
Rajakannan, V., Yamane, T., Shirai, T., Kobayashi, T., Ito, S. & Velmurugan, D. (2004). J. Synchrotron Rad. 11, 64–67. Web of Science CrossRef CAS IUCr Journals
Ramagopal, U. A., Dauter, M. & Dauter, Z. (2003). Acta Cryst. D59, 868–875. Web of Science CrossRef CAS IUCr Journals
Shirai, T., Ishida, J., Noda, H., Yamane, T., Ozaki, K., Hamada, M. & Ito, S. (2001). J. Mol. Biol. 310, 1079–1087. Web of Science CrossRef PubMed CAS
Velmurugan, D., Rajakannan, V., Yamane, T., Dauter, Z. & Sekar, K. (2002). Japanese Crystallographic Society Meeting, Poster P3-II-26, 122.
Yao, J. X. (2002). Acta Cryst. D58, 1941–1947. Web of Science CrossRef CAS IUCr Journals
© International Union of Crystallography. Prior permission is not required to reproduce short quotations, tables and figures from this article, provided the original authors and source are cited. For more information, click here.