short communications
On the possibility of using polycrystalline material in the development of structure-based generic assays
aNational Synchrotron Light Source, Brookhaven National Laboratory, Upton, NY 11973-5000, USA, bDepartment of Physics and Astronomy, Stony Brook University, Stony Brook, NY 11794-3800, USA, and cDepartment of Biomedical Engineering, Stony Brook University, Stony Brook, NY 11794-2580, USA
*Correspondence e-mail: allaire@bnl.gov
The discovery of ligands that bind specifically to a targeted protein benefits from the development of generic assays for high-throughput screening of a library of chemicals. Protein powder diffraction (PPD) has been proposed as a potential method for use as a structure-based assay for high-throughput screening applications. Building on this effort, powder samples of bound/unbound states of soluble hen-egg white lysozyme precipitated with sodium chloride were compared. The correlation coefficients calculated between the raw diffraction profiles were consistent with the known binding properties of the ligands and suggested that the PPD approach can be used even prior to a full description using stereochemically restrained Rietveld refinement.
Keywords: protein–ligand complexes; assays; powder diffraction; lysozyme.
1. Introduction
The availability of gene sequences, such as the completion of the Human Genome Project, has revolutionized our understanding of life sciences at a molecular level. From sequences, we can generate a list of all the potential components of the cell. This list includes the entire possible open reading frame that could generate proteins. A grand challenge in the genomic era is to elucidate the function of these hypothetical proteins. In addition, for some of these proteins we can anticipate multiple functions, which could also involve more than one biological component. It is imperative to pursue the development of methods to characterize the role of proteins.
Numerous approaches have been developed to identify the functions of hypothetical proteins. Major developments in bio-informatics have provided tools for the analysis and comparison of gene sequences. The concept behind this approach is to infer the potential function of hypothetical proteins from their sequence homology with proteins of known function. This method is extremely powerful, especially for recognizing functions when comparing sequences from different species. Another approach is structural genomics, which aims at identifying the three-dimensional structures of most proteins that are easily obtainable from knowledge of their corresponding DNA sequences. The long-range goals of this initiative are to find all possible protein folds and to provide structural descriptions of (hypothetical) proteins. In this approach, the functions of a protein can be inferred from structural homology with another protein structure of known functionality. Nonetheless, there are a number of `orphan' structures for which no known function can be assigned. In all cases the functions of these proteins remain to be further characterized in vitro and, more importantly, in vivo.
In recent years, chemical genetics has been proposed for the study of gene-product functions in the cellular or organismal context using exogenous ligands. In this approach, small molecules that bind specifically to targeted proteins are used to alter protein functions, enabling a kinetic analysis of potential phenotypes in vivo. In order to be successful, the chemical genetics approach requires the identification of protein ligands. The strategy employed for the discovery of ligands is similar to that exploited by the pharmaceutical industry in the development of new drugs. It consists of high-throughput screening (HTS) of a library of chemicals (>100 000 compounds) using a particular assay that can identify the specificity of a compound for binding to a target protein. Optimizing the chemistry of an initial hit is generally required in order to increase the affinity of a ligand for its binding partner. A major limitation of the chemical genetics approach is that the function of the target protein is not known a priori and therefore an HTS generic assay should be used.
Generic assays take advantage of the biophysical properties of proteins when binding to a ligand. A well known example is the release of thermodynamic binding energy during the formation of protein–ligand complexes. Alternative approaches could use structure-based assays, such as a change in diffraction in macromolecular crystallography. This method of screening chemicals is limited by the tedious preparation of the large number of crystals required for HTS applications. Recently, protein powder diffraction (reviewed in Von Dreele, 2003; Margiolaki & Wright, 2008) has been proposed as a structure-based method that could facilitate this step. Pioneering work by R. B. Von Dreele has identified the potential for extracting structural information from protein powder diffraction profiles (Von Dreele et al., 2000) and for the identification of ligand-bound states of lysozyme (Von Dreele, 2001, 2005; D'Amico & Von Dreele, 2006). Building on this effort, we report here that differences between bound and unbound states of lysozyme can be recognized by comparison of the powder diffraction profiles even prior to a full description following stereochemically restrained Rietveld refinements (Rietveld, 1969; Von Dreele, 1999; Larson & Von Dreele, 2004).
2. Experimental
The promise of protein powder diffraction is that it has the potential to distinguish bound from unbound forms of a protein when mixed with different ligands. The rationale for this experiment is based on the idea that the diffraction of protein powder would differ when directly comparing the powder profiles of unbound and ligand-bound states of lysozyme. Lysozyme hydrolyzes the β(1–4) glycosidic bond between N-acetylmuramic acid (NAM) and N-acetylglucosamine (NAG) residues in certain In this experiment, we take advantage of the known properties of lysozyme to bind NAG and NAG3 and use glucose as a negative control (Rupley et al., 1967).
In an initial step, we prepared homogenous crystalline powder from soluble lysozyme. An obvious approach in producing polycrystalline powder is to increase the number of nucleation sites by having a super-saturated solution compared with single-crystal growth conditions. Hen egg-white lysozyme (HEWL) was dissolved in 100 mM sodium acetate buffer pH 4.5 to a concentration of 200 mg ml−1. Lysozyme powders were prepared by mixing 200 µl buffered lysozyme solution with an equal volume of 100 mM sodium acetate buffered pH 4.5 precipitant solution containing 1 M sodium chloride. The mixed solutions were stirred constantly at room temperature and microcrystalline precipitate formed within 1 h. In addition to the native lysozyme preparation, ligand-bound and unbound samples were prepared by adding 10 mM glucose, NAG or NAG3 to the buffered lysozyme solution prior to precipitation. In total, data were collected from three samples of native lysozyme, two samples of lysozyme mixed with glucose, two samples of lysozyme mixed with NAG and three samples of lysozyme mixed with NAG3. After 24 h, samples were loaded into 1 mm diameter quartz capillaries and centrifuged for 2–3 min, during which some of the amorphous precipitates enhanced the packing of polycrystalline material. A small volume of air placed between the powder and the mother liquor was added to keep the powder packed together and the capillaries were flame-sealed to prevent evaporation. All powder samples were 0.5–0.8 mm in length.
Powder diffraction profiles were measured from a sample of native lysozyme on the high-resolution powder diffraction beamline X3B1 (now X16C) at the National Synchrotron Light Source, Brookhaven National Laboratory (Upton, New York, USA). Data were collected at room temperature using a wavelength of 0.6995 Å. The Ge(111) analyser crystal was scanned over a 13° 2θ range (0.64–13.64°) counting each 0.002° step for 4 s. During data collection the sample was continuously rocked by ±10° to ensure adequate powder averaging. No signs of radiation damage were visible by comparison with a second scan taken at the end of data collection. For the ligand-bound/unbound comparison, diffraction profiles were collected using a 0.001° step size for two regions of 2θ: 4–5° and 13–14°. Pearson correlation coefficients were calculated between the raw diffraction profiles of the different samples. It should be pointed out that data sets from each sample were acquired in the first instance before the collection of duplicate and triplicates in order to eliminate potential correlations over the time of data acquisition.
3. Results and discussion
A typical lysozyme powder sample is shown in Fig. 1. The samples are composed of a mixture of single crystals, polycrystalline aggregates and amorphous precipitate. The majority of the single crystals are less than 15 µm in their largest dimension. The shapes of the single crystals are similar to the shapes of larger crystals of the tetragonal form (space group P43212) of HEWL also grown from sodium chloride precipitant in acetate buffer. The small crystals from the powder preparation, like the larger P43212 crystals, are birefringent. Preliminary inspection of all diffraction profiles revealed peaks at 2θ angles consistent with the predicted Bragg reflections calculated using the (a = b = 79.3, c = 38.0 Å) of the tetragonal form (P43212) of HEWL.
The quality of the lysozyme powder produced was evaluated using stereochemically restrained ) and used the same lysozyme structure obtained from single crystallography (PDB code 1rfp ; Motoshima et al., 1997) as a starting model. The lysozyme structure was refined against the observed diffraction profile and was consistent with P43212. Differences between the calculated and observed profiles are minimal, as shown in Fig. 2. Even at low resolution, where single non-overlapping reflections are observed, high-intensity reflections (e.g. hkl 210) as well as low-intensity reflections (e.g. hkl 201) are fitted well from the The final refined lysozyme structure is consistent with the accepted region of the Ramachandran plot (Ramakrishnan & Ramachandran, 1965). The converged with a final χ2 of 3.85 and root-mean-square deviations on bond lengths and bond angles of 0.012 Å and 1.8°, respectively. These results suggest that the data are of comparable crystalline quality to previously reported work (Von Dreele, 2001).
The protocol used for was similar to the protocol previously described (Von Dreele, 2001In the protein ligand-bound/unbound experiment, the 2θ ranges 4–5° and 13–14° represent regions of the diffraction profile in which minimal and numerous overlapping reflections exist and correspond to Bragg d-spacings of ∼9 and ∼3 Å, respectively. Overplots of all raw diffraction profiles are shown in Fig. 3. Fig. 4 presents all the calculated correlation coefficients between raw diffraction profiles when comparing all samples. The correlation between profiles collected in the 4–5° 2θ range is remarkable. The comparison of duplicates and triplicates of samples with correlation coefficients in the range 0.91–0.98 suggest that the diffraction profiles are similar and polycrystalline powder can be grown reproducibly. The correlation coefficients obtained when comparing native lysozyme and lysozyme + glucose (0.93–0.98) are comparable to those obtained on comparing duplicates/triplicates and suggest that the presence of glucose does not alter the diffraction profile. In contrast, the correlation coefficients obtained in the comparison of lysozyme + NAG with native lysozyme (0.83–0.86) and lysozyme + glucose (0.82–0.84) suggest that the presence of NAG has an effect on the diffraction profile. More striking is the comparison of lysozyme + NAG3 with native lysozyme and lysozyme + glucose, which gives correlation coefficients in the range 0.36–0.44 and that are slightly different from those obtained for lysozyme + NAG (0.43–0.49). These results show that the presence of NAG3 has an effect on the diffraction profile and that this effect is to some extent comparable to that obtained from lysozyme in the presence of NAG. The correlation coefficients obtained here are in good agreement with the expected binding properties of these compounds to lysozyme. A less obvious but similar pattern is discernable when comparing the correlation coefficients at ∼3 Å (13–14° 2θ), suggesting that the diffraction profiles measured in this resolution range are also sample-dependant. The higher correlation coefficients on comparing samples at ∼3 Å are most likely to be the result of lower signal to noise and highly overlapping reflections (Basso et al., 2005; Von Dreele, 2007).
For further HTS applications, small quantities of sample are required prepared in a high-throughput format. Our initial effort reveals that we can reproducibly grow lysozyme powder in a 96-well plate by mixing 10 µl buffered lysozyme with 10 µl precipitate solution. The diffraction profile acquired by irradiating the lysozyme powder directly through the plate possesses similar characteristics to that collected from samples loaded in capillaries. Furthermore, the data-collection time can be reduced for HTS applications by the utilization of area/strip detectors and a more intense beam. Taken together, these results support the idea that the diffraction profile measured from protein powder samples could be exploited in the development of a generic assay for use in HTS applications and chemical genetics. This approach could exploit the known crystallization conditions of a target for powder preparation, e.g. an `orphan' structure solved from a structural genomics effort. It should be pointed out that the approach of comparing raw diffraction profiles would also detect false positives from a change in the unit-cell lattice parameters even if the chemicals do not bind specifically to the protein. Further characterization of the hits using restrained could then be achieved. In view of the size of the individual lysozyme crystals used in this study, single crystallography and microdiffraction (Sanishvili et al., 2008) could also be considered.
Footnotes
‡Present address: Department of Physics, University of Texas at El Paso, El Paso, TX 79968, USA.
Acknowledgements
This work was supported by the Brookhaven National Laboratory/Laboratory Directed Research and Development Program and the National Institutes of Health/National Institute of General Medical Sciences under agreement Y1 GM-0080-03. Use of the National Synchrotron Light Source, Brookhaven National Laboratory was supported by the US Department of Energy, Office of Science, Office of Basic Energy Sciences under Contract No. DE-AC02-98CH10886.
References
Basso, S., Fitch, A. N., Fox, G. C., Margiolaki, I. & Wright, J. P. (2005). Acta Cryst. D61, 1612–1625. Web of Science CrossRef CAS IUCr Journals Google Scholar
D'Amico, K. & Von Dreele, R. B. (2006). US Patent 7130747. Google Scholar
Larson, A. C. & Von Dreele, R. B. (2004). General Structure Analysis System (GSAS). Los Alamos National Laboratory Report LAUR 86-748. Los Alamos National Laboratory, Los Alamos, USA. Google Scholar
Margiolaki, I. & Wright, J. P. (2008). Acta Cryst. A64, 169–180. Web of Science CrossRef CAS IUCr Journals Google Scholar
Motoshima, H., Mine, S., Masumoto, K., Abe, Y., Iwashita, H., Hashimoto, Y., Chijiiwa, Y., Ueda, T. & Imoto, T. (1997). J. Biochem. 121, 1076–1081. CrossRef CAS PubMed Web of Science Google Scholar
Ramakrishnan, C. & Ramachandran, G. N. (1965). Biophys. J. 5, 909–933. CrossRef CAS PubMed Google Scholar
Rietveld, H. M. (1969). J. Appl. Cryst. 2, 65–71. CrossRef CAS IUCr Journals Web of Science Google Scholar
Rupley, J. A., Butler, L., Gerring, M., Hartdegen, F. J. & Pecoraro, R. (1967). Proc. Natl Acad. Sci. USA, 57, 1088–1094. CrossRef CAS PubMed Web of Science Google Scholar
Sanishvili, R., Nagarajan, V., Yoder, D., Becker, M., Xu, S., Corcoran, S., Akey, D. L., Smith, J. L. & Fischetti, R. F. (2008). Acta Cryst. D64, 425–435. Web of Science CrossRef CAS IUCr Journals Google Scholar
Von Dreele, R. B. (1999). J. Appl. Cryst. 32, 1084–1089. Web of Science CrossRef CAS IUCr Journals Google Scholar
Von Dreele, R. B. (2001). Acta Cryst. D57, 1836–1842. Web of Science CrossRef CAS IUCr Journals Google Scholar
Von Dreele, R. B. (2003). Methods Enzymol. 368, 254–267. Web of Science PubMed CAS Google Scholar
Von Dreele, R. B. (2005). Acta Cryst. D61, 22–32. Web of Science CrossRef CAS IUCr Journals Google Scholar
Von Dreele, R. B. (2007). J. Appl. Cryst. 40, 133–143. Web of Science CrossRef CAS IUCr Journals Google Scholar
Von Dreele, R. B., Stephens, P. W., Smith, G. D. & Blessing, R. H. (2000). Acta Cryst. D56, 1549–1553. Web of Science CrossRef CAS IUCr Journals Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.