diffraction structural biology
Ultrahigh-resolution crystallography and related electron density and electrostatic properties in proteins
aLaboratoire de Cristallographie et de Modélisation des Matériaux Minéraux et Biologiques (LCM3B), CNRS ESA 7036, UHP Faculté des Sciences, BP 239, 54506 Vandoeuvre-les-Nancy CEDEX, France
*Correspondence e-mail: claude.lecomte@lcm3b.uhp-nancy.fr
With an increasing number of biological macromolecular crystal structures measured at ultrahigh resolution (1 Å or better), it is necessary to extend to large systems the experimental valence electron density modelling that is applied to small molecules. A database of average multipole populations has been built, describing the electron density of chemical groups in all 20 amino acids found in proteins. It allows calculation of atomic aspherical scattering factors, which are the starting point for MoPro software. It is shown that the use of non-spherical scattering factors has a major impact on crystallographic statistics and results in a more accurate notably in terms of thermal displacement parameters and bond distances involving H atoms. It is also possible to obtain a realistic valence electron density model, which is used in the calculation of the electrostatic potential and energetic properties of proteins.
of the protein electron density, using theThe details observable in protein crystal structures depend on the experimental diffraction resolution. In X-ray diffraction (XRD), the experimental resolution (Å) is defined by d = λ/(2sinθ), where θ is the and λ is the wavelength. At the usual resolution for macromolecular crystallography, 1.6 < d < 3 Å, the number of Fourier components F(h, k, l) of the electron density (ED) as obtained by
is not high enough to estimate all the atomic parameters. In the ultrahigh-resolution case, we will show that bonding interactions and electrostatic properties may be quantified in relation to biological activity.
In contrast to the independent atom model (IAM), where all atoms are assumed to be neutral, spherical and independent, the crystal static valence electron density is modelled by a sum of multipolar pseudo-atom densities, , located at atomic centres (Hansen & Coppens, 1978), while the atomic core density, , remains unchanged,
where the term ρval represents the spherically averaged free atom Hartree Fock valence density. The third term of the summation describes the non-spherical part, in which the radial functions used are of Slater type: Rnl(r) = Nrnlexp(−ξr). The functions ylm are spherical harmonics in real form; the (θ,φ) coordinates are expressed in an atom-centred local axis system, which facilitates the application of chemical similarity. The refinable parameters are the κ and κ′ coefficients, which describe the expansion–contraction of the perturbed valence ED (Coppens et al., 1979), and the population parameters Pval and Plm.
To apply this formalism to ultrahigh-resolution protein diffraction data, we have proposed a multipolar parameters library (Pichon-Pesme et al., 1995; Zarychta et al., 2007). High-resolution XRD data have been collected at the Laboratoire de Cristallographie et de Modélization des Matériaux Minéraux et Biologiques (LCM3B, Nancy, France) on a large group of mono- or to determine precisely the ED distribution of all natural amino acids (neutral or charged). Multipole of the related structure factors allowed the building of an experimental database of multipolar parameters for each protein atom type in a given chemical environment (Jelsch et al., 1998), from which a specific atom-type nomenclature has been developed. The static electron deformation density,
can be calculated from the experimental database in all amino acids and natural
The database parameters were shown to be transferable to the protein amino acids and permit the calculation of aspherical atomic scattering factors to be used for protein refinement.The validity of these aspherical scattering factors was checked more than ten years ago (Jelsch et al., 1998) and has been confirmed since then by several studies. These factors have been successfully used to refine ultrahigh-resolution protein structures: a scorpion toxin (Housset et al., 2000), crambin (Jelsch et al., 2000) and hAR (Muzet et al., 2003). As shown in these papers, the use of aspherical scattering factors improves all least-squares statistical indices and, consequently, leads to more physically meaningful bond distances and thermal anisotropic displacement parameters. Aspherical features, such as, for instance, non-bonded density on a C=O O atom, are also taken into account.
This finding led us to the development of the MoPro package of crystallographic programs (Guillot et al., 2001; Jelsch et al., 2005). In MoPro, for any chemical type of atom belonging to a protein-type molecule, automatic assignment of the corresponding aspherical scattering factor is performed, and these scattering factors can then be further used in the process. Hence, a stereochemical and anisotropic thermal motion parameters following the assignment of aspherical scattering factors can be performed on protein atoms, leading to the deconvolution of thermal motion and valence electron density deformation effects. This allows the computation of dynamical deformation valence electron density, smeared out by thermal vibrations, as routinely performed in small-molecule crystallography. Fig. 1 shows an example of a dynamical deformation computed in the plane of a well ordered peptide plane of the 0.62 Å structure of an eel pout type III antifreeze protein RD1 (Ko et al., 2003; Protein Data Bank code 1ucs). The bonding electron density features are clearly visible on each as well as electron density peak accumulation corresponding to the carbonyl O atom electron lone pairs.
Such a generalized charge density as the aspherical scattering factors can be fixed parameters (as in the above example), should also be generalized to all small molecules and protein refinements in the future. Furthermore, as shown below, this atom transferability allows estimation of electrostatic properties at least as accurately as when usual force fields are used (Muzet et al., 2003).
which is not necessarily aAnother approach, based on high-level theoretical calculations, was later proposed by Koritsanszky et al. (2002; SUNY Buffalo database). The aim of their theoretical database is different from ours because the also transferable multipolar parameters will be used to estimate properties such as protein–protein or protein–ligand interaction energies based on theoretical ED reconstruction, while our approach is based on experimental ultrahigh-resolution XRD data. A comparison between our experimental database and the SUNY Buffalo theoretical database is in progress (Pichon-Pesme et al., 2004) and will be published soon. A similar project has been proposed by Dittrich et al. (2004).
To summarize, we have at our disposal an experimental database that contains a set of transferable multipolar pseudo-atoms that can be used for protein ).
and electrostatic properties estimation. These transferable pseudo-atoms have their own topology and may therefore be considered as experimental topological atoms as defined by Bader (1990Acknowledgements
Drs A. Podjarny and Virginie Pichon-Pesme are gratefully acknowledged. AL and BF thank the French Ministry of Research for a doctoral fellowship.
References
Bader, R. F. W. (1990). Atoms in Molecules: A Quantum Theory. Oxford University Press.
Coppens, P., Guru Row, T. N., Leung, P., Stevens, E. D., Becker, P. J. & Yang, Y. W. (1979). Acta Cryst. A35, 63–72. CrossRef CAS IUCr Journals Web of Science
Dittrich, B., Koritsanszky, T. & Lüger, P. (2004). Angew. Chem. Int. Ed. 43, 2718–2721. Web of Science CSD CrossRef CAS
Guillot, B., Viry, L., Guillot, R., Lecomte, C. & Jelsch, C. (2001). J. Appl. Cryst. 34, 214–223. Web of Science CrossRef CAS IUCr Journals
Hansen, N. K. & Coppens, P. (1978). Acta Cryst. A34, 909–921. CrossRef CAS IUCr Journals Web of Science
Housset, D., Benabicha, F., Pichon-Pesme, V., Jelsch, C., Maierhofer, A., David, S., Fontecilla-Camps, J. C. & Lecomte, C. (2000). Acta Cryst. D56, 151–160. Web of Science CrossRef CAS IUCr Journals
Jelsch, C., Guillot, B., Lagoutte, A. & Lecomte, C. (2005). J. Appl. Cryst. 38, 38–54. Web of Science CrossRef IUCr Journals
Jelsch, C., Pichon-Pesme, V., Lecomte, C. & Aubry, A. (1998). Acta Cryst. D54, 1306–1318. Web of Science CrossRef CAS IUCr Journals
Jelsch, C., Teeter, M. M., Lamzin, V., Pichon-Pesme, V., Blessing, R. H. & Lecomte, C. (2000). Proc. Natl Acad. Sci. USA, 97, 3171–3176. Web of Science CrossRef PubMed CAS
Ko, T. P., Robinson, H., Gao, Y. G., Cheng, C. H., DeVries, A. L. & Wang, A. H. (2003). Biophys. J. 84, 1228–1237. Web of Science CrossRef PubMed CAS
Koritsanszky, T., Volkov, A. & Coppens, P. (2002). Acta Cryst. A58, 464–472. Web of Science CrossRef CAS IUCr Journals
Muzet, N., Guillot, B., Jelsch, C., Howard, E. & Lecomte, C. (2003). Proc. Natl Acad. Sci. USA, 100, 8742–8747. Web of Science CrossRef PubMed CAS
Pichon-Pesme, V., Jelsch, C., Guillot, B. & Lecomte, C. (2004). Acta Cryst. A60, 204–208. Web of Science CrossRef CAS IUCr Journals
Pichon-Pesme, V., Lecomte, C. & Lachekar, H. (1995). J. Phys. Chem. 99, 6242–6250. CrossRef CAS Web of Science
Zarychta, B., Pichon-Pesme, V., Guillot, B., Lecomte, C. & Jelsch, C. (2007). Acta Cryst. A63, 108–125. Web of Science CrossRef CAS IUCr Journals
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.