research papers
Analysis of errors in the
of MsbAaDepartment of Molecular Biology, Princeton University, Princeton, NJ 08544, USA
*Correspondence e-mail: pjeffrey@princeton.edu
The determination of incorrect structures for the ABC transporter MsbA gave rise to questions of how this could have occurred. Methodological aspects of the MsbA
are explored in light of this error.Keywords: protein crystallography; methodology.
1. Introduction
Evidence of significant errors in the crystal structures of the transporters MsbA and EmrE (Dawson & Locher, 2006; Tate, 2006) led to the retraction of five structure papers (Chang et al., 2006; Chang, 2007; Ma & Chang, 2007), the so-called `great pentaretraction' (Miller, 2007). These structures corresponded to Protein Data Bank (PDB; Berman et al., 2000) entries 1jsq , 1pf4 and 1z2r for MsbA, and 1s7b and 2f2m for EmrE, which were moved to the PDB obsolete archive. These retractions generated much discussion (Miller, 2006, 2007; Petsko, 2007; Matthews, 2007; Jones & Kleywegt, 2007), at least in part because three of the retracted papers were published in a prominent journal in the field of structural biology (Chang & Roth, 2001; Chang, 2003; Reyes & Chang, 2005; Ma & Chang, 2004; Pornillos et al., 2005).
In an attempt to understand how such incorrect structure determinations could occur, examples and conclusions will be presented based on test data and the published structure papers. MsbA and EmrE are unrelated in structure but they shared the same pathology during et al., 2007; Ward et al., 2007).
and the conclusions apply to both structures. The corrected structures were published in late 2007 (Chen2. The initial error
It has been indicated (Chang et al., 2006) that the initial error in the structure determinations was the accidental inversion of the sign of the anomalous difference in a data-conversion step that converted experimental intensity (I) values to structure-factor moduli (F). Specifically, I(h, k, l) and I(−h, −k, −l) were converted to |F(−h, −k, −l)| and |F(h, k, l)|, respectively. This conversion utilized an in-house program for which the source code was unavailable (G. Chang, personal communication).
The anomalous difference (Δano) can be expressed as |F(h, k, l)| − |F(−h, −k, −l)|. Negation of this anomalous difference is equivalent to a centrosymmetric misassignment of the from (h, k, l) to (−h, −k, −l), changing the hand of the reciprocal-lattice indexing. This is unlikely, but possible, during the data-processing step by a misstatement of the detector or goniostat geometry.
In experimental SAD or ). However, it is possible to obtain a superficially interpretable map, albeit a centrosymmetrically inverted one, if an inverted heavy-atom is matched with data with an inverted anomalous sign (Wang et al., 2007).
the combination of inverted anomalous data with the correct heavy-atom substructure leads to an uninterpretable map, as does combination of the centrosymmetrically inverted with anomalous data of the correct sign (Matthews, 20073. examples
The consequences of inverting the sign of the anomalous signal during phase determination have been discussed by Matthews (2007) with reference to the MsbA and EmrE cases. To test how far one might practically proceed with data sets that had inverted anomalous signals, two example MAD data sets at high and low resolution were selected. In both cases the SHELX program suite (Sheldrick, 2008) was used to determine heavy-atom locations and calculate experimental phases. A simple program was written to invert the anomalous sign in otherwise unmodified files output by SCALEPACK (Otwinowski & Minor, 1997) prior to using SHELXC.
In this protocol SHELXC was used to calculate an improved magnitude estimate of the FA, SHELXD determined the heavy-atom and SHELXE calculated phases from the and the MAD data to produce phases corresponding to an electron-density map modified by solvent flattening. The heavy-atom determined by SHELXD is inherently ambiguous, with two possible solutions being equally consistent with the data: one with coordinates at (x, y, z) and the other with centrosymmetrically inverted coordinates at (−x, −y, −z). In favorable cases there is a clear distinction between the correct solution and the incorrect solution during the phasing process in SHELXE, with one heavy-atom producing an interpretable map and the other producing a nonsense map.
3.1. at high resolution
The DED domain of MC159 was a straightforward 1.8 Å resolution multiwavelength et al., 2006). MC159 crystals grew in P212121, with unit-cell parameters a = 35.10, b = 63.50, c = 76.46 Å. yielded a very interpretable experimental map that was amenable to automatic building using ARP/wARP (Perrakis et al., 1999). SHELX was used to prepare the data, find the heavy-atom of four seleniums, generate experimental phases and perform solvent flattening with an assumed solvent content of 35% by volume. The same phasing protocol was run using the original data and using the same data with inverted anomalous signal. The results are summarized in Table 1.
(MAD) using selenomethionine-labeled protein (Li
|
The anomalous signal statistics in SHELXC were essentially identical for the unmodified and inverted data because the magnitudes of the anomalous differences were maintained; only their sign was altered. During the heavy-atom substructure-determination step in SHELXD the for weak reflections was similar in both cases, as were both the contrast and pseudo-free correlation coefficients during the phasing and solvent-flattening steps in SHELXE. These statistics are often those that prove to be most useful for monitoring success in experimental phasing. For these statistical values, the correct and inverted anomalous sign data were almost indistinguishable. Table 1 shows the results for SHELXE runs using the sites found by SHELXD with and without inversion applied, as there is an inherent centrosymmetric ambiguity in the sites located by SHELXD. The heavy-atom substructures corresponding to unmodified and inverted data were found to be related by inversion after compensation for an alternative origin choice and a crystallographic symmetry operator. They were also superimposible and therefore the heavy-atom substructures were consistent apart from the inversion. The experimental phases led to electron-density maps (Fig. 1) that were essentially identical in terms of quality except that they were centrosymmetrically inverted with respect to each other.
This situation was not found to be unique to the use of SHELX for experimental phasing and this same behavior could be reproduced with SOLVE/RESOLVE (Terwilliger & Berendzen, 1999; data not shown).
3.2. at low resolution
The et al., 2007) was an example that was somewhat more representative of the lower resolution of the MsbA and EmrE studies. S2P crystals formed in R3 and were indexed in the hexagonal setting H3 with unit-cell parameters a = b = 123.86, c = 136.47 Å. Experimental MAD data to a maximum resolution of 3.8 Å were collected from selenomethionine-labeled protein. The same protocol was used to generate experimental phases for S2P as was used in the MC159 example above. While the 3.8 Å resolution MAD map predictably lacked the clarity of the 1.8 Å resolution map of MC159, the hand of the helices and many of the side chains are clearly visible in the map. If the same phases are used to calculate a map at the 4.5 Å resolution of the original Escherichia coli MsbA study (Chang & Roth, 2001) the hand of the helices becomes very difficult to detect.
of the intramembrane protease S2P (FengThe results of and as with the higher resolution example there was no distinction on statistical criteria alone between solutions based on data with the correct and inverted anomalous signs. Use of the program SHARP (de La Fortelle & Bricogne, 1997) considerably improved the interpretability of the electron-density maps from both the correct and inverted anomalous data with similar phasing statistics (data not shown). The experimentally phased electron-density maps were similar in superficial interpretability, including the detection of α-helices. Since the heavy-atom from the inverted anomalous signal was the centrosymmetric inverse of the corrected anomalous data, the phases from one solution were the approximate negative of the other. The experimental maps were also the centrosymmetric inverse of each other.
are summarized in Table 13.3. Conclusions from the test cases
The use of conventional phasing protocols and programs on data with an inverted anomalous sign gives rise to phases that have essentially the same statistical quality indicators as phases derived from data with the correct sign. This is unlikely to be influenced by choice of program employed or changes in phasing protocol beyond the initial inversion event. The experimental phasing process applied to the inverted anomalous data first gives rise to a heavy-atom
determination that is centrosymmetrically inverted with respect to the correct one. Combining the inverted data with the inverted during the phase-calculation step leads to phases that are the negative of the correct ones (`inverted phases') in the absence of obscuring factors such as alternative choices of origin.It is noteworthy that the inverted anomalous data and corresponding inverted phases are self-consistent: phased anomalous difference maps gave rise to positive peaks at the inverted Z-score statistics using Phaser (Storoni et al., 2004; data not shown).
locations. Along parallel lines, it also proved possible to solve a structure by in the contrived case of having a centrosymmetrically inverted structure of S2P as a model with essentially the same log-likelihood gain and4. Compounding the error
Although one can obtain a superficially interpretable electron-density map from phases derived from data with an inverted anomalous signal, this map will resist interpretation in terms of protein structure of conventional geometry. It would be difficult not to detect this error in high-resolution cases with high-quality experimental phases. In the retractions (Chang et al., 2006; Chang, 2007; Ma & Chang, 2007) the authors identified two errors with the structures: the model was fitted into an inverted map with a model of non-inverted geometry and connectivity errors in the interpretation led to incorrect topology.
4.1. Chirality
The relatively low resolutions of the native data used in MsbA E. coli and Salmonella typhimurium, respectively) made detection of the map inversion error difficult. Chang & Roth (2001) characterized the experimental MsbA phases as having `…yielded electron-density maps of excellent quality for tracing a polypeptide chain' and the reported figure of merit for the phases was 0.7. averaging and sharpening of the data were apparently insufficient to resolve features that would have indicated that the map was calculated on the incorrect hand.
(4.5 and 4.2 Å forModels with conventional geometry were built into electron-density maps that were inconsistent with this geometry. The incorrect structure can be compared with the correct one upon centrosymmetric inversion of the incorrect structure to put them both in the same hand. Comparison of these structures show that the locations of many of the secondary-structure elements were comparable, allowing inversion. Although the locations of the helices correspond, the polypeptide backbones do not overlay (Fig. 2).
4.2. Sequence assignment
Each of the structures of MsbA and EmrE had experimental MAD or SAD data associated with them; however, these were obtained by soaking osmium, mercury or arsenic compounds into the crystals. There were no data from selenomethionine-labeled protein. Selenomethionine can greatly assist sequence assignment and can be a sensitive test for topology errors (Hunte et al., 2005). Broken density in the loops that interconnected the secondary-structure elements also gave rise to incorrect topology connections (Chang et al., 2006). As an incorrect model was being built into an inverted map, it is not surprising that the topology would also be built incorrectly, making a correct sequence assignment impossible.
4.3. Refinement
The models of MsbA and EmrE were built into inverted maps and with incorrect topology and therefore essentially all the atom locations were wrong. Optimization of the model during
would at best fit the low-resolution features of the map with cylindrical helix density from the model overlapping that of the inverted map. Higher resolution details arising from detailed atomic positions would be impossible to reproduce accurately.This was reflected in the relatively high free R factors for the refined structures as conventional single-model representations for MsbA and EmrE (Table 2). This behavior was rationalized as being a consequence of intrinsic crystal disorder (Chang & Roth, 2001) and multicopy (Pellegrini et al., 1997) was used in to reduce the free R factor. Multicopy replaces the conventional single-model description of a protein structure with an ensemble of non-interacting copies of the model in order to better fit the experimental data. This method has the potential to model disorder and motion not amenable to the single-model description of electron density. However, for an N-copy multicopy the observation-to-parameter ratio is decreased N-fold. For the structure of E. coli MsbA an ensemble of 16 copies of the were refined simultaneously, with noncrystallographic symmetry restraints applied within each set (Chang & Roth, 2001). Since observation-to-parameter ratios are already particularly poor at 4.5 Å resolution, this made a bad situation considerably worse and led to overfitting. Chen & Chapman (2001) have indicated that multicopy refinements show signs of overfitting even at significantly higher resolutions.
‡Root-mean-square deviation between ideal and observed bond-length stereochemistry. §Average B value of the model. |
Table 2 shows that when multicopy was employed with E. coli MsbA the difference between the working-set R factor and test-set R factor increases from 7 to 11%, but the free R factor does decrease by 7%. Although the multicopy represents a better model for the data as atoms are allowed to wander away from the single-model representation, it was not realised at the time that this was because the structure was wrong instead of modelling crystal disorder. The relatively large average deviation of the multicopy models from the mean atomic position is illustrated in supplementary Fig. 1 in Chang & Roth (2001).
Another factor that biased the free R factor was the selection of the test-set reflections in a random manner. In cases where there is significant the random choice of the free R-factor set introduces relationships between the working set and test set that depend on the number of molecules related by and the extent to which they are similar to each other (Fabiola et al., 2006). This cross-talk biases the free R factor to lower values and can be reduced by selecting test-set reflections in shells of constant resolution. However, it is not a simple matter to quantify the magnitude of this effect.
The third factor was the systematic omission of weak data during σ(F) cutoff to the data. This is illustrated by the R factors reported for the corrected structures of MsbA (Ward et al., 2007) refined with either 2σ(F) or 0σ(F) cutoffs. R factors were reduced by up to 4%, while at the same time up to 32% of the weakest reflections were discarded (Table 3). Although the structures had superficially better agreement with the data that remained, it is likely that they were correspondingly less accurate by failing to incorporate the measured weaker data in refinement.
by applying a 2
‡The R factor is R = for working-set data; the free R factor is the same quantity calculated for the test-set reflections. §Root-mean-square deviation between ideal and observed bond-length stereochemistry. ¶Average B value of the model. ††Proportion of residues lying in the most favored region of the Ramachandran plot as calculated by PROCHECK (Laskowski et al., 1993). |
5. Detecting wrong structures at low resolution
The free R factors of the single-model refinements of MsbA (Table 2) were high enough to warrant skepticism as to the accuracy of the structures. The free R factor is of great utility in assessing the overall quality of macromolecular structures (Brünger, 1997), but it is not foolproof. Use of multicopy and a random choice of test-set reflections reduced the Rfree to a value that was not unreasonable considering the resolution. The free R factor of the MsbA multicopy was not outrageously high compared with other PDB files of comparable resolution: as of December 2007, crystal structures in the PDB with resolutions between 4.0 and 4.5 Å had a mean free R factor of 33% with a wide range of variation between 22 and 45%. The free R factor of the single-model lay at the upper end of this range (Table 2), which should have been a warning sign.
Of particular interest is the observation that even with the corrected structures the free R-factor values for the single model are sometimes similar to those of multicopy refinements of the incorrect structures (Tables 3 and 4). The combination of and multicopy substantially compromised the effectiveness of the free R factor as a measure of structure quality. This suggests that in at least some cases the free R factor has its limitations as a structure-quality indicator and is vulnerable to a certain amount of manipulation to achieve values that would allow publication. The agreement of the model with experimental and model-phased electron density is an extremely important factor that was neglected in this case and which is less trivial to present in a paper.
‡The R factor is R = for working-set data; the free R factor is the same quantity calculated for the test-set reflections. §Root-mean-square deviation between ideal and observed bond-length stereochemistry. ¶Average B value of the model. ††Proportion of residues lying in the most favored region of the Ramachandran plot as calculated by PROCHECK (Laskowski et al., 1993). |
Although each subsequent structure of MsbA and EmrE was associated with experimental phases (Chang, 2003; Reyes & Chang, 2005; Pornillos et al., 2005), it is evident that the previously determined topologies were biasing the interpretation of the subsequent experimental electron-density maps. It is noteworthy that the subsequent structures were not determined by despite the availability of these prior structures.
5.1. Conserved substructures
One possibility to test for incorrect structures utilizes domains or subdomains of conserved structure. Such potential existed in MsbA, where the ATPase domain was similar to that of the ATPase domain of S. typhimurium histidine permease (HisP; Hung et al., 1998). For E. coli MsbA, alignments of HisP onto the structurally homologous domain gave a root-mean-square deviation (r.m.s.d.) of 1.5 Å for 90 equivalent Cα atoms for the correct structure. For the incorrect structure the same alignment procedure gave an r.m.s.d. of 1.6 Å for 44 Cα atoms. However, the converse is true for the case of S. typhimurium MsbA: the correct structure has an r.m.s.d. of 1.5 Å for 176 equivalent Cα atoms and the incorrect structure has an r.m.s.d. of 1.3 Å for 195 Cα atoms. This test does not prove to be sensitive for an incorrect structure, although the result may be biased because the sequence homology was known before the models of MsbA were built.
5.2. Validation and data deposition
Critical assessment of structure quality requires access to all the coordinates and at a minimum to the native data used during ; Joosten & Vriend, 2007). This policy has been actively advocated for a number of years, yet some of the MsbA and EmrE structures were deposited as only the Cα atoms, including the coordinate sets corresponding to the corrected structures (Chen et al., 2007; Ward et al., 2007). This substantially undermines the ability to independently assess structure quality, as was clearly necessary in this case.
(Jones & Kleywegt, 20076. Conclusions
The original error causing the inversion of the anomalous sign was a simple mistake (Chang et al., 2006). The examples in this paper demonstrate that it is trivial to propagate this error to the point of generating an experimental electron-density map with superficially acceptable phasing statistics. A straightforward way to minimize these types of errors is to make use of widely deployed and well tested program suites, as exemplified by the CCP4 suite itself (Collaborative Computational Project, Number 4, 1994).
The error was independently repeated for both the MsbA and EmrE structures and should have been caught at the point of map interpretation. The low resolution of the structural studies hampered the interpretation, but these resolutions are by no means unprecedented. It is not easy to understand how the error evaded detection over the course of five papers in four years (Chang & Roth, 2001; Chang, 2003; Ma & Chang, 2004; Reyes & Chang, 2005; Pornillos et al., 2005) even if one allows that existing structures bias the interpretation of subsequent structures. The aggressive use of novel techniques, an ill-advised method of selecting test-set reflections and the habit of truncating the data in at 2σ(F) to remove the weak data all played a role in giving rise to acceptable (Table 2). A focus on getting the free R factor into the `publishable range' appears to have been a larger factor than careful assessment of how the model fitted the experimental electron density.
Low-resolution structures are especially challenging from a technical point of view, but are often very rewarding in the amount of biological insight that they reveal. In the case of MsbA and EmrE the potential impact of the structures appears to have overwhelmed considerations as to their accuracy. Despite advancing standards and tools for quality control of macromolecular structures, the fundamentally incorrect atomic models of MsbA and EmrE made their way into the literature multiple times. The impact of these incorrect structures has extended beyond embarrassment for the scientists concerned and affected research in other laboratories (Petsko, 2007).
Ultimately, the incorrect structures of MsbA and EmrE were uncovered by the careful work of others (Dawson & Locher, 2006; Tate, 2006). The lesson for the rest of us may well be that we can never take too much care with our own structure determinations, lest we attract equally unflattering attention.
Acknowledgements
The author would like to thank M. Neiditch, F. Hughson and G. Kleywegt for thought-provoking discussions.
References
Berman, H. M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T. N., Weissig, H., Shindyalov, I. N. & Bourne, P. E. (2000). Nucleic Acids Res. 28, 235–242. Web of Science CrossRef PubMed CAS Google Scholar
Brünger, A. T. (1997). Methods Enzymol. 277, 366–396. CrossRef PubMed CAS Web of Science Google Scholar
Chang, G. (2003). J. Mol. Biol. 330, 419–430. Web of Science CrossRef PubMed CAS Google Scholar
Chang, G. (2007). J. Mol. Biol. 369, 596. Web of Science CrossRef PubMed Google Scholar
Chang, G. & Roth, C. B. (2001). Science, 293, 1793–1800. Web of Science CrossRef PubMed CAS Google Scholar
Chang, G., Roth, C. B., Reyes, C. L., Pornillos, O., Chen, Y. J. & Chen, A. P. (2006). Science, 314, 1875. CrossRef PubMed Google Scholar
Chen, Y. J., Pornillos, O., Lieu, S., Ma, C., Chen, A. P. & Chang, G. (2007). Proc. Natl Acad. Sci. USA, 104, 18999–19004. Web of Science CrossRef PubMed CAS Google Scholar
Chen, Z. & Chapman, M. S. (2001). Biophys. J. 80, 1466–1472. CrossRef PubMed CAS Google Scholar
Collaborative Computational Project, Number 4 (1994). Acta Cryst. D50, 760–763. CrossRef IUCr Journals Google Scholar
Dawson, R. J. & Locher, K. P. (2006). Nature (London), 443, 180–185. Web of Science CrossRef PubMed CAS Google Scholar
Fabiola, F., Korostelev, A. & Chapman, M. S. (2006). Acta Cryst. D62, 227–238. Web of Science CrossRef CAS IUCr Journals Google Scholar
Feng, L., Yan, H., Wu, Z., Yan, N., Wang, Z., Jeffrey, P. D. & Shi, Y. (2007). Science, 318, 1608–1612. Web of Science CrossRef PubMed CAS Google Scholar
Hung, L.-W., Wang, I. X., Nikaido, K., Liu, P.-Q., Ames, G. F.-L. & Kim, S.-H. (1998). Nature (London), 396, 703–707. Web of Science CAS PubMed Google Scholar
Hunte, C., Screpanti, E., Venturi, M., Rimon, A., Padan, E. & Michel, H. (2005). Nature (London), 435, 1197–1202. Web of Science CrossRef PubMed CAS Google Scholar
Jones, T. A. & Kleywegt, G. J. (2007). Science, 317, 194–195. CrossRef PubMed CAS Google Scholar
Joosten, R. P. & Vriend, G. (2007). Science, 317, 195–196. Web of Science CrossRef PubMed CAS Google Scholar
Kraulis, P. J. (1991). J. Appl. Cryst. 24, 946–950. CrossRef Web of Science IUCr Journals Google Scholar
La Fortelle, E. de & Bricogne, G. (1997). Methods Enzymol. 276, 472–494. Google Scholar
Laskowski, R. A., MacArthur, M. W., Moss, D. S. & Thornton, J. M. (1993). J. Appl. Cryst. 26, 283–291. CrossRef CAS Web of Science IUCr Journals Google Scholar
Li, F. Y., Jeffrey, P. D., Wu, J. W. & Shi, Y. (2006). J. Biol. Chem 281, 2960–2968. Web of Science CrossRef PubMed CAS Google Scholar
Ma, C. & Chang, G. (2004). Proc. Natl. Acad. Sci. USA, 101, 2852–2857. Web of Science CrossRef PubMed CAS Google Scholar
Ma, C. & Chang, G. (2007). Proc. Natl. Acad. Sci. USA, 104, 3668. Web of Science CrossRef PubMed Google Scholar
Matthews, B. W. (2007). Protein Sci. 16, 1013–1016. Web of Science CrossRef PubMed CAS Google Scholar
Merritt, E. A. & Murphy, M. E. P. (1994). Acta Cryst. D50, 869–873. CrossRef CAS Web of Science IUCr Journals Google Scholar
Miller, C. (2007). Science, 315, 459. CrossRef PubMed Google Scholar
Miller, G. (2006). Science, 314, 1856–1857. Web of Science CrossRef PubMed CAS Google Scholar
Otwinowski, Z. & Minor, W. (1997). Methods Enzymol. 276, 307–326. CrossRef CAS Web of Science Google Scholar
Pellegrini, M., Gronbech-Jensen, N., Kelly, J. A., Pfleugel, G. M. & Yeates, T. O. (1997). Proteins, 29, 426–432. CrossRef CAS PubMed Google Scholar
Perrakis, A., Morris, R. M. & Lamzin, V. S. (1999). Nature Struct. Biol. 6, 458–463. Web of Science CrossRef PubMed CAS Google Scholar
Petsko, G. A. (2007). Genome Biol. 8, 103. Web of Science CrossRef PubMed Google Scholar
Pornillos, O., Chen, Y. J., Chen, A. P. & Chang, G. (2005). Science, 310, 1950–1953. Web of Science CrossRef PubMed CAS Google Scholar
Reyes, C. L. & Chang, G. (2005). Science, 308, 1028–1031. Web of Science CrossRef PubMed CAS Google Scholar
Schneider, T. R. & Sheldrick, G. M. (2002). Acta Cryst. D58, 1772–1779. Web of Science CrossRef CAS IUCr Journals Google Scholar
Sheldrick, G. M. (2008). Acta Cryst. A64, 112–122. Web of Science CrossRef CAS IUCr Journals Google Scholar
Storoni, L. C., McCoy, A. J. & Read, R. J. (2004). Acta Cryst. D60, 432–438. Web of Science CrossRef CAS IUCr Journals Google Scholar
Tate, C. G. (2006). Curr. Opin. Struct. Biol. 16, 457–464. Web of Science CrossRef PubMed CAS Google Scholar
Terwilliger, T. C. & Berendzen, J. (1999). Acta Cryst. D55, 849–861. Web of Science CrossRef CAS IUCr Journals Google Scholar
Wang, J., Wlodawer, A. & Dauter, Z. (2007). Acta Cryst. D63, 751–758. Web of Science CrossRef IUCr Journals Google Scholar
Ward, A., Reyes, C. L., Yu, J., Roth, C. B. & Chang, G. (2007). Proc. Natl Acad. Sci. USA, 104, 19005–19010. Web of Science CrossRef PubMed CAS Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.