S-Adenosyl-S-carboxymethyl-l-homocysteine: a novel cofactor found in the putative tRNA-modifying enzyme CmoA

The putative methyltransferase CmoA is involved in the nucleoside modification of transfer RNA. X-ray crystallography and mass spectrometry are used to show that it contains a novel SAM derivative, S-adenosyl-S-carboxymethyl-l-homocysteine, in which the donor methyl group is replaced by a carboxymethyl group.

Uridine at position 34 of bacterial transfer RNAs is commonly modified to uridine-5-oxyacetic acid (cmo 5 U) to increase the decoding capacity. The protein CmoA is involved in the formation of cmo 5 U and was annotated as an S-adenosyl-lmethionine-dependent (SAM-dependent) methyltransferase on the basis of its sequence homology to other SAMcontaining enzymes. However, both the crystal structure of Escherichia coli CmoA at 1.73 Å resolution and mass spectrometry demonstrate that it contains a novel cofactor, S-adenosyl-S-carboxymethyl-l-homocysteine (SCM-SAH), in which the donor methyl group is substituted by a carboxymethyl group. The carboxyl moiety forms a salt-bridge interaction with Arg199 that is conserved in a large group of CmoA-related proteins but is not conserved in other SAMcontaining enzymes. This raises the possibility that a number of enzymes that have previously been annotated as SAMdependent are in fact SCM-SAH-dependent. Indeed, inspection of electron density for one such enzyme with known X-ray structure, PDB entry 1im8, suggests that the active site contains SCM-SAH and not SAM.

Introduction
Following transcription by RNA polymerase, a transfer RNA transcript is converted into a mature tRNA through processing and nucleoside modification. Processing in bacteria entails the removal of the 5 0 leader and 3 0 tail sequences and, if necessary, the re-synthesis of the 3 0 CCA extension required for aminoacylation. Nucleoside modification is part of the maturation process that extends the physicochemical properties of tRNA by providing a wider complement of nucleosides than the canonical four introduced by RNA polymerase during transcription. Approximately 10% of nucleosides in a typical tRNA are modified, which corresponds to around seven modified nucleosides per tRNA (Jü hling et al., 2009). Of the 109 modified nucleosides of RNA, 93 are found in tRNA, making it the most diversely modified RNA (Cantara et al., 2011). Modified nucleosides confer diverse properties upon tRNA, but they generally fine-tune the structure and stability such that it is optimized for functioning in processes such as aminoacylation and translation (Motorin & Helm, 2010).
The anticodon stem loop (ASL) is one of the most heavily modified regions of tRNA and is frequently modified at positions 34 (the wobble position) and 37 (immediately after the anticodon) (Jü hling et al., 2009). Modification of the ASL reduces its flexibility and shifts its structure towards the U-turn conformation; this has been seen in crystal structures of mature tRNA, solution structures of isolated modified ASL domains and structures of the 70S ribosome in complex with tRNA and mRNA (Selmer et al., 2006;Shi & Moore, 2000;Vendeix et al., 2008). Because modifications impose this conformation upon the ASL, the entropic penalty otherwise associated with remodelling during translation is avoided (Agris, 2008). A number of nucleoside modifications at positions 34 and 37 also expand the decoding capacity beyond that predicted by the 'Wobble hypothesis' (Agris et al., 2007). In bacteria, the 5-oxyuridine derivatives commonly found at position 34 fall into this category because they allow the tRNA to decode not only codons ending in A and G (as predicted by the Wobble hypothesis), but also U and C, so that a single tRNA can consequently decode all four codons within a single box of the genetic code (Nasvall et al., 2004). The derivative uridine-5-oxyacetic acid (cmo 5 U) has so far been found in tRNA Ala , tRNA Pro , tRNA Ser and tRNA Val (Jü hling et al., 2009). These modifications are therefore beneficial because they not only reduce the entropic penalty associated with translation but also reduce the number of different tRNAs that a cell must produce for decoding.
While the function of cmo 5 U within the context of a tRNA is well characterized, its synthesis has received relatively little attention. The proposed biosynthetic pathway involves the stepwise modification of U to cmo 5 U through the intermediates 5-hydroxyuridine (ho 5 U) and 5-methoxyuridine (mo 5 U) (Fig. 1). In keeping with the majority of other nucleoside-modification pathways, it is likely that the substrate base is modified while remaining part of the tRNA and is not simply exchanged for a pre-modified base in a transglycosylation reaction. Mutations in the cmoB and cmoA genes result in accumulation of ho 5 U and mo 5 U, respectively, indicating that CmoB is involved in the modification of ho 5 U to mo 5 U and that CmoA is involved in the modification of mo 5 U to cmo 5 U (Nasvall et al., 2004). Both CmoA and CmoB contain S-adenosylmethionine (SAM) binding motifs, hinting that they are methyltransferases, but only one of the two C atoms in the side chain of cmo 5 U is derived from SAM (Hagervall et al., 1990). Furthermore, the synthesis of 5-oxyuridine derivatives is also dependent upon chorismic acid, although the nature of this dependency has not yet been determined (Hagervall et al., 1990;Nasvall et al., 2004). This suggests that the modification pathway has not been fully elucidated or that parts of the cmo 5 U side chain may be derived from other metabolites.
We decided to investigate the functions of CmoA and CmoB in detail by X-ray crystallography to help fill the gaps in our understanding of cmo 5 U biosynthesis. Although CmoA was expected to be similar to the known structure of its homologue Haemophilus influenzae YecO, there is no sufficiently high-quality model of CmoB in the Protein Data Bank to form structurebased hypotheses about the function of the system. We hope to rectify this by providing high-quality structures of both proteins from the same target organism, allowing detailed models of the enzymatic pathway and RNA interactions to be constructed and further tested. Here, we report the structure of Escherichia coli CmoA, which unexpectedly reveals a cofactor that, to the best of our knowledge, has not been observed before.

Molecular biology and protein production
The coding sequence cmoA was amplified from OmniMax II cells (Invitrogen) and cloned into the vector pOPINF using the In-Fusion method to generate the construct OPPF7299 (Berrow et al., 2007). The final construct has an R100H point mutation with respect to the deposited sequence of CmoA from E. coli K-12 strain MG1655 (UniProt P76290), which may be either a PCR mutation or a genuine difference in this strain. Sequence analysis shows that amino acids with diverse properties are found at this position in other UniProt CmoAfamily members. Once the X-ray structure had been determined, it became clear that this residue is located on the surface of the protein on the side opposite to the dimer interface and at a distance of $15 Å from the nearest atom of the SCM-SAH cofactor. E. coli Rosetta pLysS (DE3) cells were transformed with the resulting vector and grown in Overnight Express Instant TB medium (Merck). The cells were incubated at 310 K until an OD 600 nm of 0.6 was attained, at which point the temperature was reduced to 298 K and the cells were grown for a further 20 h. The cells were then harvested by centrifugation and stored at 193 K.

Protein purification
The cells were resuspended in lysis buffer [500 mM NaCl, 50 mM Tris pH 7.5, 30 mM imidazole, 0.2%(v/v) Tween], lysed The proposed modification pathway of 5-oxyuridine derivatives. CmoA has been implicated in the modification of mo 5 U to cmo 5 U (Nasvall et al., 2004), although this reaction involves more than the addition of a single methyl group, indicating that either additional enzymes and/or cofactors are involved. No enzymes involved in the conversion of U to ho 5 U or cmo 5 U to mcmo 5 U have been identified.
using a Basic Z cell disruptor (Constant Systems) and clarified by centrifugation. The supernatant was loaded onto a 1 ml HisTrap FF column (GE Healthcare) equilibrated with wash buffer (500 mM NaCl, 50 mM Tris pH 7.5, 30 mM imidazole) and bound protein was eluted with elution buffer (500 mM NaCl, 50 mM Tris pH 7.5, 500 mM imidazole). Fractions containing CmoA were concentrated and loaded onto a Superdex 200 HiLoad 16/60 column (GE Healthcare) equilibrated with gel-filtration buffer (200 mM NaCl, 20 mM Tris pH 7.5). Fractions containing CmoA were pooled and the N-terminal hexahistidine tag was removed by digesting the protein with rhinovirus 3C protease. The mixture was then reverse-purified by performing an additional round of Ni 2+affinity chromatography as described above and collecting the flowthrough. This protein was then buffer-exchanged into gelfiltration buffer and concentrated to 20 mg ml À1 for crystallization.

Size-exclusion chromatography coupled with static light scattering (SEC-SLS) analysis
The oligomeric state of CmoA in solution was analysed using size-exclusion chromatography with a Superdex 200 column followed by light scattering using a Viscotek Tetra Array Detector measuring refractive index, right-angle light scattering and absorbance at 280 nm. A 100 ml sample at 0.77 mg ml À1 was applied onto the size-exclusion column and was observed to correspond to dimeric CmoA.

Crystallography
Data were collected on beamline I04 of Diamond Light Source, Didcot, England and were processed with xia2 (Winter, 2010, Evans, 2011Leslie, 2006;Sauter et al., 2004;Zhang et al., 2006). The structure was determined by molecular replacement using the structure of H. influenzae YecO (PDB entry 1im8; chain B; Lim et al., 2001) with both the SAM cofactor and solvent molecules removed and the programs CHAINSAW (Stein, 2008) and MOLREP (Vagin & Teplyakov, 2010) as implemented in the MrBUMP pipeline (Keegan & Winn, 2007). The molecular-replacement solution contained two molecules of CmoA and had initial R work /R free values of 45.9/45.5%. The model was then improved through alternate cycles of manual rebuilding using Coot (Emsley et al., 2010) and restrained refinement with REFMAC5 (Murshudov et al., 2011) using an isotropic B factor for each atom and one TLS group per chain (Winn et al., 2001). A restraints file for S-adenosyl-S-carboxymethyl-l-homocysteine was created using the PRODRG2 server (Schü ttelkopf & van Aalten, 2004).
The final model contains two molecules of CmoA (residues 19-247 in chain A and residues 20-244 in chain B), two molecules of S-adenosyl-S-carboxymethyl-l-homocysteine, two molecules of MPD and 273 water molecules. Model statistics are provided in Table 1. The Ramachandran plot of the model was calculated with RAMPAGE (Lovell et al., 2003) and the figures were created with CCP4mg (McNicholas et al., 2011) and the PoseView server (Stierand et al., 2006). The coordinates and structure factors have been deposited in the Protein Data Bank with accession code 4iwn.

Mass spectrometry
Samples of CmoA were purified as described above and diluted to a concentration of 5 mM in 50%(v/v) aqueous  jF obs j À jF calc j = P hkl jF obs j. R free was calculated from a randomly chosen set of reflections (5% of the total) excluded from the R work set used for refinement. acetonitrile containing 1% formic acid. These samples were introduced into the mass spectrometer using a TriVersa NanoMate ion source (Advion BioSciences) in positive-ion mode. Mass spectra were acquired using a solariX FT-MS (Bruker Daltonics) with a 9.4 T superconducting magnet. Tandem MS of the released ligand was performed by collisioninduced dissociation in the hexapole (Q-CID) with argon collision gas. Spectra were processed using DataAnalysis v.4.0 (Bruker Daltonics). Protein mass deconvolution was performed using v.2.0 of the SNAP algorithm and mass measurements from released ligand and fragmentation spectra were calculated from centroided data.

Results and discussion
The crystal structure of E. coli CmoA was determined by molecular replacement and was refined to R work and R free values of 19.6 and 23.1%, respectively, using data to a resolution of 1.73 Å ( Table 1). The protein copurified with a cofactor from the E. coli cells that we anticipated would be either S-adenosylmethionine (SAM) or S-adenosylhomocysteine (SAH) on the basis of the S-adenosylmethionine-binding motifs present in the sequence of CmoA. Unexpectedly, both molecules of CmoA contain the novel derivative S-adenosyl-S-carboxymethyl-l-homocysteine (SCM-SAH), which differs from SAM by the substitution of the methyl donor group (R = -CH 3 ) by a carboxymethyl group (R = -CH 2 COOH).
There are two molecules of CmoA present in the asymmetric unit that are related to each other by a noncrystallographic twofold rotational axis. Apart from minor differences at the N-and C-termini, the two molecules adopt the same conformation and superpose with an r.m.s.d. of 0.3 Å (225 aligned C atoms). Analysis of the structure with PISA (Krissinel & Henrick, 2007) reveals that the interface between the two molecules is extensive, with 1274 Å 2 of buried surface area per monomer (Fig. 2). In addition to an antiparallel -sheet formed by the 6 strands of both molecules, there are additional interactions between helix 6 of one molecule and helix 2 and strand 6 of the other molecule. Together, these interactions comprise 15 hydrogen bonds and a number of hydrophobic interactions, suggesting that the CmoA dimer present in the asymmetric unit may also represent the oligomeric state of CmoA in solution. To confirm the existence of this dimer in solution, purified CmoA was analysed by sizeexclusion chromatography and static light scattering (SEC-SLS). A single species was visible on the chromatogram and The overall structure of the CmoA dimer. (a) Two orthogonal views of the two monomers forming the dimer (green and blue ribbons). The S-adenosyl-Scarboxymethyl-l-homocysteine (SCM-SAH) cofactor is shown with O atoms in red, N atoms in blue and C atoms in green. (b) The SEC-SLS chromatogram that confirms that CmoA is a dimer in solution. The refractive index (RI; red line) and right-angle light scattering (RALS; green line) traces are displayed. The molecular weight calculated by the OmniSEC software (MW; black line) is shown above the elution peak. The dispersity, Mw/Mn, defined as the ratio of the weight average to number average molecular weights was reported to be 1.001, indicating a highly uniform sample. the molecular weight was calculated to be 52.5 kDa, which is consistent with the theoretical molecular weight of 55.6 kDa for the dimer (Fig. 2). Interestingly, PISA identifies an equivalent dimer in the crystal structure of H. influenzae YecO (PDB entry 1im8) between chain A and its symmetry mate (x À y, Ày, Àz) with an interface area of 1194 Å 2 per monomer (Lim et al., 2001). The conserved nature of this interface indicates that the oligomeric state may be important for the structure and function of CmoA.

CmoA contains a novel S-adenosylmethionine derivative
During the refinement of the structure, it became apparent from inspection of both the 2mF o À DF c and the mF o À DF c electron-density maps that the putative active site of CmoA contained a cofactor that was neither S-adenosylmethionine (SAM) nor S-adenosylhomocysteine (SAH). Prior to the modelling of the ligand, unambiguous positive density was visible in the mF o À DF c electron-density map for all of the features expected for SAM: l-methionine and both the adenine and ribose rings were visible at a contour level of 5, while the S atom was visible at a contour level of 18 (Fig. 3a). However, additional positive density was present at the end of the methyl group, indicating that the cofactor was a covalently modified derivative of SAM. This positive density was bifurcated, planar in shape and visible at a contour level of 7. Because these maps were generated prior to the modelling of any cofactor and no cofactor was present in the search model that was used during molecular replacement, the presence of this additional density was not a consequence of model bias. The shape of the density was most consistent with an S-adenosylmethionine derivative in which the methyl group had been derivatized with a functional group with trigonal planar geometry.
To confirm the presence of the cofactor and investigate its identity, samples of CmoA were analysed by Fourier transform mass spectrometry (FT-MS). To exclude the possibility that the modification occurred during crystallization or data collection (as a result of the chemicals present in the crystallization solution or of exposure to X-rays), the sample used for analysis was not crystallized but was from the same preparation as that used for crystallization. A signal at m/z = 27 764.2 was assigned as a protonated molecular-ion peak for CmoA, in agreement with the value of m/z = 27 763.9 estimated from the sequence alone (a difference of 0.3 Da after accounting for the proton). An additional signal at m/z = 443.1333 was detected and this was isolated and further analysed by collision-induced dissociation. The resulting fragmentation spectrum contained three signals: a parent ion at m/z = 443.1336 and two fragments at m/z = 342.0863 and 250.0925 (Fig. 4a). These are not consistent with the theoretical values for SAM (m/z = 399.1445) or SAH (m/z = 385.1289), and the fragmentation spectrum featured no signals at these values. A search of the PubChem database for compounds structurally similar to SAH or SAM with a molecular weight of between 442.6 and 443.6 Da resulted in single hit: a SAM derivative in which the methyl group is substituted by a carboxymethyl group (CID 11212932; Fig. 4b). We refer to this derivative as S-adenosyl-S-carboxymethyl-l-homocysteine (or S-carboxymethylated SAH), which could be further abbreviated as SCM-SAH; the full IUPAC systematic name is given in x2.7. The m/z values calculated for this compound are in close agreement with those determined experimentally: the parent  Structure of S-adenosyl-S-carboxymethyl-l-homocysteine (SCM-SAH). The final coordinates are displayed with (a) the likelihood-weighted mF o À DF c difference electron-density maps contoured at 3 calculated prior to the modelling of SCM-SAH and (b) the 2mF o À DF c electron-density maps contoured at 1.5. The SCM-SAH model is shown with C atoms in cyan, O atoms in red and N atoms in blue. C atoms of protein residues are shown in green.

Figure 4
Mass spectrum of SCM-SAH. (a) The fragmentation spectrum and (b) the chemical structure of SCM-SAH. Signals consistent with the entire SCM-SAH cofactor as well as two fragments (in the blue and green boxes) were detected. ion has a calculated value of m/z = 443.1336 (À0.70 mDa difference) and two potential fragments may be generated with calculated values of m/z = 342.0867 and 250.0935 (À0.33 and À0.93 mDa difference, respectively; Figs. 4a and 4b).
Restraints for the modelling and refinement of SCM-SAH were generated with PRODRG2 and the ligand was then modelled into the active site of each monomer (Fig. 3b). Following refinement, there was no obvious distortion of the ligand geometry with respect to the ideal geometry and there were no significant positive or negative peaks in the mF o À DF c difference electron-density map in the immediate vicinity of the ligand. Taken together, the crystallographic and massspectrometric data suggest that the active site of CmoA contains an S-adenosylmethionine derivative in which the methyl group is substituted by a carboxymethyl group.

Comparison with H. influenzae YecO
H. influenzae YecO (PDB entry 1im8) was identified as the most similar deposited structure to CmoA by both sequencebased (68% sequence identity) and structure-based (r.m.s.d. = 0.58 Å for 222 aligned C atoms) search methods (Krissinel & Henrick, 2004). The structure of YecO was originally determined by multiple-wavelength anomalous diffraction (MAD) using a selenomethionine derivative produced in E. coli B834 (DE3) cells grown in minimal medium in the presence of l-selenomethionine (Lim et al., 2001). The authors noted that in addition to the number of selenium sites expected on the basis of the protein sequence, one additional selenium site per monomer was detected during structure determination. This was incorporated into the cofactor that copurified with YecO, suggesting that the cofactor was derived from l-methionine (in the case of cells grown in non-labelled medium) or l-selenomethionine (in the case of cells grown in minimal medium with l-selenomethionine). In the deposited structure of YecO the cofactor was modelled as Se-substituted SAH with a Cl À ion 2.9 Å away from the Se atom.
Given the high degree of similarity between CmoA and YecO, we re-examined both the coordinates and the structure factors for YecO deposited in the PDB. Re-refinement of the deposited structure resulted in R work and R free values of 19.2 and 24.8%, respectively, which are comparable with the values of 18.6 and 25.5% originally reported (Fig. 5a). Refinement of the YecO structure without any cofactor modelled results in electron density in both the 2mF o À DF c and the mF o À DF c electron-density maps into which the Se-substituted form of the SCM-SAH cofactor found in CmoA can be modelled, and the electron-density maps after refinement are also compatible with the presence of this cofactor (Fig. 5b). However, the electron density is not defined well enough to allow a distinction between the possibilities of SAH and a Cl À ion (as modelled originally) or of SCM-SAH (as modelled in CmoA). We note, however, that we were unable to find evidence of SAH and an equivalently positioned Cl À ion in a manual inspection of PDB entries that are (i) annotated as methyltransferases (EC 2.1.1) and (ii) contain at least one Cl À ion. Furthermore, in the case of CmoA the electron-density maps are better defined and the mass-spectrometric data argue against the cofactor modelled in YecO.

Overall structure of CmoA
CmoA has a Rossmann fold that comprises seven -strands and eight -helices. The -strands form a single sheet in which all strands are parallel except 7. The majority of the -helices pack against both faces of the -sheet, although helices 2, 6 and 7 form a compact lid-like structure that sits over the region containing the SCM-SAH and renders it almost inaccessible to solvent. Superposition of CmoA with the structures of the other RNA methyltransferases TrmA (PBD entry 3bt7) and RumA (RlmD; PDB entry 2bh2) reveals that while the Rossmann-fold core is conserved between these enzymes, the lid-like structure of CmoA obstructs the region used for the binding of the RNA substrate in these methyltransferases (Lee et al., 2005;Alian et al., 2008). Modelling of SCM-SAH into YecO. The re-refined coordinates with either (a) the originally modelled Se-substituted SAH and Cl À ion or (b) remodelled SCM-SAH are displayed along with the 2mF o À DF c electron-density maps contoured at 1 and the mF o À DF c difference electron-density maps contoured at 3. SCM-SAH is depicted with C atoms in cyan, while C atoms of protein residues are shown in green.
suggests that the binding mode is relatively fixed. In order to place the substrate nucleoside in the corresponding position with respect to the SCM-SAH cofactor in CmoA, the lid would need to undergo a large conformational change to allow access to the cofactor and to prevent significant clashes with the neighbouring nucleotides of the tRNA substrate. An alternative possibility is that CmoA acts using additional factors which assist in the modification of the substrate uridine and does not bind tRNA directly.
In common with many enzymes possessing a Rossmann fold, the majority of the conserved residues are located at the C-terminal ends of the -strands or in the loops which immediately follow. In CmoA these residues are directly involved in contacting the SAM derivative: the adenine ring is hydrogen-bonded by the side chain of Asp117 and the main chains of Asn90, Ile118 and Phe137 (via a water molecule), the ribose is hydrogen bonded by the side chains of Ser66, Asp89 and Asn90 (via a water molecule), and the l-methionine is hydrogen bonded by the side chains of Tyr39, Asp62 (via a water molecule) and Asn132 and the main chains of Gly64, Ala70 (via a water molecule) and Asn132 (Figs. 6a and 6b). The negatively charged carboxylate of the carboxymethyl group interacts with the positively charged guanidinium group of Arg199 through a salt bridge with a length of 2.7 Å . The high conservation of these residues within members of the UniProt CmoA family indicates that they are important for binding the SAM derivative and may also play a role in its biochemistry.

Conclusions
We have determined the structure of E. coli CmoA, a putative methyltransferase that is involved in the post-transcriptional modification of U34 in a number of bacterial tRNAs. While the sequence motifs and Rossmann fold of the enzyme suggest that it is a typical SAM-dependent methyltransferase, analysis of the electron-density maps and mass-spectrometric data revealed that the protein contains an atypical SAM derivative in which the donor methyl group is replaced by a carboxymethyl group. We name this previously unobserved derivative S-adenosyl-S-carboxymethyl-l-homocysteine (SCM-SAH). According to the UniProt database, the CmoA family currently contains 1566 proteins that are currently annotated as putative SAM-dependent methyltransferases. However, conservation of Arg199, the key residue of CmoA that stabilizes the negative charge of the carboxyl group of the SCM-SAH cofactor, suggests that these proteins contain the SCM-SAH cofactor instead of SAM and are currently annotated incorrectly. The equivalent residue in known SAMdependent methyltransferases is not conserved. Although CmoA homologues are only found in bacteria, it is possible that such SAM derivatives are widespread in nature, being present in other enzymes currently annotated as methyltransferases.
Previous genetic studies have indicated that CmoA is involved in the modification of mo 5 U to cmo 5 U, a reaction that involves the addition of a carboxyl group onto the methoxy group of mo 5 U but that cannot be catalysed solely by a methyltransferase (Nasvall et al., 2004). Assuming that the cofactor in CmoA is directly involved in modification of the tRNA, we speculate that it may participate in the formation of cmo 5 U by either (i) the transfer of just the carboxyl group of SCM-SAH onto the methoxy group of mo 5 U or (ii) the substitution of the methyl group of the side chain in mo 5 U by the entire carboxymethyl group from SCM-SAH (Fig. 7). A third possibility is that the carboxymethyl group is transferred directly onto the hydroxyl group of ho 5 U. Although this proposal is not supported by the observation that mutations in cmoA result in accumulation of mo 5 U and not ho 5 U, this has Binding of SCM-SAH by CmoA. (a) SCM-SAH (cyan cylinders) interacts with multiple main-chain and side-chain atoms of CmoA (green cylinders) through hydrogen bonds (blue dashes). (b) A two-dimensional schematic of the active site. been suggested previously (Murao et al., 1978) and a precedent for the chemistry of this reaction can be found in the O-methyltransferases. All three possibilities, however, would be compatible with the observation that only one of the C atoms in the side chain of cmo 5 U is derived from l-methionine. The proposed mechanisms for SAM-dependent methyltransferases often involve a general base. Superpositions of CmoA with the 5-methyluridine methyltransferases TrmA and RumA show that the C5 atom of the substrate uridine is neighboured by Glu164. This residue is highly conserved in members of the CmoA family and the only other amino acid found at this position is aspartic acid. As the side chain of cmo 5 U derivatives is attached to the C5 atom of the pyrimidine ring, this residue would potentially be able to act as a general base during the reaction.
Although DNA and RNA methyltransferases are able to use synthetic S-adenosylmethionine analogues with extended carbon chains both in vitro and in vivo (Schlenk & Dainko, 1975;Klimasauskas & Weinhold, 2007;Motorin et al., 2011), there do not appear to be any reports indicating that DNA or RNA methyltransferases actually use such analogues in vivo for the modification of nucleic acids. We hope that our findings will lead to further characterization of the function and mechanism of CmoA and its SCM-SAH cofactor.
During the final stages of preparation of our manuscript, we became aware of PDB deposition 4gek by the New York Structural Genomics Research Consortium, in which E. coli CmoA is also observed in complex with SCM-SAH. A comparison of these independently determined structures adds support to the conclusions presented in this paper.

Figure 7
Speculative roles for SCM-SAH in the modification of mo 5 U. The modification might involve either (i) the addition of the carboxyl group (red) from SCM-SAH onto the methoxy group of mo 5 U or (ii) the substitution of the methyl group of mo 5 U with the entire carboxymethyl group.