structural communications
Structures of the first representatives of Pfam family PF06684 (DUF1185) reveal a novel variant of the Bacillus chorismate mutase fold and suggest a role in amino-acid metabolism
aJoint Center for Structural Genomics, https://www.jcsg.org , USA,bProgram on Bioinformatics and Systems Biology, Burnham Institute for Medical Research, La Jolla, CA, USA,cStanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, CA, USA,dProtein Sciences Department, Genomics Institute of the Novartis Research Foundation, San Diego, CA, USA,eCenter for Research in Biological Systems, University of California, San Diego, La Jolla, CA, USA,fDepartment of Molecular Biology, The Scripps Research Institute, La Jolla, CA, USA,gUniversity of California, San Diego, La Jolla, CA, USA, and hPhoton Science, SLAC National Accelerator Laboratory, Menlo Park, CA, USA
*Correspondence e-mail: wilson@scripps.edu
The crystal structures of BB2672 and SPO0826 were determined to resolutions of 1.7 and 2.1 Å by single-wavelength Bacillus chorismate mutase fold (BCM). The biological unit of both proteins is a hexamer and analysis of homologs indicates that the oligomer interface residues are highly conserved. The conformation of the critical regions for appears to be dependent on pH or salt concentration, suggesting that this protein might be subject to environmental regulation. Structural similarities to BCM and genome-context analysis suggest a function in amino-acid synthesis.
and multiple-wavelength respectively, using the semi-automated high-throughput pipeline of the Joint Center for Structural Genomics (JCSG) as part of the NIGMS Protein Structure Initiative (PSI). These proteins are the first structural representatives of the PF06684 (DUF1185) Pfam family. Structural analysis revealed that both structures adopt a variant of theKeywords: domain of unknown function; structural genomics; chorismate mutase; amino acids; pH-dependent; salt-dependent.
3D view: 3byq,2qtp
1. Introduction
To extend the structural coverage of proteins of unknown function, we targeted Pfam protein family PF06684 (domain of unknown function 1185; DUF1185) and determined the crystal structures of two representative members. The BB2672 gene of Bordetella bronchiseptica, which is a causative agent of infectious bronchitis in domestic mammals, encodes a protein with a molecular weight of 20.9 kDa (residues 1–192) and a calculated of 7.1. The SPO0826 gene of the marine α-proteobacterium Silicibacter pomeroyi encodes a protein with a molecular weight of 20.5 kDa (residues 1–193) and a calculated of 6.2.
Here, we report the crystal structures of BB2672 and SPO0826, which were determined using the semi-automated high-throughput pipeline of the Joint Center for Structural Genomics (JCSG; https://www.jcsg.org ; Lesley et al., 2002) as part of the NIGMS Protein Structure Initiative (PSI). Despite lacking any recognizable sequence similarity to other protein families, both proteins show significant structural similarity to proteins with a Bacillus chorismate mutase-like (BCM-like) fold characterized by a β-α-β-α-β-β core that includes a mixed β-sheet (order 1423) with the β4 strand antiparallel to the rest. In bacteria, fungi and higher plants, chorismate mutase (EC 5.4.99.5) catalyzes the isomerization of chorismate to prephenate in the first committed step in the biosynthesis of the aromatic amino acids phenylalanine and tyrosine. While bifunctional chorismate mutases are all-α proteins that form dimers and exhibit feedback inhibition and allostery (Schmidheini et al., 1990), the monofunctional chorismate mutase adopts a BCM-like fold, is trimeric and follows Michaelis–Menten kinetics. Analysis of the BB2672 crystallographic hexamer reveals a trimer of dimers, with two distinct types of dimerization interface created by an extra N-terminal strand and a β-hairpin insertion between strands β3 and β4 in the core BCM-like fold. The more extensive interface, which is located at the C-terminus, contains a network of histidine and salt-bridged residues that may indicate environmental regulation of hexamer assembly. The smaller interface at the N-terminus could present a potential ligand-binding site for the DUF1185 family, the genetic context of which supports a role in amino-acid metabolism.
2. Materials and methods
2.1. Protein production and crystallization
Clones were generated using the Polymerase Incomplete Primer Extension (PIPE) cloning method (Klock et al., 2008). The genes encoding B. bronchiseptica RB50 BB2672 (GenBank NP_889209.1, gi:33601649, Swiss-Prot Q7WJ28) and S. pomeroyi DSS-3 SPO0826 (GenBank YP_166079.1, gi:56695728, Swiss-Prot Q5LV76) were amplified by (PCR) from genomic DNA using PfuTurbo DNA polymerase (Stratagene) and I-PIPE (Insert) primers (BB2672 forward primer, 5′-ctgtacttccagggcATGTCTTTGTACGAAATCCGCAAGCGC-3′; BB2672 reverse primer, 5′-aattaagtcgcgttaGCGCTGGCCGTCGTGCACCGAGACGGC-3′; SPO0826 forward primer, 5′-ctgtacttccagggcATGACCAAGATCCGCAAGATCGCTG-3′; SPO0826 reverse primer, 5′-aattaagtcgcgttaTCTCAGGCCGTCCTTGCCTTCGGCCGCG-3′; target sequences are shown in upper case) that included sequences for the predicted 5′ and 3′ ends. The expression vector pSpeedET, which encodes an amino-terminal tobacco etch virus (TEV) protease-cleavable expression and purification tag (MGSDKIHHHHHHENLYFQ/G), was PCR-amplified with V-PIPE (Vector) primers (forward primer, 5′-taacgcgacttaattaactcgtttaaacggtctccagc-3′; reverse primer, 5′-gccctggaagtacaggttttcgtgatgatgatgatgatg-3′). V-PIPE and I-PIPE PCR products were mixed to anneal the amplified DNA fragments together. Escherichia coli GeneHogs (Invitrogen) competent cells were transformed with the V-PIPE/I-PIPE mixture and dispensed onto selective LB–agar plates. The cloning junctions were confirmed by DNA sequencing. Protein expression was performed in selenomethionine-containing medium at 310 K with suppression of normal methionine synthesis. At the end of lysozyme was added to the cultures to a final concentration of 250 µg ml−1 and the cells were harvested. After one freeze–thaw cycle, the cells were homogenized in lysis buffer [50 mM HEPES pH 8.0, 50 mM NaCl, 10 mM imidazole, 1 mM tris(2-carboxyethyl)phosphine hydrochloride (TCEP)] and passed through a Microfluidizer (Microfluidics). The lysates were clarified by centrifugation at 32 500g for 30 min and loaded onto nickel-chelating resin (GE Healthcare) pre-equilibrated with lysis buffer; the resin was washed with wash buffer [50 mM HEPES pH 8.0, 300 mM NaCl, 40 mM imidazole, 10%(v/v) glycerol, 1 mM TCEP] and the proteins were eluted with elution buffer [20 mM HEPES pH 8.0, 300 mM imidazole, 10%(v/v) glycerol, 1 mM TCEP]. The eluates were buffer-exchanged with crystallization buffer (20 mM HEPES pH 8.0, 200 mM NaCl, 40 mM imidazole, 1 mM TCEP) using PD-10 columns (GE Healthcare). The BB2672 PD-10 and SPO0826 eluates were treated with 1 mg TEV protease per 15 mg of eluted protein. The digested protein was passed over nickel-chelating resin pre-equilibrated with crystallization buffer and the resin was washed with the same buffer. The flowthrough and wash fractions were combined and concentrated to 20 mg ml−1 by centrifugal ultrafiltration (Millipore) for crystallization assays. For SPO0826, TEV cleavage of the expression and purification tag was unsuccessful and the PD-10 was therefore further purified on a HiLoad 16/60 Superdex 200 column (GE Healthcare) using crystallization buffer as the mobile phase. The peak fractions were pooled and concentrated to 15 mg ml−1 by centrifugal ultrafiltration for crystallization assays. The two proteins were crystallized by mixing 100 nl protein solution with 100 nl crystallization solution with a 50 µl reservoir volume using the nanodrop vapor-diffusion method (Santarsiero et al., 2002) with standard JCSG crystallization protocols (Lesley et al., 2002). Screening for diffraction was carried out using the Stanford Automated Mounting system (SAM; Cohen et al., 2002) at the Stanford Synchrotron Radiation Lightsource (SSRL, Menlo Park, California, USA). For BB2672, the crystallization reagent consisted of 20%(w/v) polyethylene glycol 8000 and 0.1 M CHES pH 9.5. Ethylene glycol (1,2-ethanediol) was added to a final concentration of 10%(v/v) as a cryoprotectant. A plate-shaped crystal of approximate dimensions 0.1 × 0.04 × 0.02 mm was harvested after 26 d at 277 K for data collection. The BB2672 diffraction data were indexed in the orthorhombic C2221 (Table 1). For SPO0826, the crystallization reagent consisted of 0.1 M NaH2PO4, 0.1 M KH2PO4, 2 M NaCl and 0.1 M MES pH 6.5. Glycerol was added to a final concentration of 15%(v/v) as a cryoprotectant. A crystal of approximate dimensions 0.3 × 0.2 × 0.2 mm was harvested after 15 d at 277 K for data collection. The SPO0826 diffraction data were indexed in the tetragonal P43212 (Table 2). The oligomeric states of BB2672 and SPO0826 were determined using a 1 × 30 cm Superdex 200 size-exclusion column (GE Healthcare) coupled with miniDAWN static light-scattering (SEC/SLS) and Optilab differential refractive-index detectors (Wyatt Technology). The mobile phase consisted of 20 mM Tris pH 8.0, 150 mM NaCl and 0.02%(w/v) sodium azide. The molecular weight was calculated using ASTRA v.5.1.5 software (Wyatt Technology).
‡The number of unique reflections used in is typically slightly less than the total number that were integrated and scaled. Reflections are excluded owing to negative intensities and rounding errors in the resolution limits and unit-cell parameters. §Rcryst = , where Fcalc and Fobs are the calculated and observed structure-factor amplitudes, respectively. ¶Rfree is the same as Rcryst but for 5.1% of the total reflections chosen at random and omitted from ††This value represents the total B that includes TLS and residual B components. ‡‡Estimated overall coordinate error (Collaborative Computational Project, Number 4, 1994; Cruickshank, 1999). |
‡The number of unique reflections used in is typically slightly less than the total number that were integrated and scaled. Reflections are excluded owing to negative intensities and rounding errors in the resolution limits and unit-cell parameters. §Rcryst = , where Fcalc and Fobs are the calculated and observed structure-factor amplitudes, respectively. ¶Rfree is the same as Rcryst but for 4.9% of the total reflections chosen at random and omitted from ††This value represents the total B that includes TLS and residual B components. ‡‡Estimated overall coordinate error (Collaborative Computational Project, Number 4, 1994; Cruickshank, 1999). |
2.2. Data collection, structure solution and refinement
For BB2672, single-wavelength anomalous diffraction (SAD) data were collected on beamline BL11-1 at SSRL at a wavelength corresponding to the peak of a selenium SAD experiment. The data were collected at 100 K on a MAR Mosaic 325 mm CCD detector using the Blu-Ice (McPhillips et al., 2002) data-collection environment. The SAD data were integrated and reduced using MOSFLM (Leslie, 1992) and scaled with the program SCALA (Collaborative Computational Project, Number 4, 1994). Selenium-substructure solution and phasing were performed with SHELXD (Sheldrick, 2008) and autoSHARP (Bricogne et al., 2003; the mean figure of merit was 0.32 with 25 selenium sites). Automatic model building was performed with ARP/wARP (Cohen et al., 2004). Model completion and were performed with Coot (Emsley & Cowtan, 2004) and REFMAC v.5.4 (Winn et al., 2003). The included experimental phase restraints (Pannu et al., 1998) in the form of Hendrickson–Lattman coefficients from SHARP and TLS with one TLS group per chain. Data and for BB2672 are summarized in Table 1.
For SPO0826, multiple-wavelength anomalous diffraction (MAD) data were collected on beamline BL9-2 at SSRL at wavelengths corresponding to the high-energy remote (λ1), inflection (λ2) and peak (λ3) of a selenium MAD experiment. The data sets were collected at 100 K on a MAR Mosaic 325 mm CCD detector using Blu-Ice (McPhillips et al., 2002). The MAD data were integrated and reduced using XDS and scaled with the program XSCALE (Kabsch, 1993). Selenium-substructure solution and phasing were performed with SHELXD (Sheldrick, 2008) and autoSHARP (Bricogne et al., 2003; the mean figure of merit was 0.60 with four selenium sites) and automatic model building was performed with iterative RESOLVE (Terwilliger, 2003). Model completion and were performed with Coot (Emsley & Cowtan, 2004) and REFMAC v.5.2 (Winn et al., 2003) using the high-energy remote (λ1) data set. The included experimental phase restraints (Pannu et al., 1998) in the form of Hendrickson–Lattman coefficients from SHARP and TLS with one TLS group. Data and for SPO0826 are summarized in Table 2.
2.3. Validation and deposition
The quality of the https://smb.slac.stanford.edu/jcsg/QC ). This server processes the coordinates and data through a variety of validation tools including AutoDepInputTool (Yang et al., 2004), MolProbity (Davis et al., 2007), WHAT IF v.5.0 (Vriend, 1990), RESOLVE (Terwilliger, 2003) and MOLEMAN2 (Kleywegt, 2000), as well as several in-house scripts, and summarizes the results. Protein quaternary-structure analysis used the PISA server (Krissinel & Henrick, 2007). Fig. 1(b) was adapted from an analysis using PDBsum (Laskowski et al., 2005) and all other figures were prepared with PyMOL (DeLano Scientific). Atomic coordinates and experimental structure factors for BB2672 at 1.7 Å resolution and for SP0826 at 2.1 Å resolution have been deposited in the PDB (https://www.pdb.org ) and are accessible under codes 3byq and 2qtp , respectively.
was analyzed using the JCSG Quality Control server (3. Results and discussion
3.1. Overall structure
The ) was determined to 1.7 Å resolution using the SAD method (Table 1). The final model included three protomers (residues 2–192 of chain A, residues 1–192 of chain B and residues 2–192 of chain C), two tetraethylene glycol (PEG) molecules, 27 ethylene glycol molecules, four chloride ions and 566 water molecules in the No electron density was observed for Gly0 (which remained from the N-terminal expression and purification tag after TEV cleavage) in all three chains or for SeMet1 of chains A and C. Side-chain atoms of Lys73 in chain A, SeMet1, Gln26, Lys73 and Asp74 in chain B, and Gln26 and Lys73 in chain C had poorly defined electron density and were omitted from the model. The Matthews coefficient (VM; Matthews, 1968) for BB2672 was 2.3 Å3 Da−1 and the estimated solvent content was 46.9%. The Ramachandran plot produced by MolProbity (Davis et al., 2004) showed that 97.1% of the residues were in favored regions, with no outliers.
of BB2672 (Fig. 1The ). The final model included one protomer (147 of 194 residues) and 42 water molecules in the No electron density was observed for residues from the expression and purification tag, SeMet1–Thr2, Gly103–Ala115, Lys137–His145 or Gly172–Arg193. Side-chain atoms of Lys3, Ala18, Arg20, Arg72, Glu74, Glu77, Glu87, Lys101, Leu116 and Lys122 had poorly defined electron density and were omitted from the model. The Matthews coefficient (VM; Matthews, 1968) for SPO0826 was 2.5 Å3 Da−1 and the estimated solvent content was 51.6%. The Ramachandran plot (Davis et al., 2004) showed that 95.8% of the residues were in favored regions, with no outliers.
of SPO0826 was determined to 2.1 Å resolution using the MAD method (Table 2BB2672 forms a single domain composed of seven β-strands (β1–β7; residues 6–16, 28–38, 78–85, 118–122, 129–135, 146–150 and 160–168), three α-helices (H1, H3 and H4; residues 50–71, 92–95 and 97–108) and three 310-helices (H2, H5 and H6; residues 73–75, 142–144 and 182–184) (Fig. 1). The total β-sheet, α-helical and 310-helical contents were 29.3, 19.9 and 4.7%, respectively. BB2672 contains a central five-stranded mixed β-sheet with 12734 topology that packs against helices H1, H3 and H4. Two additional strands (β5–β6) flank the sheet from the opposite side.
SPO0826 also forms a single domain composed of seven β-strands (β1–β7; residues 4–17, 20–36, 78–85, 119–123, 130–134, 147–151 and 161–170), but with only two α-helices (H1 and H3; residues 48–69 and 91–97) and one 310-helix (H2; residues 73–75). The total β-sheet, α-helical and 310-helical contents were 43.5, 19.7 and 2.0%, respectively.
3.2. Similarity between BB2672 and SPO0826
BB2672 and SPO0826 are closely homologous, with a sequence identity of 39%, and share the same overall fold and α atoms, with an r.m.s.d. of 1.4 Å (Fig. 2a). The disordered residues in the SPO0826 structure correspond to a large C-terminal region of BB2672 that encompasses helix H4 (residues 97–109), most of the H4–β4 loop (residues 110–115), the β5–β6 loop (residues 137–145), including the 310-helix H5, and the C-terminal region (residues 172–193) after β7, including 310-helix H6. In BB2672, consecutive glycines (residues 101–102) located after the first turn of helix H4 (Fig. 1b) result in a break in the main-chain hydrogen bonding of the helix, indicating a degree of structural plasticity in this region. Gly102 is highly conserved among BB2672 homologs, suggesting that the conformational flexibility in H4 might have functional implications. The region C-terminal of β7, including 310-helix H6, has high B values but is stabilized through interactions with the β5–β6 loop that appear to be critical for formation of the hexamer.
Superposition of the two structures extends over 142 equivalent C3.3. Similarity to other proteins
BB2672 and SPO0826 both show significant structural similarity to proteins with the Bacillus chorismate mutase-like (BCM-like) fold (https://scop.mrc-lmb.cam.ac.uk/scop/data/scop.b.e.bie.j.b.b.html ), a `circular permutation' variant of the thioredoxin-like fold (Qi & Grishin, 2005). A FATCAT (Ye & Godzik, 2004) database search identified several proteins with this fold that had significant structural similarities to BB2672. Among these, the top hit was the C-terminal domain of RmpM, an outer membrane protein from Neisseria meningitidis (PDB code 1r1m ; Grizot & Buchanan, 2004), with a Cα r.m.s.d. of 3.1 Å and 7% sequence identity over 96 residues (Fig. 2b). Superposition with the monofunctional chorismate mutase from Bacillus subtilis (PDB code 2cht ; Chook et al., 1993) resulted in similar r.m.s.d. values (2.5 Å with 10% sequence identity over 86 residues; Fig. 2c). In addition to the core BCM-like fold, BB2672 contains an extra N-terminal strand (β1) and a β-hairpin (strands β5–β6) inserted between the third and fourth strands of the BCM β-sheet (Fig. 2c). In RmpM, BB2672 strands β5–β6 are replaced by two helices that extend out in the opposite direction from the β-strands (Fig. 2b), while the region C-terminal of BB2672 strand β7 is absent from both structures (Figs. 2b and 2c).
Analysis of BB2672 using the PISA server (Krissinel & Henrick, 2007) indicated that a hexamer was the likely quaternary form. Analytical in combination with static indicated that both BB2672 and SPO0826 were hexamers in solution. The BB2672 hexamer corresponds to a `trimer of dimers' with approximate dimensions of 50 × 70 × 80 Å, with the helices positioned on the outside and the β-strands forming the inner core (Fig. 3a). This arrangement creates two main types of interface. The interface between chains B and C (and equivalents between A and A′ and between C′ and B′; Fig. 3a) is extensive, burying a surface area of 1960 Å2 per protomer with 32 hydrogen bonds and six salt bridges. This interface involves strands β4–β6 and helix H5 from both protomers, and the first turn of helix H4, loops β2–H1 and β3–H3 from one protomer that contacts the region C-terminal of β7 in the other protomer. Two strictly conserved pairs of salt bridges (Glu90–Arg176 and Glu92–Arg172) account for the majority of the electrostatic interactions along this interface. Both sets of salt bridges are almost entirely shielded from solvent upon Residues in the first half-turn of helix H3, loop β2–H1, loop β3–H3 and the region C-terminal of β7 are also highly conserved, suggesting that this interface serves a conserved functional purpose. Analysis of the BB2672 hexamer structure indicates that this `C-terminal' or `dimer' interface may provide dimers as the initial building blocks which then assemble to form the hexamer. However, this interface was not observed in the SPO0826 structure, which crystallized with a single molecule in the asymmetric unit.
The second interface, which is observed in both the BB2672 and the SPO0826 structures, is formed between chains A and B (and equivalently between C and C′ and between B′ and A′; Fig. 3a). This `N-terminal' interface mainly involves interaction of the β1 strands of adjacent chains in an antiparallel manner to form a continuous β-sheet across interacting protomers. In BB2672 this ,interface buries a surface area of 950 Å2 per protomer with 18 hydrogen bonds and four salt bridges. Pairs of intermolecular salt bridges (Lys8–Glu14 and Glu19–Arg7) anchor the adjacent ends of the corresponding β1 strands. With the exception of the strictly conserved Arg7–Glu19 salt bridge, no other highly conserved intermolecular interactions are observed in this interface. In BB2672, the N-terminal interface is involved in the formation of the hexamer, while in SPO0826, it is only involved in crystal contacts. This interface is not predicted by PISA (Krissinel & Henrick, 2007) to form a stable dimer and thus seems to only be stable when the molecules assemble as a hexamer. Both the β1 strand and the region C-terminal of strand β7 are absent in classic BCM folds, where the subunits are arranged in parallel to form homotrimers (Chook et al., 1993; Fig. 3b). Thus, the two distinct types of interface observed in the BB2672 hexamer could result from these two novel additions to the classic BCM-like fold.
Analysis of the BB2672 hexamer structure using the CastP server (Binkowski et al., 2003) revealed a large cavity (∼1000 Å3) located along the C-terminal or dimer interface and including residues from helix H4, the H4–β4 loop, β5, the β5–H5 loop, the 310-helix H5 and the region C-terminal of β7. Proximal to this cavity, a highly conserved cluster of four histidines, His93, His98, His144, His174, is located within hydrogen-bonding distance of residues involved in the formation of intermolecular salt bridges (Fig. 4). All four histidines are located in potentially flexible regions of the molecule: His144 and His174 in loops and His87 and His98 at the beginning of helices H3 and H4, respectively. Helix H3 is a very short 310-helix, whereas His98 is located on a turn before a break in the main-chain hydrogen-bonding pattern of helix H4. In SPO0826, most of these regions are disordered, indicating flexibility. This clustering of histidine residues in regions of some conformational variability and the proximity of salt bridges involved in suggests that assembly of the BB2672 hexamer could be modulated by pH or salt concentration. The imidazole of the strictly conserved His144 is the only solvent-exposed side chain of these His residues in the structure, suggesting that it is the less solvent-accessible histidines that are likely to control pH-dependent The observation that SPO0826 crystallizes as a monomer in high salt and acidic pH (2 M NaCl pH 6.5), while demonstrating the same hexameric state in solution as BB2672 under alkaline and low-salt conditions (150 mM NaCl pH 8.0), lends further support to the hypothesis that pH or salt concentration might be a key factor in oligomerization.
RmpM, the most similar structure to BB2672, is a putative peptidoglycan-binding protein that interacts with integral outer membrane proteins such as porins and with transporters implicated in iron acquisition and bacterial pathogenesis (Grizot & Buchanan, 2004). The peptidoglycan-binding site is located along a highly conserved hydrophilic groove that partially overlaps with a cavity containing highly conserved residues along the C-terminal dimerization interface of BB2672 (His93, Arg176 and Glu90; Grizot & Buchanan, 2004). However, structural alignment shows no sequence similarity between BB2672 and the proposed binding residues in RmpM, and access to the cavity in BB2672 is partially occluded by the β2–H1 and β3–H3 loops.
In the monofunctional chorismate mutase from Bacillus subtilis (PDB code 2cht ), the active sites are located at each of the three subunit interfaces and the reaction involves stabilization of the charged transition state of chorismate via the formation of ionic bonds with a number of acidic and basic side chains in BCM (Chook et al., 1993). A number of studies have established that these electrostatic effects provide the main drive in catalysis (Kast et al., 1996). Electrostatic catalysis can additionally be mediated by metal ions and histidine proton shuttling (Christianson & Cox, 1999). Although the C-terminal interface in BB2672 displays conservation of an electrostatic surface analogous to monofunctional BCM, the lack of sequence similarity between the two proteins, as well as their different states, makes it unlikely that they share a common ligand. Assuming that the C-terminal interface plays a role in environmental regulation of the BB2672 hexamer assembly, ligand binding might occur along the less extensive and presumably less stable N-terminal interface that is likely not formed prior to BB2672 hexamerization.
Several genes predicted (https://string.embl.de ) to have functional associations with BB2672 include putative exported proteins (BB2673, BB0971 and BB4692), transcriptional regulators implicated in pathogenesis and multiple antibiotic resistance (BB2675 and BB1771) and various proteins (BB1772, BB1773 and BB1774) involved in branched-chain amino-acid (BCAA; i.e. leucine, isoleucine and valine) metabolism. A similar pattern is observed in the genomic neighborhood of SPO0826, with several genes (SPO0822, SPO0823, SPO0824 and SPO0825) involved in ATP-binding cassette (ABC) type BCAA transport and this pattern is repeated again in other BB2672 orthologs, lending further support to an association of this family with amino-acid metabolism.
The BB2672 protein family (DUF1185; PF06684) contains more than 200 sequence homologs, all of which are approximately 180 residues in length; they are mostly found in proteobacteria, but also in firmicutes and uncultured bacteria from ocean, soil and human microbiota. Members of the DUF1185 family are also present in several human and animal pathogens from burkholderia and bordetella, as well as clostridia, where their genome location on virulence islands suggests a possible role in pathogenesis (Nierman et al., 2004). In some pathogenic bacteria, one of the modes of BCAA transportation is thought to be sensitive to stress induced by changes in pH or temperature (Vijaranakul et al., 1998). In this context, BB2672 and its homologs might present attractive drug targets since, similar to chorismate mutases, DUF1185 homologs are not found in mammals.
4. Conclusion
The structural similarities between BB2672 and monofunctional BCM, as well the genomic neighborhood of BB2672 and homologs, appear to suggest involvement of the DUF1185 (PF06684) family in amino-acid metabolism. However, the presence of additional secondary-structure elements in BB2672 and differences in
states (hexamer for BB2672, trimer for BCM) and the lack of detectable sequence similarity between BB2672 and BCM make it unlikely that BB2672 and BCM share the same ligands. Finally, comparison of the BB2672 and SPO0826 structures led us to propose that and potentially function in the DUF1185 family could be subject to pH or salt regulation. Thus, BB2672 homologs may also act as environmental sensors, enabling bacterial adaptation and survival under a range of conditions.The availability of more DUF1185 sequences and structures should shed light on the evolutionary history of this intriguing protein family. The information presented here, in combination with further biochemical and biophysical studies, should yield valuable insights into the functional role of BB2672. Models for BB2672 homologs can be accessed at https://www1.jcsg.org/cgi-bin/models/get_mor.pl?key=3byqA . Additional information about BB2672 and SPO0826 is available from TOPSAN (Krishna et al., 2010) at https://www.topsan.org/explore?PDBid=3byq and https://www.topsan.org/explore?PDBid=2qtp , respectively.
Acknowledgements
This work was supported by National Institutes of Health Protein Structure Initiative grant No. U54 GM074898 from the National Institute of General Medical Sciences (https://www.nigms.nih.gov ). Portions of this research were carried out at the Stanford Synchrotron Radiation Lightsource (SSRL). The SSRL is a national user facility operated by Stanford University on behalf of the US Department of Energy, Office of Basic Energy Sciences. The SSRL Structural Molecular Biology Program is supported by the Department of Energy, Office of Biological and Environmental Research and by the National Institutes of Health (National Center for Research Resources, Biomedical Technology Program and the National Institute of General Medical Sciences). Genomic DNA from B. bronchiseptica RB50 and S. pomeroyi DSS-3 were obtained from the American Type Culture Collection (ATCC Nos. BAA-588D and 700808D, respectively). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institute of General Medical Sciences or the National Institutes of Health.
References
Binkowski, T. A., Naghibzadeh, S. & Liang, J. (2003). Nucleic Acids Res. 31, 3352–3355. Web of Science CrossRef PubMed CAS Google Scholar
Bricogne, G., Vonrhein, C., Flensburg, C., Schiltz, M. & Paciorek, W. (2003). Acta Cryst. D59, 2023–2030. Web of Science CrossRef CAS IUCr Journals Google Scholar
Chook, Y. M., Ke, H. & Lipscomb, W. N. (1993). Proc. Natl Acad. Sci. USA, 90, 8600–8603. CrossRef CAS PubMed Web of Science Google Scholar
Christianson, D. W. & Cox, J. D. (1999). Annu. Rev. Biochem. 68, 33–57. Web of Science CrossRef PubMed CAS Google Scholar
Cohen, A. E., Ellis, P. J., Miller, M. D., Deacon, A. M. & Phizackerley, R. P. (2002). J. Appl. Cryst. 35, 720–726. Web of Science CrossRef CAS IUCr Journals Google Scholar
Cohen, S. X., Morris, R. J., Fernandez, F. J., Ben Jelloul, M., Kakaris, M., Parthasarathy, V., Lamzin, V. S., Kleywegt, G. J. & Perrakis, A. (2004). Acta Cryst. D60, 2222–2229. Web of Science CrossRef CAS IUCr Journals Google Scholar
Collaborative Computational Project, Number 4 (1994). Acta Cryst. D50, 760–763. CrossRef IUCr Journals Google Scholar
Cruickshank, D. W. J. (1999). Acta Cryst. D55, 583–601. Web of Science CrossRef CAS IUCr Journals Google Scholar
Davis, I. W., Leaver-Fay, A., Chen, V. B., Block, J. N., Kapral, G. J., Wang, X., Murray, L. W., Arendall, W. B. III, Snoeyink, J., Richardson, J. S. & Richardson, D. C. (2007). Nucleic Acids Res. 35, W375–W383. Web of Science CrossRef PubMed Google Scholar
Davis, I. W., Murray, L. W., Richardson, J. S. & Richardson, D. C. (2004). Nucleic Acids Res. 32, W615–W619. Web of Science CrossRef PubMed CAS Google Scholar
Emsley, P. & Cowtan, K. (2004). Acta Cryst. D60, 2126–2132. Web of Science CrossRef CAS IUCr Journals Google Scholar
Grizot, S. & Buchanan, S. K. (2004). Mol. Microbiol. 51, 1027–1037. Web of Science CrossRef PubMed CAS Google Scholar
Kabsch, W. (1993). J. Appl. Cryst. 26, 795–800. CrossRef CAS Web of Science IUCr Journals Google Scholar
Kast, P., Asif-Ullah, M., Jiang, N. & Hilvert, D. (1996). Proc. Natl Acad. Sci. USA, 93, 5043–5048. CrossRef CAS PubMed Web of Science Google Scholar
Kleywegt, G. J. (2000). Acta Cryst. D56, 249–265. Web of Science CrossRef CAS IUCr Journals Google Scholar
Klock, H. E., Koesema, E. J., Knuth, M. W. & Lesley, S. A. (2008). Proteins, 71, 982–994. Web of Science CrossRef PubMed CAS Google Scholar
Krishna, S. S., Weekes, D., Bakolitsa, C., Elsliger, M.-A., Wilson, I. A., Godzik, A. & Wooley, J. (2010). Acta Cryst. F66, 1143–1147. Web of Science CrossRef IUCr Journals Google Scholar
Krissinel, E. & Henrick, K. (2007). J. Mol. Biol. 372, 774–797. Web of Science CrossRef PubMed CAS Google Scholar
Laskowski, R. A., Chistyakov, V. V. & Thornton, J. M. (2005). Nucleic Acids Res. 33, D266–D268. Web of Science CrossRef PubMed CAS Google Scholar
Lesley, S. A. et al. (2002). Proc. Natl Acad. Sci. USA, 99, 11664–11669. Web of Science CrossRef PubMed CAS Google Scholar
Leslie, A. G. W. (1992). Jnt CCP4/ESF–EACBM Newsl. Protein Crystallogr. 26. Google Scholar
Matthews, B. W. (1968). J. Mol. Biol. 33, 491–497. CrossRef CAS PubMed Web of Science Google Scholar
McPhillips, T. M., McPhillips, S. E., Chiu, H.-J., Cohen, A. E., Deacon, A. M., Ellis, P. J., Garman, E., Gonzalez, A., Sauter, N. K., Phizackerley, R. P., Soltis, S. M. & Kuhn, P. (2002). J. Synchrotron Rad. 9, 401–406. Web of Science CrossRef CAS IUCr Journals Google Scholar
Nierman, W. C. et al. (2004). Proc. Natl Acad. Sci. USA, 101, 14246–14251. Web of Science CrossRef PubMed CAS Google Scholar
Pannu, N. S., Murshudov, G. N., Dodson, E. J. & Read, R. J. (1998). Acta Cryst. D54, 1285–1294. Web of Science CrossRef CAS IUCr Journals Google Scholar
Qi, Y. & Grishin, N. V. (2005). Proteins, 58, 376–388. Web of Science CrossRef PubMed CAS Google Scholar
Santarsiero, B. D., Yegian, D. T., Lee, C. C., Spraggon, G., Gu, J., Scheibe, D., Uber, D. C., Cornell, E. W., Nordmeyer, R. A., Kolbe, W. F., Jin, J., Jones, A. L., Jaklevic, J. M., Schultz, P. G. & Stevens, R. C. (2002). J. Appl. Cryst. 35, 278–281. Web of Science CrossRef CAS IUCr Journals Google Scholar
Schmidheini, T., Mosch, H. U., Evans, J. N. & Braus, G. (1990). Biochemistry, 29, 3660–3668. CrossRef CAS PubMed Web of Science Google Scholar
Sheldrick, G. M. (2008). Acta Cryst. A64, 112–122. Web of Science CrossRef CAS IUCr Journals Google Scholar
Terwilliger, T. C. (2003). Acta Cryst. D59, 1174–1182. Web of Science CrossRef CAS IUCr Journals Google Scholar
Vijaranakul, U., Xiong, A., Lockwood, K. & Jayaswal, R. K. (1998). Appl. Environ. Microbiol. 64, 763–767. Web of Science CAS PubMed Google Scholar
Vriend, G. (1990). J. Mol. Graph. 8, 52–56. CrossRef CAS PubMed Web of Science Google Scholar
Winn, M. D., Murshudov, G. N. & Papiz, M. Z. (2003). Methods Enzymol. 374, 300–321. Web of Science CrossRef PubMed CAS Google Scholar
Yang, H., Guranovic, V., Dutta, S., Feng, Z., Berman, H. M. & Westbrook, J. D. (2004). Acta Cryst. D60, 1833–1839. Web of Science CrossRef CAS IUCr Journals Google Scholar
Ye, Y. & Godzik, A. (2004). Nucleic Acids Res. 32, W582–W585. Web of Science CrossRef PubMed CAS Google Scholar
This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.