crystallization communications
Crystallization of hepatocyte nuclear factor 4α (HNF4α) in complex with the HNF1α promoter element
aDepartment of Molecular and Cellular Biochemistry and Center for Structural Biology, University of Kentucky, Lexington, KY 40536, USA
*Correspondence e-mail: ychi@uky.edu
Hepatocyte nuclear factor 4α (HNF4α) is a member of the nuclear receptor superfamily that plays a central role in organ development and metabolic functions. Mutations on HNF4α cause maturity-onset diabetes of the young (MODY), a dominant monogenic cause of diabetes. In order to understand the molecular mechanism of promoter recognition and the molecular basis of disease-causing mutations, the recombinant HNF4α DNA-binding domain was prepared and used in a study of its binding properties and in crystallization with a 21-mer DNA fragment that contains the promoter element of another MODY gene, HNF1α. The HNF4α protein displays a cooperative and specific DNA-binding activity towards its target gene-recognition elements. Crystals of the complex diffract to 2.0 Å using a synchrotron-radiation source under cryogenic (100 K) conditions and belong to C2, with unit-cell parameters a = 121.63, b = 35.43, c = 70.99 Å, β = 119.36°. A molecular-replacement solution has been obtained and structure is in progress. This structure and the binding studies will provide the groundwork for detailed functional and biochemical studies of the MODY mutants.
1. Introduction
Hepatocyte nuclear factor 4α (HNF4α) is a tissue-specific transcription factor that plays an essential role in early vertebrate development and embryonic survival. It regulates the expression of a wide variety of essential genes, including those involved in liver and pancreatic cell differentiation (Li et al., 2000; Odom et al., 2004), embryogenesis and early development (Duncan et al., 1994; Lausen et al., 2000), glucose metabolism (Stoffel & Duncan, 1997), lipid homeostasis (Hayhurst et al., 2001) and amino-acid metabolism (Schrem et al., 2002). Mutations in HNF4α cause a dominantly inherited form of diabetes known as maturity-onset diabetes of the young (MODY; Yamagata et al., 1996). These mutations cause the loss of function of the gene product (Lausen et al., 2000), which leads to impaired insulin secretion and defects in metabolic pathways (Miura et al., 2006).
HNF4α is a prototypical member of a unique nuclear receptor superfamily (NR2A1; Nuclear Receptors Nomenclature Committee, 1999) and exclusively functions as a homodimer (Jiang et al., 1995), despite its sequence homology to retinoic X receptor (RXR), which can readily heterodimerize with a related nuclear receptor (Szanto et al., 2004). HNF4α consists of distinctive functional domains including a DNA-binding domain (DBD), a ligand-binding domain (LBD) and additional domains with transcription-activation functions (AF; Schrem et al., 2002). However, the identity of its bona fide ligand is still under dispute (Hertz et al., 1998; Petrescu et al., 2005), even though its apparent ligand has been identified from structural studies (Dhe-Paganon et al., 2002; Wisely et al., 2002). HNF4α-DBD contains two zinc-finger motifs that specifically recognize and bind as a homodimer to a direct repeat of two hexameric half-sites separated by one (DR1; in the majority) or two (DR2) (Jiang et al., 1995; Rajas et al., 2002). Five MODY1 missense mutations (on four different residues) are found within the region of our HNF4α-DBD construct (Fig. 1a) and an additional MODY mutation is found in the HNF4α-binding site within the promoter of another MODY (MODY3) culprit gene HNF1α (Fig. 1b; Gragnoli et al., 1997). Analysis of the structural consequences of each amino-acid substitution should be instructive as to the functional role of each residue. In order to elucidate the molecular basis of HNF4α function and the monogenic causes of diabetes, we have prepared and crystallized the human HNF4α DNA-binding domain in complex with a high-affinity HNF1α promoter element containing the HNF4α recognition sequence.
2. Materials and methods
2.1. Construction, expression and purification of HNF4α DNA-binding domain
The cDNA harboring the full-length human HNF-4αB splice variant (Kritis et al., 1996) was a kind gift from Dr Steve Shoelson of Joslin Diabetes Center. A fragment of human HNF4α cDNA (amino acids 46–126) was subcloned by standard PCR into a pET41a vector (GE Healthcare). HNF4α was overexpressed in Escherichia coli BL21-Gold (Novagen) with induction by 0.5 mM IPTG at an OD600 of 0.8–1.0 at 310 K and harvested after culturing for an additional 3–4 h. No zinc solution was added during the purification since there should be a sufficient amount of Zn atoms in the medium to be incorporated into the protein. The cells were lysed by sonication and the expressed GST-fusion proteins were isolated with the use of glutathione-agarose beads (Invitrogen) in bulk plus washing in the presence of 0.6 M NaCl to prevent nonspecific binding to bacterial DNA. HNF4α was released by thrombin digestion from the resin after overnight incubation at 277 K and was further purified by (Mono-S FPLC). Thrombin digestion produced a two-residue remnant (Gly-Ser) at the N-terminal end (Fig. 1a). The purified protein was estimated to be at least 98% pure as judged by staining with Coomassie on 8–25% gradient SDS–PAGE gel (Fig. 2). Fractions were pooled and stored at 193 K as a 10%(v/v) glycerol stock.
2.2. Gel filtration of HNF4α DNA-binding domain
Gel filtration was performed on a Superdex 75 HR 10/30 column (GE Healthcare) equilibrated with running buffer containing 20 mM Tris pH 7.5, 200 mM NaCl, 1 mM EDTA and 10 mM 2-mercaptoethanol. Elution was performed at a flow rate of 0.5 ml min−1. The apparent molecular weight of HNF4α-DBD was determined using the same column calibrated previously with a range of reference proteins (Bio-Rad): thyroglobulin (670 kDa), bovine γ-globulin (158 kDa), chicken ovalbumin (44 kDa), equine myoglobin (17 kDa) and vitamin B12 (1.4 kDa). Blue dextran was used to determine the void volume of the column.
2.3. Preparation of DNA oligomers
Tritylated v/v) acetonitrile gradient in 50 mM triethylamine acetate buffer pH 7.0. Excess mobile phase containing acetonitrile was removed using HiTrapQ (GE Healthcare) and the trityl groups were removed with 80%(v/v) acetic acid. The deprotected were precipitated with 75%(v/v) ethanol, dissolved in water for concentration measurement by A260 and lyophilized before storage at 193 K. Double-stranded DNAs were generated for crystallization by heating equimolar amounts of complementary to 358 K for 10 min and slowly cooling to 277 K. The annealing buffer condition was 20 mM Tris pH 8.0, 200 mM NaCl and 1 mM EDTA.
were purchased from the Midland Certified Reagent Company (Midland, Texas, USA) and further purified by reverse-phase HPLC on a C8 XTerra prep column (Waters) using a linear 5–50%(2.4. Electrophoretic mobility-shift assay (EMSA)
Single-stranded b) were dissolved in 10 mM Tris (pH 7.5 at 293 K) and 1 mM EDTA. Oligonucleotide 1 was 5′-end labeled with 32P as described by Maxam & Gilbert (1977). Labeled oligonucleotide 1 was mixed with a 1.1-fold molar excess of oligonucleotide 2 and the samples were heated to 363 K and cooled slowly to 293 K. DNA was transferred by dialysis into binding buffer [10 mM Tris pH 8.0, 1 mM EDTA, 100 mM NaCl, 1 mM MgCl2, 1 mM DTT, 4%(v/v) glycerol]. DNA concentrations were measured by absorbance using a molar extinction coefficient ∊2601cm of 2.57 × 105. Samples were stored at 253 K until use. EMSAs were carried out as described by Hellman & Fried (2007) using 10%(w/v) polyacrylamide gels cast and run in 45 mM Tris–borate, 2.5 mM EDTA (pH 8.3 at 293 K). Autoradiographic images were captured on storage phosphor screens (type GP, GE Healthcare), detected with a Typhoon phosphorimager and quantitated with Image-Quanta software (GE Healthcare). Data from serial dilution experiments were analyzed using the equations
1 and 2 (Fig. 1and
Here, n is the binding stoichiometry, [PnD] and [D] the concentrations of complex and free DNA, respectively, and Kobs = [PnD]/[P]n[D]. When both n and Kobs are unknown (as in this case), iterative calculation of n and [P] using (1) and (2) results in convergence on self-consistent values of n and Kobs (Adams & Fried, 2007; Fried & Crothers, 1984).
2.5. Dynamic light-scattering measurement
The effective molecular radius and the homogeneity/monodispersity of the complex within various particular buffer conditions were measured using the Solubility Screening Kit (Jena Biosciences) in conjunction with a Dynapro-99 dynamic light-scattering instrument (Proterion Corporation) and a DynaPro-MSTC200 microsampler (Protein Solutions). The results were analyzed using DYNAMICS v.5.26.60 (Protein Solutions). 20 µl of sample was inserted into the cuvette with the temperature control set to 293 K. The light-scattering signal was collected at a wavelength of 830.7 nm. Protein concentrations were about 2 mg ml−1 in each buffer and an average of 15 readings were recorded for each measurement.
2.6. Crystallization and optimization
Protein–DNA complexes were dialyzed in 20 mM Tris pH 7.5, 75 mM NaCl and 1 mM DTT at 277 K for 2.5 h and concentrated to at least 10 mg ml−1. The initial crystallization trials were carried out at 295 K in 24-well plates using the hanging-drop vapor-diffusion method with a sparse-matrix screen (Jancarik & Kim, 1991) and similar commercially available versions such as Crystal Screens I and II (Hampton Research), Natrix and PEG/Ion Screens (Hampton Research), Cryo I and II (Molecular Dimensions) and Wizard I and II (deCODE Genetics).
Drops consisting of 0.5 µl protein–DNA solution were mixed with an equal volume of reservoir solution and equilibrated against 500 µl reservoir solution. Although many different DNA constructs varying in length and the nature of the ending (blunt end versus overhang) were screened, diffraction-quality crystals were only reproducibly obtained using the overhang 21-mer shown in Fig. 1(b) (the two HNF4α direct-repeat recognition sites are indicated by the boxes). Conditions yielding small crystals were further optimized by variation of the crystallization parameters and additives. The final condition, which produced somewhat flat bipyramidal crystals at 295 K, contained 26%(v/v) PEG 4000, 80 mM magnesium acetate and 50 mM sodium citrate pH 4.8.
2.7. Data collection and processing
The crystals were transferred into mother liquor containing an additional 15%(v/v) glycerol as a cryoprotectant before being directly plunged into liquid nitrogen and stored for data collection. The native data were collected at 100 K at APS (SER-CAT 22BM) using a MAR 225 CCD detector and an oscillation angle of 1° with 2 s exposure and were processed using HKL-2000 (Otwinowski & Minor, 1997). The wavelength used was 0.92017 Å.
3. Results and discussion
Recombinant HNF4α-DBD (amino acids 46–126; Fig. 1a) was purified to and mixed with pure DNA for subsequent studies. Gel-filtration experiments showed that the HNF4α-DBD protein existed as a monomer in solution (Fig. 2). Purified HNF4α-DBD protein forms a single complex with DNA containing its target sequence (Figs. 3a and 3b). Serial dilution analysis (Figs. 3b and 3c) revealed that the stoichiometry of the complex was 2:1 HNF4α:dsDNA, with an association constant Kobs of 8.48 ± 0.67 × 1010 M−2. The corresponding monomer equivalent dissociation constant was 3.43 ± 0.13 × 10−6 M. The formation of a 2:1 complex without the accumulation of detectable levels of the 1:1 intermediate indicates that binding is cooperative. These features will serve as a reference when we study the effects of MODY mutations on DNA binding in the near future.
Dynamic ). We used the Solubility Screening Kit (Jena Biosiences) in conjunction with DLS (Jancarik et al., 2004) in order to identify the optimal buffer conditions for complex formation and crystallization. The best polydispersity value of 0.06 was obtained with a buffer containing 20 mM Tris pH 7.5 and 75 mM NaCl and this optimal buffer was used for subsequent crystallization.
(DLS) is a useful tool to monitor protein solubility behavior and to predict favorable crystallization conditions (Wilson, 2003For crystallization, purified HNF4α 46–126 and various DNAs were simply mixed in a 2:1.2 molar ratio, dialyzed against the optimal binding buffer (20 mM Tris pH 7.5, 75 mM NaCl) and concentrated using 10 kDa molecular-weight cutoff concentrators (Millipore). The protein–DNA concentration was 10 mg ml−1 for initial screenings and 20 mg ml−1 for final optimization. Crystals with the overhang 21-mer DNA (Fig. 1b) were grown at 295 K using the hanging-drop vapor-diffusion method and the presence of the HNF4α–DNA complex in the crystals was confirmed by running SDS–PAGE and 0.5%(w/v) agarose gels (data not shown). Crystals initially appeared within 2 d and continued to grow until they reached average dimensions of 0.05 × 0.1 × 0.2 mm (Fig. 4). A range of solution conditions varying the pH, temperature and concentrations of additives such as organic solvents, divalent cations and polyamines were used to attempt to improve the crystal quality. The final optimized condition contains 26%(v/v) PEG 4000, 80 mM magnesium acetate and 50 mM sodium citrate pH 4.8. The best crystal diffracted to 2.0 Å at the synchrotron source and belongs to C2, with unit-cell parameters a = 121.63, b = 35.43, c = 70.99 Å, β = 119.36° (Fig. 5). The value of the Matthews coefficient (Matthews, 1968) is 2.12 Å3 Da−1 for one complex (two HNF4α and one dsDNA) in the and the estimated solvent content is 41.6% based on a protein specific density of 1.34. Final native data-collection statistics are summarized in Table 1.
|
The structure was determined by 1dsz ) as a search model and the program MOLREP (Vagin & Teplyakov, 1997) from the CCP4 suite (Winn, 2003). An unambiguous solution was found that gave an initial R value of 51.4% and a of 0.38 using data in the resolution range 15–3.0 Å. The subsequent σA-weighted 2Fo − Fc map after rigid-body clearly revealed density corresponding to the structural differences between the search model and the HNF4α–DNA complex. Model improvement and of the structure are in progress.
using the RXR–RAR–DNA complex structure (PDB codeAcknowledgements
We wish to thank the staff at Advanced Photon Source beamline 22-BM (SER-CAT) for their help with data collection and David Rodgers for the use of the tissue-culture and protein-production core facility. Use of the Advanced Photon Source is supported by the US Department of Energy. This work was funded by a Juvenile Diabetes Research Foundation grant (2004-503) and NIH Grant P20RR20171 from the COBRE program of the National Center for Research Resources to YIC and NIH grant GM070662 to MGF.
References
Adams, C. & Fried, M. G. (2007). Protein Interactions: Biophysical Approaches for the Study of Multicomponent Systems, edited by P. Schuck, pp. 417–446. New York: Academic Press. Google Scholar
Dhe-Paganon, S., Duda, K., Iwamoto, M., Chi, Y. I. & Shoelson, S. E. (2002). J. Biol. Chem. 277, 37973–37976. Web of Science CrossRef PubMed CAS Google Scholar
Duncan, S. A., Manova, K., Chen, W. S., Hoodless, P., Weinstein, D. C., Bachvarova, R. F. & Darnell, J. E. Jr (1994). Proc. Natl Acad. Sci. USA, 91, 7598–7602. CrossRef CAS PubMed Web of Science Google Scholar
Fried, M. G. & Crothers, D. M. (1984). J. Mol. Biol. 172, 241–262. CrossRef CAS PubMed Web of Science Google Scholar
Gragnoli, C., Lindner, T., Cockburn, B. N., Kaisaki, P. J., Gragnoli, F., Marozzi, G. & Bell, G. I. (1997). Diabetes, 46, 1648–1651. CrossRef CAS PubMed Web of Science Google Scholar
Hayhurst, G. P., Lee, Y. H., Lambert, G., Ward, J. M. & Gonzalez, F. J. (2001). Mol. Cell. Biol. 21, 1393–1403. Web of Science CrossRef PubMed CAS Google Scholar
Hellman, L. M. & Fried, M. G. (2007). Nature Protocols, 2, 1849–1861. Web of Science CrossRef PubMed CAS Google Scholar
Hertz, R., Magenheim, J., Berman, I. & Bar-Tana, J. (1998). Nature (London), 392, 512–516. Web of Science CAS PubMed Google Scholar
Jancarik, J. & Kim, S.-H. (1991). J. Appl. Cryst. 24, 409–411. CrossRef CAS Web of Science IUCr Journals Google Scholar
Jancarik, J., Pufan, R., Hong, C., Kim, S.-H. & Kim, R. (2004). Acta Cryst. D60, 1670–1673. Web of Science CrossRef CAS IUCr Journals Google Scholar
Jiang, G., Nepomuceno, L., Hopkins, K. & Sladek, F. M. (1995). Mol. Cell. Biol. 15, 5131–5143. CAS PubMed Web of Science Google Scholar
Kritis, A. A., Argyrokastritis, A., Moschonas, N. K., Power, S., Katrakili, N., Zannis, V. I., Cereghini, S. & Talianidis, I. (1996). Gene, 173, 275–280. CrossRef CAS PubMed Web of Science Google Scholar
Lausen, J., Thomas, H., Lemm, I., Bulman, M., Borgschulze, M., Lingott, A., Hattersley, A. T. & Ryffel, G. U. (2000). Nucleic Acids Res. 28, 430–437. Web of Science CrossRef PubMed CAS Google Scholar
Li, J., Ning, G. & Duncan, S. A. (2000). Genes Dev. 14, 464–474. Web of Science PubMed CAS Google Scholar
Matthews, B. W. (1968). J. Mol. Biol. 33, 491–497. CrossRef CAS PubMed Web of Science Google Scholar
Maxam, A. M. & Gilbert, W. (1977). Proc. Natl Acad. Sci. USA, 74, 560–564. CrossRef CAS PubMed Web of Science Google Scholar
Miura, A., Yamagata, K., Kakei, M., Hatakeyama, H., Takahashi, N., Fukui, K., Nammo, T., Yoneda, K., Inoue, Y., Sladek, F. M., Magnuson, M. A., Kasai, H., Miyagawa, J., Gonzalez, F. J. & Shimomura, I. (2006). J. Biol. Chem. 281, 5246–5257. Web of Science CrossRef PubMed CAS Google Scholar
Nuclear Receptors Nomenclature Committee (1999). Cell, 97, 161–163. Google Scholar
Odom, D. T., Zizlsperger, N., Gordon, D. B., Bell, G. W., Rinaldi, N. J., Murray, H. L., Volkert, T. L., Schreiber, J., Rolfe, P. A., Gifford, D. K., Fraenkel, E., Bell, G. I. & Young, R. A. (2004). Science, 303, 1378–1381. Web of Science CrossRef PubMed CAS Google Scholar
Otwinowski, Z. & Minor, W. (1997). Methods Enzymol. 276, 307–326. CrossRef CAS Web of Science Google Scholar
Petrescu, A. D., Hertz, R., Bar-Tana, J., Schroeder, F. & Kier, A. B. (2005). J. Biol. Chem. 280, 16714–16727. Web of Science CrossRef PubMed CAS Google Scholar
Rajas, F., Gautier, A., Bady, I., Montano, S. & Mithieux, G. (2002). J. Biol. Chem. 277, 15736–15744. Web of Science CrossRef PubMed CAS Google Scholar
Schrem, H., Klempnauer, J. & Borlak, J. (2002). Pharmacol. Rev. 54, 129–158. Web of Science CrossRef PubMed CAS Google Scholar
Stoffel, M. & Duncan, S. A. (1997). Proc. Natl Acad. Sci. USA, 94, 13209–13214. CrossRef CAS PubMed Web of Science Google Scholar
Szanto, A., Narkar, V., Shen, Q., Uray, I. P., Davies, P. J. & Nagy, L. (2004). Cell Death Differ. 11, Suppl. 2, S126–S143. Google Scholar
Vagin, A. & Teplyakov, A. (1997). J. Appl. Cryst. 30, 1022–1025. Web of Science CrossRef CAS IUCr Journals Google Scholar
Wilson, W. W. (2003). J. Struct. Biol. 142, 56–65. Web of Science CrossRef PubMed Google Scholar
Winn, M. D. (2003). J. Synchrotron Rad. 10, 23–25. Web of Science CrossRef CAS IUCr Journals Google Scholar
Wisely, G. B., Miller, A. B., Davis, R. G., Thornquest, A. D. Jr, Johnson, R., Spitzer, T., Sefler, A., Shearer, B., Moore, J. T., Willson, T. M. & Williams, S. P. (2002). Structure, 10, 1225–1234. Web of Science CrossRef PubMed CAS Google Scholar
Yamagata, K., Furuta, H., Oda, N., Kaisaki, P. J., Menzel, S., Cox, N. J., Fajans, S. S., Signorini, S., Stoffel, M. & Bell, G. I. (1996). Nature (London), 384, 458–460. CrossRef CAS PubMed Web of Science Google Scholar
© International Union of Crystallography. Prior permission is not required to reproduce short quotations, tables and figures from this article, provided the original authors and source are cited. For more information, click here.