research papers\(\def\hfill{\hskip 5em}\def\hfil{\hskip 3em}\def\eqno#1{\hfil {#1}}\)

Journal logoSTRUCTURAL
BIOLOGY
ISSN: 2059-7983

Breaking barriers: transitioning from X-ray crystallography to cryo-EM for structural studies

crossmark logo

aDepartment of Pharmacology, Larner College of Medicine, University of Vermont, Burlington, VT 05405, USA, bUniversity of Vermont Cancer Center, University of Vermont, Burlington, VT 05405, USA, cLife Sciences Institute, Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA, and dDepartment of Biochemistry, Larner College of Medicine, University of Vermont, Burlington, VT 05405, USA
*Correspondence e-mail: [email protected]

Edited by S. Antonyuk, Institute of Integrative Biology, University of Liverpool, United Kingdom (Received 14 October 2025; accepted 26 January 2026; online 19 February 2026)

Cryo-electron microscopy (cryo-EM) has transformed structural biology by enabling near-atomic resolution of large macromolecular complexes without the need for crystallization. Here, we describe our laboratory's transition from X-ray crystallography to single-particle cryo-EM to investigate the ATPase family AAA+ domain-containing protein 2B (ATAD2B), a chromatin regulator implicated in epigenetic signaling. We outline the challenges encountered during protein expression, purification and sample preparation, including co-purification of the chaperonin GroEL, and the strategies employed to overcome these obstacles. Our workflow highlights critical steps in sample optimization, grid vitrification and data processing using CryoSPARC, cisTEM and Topaz, as well as computational requirements for high-resolution reconstructions. We also discuss model-building, refinement and validation approaches, emphasizing best practices for new cryo-EM users. This work provides practical insights for structural biologists adopting cryo-EM, particularly for large, flexible protein complexes, and underscores the importance of integrated approaches combining biochemical, computational and imaging strategies.

1. The expanding toolkit of structural biology

The landscape of structural biology is constantly evolving, and significant advancements in cryo-electron microscopy (cryo-EM) methodologies have enabled us to study large macromolecular complexes of proteins and nucleic acids at near-atomic resolution (Chua et al., 2022View full citation). Structural biology techniques such as X-ray crystallography, nuclear magnetic resonance and electron microscopy have provided invaluable insights into the molecular machines that drive biological processes in the cell (Shoemaker & Ando, 2018View full citation). X-ray crystallography has long been the backbone of high-resolution structure determination, offering exquisite detail when well diffracting crystals are available (Smyth & Martin, 2000View full citation; Zheng et al., 2015View full citation). However, the crystallization of many target biomolecules, particularly membrane proteins and large multicomponent complexes, has remained a challenge (Birch et al., 2018View full citation).

In cryo-EM, biological samples are applied onto specialized grids, which are then rapidly plunge-frozen (Thompson et al., 2016View full citation). Techniques such as single-particle analysis have allowed researchers to achieve resolutions comparable to X-ray crystallography, while also revealing structural heterogeneities and dynamic intermediate states of molecular pathways (Cheng, 2015View full citation). However, the initial bottlenecks of employing cryo-EM include large investments in highly specialized electron microscopes and computational infrastructure (Meng et al., 2023View full citation). Additionally, the cryo-EM workflow, as illustrated in Fig. 1[link], can be quite daunting.

[Figure 1]
Figure 1
Overview of the cryo-EM workflow. Step 1: evaluation of protein sample purity and quality to determine suitability for structural characterization by cryo-EM. Step 2: vitrification of samples for cryo-EM imaging by applying the purified protein to a cryo-EM grid followed by plunge-freezing. Step 3: screening of vitrified cryo-EM grids to identify optimal freezing conditions for the purified sample. Parameters such as grid type, buffer components, additives and plunge-freezing time/force are assessed. Once sample preparation is optimized, larger cryo-EM data sets are collected. Step 4: data processing using specialized software, including motion correction, CTF estimation, particle picking, 2D classification and 3D refinement, to obtain a high-resolution 3D map. Step 5: model building and refinement to fit a model into the map. Model validation is performed to assess the quality of the map and the model fit. Note: the cryo-EM workflow often requires iterative optimization at multiple stages, and users may need to revisit earlier steps to achieve a final structure at the desired resolution.

Often, researchers must use an integrated approach, combining multiple techniques to obtain structural information on large biomolecular complexes (Nogales & Scheres, 2015View full citation). For instance, studies on large ribonucleoprotein complexes used the high-resolution information garnered with X-ray crystallography for individual subunits in conjunction with lower resolution cryo-EM maps of the multimeric complex to generate composite models (Bai et al., 2015View full citation). This synergy not only validates the individual findings of each method but also creates a more robust picture of the larger molecular architecture, which provides information that is essential for our understanding of biological mechanisms and for the development of novel therapeutic interventions (Bijak et al., 2023View full citation).

2. Rationale for developing expertise in cryo-EM for structure determination

2.1. Why cryo-EM is critical for studying ATAD2B

In the Glass laboratory, we are interested in understanding how epigenetic signaling regulates gene expression and how alterations in these pathways are involved in disease development. Our research on bromodomain-containing proteins has focused on revealing the molecular mechanisms driving the recognition of post-translational modifications found on histone proteins in the nucleosome. Bromodomains are conserved structural motifs that function as a chromatin reader domain to recognize acetylated lysine residues on histone proteins (Zeng & Zhou, 2002View full citation). Binding to specific acetyllysine modifications on histones often bridges the associated bromodomain-containing protein to chromatin, where it can carry out its molecular activities. For the past several years, our research has focused on the human ATPase family AAA+ domain-containing protein 2 (ATAD2) and its closely related paralog ATAD2B (Gay et al., 2019View full citation; Lloyd et al., 2020View full citation; Evans et al., 2021View full citation; Phillips, Malone et al., 2024View full citation). We have employed a variety of approaches including structural biology, molecular biology, genomics, biochemistry, biophysics and proteomics to determine how physiologically abundant combinations of histone modifications regulate the reader activity of the ATAD2 and ATAD2B bromodomains. However, how the bromodomain region contributes to the cellular and molecular activities of ATAD2 and ATAD2B is unknown. These proteins are thought to function as regulators of chromatin architecture (Morozumi et al., 2016View full citation; Koo et al., 2016View full citation; Lazarchuk et al., 2020View full citation), and we hypothesized that the recognition of acetyllysine residues in the flexible N-terminal region of histones plays an important role in directing the AAA+ ATPase activity of ATAD2B to its chromatin substrates. We quickly realized that using a structure–function approach to understand how ATAD2B contributes to these fundamental cellular mechanisms, designed to maintain chromatin organization, would necessitate isolating the full-length protein to study its activity in vitro.

The ATAD2B protein is 1458 amino-acid residues long and contains an unstructured N-terminal region, two AAA+ ATPase domains, a bromodomain and an ordered C-terminal domain (Fig. 2[link]a; Puchades et al., 2020View full citation; Khan et al., 2022View full citation). Previous studies on human ATAD2 and a yeast homolog Abo1 indicated that truncating the unstructured N-terminus would increase our ability to successfully express and purify the ATAD2B protein for biochemical and structural studies, without significantly impacting its enzymatic ATPase function (Cho et al., 2019View full citation, 2023View full citation). Thus, we codon-optimized the DNA encoding residues 380–1458 of the human ATAD2B protein for expression in Escherichia coli and inserted it into the pGEX-6P-1 vector to add an N-terminal GST-affinity tag. We started with our well established system in E. coli due to its ease of use and our previous experience purifying the ATAD2B bromodomain. However, the GST-ΔN-ATAD2B residue 380–1458 construct has a predicted molecular weight of 150 kDa, which is considerably larger than any other protein we had purified previously (Singh et al., 2022View full citation; Poplawski et al., 2014View full citation; Phillips, Malone et al., 2024View full citation; Obi et al., 2020View full citation; Lubula, Eckenroth et al., 2014View full citation; Lubula, Poplawaski et al., 2014View full citation; Lloyd et al., 2020View full citation; Gay et al., 2019View full citation; Evans et al., 2021View full citation). For the remainder of the text, this GST-ΔN-ATAD2B construct (residues 380–1458) will be referred to as ATAD2B for clarity. Not surprisingly, the expression of ATAD2B in E. coli was very low, but after optimizing several factors including the IPTG concentration, buffer components and the E. coli cell line we were able to reliably obtain ∼500 µl 2.5 mg ml−1 ATAD2B from a four-litre culture. However, there was a considerable amount of contaminating proteins remaining in the sample, even after GST-affinity and size-exclusion column chromatography (Fig. 2[link]b).

[Figure 2]
Figure 2
Evaluation of ATAD2B oligomerization. (a) Domain organization of the full-length human ATAD2B protein (amino acids 1–1458) and the GST-tagged, N-terminally truncated ATAD2B expression construct (amino acids 380–1458). For clarity, this GST-ΔN-ATAD2B construct is referred to as ATAD2B throughout the text. (b) 10% SDS–PAGE gel stained with Coomassie Blue showing the molecular-weight (MW) standard and a lane corresponding to the size-exclusion chromatogram peak eluting at ∼9 ml (lane 2). In lane 2, ATAD2B appears as the upper band, migrating above its predicted molecular weight of 150 kDa. Additional bands in the same lane indicate sample heterogeneity. (c) Size-exclusion chromatography traces from a Superdex 200 Increase 10/300 GL column (ÄKTAprime, GE/Cytiva) for the ATAD2B sample (blue) and a MW standard mixture (orange). Monomeric ATAD2B is expected at 150 kDa. The first peak (∼9 ml) elutes near thyroglobulin (670 kDa) consistent with ATAD2B forming an oligomeric complex. The second peak (∼15 ml), eluting before ovalbumin at 44 kDa, corresponds to dimerized GST-tags (∼50 kDa).

This sample was not pure enough or in the milligram quantities needed for the X-ray crystallization approach that we used previously with the ATAD2 and ATAD2B bromo­domains (Phillips, Malone et al., 2024View full citation; Lubula, Eckenroth et al., 2014View full citation; Lubula, Poplawaski et al., 2014View full citation; Lloyd et al., 2020View full citation; Gay et al., 2019View full citation; Evans et al., 2021View full citation). Additionally, the 150 kDa ATAD2B AAA+ ATPase is expected to hexamerize into a ∼900 kDa complex upon ATP nucleotide binding (Puchades et al., 2020View full citation; Khan et al., 2022View full citation), and this size is well above the upper limit for structure determination using solution NMR, which is another structural determination approach that we are familiar with (Poplawski et al., 2014View full citation; Kim et al., 2016View full citation; Obi et al., 2020View full citation; Evans et al., 2021View full citation; Singh et al., 2022View full citation; Phillips, Cook et al., 2024View full citation). The structure of the closely related yeast homolog of ATAD2B (Abo1) was determined using cryo-EM (Cho et al., 2019View full citation), which prompted us to begin learning more about this technique by reaching out to the National Center for Cryo-EM Access and Training (NCCAT) in New York. We first heard about the NCCAT from attending structural biology meetings such as those of the American Crystallographic Association (ACA) and the Biophysical Society, along with word of mouth from colleagues. The NCCAT staff pointed us to several online cryo-EM resources (Table 1[link]), including the Caltech Getting Started in Cryo-EM course that includes lecture videos by Dr Grant Jensen (https://cryo-em-course.caltech.edu/). Fortuitously, we also attended the ACA annual meeting in Portland, Oregon, where we participated in a workshop on `Hands-on Single-Particle CryoEM Data Analysis with cryoEDU' that was organized by Dr Michael Cianfrocco. Additionally, a large portion of the scientific sessions at that meeting were focused on cryo-EM, including `New developments in cryo-EM', `Machine learning in cryo-EM' and `Structures of very large assemblies'.

Table 1
Helpful resources for getting started in cryo-EM

Name Brief summary Quick link Expertise needed
Cryo-EM (NIH Common Fund) The home page for the National Centers for Cryo-EM in California, New York and Oregon Broadening access to high-resolution cryo-electron microscopy and tomography: https://www.cryoemcenters.org Resource
Cryo-EM Centers List (PNCC) The Pacific Northwest Center for Cryo-EM has compiled a global list of cryo-EM instrumentation and resources Cryo-EM Service Centers, a working list: https://pncc.labworks.org/team/cryoem-service-centers-working-list Resource
Cryo-EM Centers World Wide Map The Pacific Northwest Center for Cryo-EM has compiled a global list of cryo-EM instrumentation and resources Cryo-EM in a Worldwide Map: https://www.google.com/maps/d/viewer?mid=1eQ1r8BiDYfaK7D1S9EeFJEgkLggMyoaT Resource
National Center Resources: Short Courses Fantastic series of lectures on theory with sections directly involved in data processing from experts in the field. Includes links to lectures and handouts from each year a short course is held. NCCAT classes, workshops and short courses homepage: https://nccat.nysbc.org/activities/courses/ and https://www.youtube.com/@NRAMMSEMC Beginner
Intermediate
Advanced
cryoEDU Information on the theory and emphasis on why to process a certain way. Provides examples of what to look for. Hands-on cryo-EM educational tool: https://cryoedu.org Beginner
Intermediate
cryoEM101.org Explains concepts in an accessible, hands-on way with example images and situations Cryo-EM 101 website: https://cryoem101.org Beginner
CryoSPARC A tutorial/how-to guide to use one of the data-processing software programs Getting Started with CryoSPARC Introductory Tutorial: https://guide.cryosparc.com/processing-data/get-started-with-cryosparc-introductory-tutorial Beginner
Intermediate
Advanced
Grant Jensen YouTube Videos: https://www.youtube.com/playlist?list=PLhiuGaXlZZenm7lu5qv_A59zEWkRKkBn5 Long-form lectures on foundational theory with PDF handouts of the slides. Very technical, but very rewarding. Caltech Getting Started in Cryo-EM: https://cryo-em-course.caltech.edu Beginner
Intermediate
Advanced
Merit Badge Homepage Informative videos to watch for when preparing to begin your hands-on experience US Cryo-EM Center Merit Badges: https://www.cryoemcenters.org/merit-badge/ Intermediate
ThermoFisher Cryo-EM Learning Center Multi-resource area for the latest news, articles, information etc. Life Sciences Electron Microscopy Resource Center: https://www.thermofisher.com/us/en/home/electron-microscopy/life-sciences/learning-center.html Beginner
ThermoFisher Cryo-EM University Great resource for highly informative videos regarding each step of the cryo-EM workflow Cryo-EM University: https://www.thermofisher.com/us/en/home/electron-microscopy/life-sciences/learning-center/cryo-em-university.html Beginner
Yale University Fred Sigworth's Cryo-EM Principles Long-form lectures on foundational theory with PDF handouts. More videos expected… Cryo-EM Principles Home Page: https://cryoemprinciples.yale.edu Beginner
Intermediate
Advanced

The ACA meeting exposed us to the capabilities of cryo-EM, and we quickly discovered that cryo-EM requires significantly less protein than crystallography, that it can tolerate heterogenous samples and that the ATAD2B AAA+ ATPase complex is an ideal size for structure determination using this method (Vénien-Bryan et al., 2017View full citation). While structural biology techniques such as X-ray crystallography and NMR require milligram quantities of highly purified protein samples to obtain high-quality/high-resolution data, it appeared to us that cryo-EM could handle sample heterogeneity a bit better (Takizawa et al., 2017View full citation; Wang & Wang, 2017View full citation), thanks to the downstream data-processing workflow. Therefore, we decided it would be worthwhile to pursue learning cryo-EM to study the structure and function of human ATAD2B.

Early in this process, a postdoctoral trainee in the Glass laboratory had the opportunity to attend the cryo-EM short course at the NCCAT. As part of this course they were able to image the ATAD2B protein sample using negative-stain microscopy (Fig. 1[link], step 1), and the NCCAT staff also thought ATAD2B would be a good candidate to study using cryo-EM methods. To move forward, we needed access to a plunge-freezing system and a transmission electron microscope (TEM) to make and screen cryo-EM grids. However, this expensive and specialized equipment was not available locally, so we applied to the TP1 training program at the NCCAT in New York. To gain comprehensive, hands-on experience with every stage of cryo-EM sample preparation and data processing, four members of the Glass laboratory traveled in pairs to the NCCAT for week-long visits each month over the course of a year. When each team of two returned, they shared what was learned and their experiences troubleshooting with the rest of the laboratory. This allowed us to maximize our time with the NCCAT experts to learn the techniques necessary to determine the structure of the ATAD2B complex.

2.2. Helpful resources for when you decide cryo-EM is right for you

For those interested in or new to cryo-EM, we highly recommend viewing introductory videos on cryo-EM theory and sample preparation. There are a plethora of different resources, and we have compiled many of them in Table 1[link]. Additionally, virtual seminars, webinars and scientific meetings are a great place to begin learning about cryo-EM. For those who want to learn more about cryo-EM facilities located nearby prior to investing in this technique, we would encourage readers to read the information available at each of the National Cryo-EM Centers that are funded by the NIH–NIGMS. Moreover, the Pacific Northwest Cryo-EM Center (PNCC) has created a global list of available cryo-EM centers (see Table 1[link]). We also encourage readers to visit the interactive map of `HighEnd CryoEM Worldwide' that displays type of microscope, camera and operation style, if appliable, in addition to contact information (Table 1[link]). Reaching out to a facility directly will allow you to learn whether the services that they offer will fit your needs. For example, the national cryo-EM centers provide on-site training in addition to core services, whereas more regional institutions may accept the submission of a purified protein sample or a protein sample frozen on a grid.

3. Acquiring technical skills for sample preparation and optimization

The transition from X-ray crystallography to cryo-EM involves modifications in sample-preparation protocols, as the requirements for optimal imaging differ between the two techniques (Stark & Chari, 2016View full citation). In X-ray crystallography, the main goal is coaxing proteins or nucleic acids to form well ordered crystals (Smyth & Martin, 2000View full citation). This requires the optimization of various parameters such as pH, temperature, precipitating agents and chemical additives to discover the optimal conditions that promote crystal lattice formation. Well ordered crystals with good diffraction yield high-resolution structural data (McPherson & Cudney, 2014View full citation).

For single-particle cryo-EM experiments, crystallization of samples is not required, although proteins and nucleic acids are often purified using conventional approaches (Saibil, 2022View full citation). Purified samples are applied to cryo-EM grids that feature a thick holey support film, typically made of carbon or gold, which provides a scaffold to support the formation of vitreous ice. During plunge-freezing, forming a uniform and thin layer of vitreous ice is essential for capturing high-resolution images of macromolecules in a near-native state and for preserving the structural integrity of the sample under the electron beam (Passmore & Russo, 2016View full citation). In some cases, an additional ultrathin layer of continuous carbon or graphene is added to directly support the biomolecules within the ice, helping to anchor them in place and maintain the integrity of fragile samples (de Martin Garrido et al., 2021View full citation). The choice of film material can influence particle distribution, orientation and image contrast, and is often tailored to the specific requirements of the experiment (Lyumkis, 2019View full citation). Extensive optimization of the sample-freezing conditions is often necessary to achieve uniform particle distribution and the desired ice thickness on the grid, both of which are critical for obtaining high-quality cryo-EM micrographs (Table 2[link]; Han et al., 2023View full citation). The quality of the cryo-grid sample directly impacts the success of downstream imaging and data analysis, thus making this step as critical in cryo-EM as crystallization is in X-ray studies.

Table 2
Factors to consider during cryo-EM sample preparation

Category Condition/instrument
Sample type Cryo-EM is ideal for large proteins and macromolecular complexes (e.g. over 50 kDa) that are difficult to crystallize
Initial screening Acquire a small set of micrographs from negatively stained samples or frozen cryo-EM grids to evaluate particle integrity, homogeneity and potential aggregation
Buffer optimization Optimize salt (typically 150–500 mM NaCl), glycerol (typically 0–5%) and pH (typically 6.5–8.5); consider adding reducing agents (DTT, TCEP), detergents (e.g. OG) to reduce aggregation or preferred orientation or cofactors or ligands (e.g. ATP, Mg2+) to stabilize functional complexes
Protein concentration Test multiple concentrations to avoid aggregation (too high) or sparsity (too low)
Vitrification devices Compare methods (Vitrobot, Leica GP2, Chameleon) to minimize air–water interface damage
Grid-type selection Use different grids (Quantifoil, UltrAuFoil, graphene-coated) to optimize particle orientation and distribution
Ice quality control Maintain a clean, dry environment; use dried tools, LN2 transfer boxes and wear face masks to prevent frost and contamination

Transitioning to cryo-EM demands not only technical skill in operating TEMs, but also familiarity with the specialized software used for data collection and analysis. Researchers must become adept at adjusting a wide range of settings, such as electron beam parameters, detector configurations and imaging conditions, all of which play a crucial role in determining the quality of the resulting data. This involves understanding complex concepts such as defocus optimization, astigmatism correction and dose management to minimize beam damage while maximizing signal quality. Additionally, practitioners must assess ice quality and particle distribution to optimize data-collection strategies. This technical expertise represents a substantial investment in skill development that is essential for successful cryo-EM research. However, many national cryo-EM centers provide critical support to new­comers by offering guidance on selecting optimal experimental conditions, such as pixel size, total exposure and target defocus, based on the size and characteristics of the protein under study. Facility staff also assist with assessing sample quality and advising on sample optimization, which is one of the primary motivations for establishing these centers. Such resources help ensure that researchers can achieve high-quality results even without extensive prior experience.

3.1. Gaining expertise in electron microscopy

Transitioning from X-ray crystallography to cryo-EM is like stepping into a new world, but one that speaks a familiar language. Our structural studies on ATAD2B began with crystallization attempts, which failed due to the inability to obtain high concentrations of purified ATAD2B. This challenge led us to pursue cryo-EM as an alternative approach. We initiated our efforts with negative staining to evaluate sample homogeneity, integrity and concentration, key indicators for cryo-EM readiness, using the Jeol 1400 TEM 120 keV microscope, equipped with an AMT XR611 CCD camera, available through the Microscopy Imaging Center at the University of Vermont Larner College of Medicine. As we lacked in-house vitrification tools, negative staining was the most useful way to initially evaluate sample quality. However, many facilities successfully produce cryo-EM structures without this step. When we brought the ATAD2B sample to the NCCAT for plunge-freezing, we quickly realized that optimizing the sample conditions for cryo-EM grid preparation is as critical as optimizing buffer conditions for crystal growth. Variables such as salt, detergent and glycerol can dramatically affect vitrification quality and particle distribution. High protein concentrations can lead to aggregation, while low concentrations result in sparse particle visibility. Vitrification of grids for cryo-EM introduces another level of complexity. Parameters such as blotting time, blot force, temperature and humidity all required optimization. Even the type of grid significantly influenced the particle orientation and distribution (Weissenberger et al., 2021View full citation; Wang & Zimanyi, 2024View full citation). Recent advances in grid-preparation strategies, including approaches to minimize air–water interface damage and improve particle distribution, are reviewed in Haynes et al. (2025View full citation). After testing multiple conditions, we found that a buffer containing 50 mM Tris pH 7.5, 150 mM NaCl, 5% glycerol, vitrified on a 1.2/1.3 UltrAuFoil grid using a Vitrobot Mark IV, provided the best results in terms of particle orientation and distribution. Every step from ice thickness to grid clipping required precise, careful handling. These seemingly minor details were essential to enable successful high-resolution cryo-EM imaging.

3.2. Computational requirements are an important factor for cryo-EM data sets

Cryo-EM data processing involves the handling of large data sets and requires substantial computational resources to perform key tasks such as motion correction, contrast transfer function (CTF) estimation, particle picking, 2D classification and 3D refinement (Baldwin et al., 2018View full citation). For any laboratory entering the cryo-EM field, one of the first and most critical questions is which data-processing suite(s) should we use and how do we meet the computational demands of these programs? When we began transitioning into cryo-EM, we faced the same challenge: should we invest in our own graphics processing unit (GPU) workstations, rely on a cloud-based service, use a national shared facility or work with the university's high-performance computing (HPC) cluster? After evaluating our needs and budget constraints, we opted to use the Vermont Advanced Computing Center (VACC), which offered a cost-effective and scalable solution with routine data backups. To assist others in making informed decisions, Table 3[link] summarizes the core computational requirements for commonly used software suites and platforms. These specifications represent minimum requirements for each program, and data-processing performance can be significantly improved by upgrading hardware components such as GPUs, random access memory (RAM) and file-storage space.

Table 3
Core computation requirements for popular cryo-EM image-processing platform

(a) Computational requirements.

Component (minimum requirements) CryoSPARC (Punjani et al., 2017View full citation) RELION (Scheres, 2012View full citation) cisTEM (Grant et al., 2018View full citation)
CPU ≥16 cores (Intel i9/AMD Ryzen) ≥16 cores (Intel i9/AMD Ryzen) ≥16 cores (Intel i9/AMD Ryzen)
RAM (GB) 64 64 64
GPU 1× NVIDIA RTX 3060/A2000 1× NVIDIA RTX 3060/A2000 Not supported
Fast storage (SSD) 1–2 TB NVMe SSD 1–2 TB NVMe SSD 1–2 TB NVMe SSD
Bulk storage ≥10 TB HDD or NAS ≥10 TB HDD or NAS ≥10 TB HDD or NAS
Network (Gbps) 1 1 1
Operating system Linux (Ubuntu, CentOS, RHEL) Linux (Ubuntu, CentOS, RHEL) Linux (Ubuntu, CentOS, RHEL)

(b) Cryo-EM platforms.

Platform Type Notes Quick link
CryoSPARC Cloud Cloud Fully cloud-hosted by Structura Biotechnology Inc.  
Open Science Grid (OSG) Shared facility HPC cluster National academic HPC resource https://osg-htc.org
COSMIC2 Platform Shared facility HPC cluster Centralized national resource https://www.cosmic2.org
AWS Batch/EC2 Cloud Commercial cloud infrastructure https://aws.amazon.com/products/
ScipionCloud (Cuenca-Alba et al., 2017View full citation) Hybrid (cloud + shared/local HPC) Web interface; jobs can run on cloud or connect to local/university cluster  

4. Common pitfalls and problems

4.1. ATAD2B sample heterogeneity caused by a common contaminating protein

Early on in our data-processing workflow, we observed heptameric particles in both our micrographs and 2D class averages. However, the cryo-EM structures of ATAD2 homologs were all hexameric (Cho et al., 2019View full citation, 2023View full citation; Wang et al., 2023View full citation). Initially, we hypothesized that these heptameric particles may support a new function for ATAD2B (Cho et al., 2019View full citation, 2023View full citation; Wang et al., 2023View full citation). To ensure that the heptameric particles were indeed ATAD2B, and not some unwanted protein, we sent both a liquid sample of ATAD2B and cutout bands from the SDS–PAGE gel of the ATAD2B sample shown in Fig. 2[link](b) to a mass-spectrometry core (see also the supporting information). In each sample, human ATAD2B was present. Because we were pushing the size limit for soluble protein expression in E. coli (Chae et al., 2017View full citation), these results suggested that the heterogeneity could be coming from sample degradation or internal cleavage of the ATAD2B protein. The aforementioned positive results gave us the confidence to continue with high-resolution data collection. Our goal was to collect data for ATAD2B bound in three different nucleotide states to capture any large secondary/tertiary-structural conformational changes that occur between these states, and gain information on how nucleotide coordination changes within the binding pocket.

The NCCAT collected three Krios data sets: ATAD2B-apo (9541 micrographs), ATAD2B–ADP (12 945 micrographs) and ATAD2B–γATP (10 880 micrographs) (Fig. 1[link], step 3). Since ATAD2B is an ATPase, we used γATP, which is a commonly used slowly hydrolyzable ATP analog, to `trap' ATAD2B in a nucleotide-bound state (Cho et al., 2019View full citation, 2023View full citation; Wang et al., 2023View full citation). The best data were obtained for the γATP data set, and we started processing (Fig. 1[link], step 4). After pre-processing, 2D classification and a few rounds of refinement to remove junk, an initial heptameric 3D map to 3.3 Å resolution was obtained. As our very first attempt at solving a cryo-EM structure, generation of this 3D map was an exciting milestone for our laboratory. We were eager to start building the ATAD2B protein model into this novel heptameric map (Fig. 1[link], step 5). The predicted AlphaFold structure for the ATAD2B monomer (AF-Q9ULI0-F1-v4) was manually docked into the initial map using ChimeraX. However, despite trying multiple manual docking strategies, such as docking each domain separately into the map, making dimers, trimers and tetramers of the ATAD2B model for docking and running fitinmap commands in ChimeraX, we were unable to fit any region of ATAD2B into the density map.

With progress in data processing halted, we decided it was time to reach out to a cryo-EM expert for help. We shared our 2D class averages and initial 3D model while explaining the docking issues, thinking that maybe our collaborator would have a better idea for fitting the ATAD2B model into the map (Fig. 3[link]a). The collaborator immediately identified our 2D class averages and 3D map as GroEL. GroEL is a ring-shaped E. coli chaperonin protein complex that helps mediate ATP-dependent polypeptide folding. It is an abundant, ubiquitous and stress-induced protein that assists in folding proteins by interacting with incompletely folded polypeptides (Braig et al., 1994View full citation; Roh et al., 2017View full citation; Grimm et al., 1993View full citation; Fenton et al., 1994View full citation; Weissman et al., 1995View full citation). The presence of GroEL as a stress-induced protein made sense to us because we were pushing the upper limits of E. coli expression with the 150 kDa ATAD2B protein, which was expected to oligomerize into a 900 kDa complex (Khan et al., 2012View full citation). However, we were puzzled by the mass-spectrometry results, which returned every single band cut out from the SDS–PAGE gel, and the liquid sample, as ATAD2B (see supporting information). After reaching back out to the mass-spectrometry core, we learned that their analysis only searched our data against human protein databases. Once they opened up their search to include E. coli proteins, the analysis indicated the presence of GroEL in addition to ATAD2B.

[Figure 3]
Figure 3
Cryo-EM data-processing workflow for separating GroEL contaminants from ATAD2B particles. (a) Initial 2D class averages from blob picking show predominantly heptameric GroEL particles (top). Further inspection reveals hexameric ATAD2B classes (marked with green dots, middle). A representative micrograph indicates the presence of both GroEL (blue circles) and ATAD2B particles (cyan circles). (b) Topaz was trained to identify GroEL particles in raw micrographs. (c) Topaz extraction recovered more ATAD2B particles than expected. 2D class averages display GroEL (top row) and ATAD2B particles (middle and bottom rows) within the same data set. Heterogenous refinement was used to separate the ATAD2B and GroEL particles, yielding a final ATAD2B map at 5.1 Å resolution from 45 889 particles.

Our analysis of the purified ATAD2B protein clearly indicates that the sample is heterogeneous. A band is visible on the gel at the expected molecular weight for GroEL at 58 kDa (Fig. 4[link], top left panel), which was confirmed via mass spectrometry. GroEL forms a stacked heptamer formed by 14 subunits, with a molecular weight of ∼812 kDa (Braig et al., 1994View full citation). The hexameric ATAD2B is a AAA+ ATPase that is expected to have a molecular weight of ∼900 kDa. Therefore, these two large macromolecular complexes did not separate during analytical size-exclusion chromatography, and Gro-EL co-purified with ATAD2B throughout the entire process (Fig. 4[link], left panels). The SDS–PAGE gel and mass-spectrometry data indicate that ATAD2B and GroEL exist in similar quantities in the sample (Fig. 4[link], top left, and supporting information). We had the unfortunate realization that our protein-purification, sample-preparation and data-collection strategies (Fig. 1[link], steps 1–3) were optimized for imaging GroEL particles instead of ATAD2B. However, we did have a minor amount of ATAD2B particles visible in the micrographs, so the next step was to return to data processing (Fig. 1[link], step 4) to determine whether we could salvage our cryo-EM data and identify more ATAD2B particles.

[Figure 4]
Figure 4
Comparison of ATAD2B expression and purification from E. coli and Sf9 cells. Left panels. Top: schematic of E. coli expression and Coomassie-stained 10% SDS–PAGE gel of peak samples from size-exclusion chromatography (SEC). The molecular-weight (MW) ladder is shown in the first lane with the mass of each band labeled. Lane 2 shows protein sample from peak 1 of the SEC chromatogram below eluting at 9 ml that contains ATAD2B along with GroEL contaminant. Lane 3 shows protein sample from peak 2 eluting at ∼15 ml that contains the GST-tag. Middle: SEC profile (Superdex 200 Increase 10/300 GL) showing a peak for ATAD2B and GroEL (9 ml). The additional peak at ∼15 ml corresponds to the cleaved GST-tag (observed as 50 kDa on SEC and 25 kDa on SDS–PAGE). Bottom: cryo-EM micrograph confirming the presence of heptameric GroEL contaminant in the purified sample. Right panels. Top: schematic of Sf9 expression and Coomassie-stained 10% SDS–PAGE gel showing greater than 90% pure ATAD2B collected from peak 1 of the SEC chromatogram below. The peak 2 lane contains the GST-tag. Middle: SEC profile (Superdex 200 Increase 10/300 GL) of the Sf9 purified sample with peaks for ATAD2B (∼9 ml) and the cleaved GST-tag (∼15 ml). Bottom: cryo-EM micrograph confirming hexameric ATAD2B in the purified sample.

Before completely starting over in the data-processing workflow, we reanalyzed the initial extracted stack of particles (Fig. 1[link], step 4). After a few expansions of 2D class averages, additional top and side hexameric 2D class averages of ATAD2B were found (Fig. 3[link]a). In total, this particle stack contained ∼20 000 initial ATAD2B particles and ∼200 000 initial GroEL particles. To find more ATAD2B particles, we went back a step further to explore different particle-picking strategies. Altering the settings for the standard blob-picker and template-picker jobs yielded no improvements in ATAD2B particle number. However, we had the good fortune to attend a seminar on Topaz, which is neural network-based particle-picking software that is `trained' on a small subset of particles known to be your protein of interest to then find these particles in the full data set (Bepler et al., 2019View full citation). We started to use Topaz with our ATAD2B particle stacks, but because of how the training works, it was nearly impossible to get an accurate picking model. To avoid introducing noise or false picks for ATAD2B, the higher quality and easier to find GroEL particles were used in the Topaz training step. We also decided it would be helpful to continue learning cryo-EM data processing with the GroEL particles, so we could apply this workflow later, once we had better quality data on ATAD2B. Incredibly, training Topaz on the GroEL particles not only picked GroEL particles, but also picked more ATAD2B particles than any other strategy tried thus far (Fig. 3[link]b). Topaz found 127 733 initial ATAD2B particles and 294 420 initial GroEL particles, indicating that we had many ATAD2B particles in this sample despite the GroEL contamination. However, after filtering to remove junk particles from the Topaz picks, only <100 000 ATAD2B particles remained to use in reconstructions. Unfortunately, there were not enough ATAD2B particles present to generate a map beyond mid-resolution (∼5 Å) that would be needed to observe the details of nucleotide coordination in the ATAD2B binding pocket (Fig. 3[link]c).

We realized that it was time to return to the biochemistry (Fig. 1[link], step 1) to obtain a more homogenous protein sample for cryo-EM structural investigation, without GroEL. There are a few protocols available to remove the GroEL contamination from a protein sample. One includes the addition of unfolded bacterial lysate supplemented with ATP to try and outcompete GroEL from the protein of interest (Rohman & Harrison-Lavoie, 2000View full citation). We tried this purification method with a few rounds of optimization; however, GroEL still appeared at the same intensity on SDS–PAGE gels after purification. The final test was another cryo-EM screen and small overnight data-set collection, which confirmed that GroEL was present in the same amount prior to the addition of the unfolded bacterial lysate.

We had to decide whether we should switch protein-expression systems or collect enough cryo-EM data with the GroEL contaminant to obtain a higher resolution structure. In the γATP data set, 10 880 micrographs were collected and less than 200 000 initial ATAD2B particles were observed before all of the junk was removed. Taking these numbers into consideration, we decided that it would require too much microscope time to make it worthwhile to continue with this sample. It was time to switch expression systems and return, again, to the biochemistry (Fig. 1[link], step 1). In the literature, cryo-EM structures of the ATAD2 homologs Abo1 and Yta7 were successfully determined from protein samples generated with the Sf9 insect-cell expression system (Cho et al., 2019View full citation, 2023View full citation; Wang et al., 2023View full citation). We formed a new collaboration with the Trybus laboratory at the University of Vermont, who generously taught us how to express the ATAD2B protein using Sf9 insect cells. Within four months we obtained pure ATAD2B protein, free of GroEL, suitable for downstream cryo-EM studies (Fig. 4[link]). Obtaining a high-quality protein sample was essential for determining the structure of ATAD2B at near-atomic resolution, enabling us to characterize the ligand coordination and protein–protein inter­actions that underpin its mechanism of action. This requirement parallels X-ray crystallography, where the quality of the crystal dictates the clarity of the diffraction data. Similarly, in cryo-EM, the purity and stability of the protein govern the resolution and reliability of the final structure, underscoring that sample preparation is the foundation for revealing atomic-level details in both techniques.

4.2. Protein expression with chaperone and other common contaminants

The presence of chaperones and other contaminants are a frequent challenge in cryo-EM sample preparation that complicate the data-collection and analysis pipeline (Table 4[link]; Zielinski et al., 2021View full citation). Molecular chaperones such as the chaperonin GroEL can co-purify with target proteins, as they bind unfolded or partially folded polypeptides, making them unwanted entities during purification (Weissman et al., 1995View full citation; Rutledge et al., 2022View full citation). Expression and purification of the AAA+ ATPase ATAD2B complex in E. coli resulted in GroEL contamination that co-purified with our target protein. Due to the similar sizes of both proteins, we did not realize that the chaperone was present in our data until the 2D classification stage in data processing. Our attempts to separate the GroEL protein from ATAD2B during purification were unsuccessful. Therefore, we reverted to using Spodoptera frugiperda (Sf9) insect-cell lines of instead of bacterial protein-expression systems for protein expression and purification. Moreover, based on our experiences, it can be safely stated that effective cryo-EM studies require careful purification strategies, such as additional chromatography steps and nuclease treatments, as well as vigilant screening of grids to identify and minimize these potential contaminants.

Table 4
Common cryo-EM contaminants

Protein contaminant Notes
ArnA, SlyD and AcrB (Andersen et al., 2013View full citation; Caliseki et al., 2025View full citation) Frequently contaminate samples with His-tagged fusion proteins expressed in E. coli due to their natural histidine-rich sequences
GroEL/GroES (Nain et al., 2022View full citation) Very common chaperonin contaminant from E. coli expression systems
Ferritin (Wenborn et al., 2015View full citation) Typically encountered in samples derived from animal tissues or produced using mammalian expression systems
Ribosomal proteins/fragments (Bolanos-Garcia & Davies, 2006View full citation; Singh et al., 2020View full citation; Amunts et al., 2014View full citation) Can be found in protein samples purified from bacterial or mammalian cells
Heat-shock proteins (Hsp70, Hsp90, sHsps; Carr et al., 2025View full citation; Bolanos-Garcia & Davies, 2006View full citation) These chaperones often bind unfolded proteins and co-purify with them
Affinity-tag fragments [glutathione S-transferase (GST)/maltose-binding protein (MBP)] (Waugh, 2011View full citation) Tag remnants are common contaminants in proteins purified using affinity tags

To help future laboratories convert to this new technique and avoid spending a lot of time troubleshooting the removal of a contaminating protein, we would like to pass on what was helpful to us during this process. In Table 5[link], we have included a list of specialized data-processing programs. The most common cryo-EM data-processing software or platforms are found in Table 3[link], while a collection of helpful resources for learning various aspects of cryo-EM methods are mentioned in Table 1[link]. We have included a list of common contaminants in Table 4[link] to help assess micrographs for their presence. We also encourage readers to investigate specific contaminants and why they may occur. Knowing when to switch expression systems will vary between laboratories. We chose to switch due to the amount of microscope time that would be required to collect a high-quality data set. We needed to apply for time at the National CryoEM Center, which is a limited resource, and we felt that it would be better to create a better biochemical sample to collect a high-quality data set. Despite a higher upfront investment in time to switch expression systems, we decided moving to Sf9 cells would set the laboratory up for future studies on large complexes that interact with chromatin. For more information on undesirable contamination, and other issues that may impact image quality, such as ice thickness, contamination, preferred orientation etc., we refer the reader to the reviews by Tan et al. (2017View full citation), Cianfrocco & Kellogg (2020View full citation), Neselu et al. (2023View full citation) and Wang & Zimanyi (2024View full citation) and to chapter 3 of CryoEM 101 (https://cryoem101.org/chapter-3/) for guidance.

Table 5
Specialized modular programs for data processing

Name Brief description Quick link
AlphaFold (Jumper et al., 2021View full citation) AI system for predicting protein 3D structures from amino-acid sequences https://alphafold.ebi.ac.uk/
ChimeraX (Pettersen et al., 2021View full citation) Molecular-visualization program for displaying and analyzing 3D biological structures, including atomic models and cryo-EM maps https://www.cgl.ucsf.edu/chimerax/
CryoDRGN (Zhong et al., 2021View full citation) Deep learning for heterogeneous cryo-EM data https://cryodrgn.csail.mit.edu/
crYOLO (Wagner et al., 2019View full citation) Neural network-based particle picker for cryo-EM micrographs https://cryolo.readthedocs.io/en/stable/
DynaMight (Schwab et al., 2024View full citation) A tool to model continuous conformational changes in cryo-EM data sets https://github.com/3dem/DynaMight/
EMAN2 (Tang et al., 2007View full citation) Suite for single-particle reconstruction and electron microscopy image processing https://blake.bcm.edu/emanwiki/EMAN2
Phenix (Adams et al., 2010View full citation; Afonine et al., 2018View full citation; Liebschner et al., 2019View full citation) Software suite for macromolecular structure determination by X-ray crystallography and cryo-EM https://phenix-online.org/
ModelAngelo (Jamali et al., 2024View full citation) Automated atomic model building for cryo-EM maps https://github.com/3dem/model-angelo/
Topaz (Bepler et al., 2019View full citation) Machine-learning particle picking for cryo-EM https://cb.csail.mit.edu/topaz/

5. Data-processing software for single-particle cryo-EM workflows

Data processing for single-particle cryo-EM is a multi-step, computationally intensive workflow that requires specialized software and robust infrastructure. Each stage, ranging from motion correction and CTF estimation to particle classification and high-resolution refinement, demands significant computing power and storage. Commonly used software suites such as CryoSPARC (Punjani et al., 2017View full citation), cisTEM (Grant et al., 2018View full citation; Lucas et al., 2021View full citation) and RELION (Scheres, 2012View full citation; Kimanius et al., 2016View full citation; Zivanov et al., 2022View full citation) (summarized in Table 3[link]) often operate in tandem, generating terabytes of data from raw movies to final reconstructions. These requirements necessitate high-performance computing (HPC) clusters equipped with powerful GPUs, multi-terabyte storage and high-speed networking. At the University of Vermont, collaboration with VACC staff was critical for building and maintaining this infrastructure, enabling parallel processing and efficient large-scale data transfers. Given the size of cryo-EM data sets, effective data-management strategies are critical, and should include long-term storage plans for older data sets because downstream analyses may require access to raw data. Emerging trends such as cloud-based solutions and AI-driven tools further highlight the evolving nature of cryo-EM data processing.

5.1. Tutorial on data processing using the E. coli GroEL data sets to generate map reconstructions

The GroEL-containing samples, expressed and purified from E. coli, were processed for cryo-EM analysis. The initial GroEL–γATP data set was analyzed using CryoSPARC v.4.5.31 (Punjani et al., 2017View full citation). All 15 169 movies underwent pre-processing, including patch-based motion correction. Patch contrast transfer function (CTF) estimation was performed on the motion-corrected micrographs. This is essential for high-resolution reconstructions, as it enables more precise correction of image distortions. The two parameters of ice thickness and CTF fit were checked for the selection of 14 568 micrographs for further processing. The next step included picking particles from the micrographs. In CryoSPARC there are different methods to do this such as manual picking, blob picking, template and deep-learning methods such as Topaz (Bepler et al., 2019View full citation). The circular blob picker in CryoSPARC identifies particles in micrographs by detecting circular features of a specified size. It provides a fast, template-free method for initial particle selection. We used the blob picker to pick 10 124 313 particles. The initially picked particles were filtered using defocus-adjusted power and pick scores, which help assess the quality and reliability of each pick. These metrics allow the exclusion of low-quality or spurious particles before extraction. The selected particles were then extracted from the micrographs using a box size of 300 pixels, ensuring that each particle is fully captured for downstream processing. A stack of 3 056 751 particles was subjected to reference-free 2D classification. This type of classification groups particles into a set number of classes by aligning and averaging them in-plane, improving the signal-to-noise ratio of each class average. This makes it easier to identify and remove poor-quality or contaminant classes. A total of 317 080 particles were used to generate six ab initio classes. Three of the six ab initio classes, comprising at total of 262 461 particles, resembled GroEL and were selected to generate a homogeneous reconstruction using D7 symmetry. A homogeneous map with D7 symmetry is a three-dimensional reconstruction in which all selected particles are refined together under the constraint of sevenfold dihedral symmetry, which can enhance resolution by leveraging the inherent symmetry of the complex. GroEL is known to exhibit D7 symmetry, as confirmed by previously published studies (Torino et al., 2023View full citation). To further analyze structural heterogeneity, the selected particles were divided into four distinct 3D classes without applying a focused mask, allowing unbiased classification across the entire volume. This approach facilitates the separation of different conformational states and the removal of heterogeneous or junk particles. One of the resulting classes was identified as junk and excluded from further processing. The other three classes comprising 261 106 particles were combined to generate a final non-uniform map at 3.3 Å resolution, according to the gold-standard 0.143 FSC cutoff criterion. A scheme for the data processing is shown in Fig. 5[link].

[Figure 5]
Figure 5
Single-particle cryo-EM data-processing scheme for the GroEL–γATP complex. A total of 317 080 GroEL particles were selected following 2D classification and subjected to ab initio reconstruction into six classes. Three of these ab initio classes were retained for further processing and used to generate a homogeneous 3D map. Subsequent 3D classification yielded four classes, with one class identified as junk and excluded from further analysis. The remaining three classes were combined, resulting in a final reconstruction comprising 261 106 particles with a CryoSPARC gold-standard FSC resolution of 2.95 Å; however, we report the no-mask resolution of 3.3 Å for the final map. Local resolution estimation was performed in CryoSPARC and the resulting map was visualized using UCSF ChimeraX.

The apo GroEL data set comprising 9541 micrographs was also processed using CryoSPARC v.4.5.31 (Punjani et al., 2017View full citation). Patch contrast transfer function (CTF) estimation was carried out on the motion-corrected micrographs. The processing scheme was similar to the GroEL–γATP data set. However, in this data set we first performed manual picking from selected micrographs to create a high-quality training set. This manually curated set of particles was then used to train a Topaz model (Bepler et al., 2019View full citation), enabling more accurate automated particle picking for the entire data set. A total of 513 428 particles were extracted from the micrographs with a box size of 256 pixels. From the 2D classification, a total of 148 146 particles were used to generate one ab nitio class. This class was used to generate a homogeneous map with D7 symmetry and was further classified into five classes. The final map generated from non-uniform refinement had a resolution of 3.7 Å according to the 0.143 FSC cutoff criterion. The processing scheme is shown in Fig. 6[link].

[Figure 6]
Figure 6
Single-particle cryo-EM data-processing scheme for the apo GroEL complex. A total of 148 187 GroEL particles were selected after 2D classification and subjected to ab initio reconstruction. Homogeneous refinement was performed on the resulting single class. Subsequent 3D classification yielded five classes; adopting a conservative approach, only the highest quality class was selected for final map generation. The final reconstruction comprised 42 776 particles and had a final resolution of 3.32 Å according to the CryoSPARC gold-standard FSC. However, we report our final map as 3.7 Å resolution with no mask applied. Local resolution estimation was performed in CryoSPARC and the resulting map was visualized using UCSF ChimeraX.

For the GroEL–ADP data set we chose to use a different software called cisTEM (Grant et al., 2018View full citation) for data processing. 11 46 722 particles were picked using the ab initio template picker and were subjected to reference-free 2D classification. Particles in the 2D classes resembling GroEL were selected, and a stack of 158 542 particles was exported into FREALIGN (Grigorieff, 2016View full citation). The particle stack was binned 8×, 4× and 2× using the command resample.exe. This binning of the particle stacks is performed to reduce the computational processing time. To prevent the introduction of high-resolution bias a low-pass filtered (20 Å) map of GroEL was generated from PDB entry 9c0c and was used for initial particle alignment. The 2× binned stack was then 3D-classified into eight classes. The classes with the largest number of particles were combined to generate a final 1× map at a resolution of 4.2 Å. The processing scheme is shown in Fig. 7[link].

[Figure 7]
Figure 7
Single-particle cryo-EM data-processing scheme for the GroEL–ADP complex. The particle stack, comprising 125 716 GroEL particles selected from a 2D classification in cisTEM, was exported for further processing in FREALIGN. An initial alignment was performed using a low-pass filtered GroEL template. Subsequently, the aligned particles underwent 3D classification over 50 rounds to sort them into distinct conformational states. However, no major confirmational changes were observed in the three best classes. These classes were combined to generate a final map at 4.2 Å resolution. The two half-maps generated from the final 3D reconstruction were used to compute a Fourier shell correlation (FSC) curve by using the EMDB validation tool (https://www.ebi.ac.uk/emdb/validation/fsc/).

6. Nuances of model building, refinement, cryo-EM structure validation and deposition

Once the highest quality reconstruction has been achieved, the next step is to interpret the cryo-EM density map by building an atomic model. This model may be derived from previously solved structures obtained through techniques such as X-ray crystallography or cryo-EM, or predicted using computational tools such as AlphaFold. Regardless of the source, model building should be guided by prior biological knowledge and experimental context to ensure accurate interpretation. An integrated approach is essential, often combining structural, biochemical and biophysical data to inform model construction and validation. Techniques such as mass spectrometry, cross-linking mass spectrometry (XL-MS), small-angle X-ray scattering (SAXS), nuclear magnetic resonance (NMR), size-exclusion chromatography (SEC) and SEC coupled with multi-angle light scattering (SEC-MALS) can provide valuable insights into the composition, organization and conformational states of the molecular complex.

Macromolecular crystallography and cryo-EM differ fundamentally in how structural information is obtained. In X-ray crystallography, diffraction patterns obtained from highly ordered crystals are used to compute electron-density maps, which provide atomic-level detail when crystals diffract well. A major challenge is the `phase problem'. While diffraction experiments measure the intensities of scattered X-rays, they do not capture the corresponding phase information, which is essential for calculating an accurate electron-density map. Because phases cannot be obtained directly from the experiment, specialized experimental strategies or computational methods must be used to estimate or recover them in order to solve crystal structures (Adams et al., 2009View full citation).

In contrast, cryo-EM reconstructs three-dimensional Coulomb potential maps from many thousands of particle images of flash-cooled molecules captured in random orientations. Unlike X-ray crystallography, cryo-EM micrographs retain the phase information, avoiding the crystallographic phase problem. While cryo-EM maps often exhibit lower signal-to-noise ratios and variable resolution across flexible regions (Punjani & Fleet, 2023View full citation), high-resolution areas can be interpreted with reduced model bias. Model building in cryo-EM typically involves docking known protein fragments or predicted structures into the density, followed by iterative refinement to improve the fit and identify secondary-structure elements (Vénien-Bryan et al., 2017View full citation).

6.1. Model building of GroEL in different nucleotide-bound states

The initial steps of model building for X-ray crystallography and cryo-EM differ significantly due to the nature of the data (Terwilliger et al., 2020View full citation). However, after the initial fitting of the starting model in the final map (Fig. 1[link], step 5), these two techniques become similar. For our three GroEL structures, we used PDB entries 8bl7 and 9c0c as starting models. Here, we will focus on the model building of GroEL–γATP (scheme shown in Fig. 5[link]). The initial model was rigid-body fitted into the final map using UCSF ChimeraX (Pettersen et al., 2004View full citation). As the name indicates, it is just placing the initial rigid model in the map based on optimal position and coordinates without changes to the shape of the model. The coordinates for the γATP ligand (AGS) were taken from PDB entry 5dac and it was rigid-body fitted into the map. Further manual adjustments were performed using Coot (Crystallographic Object-Oriented Toolkit; Emsley & Cowtan, 2004View full citation), a molecular-graphics program for building and validating macromolecular models. Our GroEL model was then structurally refined in the cryo-EM map using phenix.real_space_refine in Phenix (Liebschner et al., 2019View full citation). This program refines the atomic coordinates while simultaneously enforcing good stereochemistry, such as ideal bond lengths, angles and proper protein backbone and side-chain conformations.

During the refinement steps, secondary restraints files were generated for the model and were used in subsequent refinement cycles. The restraints prevent the model from deviating from physically and chemically plausible conformations. Without these restraints, the refinement software would be at risk of overfitting the model to noise or artifacts in the cryo-EM map, leading to a geometrically unreliable structure. During refinement in Phenix, the resolution-dependent model-to-map correlation was evaluated using the correlation-coefficient plots generated at the end of the process. These plots confirmed that all three final atomic models exhibited high correlation with their respective experimental density maps, indicating an excellent fit without evidence of overfitting. Separately, validation of the resulting models indicated favorable stereochemical parameters, including minimal deviation from ideal bond lengths and angles, suggesting high quality throughout the model geometry. Structural details of GroEL–γATP are also shown in Fig. 8[link], with closer views of the nucleotide-binding site of chain A. Figures were generated using UCSF ChimeraX (Pettersen et al., 2004View full citation) and PyMOL (version 2.3.1; Schrödinger; DeLano, 2002View full citation).

[Figure 8]
Figure 8
Model-building and structural details of the GroEL–γATP complex. (a) Schematic workflow of the model-building and refinement process. The starting model for structural refinement was PDB entry 9c0c (apo GroEL). Initially, the model was fitted into the final map using UCSF ChimeraX, followed by manual adjustments in Coot. Iterative refinement cycles were performed in Phenix (phenix.real_space_refine) and structure validation was carried out using MolProbity. The resulting models have favorable stereochemical parameters, including minimal deviation from ideal bond lengths and angles, and a low number of macromolecular backbone outliers. (b) A side view of the multimeric GroEL complex is shown on the left, with each of the 14 subunits colored differently. The middle and right panels show top and bottom views of the complex, respectively, to highlight the overall architecture and subunit organization. (c) The left panel displays subunit A of GroEL bound to γATP (purple), with a boxed region indicating the ATPase active site. The right panel highlights this region in detail, showing the molecular surface enveloping the bound γATP. Key residues involved in ATP hydrolysis are labeled, including Lys51 (involved in ATP phosphate binding), Asp87 and Asp97 (which coordinate the essential Mg2+ ion and help position water for nucleophilic attack), Asp398 (which contributes to allosteric signaling and conformational changes) and Gly414 (which provides structural flexibility for proper positioning of catalytic residues).

6.2. Structure validation

During model validation Phenix gives a wide range of statistics for validation, as MolProbity is incorporated into the software (Davis et al., 2007View full citation). However, the model can also be validated using the MolProbity web server at http://molprobity.biochem.duke.edu/. An overview of data-collection and validation statistics for all three structures is shown in Table 6[link]. Here, we will go through the validation procedure used for the GroEL–γATP complex. During model refinement in Phenix (Adams et al., 2010View full citation), global refinement statistics were monitored after each cycle to assess model quality and convergence. Upon completion of the refinement process, the final model-to-map cross-correlation coefficient (CCC) was 0.83, indicating strong agreement between the atomic model and the experimental cryo-EM map. For geometry restraints, the root-mean-square deviation (r.m.s.d.) for bonds and angles quantifies how much the geometry of the model deviates from established ideal values. The low r.m.s.d. values for bonds (0.003 Å) and angles (0.599°) demonstrated that the geometry of the model was close to the expected ideal values.

Table 6
Details of structural refinement of all three GroEL structures/models

Structure GroEL–γATP GroEL–ADP Apo GroEL
PDB code 9ykc 9ynj 9yke
EMDB code EMD-73044 EMD-73200 EMD-73045
Data collection and processing
 Magnification 81000 81000 81000
 Voltage (kV) 300 300 300
 Electron exposure (e Å−2) 64 64 52.5
 Defocus range (µm) −0.8 to −2.5 −0.8 to −2.5 −0.8 to −2.5
 Pixel size 1.058 1.069 1.069
 Symmetry imposed D7 D7 D7
 Final map particles 261106 78179 42776
 Final map resolution (Å) 3.3 4.2 3.7
 FSC threshold 0.143 0.143 0.143
Refinement details
 Model used (PDB code) 8bl7 9c0c 9c0c
 Resolution of starting model (Å) 4.40 3.41 3.41
 Correlation coefficient (cc_mask) 0.83 0.87 0.87
 Map sharpening B factor (Å2) −60 −100 −40
MolProbity score 1.30 1.38 1.29
 Clashscore 3.1 4.0 >3.0
 Poor rotamers 0.1 0 0
 Ramachandran plot
  Favored 97.4 98.0 97.2
  Allowed 2.6 2.0 2.8
  Disallowed 0 0 0
CaBLAM outliers 2.3 1.4 1.6
Q-score 0.443 0.305 0.396

In addition, Z-scores (0.223 and 0.390, respectively) further confirm this, as they are very low and well within an acceptable range.

The all-atom clashscore for GroEL–γATP was 3.1, which indicated very few steric clashes between atoms. The Ramachandran plot is a graphical way to visualize the energetically favored and allowed backbone conformations of amino-acid residues in a protein. It plots the two main dihedral angles of the protein backbone, phi (φ) and psi (ψ). The Ramachandran plot had 0.00% outliers and a very high percentage of residues in the `favored' region (97.4% in this case). The rotamer outliers are amino-acid side-chain conformations that are found in energetically unfavorable positions. Their presence could indicate potential errors in the model, as side chains typically adopt a limited set of stable conformations known as rotamers to minimize steric clashes. There were 0.1% rotamer outliers, which indicated favorable side-chain conformations.

After reviewing the validation statistics, we checked each residue of our model to ensure its quality and adherence to stereochemical principles. This process helps to identify potential errors or issues that could lead to future validation problems. Afterwards, we started the PDB validation; for this purpose the refined model (GroEL–γATP) and final map were uploaded to https://validate-rcsb-2.wwpdb.org/. The PDB validation report provides an overall summary, using color-coded flags to highlight potential issues. A green flag indicates a metric is within expected ranges, yellow suggests a potential issue and red flags a significant problem. Another metric in the validation report was the Q-score (Pintilie et al., 2020View full citation), which quantifies the resolvability of individual atoms in a 3D density map. It measures how well the map values around the position of a specific atom correlate with an ideal, well resolved Gaussian-like function, with a score of 1 indicating a perfect fit and a value closer to 0 or negative indicating poor resolvability or a poor fit; for all 14 chains in the our GroEL–γATP model, the Q-score was 0.443.

6.3. Structure deposition and repositories for cryo-EM data

Best practices for cryo-EM data deposition emphasize transparency, reproducibility and community access. Final atomic models should be deposited in the Protein Data Bank (PDB; https://www.wwpdb.org/), while the corresponding 3D density maps must be submitted to the Electron Microscopy Data Bank (EMDB; https://www.ebi.ac.uk/emdb/). For raw image data, deposition in the Electron Microscopy Public Image Archive (EMPIAR; https://www.ebi.ac.uk/empiar/) is strongly encouraged, as it enables method development, validation and training of new algorithms. These repositories provide persistent identifiers and standardized validation reports, ensuring compliance with funding, journal and community guidelines. Using the OneDep system, models and maps can be deposited to the PDB and EMDB simultaneously through a single deposition workflow, streamlining the process and ensuring consistency across related data sets. Detailed instructions for deposition, including file formats and validation requirements, are available on the respective websites.

Following these practices aligns with the NIH 2023 Data Management and Sharing Policy and the NSF Public Access and Data Management and Sharing Plan requirements, promoting timely public access to publications and supporting data, and strengthening the transparency and reproducibility of federally funded research. It also supports the broader structural biology community in benchmarking and advancing the field.

7. Conclusions and future directions

Our transition from X-ray crystallography to single-particle cryo-EM provided critical insights into the structural characterization of ATAD2B, a large AAA+ ATPase- and bromodomain-containing protein involved in chromatin regulation. This journey highlighted both the transformative potential of cryo-EM and the practical challenges associated with adopting this technique, including sample heterogeneity, co-purification of chaperones such as GroEL and the steep learning curve for data processing and computational infrastructure. Despite these obstacles, we successfully established a workflow that integrates biochemical optimization, advanced vitrification strategies and state-of-the-art image-processing tools.

Our experience with ATAD2B highlights a critical question for cryo-EM practitioners: how much contamination can be tolerated before a targeted reconstruction becomes impractical? The contamination threshold is variable and can sometimes be overcome by collecting enough data, even if your particles are scarce (Ho et al., 2018View full citation). However, our experience suggests that when contaminant particles outnumber the target by an order of magnitude or more, as in our initial data set where GroEL particles exceeded ATAD2B particles by nearly tenfold, the likelihood of achieving a high-resolution reconstruction of the target diminishes substantially. In such cases, even aggressive classification and filtering strategies may fail to recover sufficient particles for near-atomic resolution. Ultimately, when contaminant particles comprise more than ∼70–80% of the data set, the time and computational cost required to isolate enough target particles becomes prohibitive, and a return to sample optimization is warranted. We chose to go back and improve the quality of the ATAD2B protein sample because 300 kV Krios data-collection time is precious and our access was limiting. We also planned to continue structural studies on the ATAD2B AAA+ ATPase complex in the long term, so it made sense to move to a different expression system to eliminate the contaminant. In other scenarios, a brute-force data-collection strategy may make sense, particularly if microscope time and storage resources are abundant. This decision depends on several factors, including how much microscope time you have, what resolution you need in specific regions of the protein and the intrinsic properties of your macromolecular complex. Highly symmetric contaminants, such as GroEL with D7 symmetry, refine rapidly and dominate classification, making it difficult to isolate minority species (Scheres, 2016View full citation). Highly dynamic complexes, flexible proteins and heterogeneous samples will require more particles to reach high resolution, further increasing the impact of contamination. While data-set size and computational resources can partially offset these challenges, their success depends on practical constraints. Importantly, these factors can be manipulated, and adjusting the biochemical strategies remains the most effective approach. In our case switching expression systems from E. coli to Sf9 insect cells eliminated GroEL contamination and enabled the collection of a homogeneous data set suitable for high-resolution reconstructions. Additional troubleshooting approaches include editing the expression construct, altering the purification strategy, varying the buffer components and optimizing grid-preparation parameters by testing alternative grid types, or employing additives such as detergents or graphene oxide supports to improve particle behavior (An et al., 2025View full citation). If cryo-EM grids exhibit poor particle distribution or preferred orientation, adjusting the blotting conditions and plunge-freezing strategy may be useful (for example, Vitrobot versus Chameleon; Levitz et al., 2022View full citation). An advantage in cryo-EM is that contaminants and heterogeneity can also be reduced computationally by employing selective particle-picking workflows (including neural network-based tools such as Topaz; Bepler et al., 2019View full citation). Topaz uses machine learning to selectively pick particles that match the target protein, reducing sample heterogeneity by excluding contaminants and junk particles. CryoDRGN and RECOVAR are additional deep-learning approaches that can addresses conformational heterogeneity by modeling continuous structural variability and conformational landscapes (Zhong et al., 2021View full citation). Despite these computational breakthroughs, structure determination by cryo-EM is still an iterative process and typically requires the collection of multiple data sets (often three to four per structure) to achieve the desired resolution for a new structural target. In our case, improving the ATAD2B sample was the most efficient path forward, given our long-term goals and limited access to high-end instrumentation.

Looking forward, cryo-EM is poised to redefine structural biology by enabling the visualization of macromolecular assemblies in unprecedented detail and in increasingly native contexts. Advances in detector technology, sample preparation and computational methods, including machine learning-driven particle picking, heterogeneity analysis and integrative modeling, will accelerate data processing and interpretation. Furthermore, the expansion of cryo-electron tomography and correlative workflows will bridge the gap between isolated complexes and cellular environments, offering a holistic view of molecular machines in action.

Together, these developments position cryo-EM to deliver not only higher resolution but higher information content: dynamic ensembles, contextualized structures and integrative models aligned with cellular function. For the crystallographic community, the opportunity is clear: leveraging cryo-EM alongside crystallography and complementary biophysical approaches will deepen mechanistic understanding across scales, inform structure-guided discovery and establish robust, community-vetted standards for the next decade of structural biology.

Supporting information


Acknowledgements

We gratefully acknowledge the staff at the National Center for Cryo-EM Access and Training (NCCAT) at the New York Structural Biology Center for their exceptional support and guidance throughout our training in single-particle cryo-electron microscopy. Their expertise, patience and dedication were instrumental in helping us develop the skills necessary for high-quality sample preparation, data collection and analysis. We especially thank them for their hands-on instruction and thoughtful mentorship during our visits, which greatly enriched our understanding and capabilities in cryo-EM. Special thanks to Christina Zimanyi and Eugene Chua for their critical feedback and thoughtful editing of this manuscript. Author contributions were as follows. HZ, KLM, AKS, MAC and KCG conceptualized the work. All authors contributed to the literature review and analysis of current research, and all authors drafted, revised and approved the manuscript. All authors approved the submission and agreed to be responsible for their contributions.

Conflict of interest

The authors declare no competing interests.

Funding information

This work was supported by a mid-career advancement award from the Molecular and Cellular Biosciences Division of the National Science Foundation to KCG and MAC under award No. 2321501. This work was also supported by the National Cancer Institute and the National Institute of General Medical Sciences of the National Institutes of Health under award Nos. P01CA240685, P20GM113131-07S1 and R01GM129338 to KCG. AKS was supported by an American Heart Association Postdoctoral Fellowship 24POST1194310 and an Early Career Research Award from the Cardiovascular Research Institute. The authors acknowledge the Vermont Advanced Computing Center (VACC) at the University of Vermont for providing computational resources that have contributed to the research results reported within this paper. Specifically, computations were supported by the National Science Foundation under Award Nos. 1827314 and 2510406. Creation of the UVM Center for Biomedical Shared Resources, which houses the Microscopy Imaging Center, was supported by NIH award 1C06OD030087-01. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Institutes of Health, the National Science Foundation or the American Heart Association. This research was also supported by the University of Vermont Cancer Center and the University of Vermont Larner College of Medicine. Some of this work was performed at the National Center for CryoEM Access and Training (NCCAT) and the Simons Electron Microscopy Center located at the New York Structural Biology Center, supported by NIH (Common Fund U24GM129539, NIGMS R24GM154192), the Simons Foundation (SF349247) and NY State Assembly.

References

Return to citationAdams, P. D., Afonine, P. V., Bunkóczi, G., Chen, V. B., Davis, I. W., Echols, N., Headd, J. J., Hung, L.-W., Kapral, G. J., Grosse-Kunstleve, R. W., McCoy, A. J., Moriarty, N. W., Oeffner, R., Read, R. J., Richardson, D. C., Richardson, J. S., Terwilliger, T. C. & Zwart, P. H. (2010). Acta Cryst. D66, 213–221.  Web of Science CrossRef CAS IUCr Journals Google Scholar
Return to citationAdams, P. D., Afonine, P. V., Grosse-Kunstleve, R. W., Read, R. J., Richardson, J. S., Richardson, D. C. & Terwilliger, T. C. (2009). Curr. Opin. Struct. Biol. 19, 566–572.  Web of Science CrossRef PubMed CAS Google Scholar
Return to citationAfonine, P. V., Klaholz, B. P., Moriarty, N. W., Poon, B. K., Sobolev, O. V., Terwilliger, T. C., Adams, P. D. & Urzhumtsev, A. (2018). Acta Cryst. D74, 814–840.  Web of Science CrossRef IUCr Journals Google Scholar
Return to citationAmunts, A., Brown, A., Bai, X. C., Llácer, J. L., Hussain, T., Emsley, P., Long, F., Murshudov, G., Scheres, S. H. W. & Ramakrishnan, V. (2014). Science, 343, 1485–1489.  CrossRef PubMed Google Scholar
Return to citationAn, S., Ahn, E., Koo, T., Park, S., Suh, B., Rengasamy, K. P., Lyu, G., Kim, C., Kim, B., Kim, H., Park, S., Tan, D. & Cho, U. S. (2025). bioRxiv, 2025.02.22.63868.  Google Scholar
Return to citationAndersen, K. R., Leksa, N. C. & Schwartz, T. U. (2013). Proteins, 81, 1857–1861.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationBai, X. C., Yan, C., Yang, G., Lu, P., Ma, D., Sun, L., Zhou, R., Scheres, S. H. W. & Shi, Y. (2015). Nature, 525, 212–217.  CrossRef PubMed Google Scholar
Return to citationBaldwin, P. R., Tan, Y. Z., Eng, E. T., Rice, W. J., Noble, A. J., Negro, C. J., Cianfrocco, M. A., Potter, C. S. & Carragher, B. (2018). Curr. Opin. Microbiol. 43, 1–8.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationBepler, T., Morin, A., Rapp, M., Brasch, J., Shapiro, L., Noble, A. J. & Berger, B. (2019). Nat. Methods, 16, 1153–1160.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationBijak, V., Szczygiel, M., Lenkiewicz, J., Gucwa, M., Cooper, D. R., Murzyn, K. & Minor, W. (2023). Exp. Opin. Drug. Discov. 18, 1221–1230.  Web of Science CrossRef Google Scholar
Return to citationBirch, J., Axford, D., Foadi, J., Meyer, A., Eckhardt, A., Thielmann, Y. & Moraes, I. (2018). Methods, 147, 150–162.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationBolanos-Garcia, V. M. & Davies, O. R. (2006). Biochim. Biophys. Acta, 1760, 1304–1313.  Web of Science PubMed CAS Google Scholar
Return to citationBraig, K., Otwinowski, Z., Hegde, R., Boisvert, D. C., Joachimiak, A., Horwich, A. L. & Sigler, P. B. (1994). Nature, 371, 578–586.  CrossRef CAS PubMed Web of Science Google Scholar
Return to citationCaliseki, M., Borucu, U., Yadav, S. K. N., Schaffitzel, C. & Kabasakal, B. V. (2025). Acta Cryst. D81, 545–557.  CrossRef IUCr Journals Google Scholar
Return to citationCarr, K. D., Zambrano, D. E. D., Weidle, C., Goodson, A., Eisenach, H. E., Pyles, H., Courbet, A., King, N. P. & Borst, A. J. (2025). J. Struct. Biol. X, 11, 100120.  PubMed Google Scholar
Return to citationChae, Y. K., Kim, S. H. & Markley, J. L. (2017). PLoS One, 12, e0177233.  CrossRef PubMed Google Scholar
Return to citationCheng, Y. (2015). Cell, 161, 450–457.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationCho, C., Ganser, C., Uchihashi, T., Kato, K. & Song, J. J. (2023). Commun. Biol. 6, 993.  CrossRef PubMed Google Scholar
Return to citationCho, C., Jang, J., Kang, Y., Watanabe, H., Uchihashi, T., Kim, S. J., Kato, K., Lee, J. Y. & Song, J. J. (2019). Nat. Commun. 10, 5764.  CrossRef PubMed Google Scholar
Return to citationChua, E. Y. D., Mendez, J. H., Rapp, M., Ilca, S. L., Tan, Y. Z., Maruthi, K., Kuang, H., Zimanyi, C. M., Cheng, A., Eng, E. T., Noble, A. J., Potter, C. S. & Carragher, B. (2022). Annu. Rev. Biochem. 91, 1–32.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationCianfrocco, M. A. & Kellogg, E. H. (2020). J. Chem. Inf. Model. 60, 2458–2469.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationCuenca-Alba, J., del Cano, L., Gómez Blanco, J., de la Rosa Trevín, J. M., Conesa Mingo, P., Marabini, R., Sorzano, C. O. & Carazo, J. M. (2017). J. Struct. Biol. 200, 20–27.  PubMed Google Scholar
Return to citationDavis, I. W., Leaver-Fay, A., Chen, V. B., Block, J. N., Kapral, G. J., Wang, X., Murray, L. W., Arendall, W. B., Snoeyink, J., Richardson, J. S. & Richardson, J. S. (2007). Nucleic Acids Res. 35, W375–W383.  Web of Science CrossRef PubMed Google Scholar
Return to citationDeLano, W. L. (2002). The PyMOL Molecular Graphics System. DeLano Scientific, Palo Alto, California, USA.  Google Scholar
Return to citationde Martin Garrido, N., Ramlaul, K. & Aylett, C. H. S. (2021). J. Vis. Exp., e62321.  Google Scholar
Return to citationEmsley, P. & Cowtan, K. (2004). Acta Cryst. D60, 2126–2132.  Web of Science CrossRef CAS IUCr Journals Google Scholar
Return to citationEvans, C. M., Phillips, M., Malone, K. L., Tonelli, M., Cornilescu, G., Cornilescu, C., Holton, S. J., Gorjánácz, M., Wang, L., Carlson, S., Gay, J. C., Nix, J. C., Demeler, B., Markley, J. L. & Glass, K. C. (2021). Int. J. Mol. Sci. 22, 9128.  CrossRef PubMed Google Scholar
Return to citationFenton, W. A., Kashi, Y., Furtak, K. & Norwich, A. L. (1994). Nature, 371, 614–619.  CrossRef PubMed Google Scholar
Return to citationGay, J. C., Eckenroth, B. E., Evans, C. M., Langini, C., Carlson, S., Lloyd, J. T., Caflisch, A. & Glass, K. C. (2019). Proteins, 87, 157–167.  CrossRef PubMed Google Scholar
Return to citationGrant, T., Rohou, A. & Grigorieff, N. (2018). eLife, 7, e35383.  Web of Science CrossRef PubMed Google Scholar
Return to citationGrigorieff, N. (2016). Methods Enzymol. 579, 191–226.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationGrimm, R., Donaldson, G. K., van der Vies, S. M., Schäfer, E. & Gatenby, A. A. (1993). J. Biol. Chem. 268, 5220–5226.  CrossRef PubMed Google Scholar
Return to citationHan, B. G., Avila-Sakar, A., Remis, J. & Glaeser, R. M. (2023). Curr. Opin. Struct. Biol. 81, 102646.  CrossRef PubMed Google Scholar
Return to citationHaynes, R. M., Myers, J., López, C. S., Evans, J., Davulcu, O. & Yoshioka, C. (2025). J. Struct. Biol. 217, 108068.  CrossRef PubMed Google Scholar
Return to citationHo, C.-M., Beck, J. R., Lai, M., Cui, Y., Goldberg, D. E., Egea, P. F. & Zhou, Z. H. (2018). Nature, 561, 70–75.  CrossRef PubMed Google Scholar
Return to citationJamali, K., Käll, L., Zhang, R., Brown, A., Kimanius, D. & Scheres, S. H. W. (2024). Nature, 628, 450–457.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationJumper, J., Evans, R., Pritzel, A., Green, T., Figurnov, M., Ronneberger, O., Tunyasuvunakool, K., Bates, R., Žídek, A., Potapenko, A., Bridgland, A., Meyer, C., Kohl, S. A. A., Ballard, A. J., Cowie, A., Romera-Paredes, B., Nikolov, S., Jain, R., Adler, J., Back, T., Petersen, S., Reiman, D., Clancy, E., Zielinski, M., Steinegger, M., Pacholska, M., Berghammer, T., Bodenstein, S., Silver, D., Vinyals, O., Senior, A. W., Kavukcuoglu, K., Kohli, P. & Hassabis, D. (2021). Nature, 596, 583–589.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationKhan, Y. A., White, K. I. & Brunger, A. T. (2022). Crit. Rev. Biochem. Mol. Biol. 57, 156–187.  CrossRef PubMed Google Scholar
Return to citationKim, S., Natesan, S., Cornilescu, G., Carlson, S., Tonelli, M., McClurg, U. L., Binda, O., Robson, C. N., Markley, J. L., Balaz, S. & Glass, K. C. (2016). J. Biol. Chem. 291, 18326–18341.  CrossRef PubMed Google Scholar
Return to citationKimanius, D., Forsberg, B. O., Scheres, S. H. W. & Lindahl, E. (2016). eLife, 5, e18722.  Web of Science CrossRef PubMed Google Scholar
Return to citationKoo, S. J., Fernández-Montalván, A. E., Badock, V., Ott, C. J., Holton, S. J., von Ahsen, O., Toedling, J., Vittori, S., Bradner, J. E. & Gorjánácz, M. (2016). Oncotarget, 7, 70323–70335.  CrossRef PubMed Google Scholar
Return to citationLazarchuk, P., Hernandez-Villanueva, J., Pavlova, M. N., Federation, A., MacCoss, M. & Sidorova, J. M. (2020). Mol. Cell. Biol. 40, e00421-19.  CrossRef PubMed Google Scholar
Return to citationLevitz, T. S., Brignole, E. J., Fong, I., Darrow, M. C. & Drennan, C. L. (2022). J. Struct. Biol. 214, 107825.  Web of Science CrossRef PubMed Google Scholar
Return to citationLiebschner, D., Afonine, P. V., Baker, M. L., Bunkóczi, G., Chen, V. B., Croll, T. I., Hintze, B., Hung, L.-W., Jain, S., McCoy, A. J., Moriarty, N. W., Oeffner, R. D., Poon, B. K., Prisant, M. G., Read, R. J., Richardson, J. S., Richardson, D. C., Sammito, M. D., Sobolev, O. V., Stockwell, D. H., Terwilliger, T. C., Urzhumtsev, A. G., Videau, L. L., Williams, C. J. & Adams, P. D. (2019). Acta Cryst. D75, 861–877.  Web of Science CrossRef IUCr Journals Google Scholar
Return to citationLloyd, J. T., McLaughlin, K., Lubula, M. Y., Gay, J. C., Dest, A., Gao, C., Phillips, M., Tonelli, M., Cornilescu, G., Marunde, M. R., Evans, C. M., Boyson, S. P., Carlson, S., Keogh, M. C., Markley, J. L., Frietze, S. & Glass, K. C. (2020). J. Med. Chem. 63, 12799–12813.  CrossRef PubMed Google Scholar
Return to citationLubula, M. Y., Eckenroth, B. E., Carlson, S., Poplawski, A., Chruszcz, M. & Glass, K. C. (2014). FEBS Lett. 588, 3844–3854.  CrossRef PubMed Google Scholar
Return to citationLubula, M. Y., Poplawaski, A. & Glass, K. C. (2014). Acta Cryst. F70, 1389–1393.  CrossRef IUCr Journals Google Scholar
Return to citationLucas, B. A., Himes, B. A., Xue, L., Grant, T., Mahamid, J. & Grigorieff, N. (2021). eLife, 10, e68946.  Web of Science CrossRef PubMed Google Scholar
Return to citationLyumkis, D. (2019). J. Biol. Chem. 294, 5181–5197.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationMcPherson, A. & Cudney, B. (2014). Acta Cryst. F70, 1445–1467.  Web of Science CrossRef IUCr Journals Google Scholar
Return to citationMeng, X., Ratnayake, I., Escobar Galvis, M. L., Kotecki, J., Ramjan, Z. & Zhao, G. (2023). Front. Mol. Biosci. 10, 1302680.  CrossRef PubMed Google Scholar
Return to citationMorozumi, Y., Boussouar, F., Tan, M., Chaikuad, A., Jamshidikia, M., Colak, G., He, H., Nie, L., Petosa, C., de Dieuleveult, M., Curtet, S., Vitte, A. L., Rabatel, C., Debernardi, A., Cosset, F. L., Verhoeyen, E., Emadali, A., Schweifer, N., Gianni, D., Gut, M., Guardiola, P., Rousseaux, S., Gérard, M., Knapp, S., Zhao, Y. & Khochbin, S. (2016). J. Mol. Cell Biol. 8, 349–362.  CrossRef PubMed Google Scholar
Return to citationNain, A., Kumar, M. & Banerjee, M. (2022). Microb. Cell Fact. 21, 53.  CrossRef PubMed Google Scholar
Return to citationNeselu, K., Wang, B., Rice, W. J., Potter, C. S., Carragher, B. & Chua, E. Y. D. (2023). J. Struct. Biol. X, 7, 100085.  Web of Science PubMed Google Scholar
Return to citationNogales, E. & Scheres, S. H. W. (2015). Mol. Cell, 58, 677–689.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationObi, J. O., Lubula, M. Y., Cornilescu, G., Henrickson, A., McGuire, K., Evans, C. M., Phillips, M., Boyson, S. P., Demeler, B., Markley, J. L. & Glass, K. C. (2020). Curr. Res. Struct. Biol. 2, 104–115.  CrossRef PubMed Google Scholar
Return to citationPassmore, L. A. & Russo, C. J. (2016). Methods Enzymol. 579, 51–86.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationPettersen, E. F., Goddard, T. D., Huang, C. C., Couch, G. S., Greenblatt, D. M., Meng, E. C. & Ferrin, T. E. (2004). J. Comput. Chem. 25, 1605–1612.  Web of Science CrossRef PubMed CAS Google Scholar
Return to citationPettersen, E. F., Goddard, T. D., Huang, C. C., Meng, E. C., Couch, G. S., Croll, T. I., Morris, J. H. & Ferrin, T. E. (2021). Protein Sci. 30, 70–82.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationPhillips, M., Cook, E. D., Marunde, M. R., Tonelli, M., Khan, L., Hendrickson, A., Lignos, J. M., Stein, J. L., Stein, G. S., Frietze, S., Demeler, B. & Glass, K. C. (2024). bioRxiv, 2024.12.09.627393.  Google Scholar
Return to citationPhillips, M., Malone, K. L., Boyle, B. W., Montgomery, C., Kressy, I. A., Joseph, F. M., Bright, K. M., Boyson, S. P., Chang, S., Nix, J. C., Young, N. L., Jeffers, V., Frietze, S. & Glass, K. C. (2024). J. Med. Chem. 67, 8186–8200.  CrossRef PubMed Google Scholar
Return to citationPintilie, G., Zhang, K., Su, Z., Li, S., Schmid, M. F. & Chiu, W. (2020). Nat. Methods, 17, 328–334.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationPoplawski, A., Hu, K., Lee, W., Natesan, S., Peng, D., Carlson, S., Shi, X., Balaz, S., Markley, J. L. & Glass, K. C. (2014). J. Mol. Biol. 426, 1661–1676.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationPuchades, C., Sandate, C. R. & Lander, G. C. (2020). Nat. Rev. Mol. Cell Biol. 21, 43–58.  CrossRef PubMed Google Scholar
Return to citationPunjani, A. & Fleet, D. J. (2023). Nat. Methods, 20, 860–870.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationPunjani, A., Rubinstein, J. L., Fleet, D. J. & Brubaker, M. A. (2017). Nat. Methods, 14, 290–296.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationRoh, S. H., Hryc, C. F., Jeong, H. H., Fei, X., Jakana, J., Lorimer, G. H. & Chiu, W. (2017). Proc. Natl Acad. Sci. USA, 114, 8259–8264.  CrossRef PubMed Google Scholar
Return to citationRohman, M. & Harrison-Lavoie, K. J. (2000). Protein Expr. Purif. 20, 45–47.  CrossRef PubMed Google Scholar
Return to citationRutledge, B. S., Choy, W. Y. & Duennwald, M. L. (2022). J. Biol. Chem. 298, 101905.  CrossRef PubMed Google Scholar
Return to citationSaibil, H. R. (2022). Mol. Cell, 82, 274–284.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationScheres, S. H. W. (2012). J. Struct. Biol. 180, 519–530.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationScheres, S. H. W. (2016). Methods Enzymol. 579, 125–157.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationSchwab, J., Kimanius, D., Burt, A., Dendooven, T. & Scheres, S. H. W. (2024). Nat. Methods, 21, 1855–1862.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationShoemaker, S. C. & Ando, N. (2018). Biochemistry, 57, 277–285.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationSingh, A. K., Datta, A., Jobichen, C., Luan, S. & Vasudevan, D. (2020). Nucleic Acids Res. 48, 1531–1550.  CrossRef PubMed Google Scholar
Return to citationSingh, A. K., Phillips, M., Alkrimi, S., Tonelli, M., Boyson, S. P., Malone, K. L., Nix, J. C. & Glass, K. C. (2022). Int. J. Biol. Macromol. 223, 316–326.  CrossRef PubMed Google Scholar
Return to citationSmyth, M. S. & Martin, J. H. J. (2000). Mol. Pathol. 53, 8–14.  Web of Science CrossRef PubMed CAS Google Scholar
Return to citationStark, H. & Chari, A. (2016). Microscopy (Tokyo), 65, 23–34.  Web of Science CrossRef CAS Google Scholar
Return to citationTakizawa, Y., Binshtein, E., Erwin, A. L., Pyburn, T. M., Mittendorf, K. F. & Ohi, M. D. (2017). Protein Sci. 26, 69–81.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationTan, Y. Z., Baldwin, P. R., Davis, J. H., Williamson, J. R., Potter, C. S., Carragher, B. & Lyumkis, D. (2017). Nat. Methods, 14, 793–796.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationTang, G., Peng, L., Baldwin, P. R., Mann, D. S., Jiang, W., Rees, I. & Ludtke, S. J. (2007). J. Struct. Biol. 157, 38–46.  Web of Science CrossRef PubMed CAS Google Scholar
Return to citationTerwilliger, T. C., Adams, P. D., Afonine, P. V. & Sobolev, O. V. (2020). Protein Sci. 29, 87–99.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationThompson, R. F., Walker, M., Siebert, C. A., Muench, S. P. & Ranson, N. A. (2016). Methods, 100, 3–15.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationTorino, S., Dhurandhar, M., Stroobants, A., Claessens, R. & Efremov, R. G. (2023). Nat. Methods, 20, 1400–1408.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationVénien-Bryan, C., Li, Z., Vuillard, L. & Boutin, J. A. (2017). Acta Cryst. F73, 174–183.  Web of Science CrossRef IUCr Journals Google Scholar
Return to citationWagner, T., Merino, F., Stabrin, M., Moriya, T., Antoni, C., Apelbaum, A., Hagel, P., Sitsel, O., Raisch, T., Prumbaum, D., Quentin, D., Roderer, D., Tacke, S., Siebolds, B., Schubert, E., Shaikh, T. R., Lill, P., Gatsogiannis, C. & Raunser, S. (2019). Commun. Biol. 2, 218.  Web of Science CrossRef PubMed Google Scholar
Return to citationWang, F., Feng, X., He, Q., Li, H. & Li, H. (2023). J. Biol. Chem. 299, 102852.  CrossRef PubMed Google Scholar
Return to citationWang, H. W. & Wang, J. W. (2017). Protein Sci. 26, 32–39.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationWang, L. & Zimanyi, C. M. (2024). Acta Cryst. F80, 74–81.  Web of Science CrossRef IUCr Journals Google Scholar
Return to citationWaugh, D. S. (2011). Protein Expr. Purif. 80, 283–293.  CrossRef PubMed Google Scholar
Return to citationWeissenberger, G., Henderikx, R. J. M. & Peters, P. J. (2021). Nat. Methods, 18, 463–471.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationWeissman, J. S., Hohl, C. M., Kovalenko, O., Kashi, Y., Chen, S., Braig, K., Saibil, H. R., Fenton, W. A. & Norwich, A. L. (1995). Cell, 83, 577–587.  CrossRef PubMed Google Scholar
Return to citationWenborn, A., Terry, C., Gros, N., Joiner, S., D'Castro, L., Panico, S., Sells, J., Cronier, S., Linehan, J. M., Brandner, S., Saibil, H. R., Collinge, J. & Wadsworth, J. D. (2015). Sci. Rep. 5, 10062.  CrossRef PubMed Google Scholar
Return to citationZeng, L. & Zhou, M. M. (2002). FEBS Lett. 513, 124–128.  Web of Science CrossRef PubMed CAS Google Scholar
Return to citationZheng, H., Handing, K. B., Zimmerman, M. D., Shabalin, I. G., Almo, S. C. & Minor, W. (2015). Exp. Opin. Drug. Discov. 10, 975–989.  Web of Science CrossRef Google Scholar
Return to citationZhong, E. D., Bepler, T., Berger, B. & Davis, J. H. (2021). Nat. Methods, 18, 176–185.  Web of Science CrossRef CAS PubMed Google Scholar
Return to citationZielinski, M., Röder, C. & Schröder, G. F. (2021). J. Biol. Chem. 297, 100938.  CrossRef PubMed Google Scholar
Return to citationZivanov, J., Oton, J., Ke, Z., von Kugelgen, A., Pyle, E., Qu, K., Morado, D., Castano-Diez, D., Zanetti, G., Bharat, T. A. M., Briggs, J. A. G. & Scheres, S. H. W. (2022). eLife, 11, e83724.  CrossRef PubMed Google Scholar

This is an open-access article distributed under the terms of the Creative Commons Attribution (CC-BY) Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original authors and source are cited.

Journal logoSTRUCTURAL
BIOLOGY
ISSN: 2059-7983
Follow Acta Cryst. D
Sign up for e-alerts
Follow Acta Cryst. on Twitter
Follow us on facebook
Sign up for RSS feeds