175 research outputs found

    Novel computational methods for in vitro and in situ cryo-electron microscopy

    Get PDF
    Over the past decade, advances in microscope hardware and image data processing algorithms have made cryo-electron microscopy (cryo-EM) a dominant technique for protein structure determination. Near-atomic resolution can now be obtained for many challenging in vitro samples using single-particle analysis (SPA), while sub-tomogram averaging (STA) can obtain sub-nanometer resolution for large protein complexes in a crowded cellular environment. Reaching high resolution requires large amounts of im-age data. Modern transmission electron microscopes (TEMs) automate the acquisition process and can acquire thousands of micrographs or hundreds of tomographic tilt se-ries over several days without intervention. In a first step, the data must be pre-processed: Micrographs acquired as movies are cor-rected for stage and beam-induced motion. For tilt series, additional alignment of all micrographs in 3D is performed using gold- or patch-based fiducials. Parameters of the contrast-transfer function (CTF) are estimated to enable its reversal during SPA refine-ment. Finally, individual protein particles must be located and extracted from the aligned micrographs. Current pre-processing algorithms, especially those for particle picking, are not robust enough to enable fully unsupervised operation. Thus, pre-processing is start-ed after data collection, and takes several days due to the amount of supervision re-quired. Pre-processing the data in parallel to acquisition with more robust algorithms would save time and allow to discover bad samples and microscope settings early on. Warp is a new software for cryo-EM data pre-processing. It implements new algorithms for motion correction, CTF estimation, tomogram reconstruction, as well as deep learn-ing-based approaches to particle picking and image denoising. The algorithms are more accurate and robust, enabling unsupervised operation. Warp integrates all pre-processing steps into a pipeline that is executed on-the-fly during data collection. Inte-grated with SPA tools, the pipeline can produce 2D and 3D classes less than an hour into data collection for favorable samples. Here I describe the implementation of the new algorithms, and evaluate them on various movie and tilt series data sets. I show that un-supervised pre-processing of a tilted influenza hemagglutinin trimer sample with Warp and refinement in cryoSPARC can improve previously published resolution from 3.9 Å to 3.2 Å. Warp’s algorithms operate in a reference-free manner to improve the image resolution at the pre-processing stage when no high-resolution maps are available for the particles yet. Once 3D maps have been refined, they can be used to go back to the raw data and perform reference-based refinement of sample motion and CTF in movies and tilt series. M is a new tool I developed to solve this task in a multi-particle framework. Instead of following the SPA assumption that every particle is single and independent, M models all particles in a field of view as parts of a large, physically connected multi-particle system. This allows M to optimize hyper-parameters of the system, such as sample motion and deformation, or higher-order aberrations in the CTF. Because M models these effects accurately and optimizes all hyper-parameters simultaneously with particle alignments, it can surpass previous reference-based frame and tilt series alignment tools. Here I de-scribe the implementation of M, evaluate it on several data sets, and demonstrate that the new algorithms achieve equally high resolution with movie and tilt series data of the same sample. Most strikingly, the combination of Warp, RELION and M can resolve 70S ribosomes bound to an antibiotic at 3.5 Å inside vitrified Mycoplasma pneumoniae cells, marking a major advance in resolution for in situ imaging

    Cross-validation tests for cryo-electron microscopy using an independent set of images

    Get PDF
    ABSTRACT: In addition to the chemical composition, information about the three-dimensional structure of a biomolecule is vital for understanding its biological function. For many years, resolv- ing structures of biomolecules was exclusive of X-ray crystallography and nuclear magnetic resonance (NMR) techniques. However, due to technological and software improvements, cryo-electron microscopy (cryo-EM) has emerged as an alternative for resolving complexes that were infeasible for crystallization or too large for NMR. Currently, cryo-EM is able to provide near-atomic resolution and close-to-native structures . Moreover, it enables extracting dynamical information, such as free-energy landscapes, from thermal states in the micrographs. The “resolution revolution” in cryo-EM has provoked an avalanche of reported cryo-EM maps. Recent statistics show an exponentially-growing number of reported maps spatially resolved by cryo-EM with their mean resolution decreasing from ∌ 10 Å (in 2013) to 4 Å (for 2018). The resolution revolution brings with it the need of creating robust and reliable methodolo- gies to validate the increasingly large number of maps. Some advances have been done along these lines: the tilt-pair analysis , the gold-standard procedure and the high-frequency randomization have shown to be reliable validation tools. However, it has recently been shown that these methods remain sensitive to overfitting (treating noise as true signal) and subjective criteria. In this work, I will present a novel methodology for validating cryo-EM maps. The method is based on cross-validation criteria where the reconstructed maps are compared against a set of experimental images (raw data) not used in the reconstruction procedure. Such comparison is carried out by calculating the probability that an image is the projection of a given map. The information from these probabilities led us to propose two validation criteria, which are tested over three well-behaved systems and two systems that present overfitting. The results prove that our methodology is able to identify overfitted maps

    Gold nanomaterials and their potential use as cryo-electron tomography labels

    Full text link
    Rapid advances in cryo-electron tomography (cryo-ET) are driving a revolution in cellular structural biology. However, unambiguous identification of specific biomolecules within cellular tomograms remains challenging. Overcoming this obstacle and reliably identifying targets in the crowded cellular environment is of major importance for the understanding of cellular function and is a pre-requisite for high-resolution structural analysis. The use of highly-specific, readily visualised and adjustable labels would help mitigate this issue, improving both data quality and sample throughput. While progress has been made in cryo-CLEM and in the development of cloneable high-density tags, technical issues persist and a robust 'cryo-GFP' remains elusive. Readily-synthesized gold nanomaterials conjugated to small 'affinity modules' may represent a solution. The synthesis of materials including gold nanoclusters (AuNCs) and gold nanoparticles (AuNPs) is increasingly well understood and is now within the capabilities of non-specialist laboratories. The remarkable chemical and photophysical properties of <3nm diameter nanomaterials and their emergence as tools with widespread biomedical application presents significant opportunities to the cryo-microscopy community. In this review, we will outline developments in the synthesis, functionalisation and labelling uses of both AuNPs and AuNCs in cryo-ET, while discussing their potential as multi-modal probes for cryo-CLEM

    Single particle 2D Electron crystallography for membrane protein structure determination

    Get PDF
    Proteins embedded into or attached to the cellular membrane perform crucial biological functions. Despite such importance, they remain among the most challenging targets of structural biology. Dedicated methods for membrane protein structure determination have been devised since decades, however with only partial success if compared to soluble proteins. One of these methods is 2D electron crystallography, in which the proteins are periodically arranged into a lipid bilayer. Using transmission electron microscopy to acquire projection images of samples containing such 2D crystals, which are embedded into a thin vitreous ice layer for radiation protection (cryo-EM), computer algorithms can be used to generate a 3D reconstruction of the protein. Unfortunately, in nearly every case, the 2D crystals are not flat and ordered enough to yield high-resolution reconstructions. Single particle analysis, on the other hand, is a technique that aligns projections of proteins isolated in solution in order to obtain a 3D reconstruction with a high success rate in terms of high resolution structures. In this thesis, we couple 2D crystal data processing with single particle analysis algorithms in order to perform a local correction of crystal distortions. We show that this approach not only allows reconstructions of much higher resolution than expected from the diffraction patterns obtained, but also reveals the existence of conformational heterogeneity within the 2D crystals. This structural variability can be linked to protein function, providing novel mechanistic insights and an explanation for why 2D crystals do not diffract to high resolution, in general. We present the computational methods that enable this hybrid approach, as well as other tools that aid several steps of cryo-EM data processing, from storage to postprocessing

    Vers la classification non-supervisée des complexes macromoléculaires en cryo-tomographie électronique : Défis et opportunités

    Get PDF
    International audienceBackground and Objectives: Cryo electron tomography visualizes native cells at nanometer resolution, but analysis is challenged by noise and artifacts. Recently, supervised deep learning methods have been applied to decipher the 3D spatial distribution of macromolecules. However, in order to discover unknown objects, unsupervised classification techniques are necessary. In this paper, we provide an overview of unsupervised deep learning techniques, discuss the challenges to analyze cryo-ET data, and provide a proof-of-concept on real data. Methods: We propose a weakly supervised subtomogram classification method based on transfer learning. We use a deep neural network to learn a clustering friendly representation able to capture 3D shapes in the presence of noise and artifacts. This representation is learned here from a synthetic data set. Results: We show that when applying k-means clustering given a learning-based representation, it becomes possible to satisfyingly classify real subtomograms according to structural similarity. It is worth noting that no manual annotation is used for performing classification. Conclusions: We describe the advantages and limitations of our proof-of-concept and raise several perspectives for improving classification performance

    Cryo-EM Studies of Vesicle Inducing Protein in Plastids (Vipp1)

    Get PDF

    A guide to machine learning for biologists

    Get PDF
    The expanding scale and inherent complexity of biological data have encouraged a growing use of machine learning in biology to build informative and predictive models of the underlying biological processes. All machine learning techniques fit models to data; however, the specific methods are quite varied and can at first glance seem bewildering. In this Review, we aim to provide readers with a gentle introduction to a few key machine learning techniques, including the most recently developed and widely used techniques involving deep neural networks. We describe how different techniques may be suited to specific types of biological data, and also discuss some best practices and points to consider when one is embarking on experiments involving machine learning. Some emerging directions in machine learning methodology are also discussed
    • 

    corecore