4,370 research outputs found

    Cryo-EM map interpretation and protein model-building using iterative map segmentation.

    Get PDF
    A procedure for building protein chains into maps produced by single-particle electron cryo-microscopy (cryo-EM) is described. The procedure is similar to the way an experienced structural biologist might analyze a map, focusing first on secondary structure elements such as helices and sheets, then varying the contour level to identify connections between these elements. Since the high density in a map typically follows the main-chain of the protein, the main-chain connection between secondary structure elements can often be identified as the unbranched path between them with the highest minimum value along the path. This chain-tracing procedure is then combined with finding side-chain positions based on the presence of density extending away from the main path of the chain, allowing generation of a Cα model. The Cα model is converted to an all-atom model and is refined against the map. We show that this procedure is as effective as other existing methods for interpretation of cryo-EM maps and that it is considerably faster and produces models with fewer chain breaks than our previous methods that were based on approaches developed for crystallographic maps

    RIBFIND: a web server for identifying rigid bodies in protein structures and to aid flexible fitting into cryo EM maps

    Get PDF
    Motivation: To better analyze low-resolution cryo electron microscopy maps of macromolecular assemblies, component atomic structures frequently have to be flexibly fitted into them. Reaching an optimal fit and preventing the fitting process from getting trapped in local minima can be significantly improved by identifying appropriate rigid bodies in the fitted component. Results: Here we present the RIBFIND server, a tool for identifying rigid bodies in protein structures. The server identifies rigid bodies in proteins by calculating spatial proximity between their secondary structural elements. Availability: The RIBFIND web server and its standalone program are available at http://ribfind.ismb.lon.ac.uk

    Deep Learning for Segmentation Of 3D Cryo-EM Images

    Get PDF
    Cryo-electron microscopy (cryo-EM) is an emerging biophysical technique for structural determination of protein complexes. However, accurate detection of secondary structures is still challenging when cryo-EM density maps are at medium resolutions (5-10 Å). Most existing methods are image processing methods that do not fully utilize available images in the cryo-EM database. In this paper, we present a deep learning approach to segment secondary structure elements as helices and β-sheets from medium- resolution density maps. The proposed 3D convolutional neural network is shown to detect secondary structure locations with an F1 score between 0.79 and 0.88 for six simulated test cases. The architecture was also applied to experimentally-derived cryo- EM density regions of 571 protein chains. . The average F1 score for helix detection is 0.747 and 0.674 for β-sheets in a test involving seven cryo-EM density regions. Additionally, we extend an arc-length association method to β -strands and show that this method for measuring error is superior to many popular methods. An interactive tool is also presented that can visualize the results of this arc-length association method

    Computational Development for Secondary Structure Detection From Three-Dimensional Images of Cryo-Electron Microscopy

    Get PDF
    Electron cryo-microscopy (cryo-EM) as a cutting edge technology has carved a niche for itself in the study of large-scale protein complex. Although the protein backbone of complexes cannot be derived directly from the medium resolution (5-10 Å) of amino acids from three-dimensional (3D) density images, secondary structure elements (SSEs) such as alpha-helices and beta-sheets can still be detected. The accuracy of SSE detection from the volumetric protein density images is critical for ab initio backbone structure derivation in cryo-EM. So far it is challenging to detect the SSEs automatically and accurately from the density images at these resolutions. This dissertation presents four computational methods - SSEtracer, SSElearner, StrandTwister and StrandRoller for solving this critical problem. An effective approach, SSEtracer, is presented to automatically identify helices and β- sheets from the cryo-EM three-dimensional maps at medium resolutions. A simple mathematical model is introduced to represent the β-sheet density. The mathematical model can be used for β-strand detection from medium resolution density maps. A machine learning approach, SSElearner, has also been developed to automatically identify helices and β-sheets by using the knowledge from existing volumetric maps in the Electron Microscopy Data Bank (EMDB). The approach has been tested using simulated density maps and experimental cryo-EM maps of EMDB. The results of SSElearner suggest that it is effective to use one cryo-EM map for learning in order to detect the SSE in another cryo-EM map of similar quality. Major secondary structure elements such as a-helices and β-sheets can be computationally detected from cryo-EM density maps with medium resolutions of 5-10Å. However, a critical piece of information for modeling atomic structures is missing, since there are no tools to detect β-strands from cryo-EM maps at medium resolutions. A new method, StrandTwister, has been proposed to detect the traces of β-strands through the analysis of twist, an intrinsic nature of β-sheet. StrandTwister has been tested using 100 β-sheets simulated at 10Å resolution and 39 β-sheets computationally detected from cryoEM density maps at 4.4-7.4Å resolutions. StrandTwister appears to detect the traces of β-strands on major β-sheets quite accurately, particularly at the central area of a β-sheet. β-barrel is a structure feature that is formed by multiple β-strands in a barrel shape. There is no existing method to derive the β-strands from the 3D image of β-barrel. A new method, StrandRoller, has been proposed to generate small sets of possible β-traces from the density images at medium resolutions of 5-10Å. The results of StrandRoller suggest that it is possible to derive a small set of possible β-traces from the β-barrel cryo-EM image at medium resolutions even when it is not possible to visualize the separation of β-strands

    Determining Alpha-Helix Correspondence for Protein Structure Prediction from Cryo-EM Density Maps, Master\u27s Thesis, May 2007

    Get PDF
    Determining protein structure is an important problem for structural biologists, which has received a significant amount of attention in the recent years. In this thesis, we describe a novel, shape-modeling approach as an intermediate step towards recovering 3D protein structures from volumetric images. The input to our method is a sequence of alpha-helices that make up a protein, and a low-resolution volumetric image of the protein where possible locations of alpha-helices have been detected. Our task is to identify the correspondence between the two sets of helices, which will shed light on how the protein folds in space. The central theme of our approach is to cast the correspondence problem as that of shape matching between the 3D volume and the 1D sequence. We model both the shapes as attributed relational graphs, and formulate a constrained inexact graph matching problem. To compute the matching, we developed an optimal algorithm based on the A*-search with several choices of heuristic functions. As demonstrated in a suite of real protein data, the shape-modeling approach is capable of correctly identifying helix correspondences in noise-abundant volumes with minimal or no user intervention

    A Tool for Segmentation of Secondary Structures in 3D Cryo-EM Density Map Components Using Deep Convolutional Neural Networks

    Get PDF
    Although cryo-electron microscopy (cryo-EM) has been successfully used to derive atomic structures for many proteins, it is still challenging to derive atomic structures when the resolution of cryo-EM density maps is in the medium resolution range, such as 5–10 Å. Detection of protein secondary structures, such as helices and β-sheets, from cryo-EM density maps provides constraints for deriving atomic structures from such maps. As more deep learning methodologies are being developed for solving various molecular problems, effective tools are needed for users to access them. We have developed an effective software bundle, DeepSSETracer, for the detection of protein secondary structure from cryo-EM component maps in medium resolution. The bundle contains the network architecture and a U-Net model trained with a curriculum and gradient of episodic memory (GEM). The bundle integrates the deep neural network with the visualization capacity provided in ChimeraX. Using a Linux server that is remotely accessed by Windows users, it takes about 6 s on one CPU and one GPU for the trained deep neural network to detect secondary structures in a cryo-EM component map containing 446 amino acids. A test using 28 chain components of cryo-EM maps shows overall residue-level F1 scores of 0.72 and 0.65 to detect helices and β-sheets, respectively. Although deep learning applications are built on software frameworks, such as PyTorch and Tensorflow, our pioneer work here shows that integration of deep learning applications with ChimeraX is a promising and effective approach. Our experiments show that the F1 score measured at the residue level is an effective evaluation of secondary structure detection for individual classes. The test using 28 cryo-EM component maps shows that DeepSSETracer detects β-sheets more accurately than Emap2sec+, with a weighted average residue-level F1 score of 0.65 and 0.42, respectively. It also shows that Emap2sec+ detects helices more accurately than DeepSSETracer with a weighted average residue-level F1 score of 0.77 and 0.72 respectively

    Molecular Basis for poly(A) RNP Architecture and Recognition by the Pan2-Pan3 Deadenylase

    No full text
    The stability of eukaryotic mRNAs is dependent on a ribonucleoprotein (RNP) complex of poly(A)-binding proteins (PABPC1/Pab1) organized on the poly(A) tail. This poly(A) RNP not only protects mRNAs from premature degradation but also stimulates the Pan2-Pan3 deadenylase complex to catalyze the first step of poly(A) tail shortening. We reconstituted this process in vitro using recombinant proteins and show that Pan2-Pan3 associates with and degrades poly(A) RNPs containing two or more Pab1 molecules. The cryo-EM structure of Pan2-Pan3 in complex with a poly(A) RNP composed of 90 adenosines and three Pab1 protomers shows how the oligomerization interfaces of Pab1 are recognized by conserved features of the deadenylase and thread the poly(A) RNA substrate into the nuclease active site. The structure reveals the basis for the periodic repeating architecture at the 3' end of cytoplasmic mRNAs. This illustrates mechanistically how RNA-bound Pab1 oligomers act as rulers for poly(A) tail length over the mRNAs' lifetime.We would like to thank ... the MPIB cryo-EM, and core facilities ..

    An Approach to Developing Benchmark Datasets for Protein Secondary Structure Segmentation from Cryo-EM Density Maps

    Get PDF
    More and more deep learning approaches have been proposed to segment secondary structures from cryo-electron density maps at medium resolution range (5--10Ã…). Although the deep learning approaches show great potential, only a few small experimental data sets have been used to test the approaches. There is limited understanding about potential factors, in data, that affect the performance of segmentation. We propose an approach to generate data sets with desired specifications in three potential factors - the protein sequence identity, structural contents, and data quality. The approach was implemented and has generated a test set and various training sets to study the effect of secondary structure content and data quality on the performance of DeepSSETracer, a deep learning method that segments regions of protein secondary structures from cryo-EM map components. Results show that various content levels in the secondary structure and data quality influence the performance of segmentation for DeepSSETracer

    Intensity-Based Skeletonization of CryoEM Gray-Scale Images Using a True Segmentation-Free Algorithm

    Get PDF
    Cryo-electron microscopy is an experimental technique that is able to produce 3D gray-scale images of protein molecules. In contrast to other experimental techniques, cryo-electron microscopy is capable of visualizing large molecular complexes such as viruses and ribosomes. At medium resolution, the positions of the atoms are not visible and the process cannot proceed. The medium-resolution images produced by cryo-electron microscopy are used to derive the atomic structure of the proteins in de novo modeling. The skeletons of the 3D gray-scale images are used to interpret important information that is helpful in de novo modeling. Unfortunately, not all features of the image can be captured using a single segmentation. In this paper, we present a segmentation-free approach to extract the gray-scale curve-like skeletons. The approach relies on a novel representation of the 3D image, where the image is modeled as a graph and a set of volume trees. A test containing 36 synthesized maps and one authentic map shows that our approach can improve the performance of the two tested tools used in de novo modeling. The improvements were 62 and 13 percent for Gorgon and DP-TOSS, respectively
    • …
    corecore