826 research outputs found

    Accurate Structural Correlations from Maximum Likelihood Superpositions

    Get PDF
    The cores of globular proteins are densely packed, resulting in complicated networks of structural interactions. These interactions in turn give rise to dynamic structural correlations over a wide range of time scales. Accurate analysis of these complex correlations is crucial for understanding biomolecular mechanisms and for relating structure to function. Here we report a highly accurate technique for inferring the major modes of structural correlation in macromolecules using likelihood-based statistical analysis of sets of structures. This method is generally applicable to any ensemble of related molecules, including families of nuclear magnetic resonance (NMR) models, different crystal forms of a protein, and structural alignments of homologous proteins, as well as molecular dynamics trajectories. Dominant modes of structural correlation are determined using principal components analysis (PCA) of the maximum likelihood estimate of the correlation matrix. The correlations we identify are inherently independent of the statistical uncertainty and dynamic heterogeneity associated with the structural coordinates. We additionally present an easily interpretable method (“PCA plots”) for displaying these positional correlations by color-coding them onto a macromolecular structure. Maximum likelihood PCA of structural superpositions, and the structural PCA plots that illustrate the results, will facilitate the accurate determination of dynamic structural correlations analyzed in diverse fields of structural biology

    Semi-Blind Spatially-Variant Deconvolution in Optical Microscopy with Local Point Spread Function Estimation By Use Of Convolutional Neural Networks

    Full text link
    We present a semi-blind, spatially-variant deconvolution technique aimed at optical microscopy that combines a local estimation step of the point spread function (PSF) and deconvolution using a spatially variant, regularized Richardson-Lucy algorithm. To find the local PSF map in a computationally tractable way, we train a convolutional neural network to perform regression of an optical parametric model on synthetically blurred image patches. We deconvolved both synthetic and experimentally-acquired data, and achieved an improvement of image SNR of 1.00 dB on average, compared to other deconvolution algorithms.Comment: 2018/02/11: submitted to IEEE ICIP 2018 - 2018/05/04: accepted to IEEE ICIP 201

    Calculating the structure-based phylogenetic relationship of distantly related homologous proteins utilizing maximum likelihood structural alignment combinatorics and a novel structural molecular clock hypothesis

    Get PDF
    A dissertation in Molecular Biology and Biochemistry and Cell Biology and BiophysicsIncludes bibliographical references (pages 113-116)Dendrograms establish the evolutionary relationships and homology of species, proteins, or genes. Homology modeling, ligand binding, and pharmaceutical testing all depend upon the homology ascertained by dendrograms. Regardless of the specific algorithm, all dendrograms that ascertain protein evolutionary homology are generated utilizing polypeptide sequences. However, because protein structures superiorly conserve homology and contain more biochemical information than their associated protein sequences, I hypothesize that utilizing the structure of a protein instead of its sequence will generate a superior dendrogram. Generating a dendrogram utilizing protein structure requires a unique methodology and novel bioinformatic programs to implement this methodology. Contained within this dissertation is an original methodology that permits the aforementioned structure-based iv dendrogram generation hypothesis. Additionally, I have scripted three novel bioinformatics programs required by this proposed methodology: a protein structure alignment program that proficiently superimposes distant homologs, an accurate structure-dependent sequence alignment program, and a dendrogram generation program that employs a novel structural molecular clock hypothesis. The results from this methodology support the proposed hypothesis by demonstrating that generating dendrograms utilizing protein structures is superior to those generated utilizing exclusively protein sequences.Introduction -- Sable: Structural alignment by maximum likelihood -- UNITS: Universal true SDSA (Structure-dependent sequience alignment) -- Push: phlyogenetic tree using structural homology) -- Push discussion and general conclusion -- Generic sorting algorithm -- Template protein selection -- Units and chimera SDSAs -- References -- Vit

    Structural insights into Clostridium perfringens delta toxin pore formation

    Get PDF
    Clostridium perfringens Delta toxin is one of the three hemolysin-like proteins produced by C. perfringens type C and possibly type B strains. One of the others, NetB, has been shown to be the major cause of Avian Nectrotic Enteritis, which following the reduction in use of antibiotics as growth promoters, has become an emerging disease of industrial poultry. Delta toxin itself is cytotoxic to the wide range of human and animal macrophages and platelets that present GM2 ganglioside on their membranes. It has sequence similarity with Staphylococcus aureus β-pore forming toxins and is expected to heptamerize and form pores in the lipid bilayer of host cell membranes. Nevertheless, its exact mode of action remains undetermined. Here we report the 2.4 Å crystal structure of monomeric Delta toxin. The superposition of this structure with the structure of the phospholipid-bound F component of S. aureus leucocidin (LukF) revealed that the glycerol molecules bound to Delta toxin and the phospholipids in LukF are accommodated in the same hydrophobic clefts, corresponding to where the toxin is expected to latch onto the membrane, though the binding sites show significant differences. From structure-based sequence alignment with the known structure of staphylococcal α-hemolysin, a model of the Delta toxin pore form has been built. Using electron microscopy, we have validated our model and characterized the Delta toxin pore on liposomes. These results highlight both similarities and differences in the mechanism of Delta toxin (and by extension NetB) cytotoxicity from that of the staphylococcal pore-forming toxins

    Time series forecasting with the WARIMAX-GARCH method

    Get PDF
    It is well-known that causal forecasting methods that include appropriately chosen Exogenous Variables (EVs) very often present improved forecasting performances over univariate methods. However, in practice, EVs are usually difficult to obtain and in many cases are not available at all. In this paper, a new causal forecasting approach, called Wavelet Auto-Regressive Integrated Moving Average with eXogenous variables and Generalized Auto-Regressive Conditional Heteroscedasticity (WARIMAX-GARCH) method, is proposed to improve predictive performance and accuracy but also to address, at least in part, the problem of unavailable EVs. Basically, the WARIMAX-GARCH method obtains Wavelet “EVs” (WEVs) from Auto-Regressive Integrated Moving Average with eXogenous variables and Generalized Auto-Regressive Conditional Heteroscedasticity (ARIMAX-GARCH) models applied to Wavelet Components (WCs) that are initially determined from the underlying time series. The WEVs are, in fact, treated by the WARIMAX-GARCH method as if they were conventional EVs. Similarly to GARCH and ARIMA-GARCH models, the WARIMAX-GARCH method is suitable for time series exhibiting non-linear characteristics such as conditional variance that depends on past values of observed data. However, unlike those, it can explicitly model frequency domain patterns in the series to help improve predictive performance. An application to a daily time series of dam displacement in Brazil shows the WARIMAX-GARCH method to remarkably outperform the ARIMA-GARCH method, as well as the (multi-layer perceptron) Artificial Neural Network (ANN) and its wavelet version referred to as Wavelet Artificial Neural Network (WANN) as in [1], on statistical measures for both in-sample and out-of-sample forecasting

    MultiSeq: unifying sequence and structure data for evolutionary analysis

    Get PDF
    BACKGROUND: Since the publication of the first draft of the human genome in 2000, bioinformatic data have been accumulating at an overwhelming pace. Currently, more than 3 million sequences and 35 thousand structures of proteins and nucleic acids are available in public databases. Finding correlations in and between these data to answer critical research questions is extremely challenging. This problem needs to be approached from several directions: information science to organize and search the data; information visualization to assist in recognizing correlations; mathematics to formulate statistical inferences; and biology to analyze chemical and physical properties in terms of sequence and structure changes. RESULTS: Here we present MultiSeq, a unified bioinformatics analysis environment that allows one to organize, display, align and analyze both sequence and structure data for proteins and nucleic acids. While special emphasis is placed on analyzing the data within the framework of evolutionary biology, the environment is also flexible enough to accommodate other usage patterns. The evolutionary approach is supported by the use of predefined metadata, adherence to standard ontological mappings, and the ability for the user to adjust these classifications using an electronic notebook. MultiSeq contains a new algorithm to generate complete evolutionary profiles that represent the topology of the molecular phylogenetic tree of a homologous group of distantly related proteins. The method, based on the multidimensional QR factorization of multiple sequence and structure alignments, removes redundancy from the alignments and orders the protein sequences by increasing linear dependence, resulting in the identification of a minimal basis set of sequences that spans the evolutionary space of the homologous group of proteins. CONCLUSION: MultiSeq is a major extension of the Multiple Alignment tool that is provided as part of VMD, a structural visualization program for analyzing molecular dynamics simulations. Both are freely distributed by the NIH Resource for Macromolecular Modeling and Bioinformatics and MultiSeq is included with VMD starting with version 1.8.5. The MultiSeq website has details on how to download and use the software

    Unfolding simulations reveal the mechanism of extreme unfolding cooperativity in the kinetically stable alpha-lytic protease.

    Get PDF
    Kinetically stable proteins, those whose stability is derived from their slow unfolding kinetics and not thermodynamics, are examples of evolution's best attempts at suppressing unfolding. Especially in highly proteolytic environments, both partially and fully unfolded proteins face potential inactivation through degradation and/or aggregation, hence, slowing unfolding can greatly extend a protein's functional lifetime. The prokaryotic serine protease alpha-lytic protease (alphaLP) has done just that, as its unfolding is both very slow (t(1/2) approximately 1 year) and so cooperative that partial unfolding is negligible, providing a functional advantage over its thermodynamically stable homologs, such as trypsin. Previous studies have identified regions of the domain interface as critical to alphaLP unfolding, though a complete description of the unfolding pathway is missing. In order to identify the alphaLP unfolding pathway and the mechanism for its extreme cooperativity, we performed high temperature molecular dynamics unfolding simulations of both alphaLP and trypsin. The simulated alphaLP unfolding pathway produces a robust transition state ensemble consistent with prior biochemical experiments and clearly shows that unfolding proceeds through a preferential disruption of the domain interface. Through a novel method of calculating unfolding cooperativity, we show that alphaLP unfolds extremely cooperatively while trypsin unfolds gradually. Finally, by examining the behavior of both domain interfaces, we propose a model for the differential unfolding cooperativity of alphaLP and trypsin involving three key regions that differ between the kinetically stable and thermodynamically stable classes of serine proteases
    corecore