66 research outputs found

    GIS: a comprehensive source for protein structure similarities

    Get PDF
    A web service for analysis of protein structures that are sequentially or non-sequentially similar was generated. Recently, the non-sequential structure alignment algorithm GANGSTA+ was introduced. GANGSTA+ can detect non-sequential structural analogs for proteins stated to possess novel folds. Since GANGSTA+ ignores the polypeptide chain connectivity of secondary structure elements (i.e. α-helices and β-strands), it is able to detect structural similarities also between proteins whose sequences were reshuffled during evolution. GANGSTA+ was applied in an all-against-all comparison on the ASTRAL40 database (SCOP version 1.75), which consists of >10 000 protein domains yielding about 55 × 106 possible protein structure alignments. Here, we provide the resulting protein structure alignments as a public web-based service, named GANGSTA+ Internet Services (GIS). We also allow to browse the ASTRAL40 database of protein structures with GANGSTA+ relative to an externally given protein structure using different constraints to select specific results. GIS allows us to analyze protein structure families according to the SCOP classification scheme. Additionally, users can upload their own protein structures for pairwise protein structure comparison, alignment against all protein structures of the ASTRAL40 database (SCOP version 1.75) or symmetry analysis. GIS is publicly available at http://agknapp.chemie.fu-berlin.de/gplus

    Quantifying the geographical distribution effect on decreasing aggregated nitrogen oxides intensity in the Chinese electrical generation system

    Get PDF
    Over the past 20 years, the spatial distribution of electrical generation and its relationship to cross-regional power transmission has impacted China's power generation system and significantly affected the total amount of NO x and the aggregated nitrogen oxides intensity (ANI) of the system. An investigation of the driving mechanisms of ANI that considers the unevenness of regional electricity generation will be crucial to future improvements in the NO x efficiency of the electrical generation system in China. In this study, we built a decomposition model for ANI by incorporating the spatial distribution of electrical generation and found that the spatial distribution of electricity generation together with energy-related factors gradually caused decreases in ANI. The efficiency of electricity generation presented the dominant inhibitory effect on ANI, but its effect size has weakened since 2010. In contrast, the fossil fuel structure of thermal power shows an increasingly positive effect on changes in ANI. The primary energy composition only slightly affected changes in ANI. Moreover, the changed geographical distribution of electricity generation is non-negligible and has a positive effect on reduction of the ANI of the Chinese electrical generation system. The transferred amount of local NO x emissions by cross-provincial electricity transmission, however, could cause lead to additional environmental costs for generators. This issue should receive more attention in the future

    Superimposé: a 3D structural superposition server

    Get PDF
    The Superimposé webserver performs structural similarity searches with a preference towards 3D structure-based methods. Similarities can be detected between small molecules (e.g. drugs), parts of large structures (e.g. binding sites of proteins) and entire proteins. For this purpose, a number of algorithms were implemented and various databases are provided. Superimposé assists the user regarding the selection of a suitable combination of algorithm and database. After the computation on our server infrastructure, a visual assessment of the results is provided. The structure-based in silico screening for similar drug-like compounds enables the detection of scaffold-hoppers with putatively similar effects. The possibility to find similar binding sites can be of special interest in the functional analysis of proteins. The search for structurally similar proteins allows the detection of similar folds with different backbone topology. The Superimposé server is available at: http://bioinformatics.charite.de/superimpose

    Tableau-based protein substructure search using quadratic programming

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Searching for proteins that contain similar substructures is an important task in structural biology. The exact solution of most formulations of this problem, including a recently published method based on tableaux, is too slow for practical use in scanning a large database.</p> <p>Results</p> <p>We developed an improved method for detecting substructural similarities in proteins using tableaux. Tableaux are compared efficiently by solving the quadratic program (QP) corresponding to the quadratic integer program (QIP) formulation of the extraction of maximally-similar tableaux. We compare the accuracy of the method in classifying protein folds with some existing techniques.</p> <p>Conclusion</p> <p>We find that including constraints based on the separation of secondary structure elements increases the accuracy of protein structure search using maximally-similar subtableau extraction, to a level where it has comparable or superior accuracy to existing techniques. We demonstrate that our implementation is able to search a structural database in a matter of hours on a standard PC.</p

    Deciphering the Preference and Predicting the Viability of Circular Permutations in Proteins

    Get PDF
    Circular permutation (CP) refers to situations in which the termini of a protein are relocated to other positions in the structure. CP occurs naturally and has been artificially created to study protein function, stability and folding. Recently CP is increasingly applied to engineer enzyme structure and function, and to create bifunctional fusion proteins unachievable by tandem fusion. CP is a complicated and expensive technique. An intrinsic difficulty in its application lies in the fact that not every position in a protein is amenable for creating a viable permutant. To examine the preferences of CP and develop CP viability prediction methods, we carried out comprehensive analyses of the sequence, structural, and dynamical properties of known CP sites using a variety of statistics and simulation methods, such as the bootstrap aggregating, permutation test and molecular dynamics simulations. CP particularly favors Gly, Pro, Asp and Asn. Positions preferred by CP lie within coils, loops, turns, and at residues that are exposed to solvent, weakly hydrogen-bonded, environmentally unpacked, or flexible. Disfavored positions include Cys, bulky hydrophobic residues, and residues located within helices or near the protein's core. These results fostered the development of an effective viable CP site prediction system, which combined four machine learning methods, e.g., artificial neural networks, the support vector machine, a random forest, and a hierarchical feature integration procedure developed in this work. As assessed by using the hydrofolate reductase dataset as the independent evaluation dataset, this prediction system achieved an AUC of 0.9. Large-scale predictions have been performed for nine thousand representative protein structures; several new potential applications of CP were thus identified. Many unreported preferences of CP are revealed in this study. The developed system is the best CP viability prediction method currently available. This work will facilitate the application of CP in research and biotechnology

    A novel method to compare protein structures using local descriptors

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Protein structure comparison is one of the most widely performed tasks in bioinformatics. However, currently used methods have problems with the so-called "difficult similarities", including considerable shifts and distortions of structure, sequential swaps and circular permutations. There is a demand for efficient and automated systems capable of overcoming these difficulties, which may lead to the discovery of previously unknown structural relationships.</p> <p>Results</p> <p>We present a novel method for protein structure comparison based on the formalism of local descriptors of protein structure - DEscriptor Defined Alignment (DEDAL). Local similarities identified by pairs of similar descriptors are extended into global structural alignments. We demonstrate the method's capability by aligning structures in difficult benchmark sets: curated alignments in the SISYPHUS database, as well as SISY and RIPC sets, including non-sequential and non-rigid-body alignments. On the most difficult RIPC set of sequence alignment pairs the method achieves an accuracy of 77% (the second best method tested achieves 60% accuracy).</p> <p>Conclusions</p> <p>DEDAL is fast enough to be used in whole proteome applications, and by lowering the threshold of detectable structure similarity it may shed additional light on molecular evolution processes. It is well suited to improving automatic classification of structure domains, helping analyze protein fold space, or to improving protein classification schemes. DEDAL is available online at <url>http://bioexploratorium.pl/EP/DEDAL</url>.</p

    Characterization of a Novel Binding Protein for Fortilin/TCTP — Component of a Defense Mechanism against Viral Infection in Penaeus monodon

    Get PDF
    The Fortilin (also known as TCTP) in Penaeus monodon (PmFortilin) and Fortilin Binding Protein 1 (FBP1) have recently been shown to interact and to offer protection against the widespread White Spot Syndrome Virus infection. However, the mechanism is yet unknown. We investigated this interaction in detail by a number of in silico and in vitro analyses, including prediction of a binding site between PmFortilin/FBP1 and docking simulations. The basis of the modeling analyses was well-conserved PmFortilin orthologs, containing a Ca2+-binding domain at residues 76–110 representing a section of the helical domain, the translationally controlled tumor protein signature 1 and 2 (TCTP_1, TCTP_2) at residues 45–55 and 123–145, respectively. We found the pairs Cys59 and Cys76 formed a disulfide bond in the C-terminus of FBP1, which is a common structural feature in many exported proteins and the “x–G–K–K” pattern of the amidation site at the end of the C-terminus. This coincided with our previous work, where we found the “x–P–P–x” patterns of an antiviral peptide also to be located in the C-terminus of FBP1. The combined bioinformatics and in vitro results indicate that FBP1 is a transmembrane protein and FBP1 interact with N-terminal region of PmFortilin
    corecore