89 research outputs found

    Sequential, structural and functional properties of protein complexes are defined by how folding and binding intertwine.

    Get PDF
    Intrinsically Disordered Proteins (IDPs) fulfill critical biological roles without having the potential to fold on their own. While lacking inherent structure, the majority of IDPs do reach a folded state via interaction with a protein partner, presenting a deep entanglement of the folding and binding process. Protein disorder has been recognized as a major determinant in several properties of proteins, such as sequence, adopted structure upon binding, and function. Yet, the way the binding process is reflected in these features in general lacks a detailed description. Here, we defined three categories of protein complexes depending on the unbound structural state of the interactors, and analyzed them in detail. We found that strikingly, the properties of interactors in terms of sequence and adopted structure are defined not only by the intrinsic structural state of the protein itself, but also to a comparable extent by the structural state of the binding partner. The three different types of interactions are also regulated through divergent molecular tactics of post-translational modifications. This not only widens the range of biologically relevant sequence and structure spaces defined by ordered proteins, but also presents distinct molecular mechanisms compatible with specific biological processes, separately for each interaction type. The distinct attributes of different binding modes identified in this study can help to understand how various types of interactions serve as building blocks for the assembly of tightly regulated and highly intertwined regulatory networks

    The Membrane Protein Data Bank

    Get PDF
    The Membrane Protein Data Bank (MPDB) is an online, searchable, relational database of structural and functional information on integral, anchored and peripheral membrane proteins and peptides. Data originates from the Protein Data Bank and other databases, and from the literature. Structures are based on X-ray and electron diffraction, nuclear magnetic resonance and cryoelectron microscopy. The MPDB is searchable online by protein characteristic, structure determination method, crystallization technique, detergent, temperature, pH, author, etc. Record entries are hyperlinked to the PDB and Pfam for viewing sequence, three-dimensional structure and domain architecture, and for downloading coordinates. Links to PubMed are also provided. The MPDB is updated weekly in parallel with the Protein Data Bank. Statistical analysis of MPDB records can be performed and viewed online. A summary of the statistics as applied to entries in the MPDB is presented. The data suggest conditions appropriate for crystallization trials with novel membrane proteins

    TMFoldRec: a statistical potential-based transmembrane protein fold recognition tool.

    Get PDF
    BACKGROUND: Transmembrane proteins (TMPs) are the key components of signal transduction, cell-cell adhesion and energy and material transport into and out from the cells. For the deep understanding of these processes, structure determination of transmembrane proteins is indispensable. However, due to technical difficulties, only a few transmembrane protein structures have been determined experimentally. Large-scale genomic sequencing provides increasing amounts of sequence information on the proteins and whole proteomes of living organisms resulting in the challenge of bioinformatics; how the structural information should be gained from a sequence. RESULTS: Here, we present a novel method, TMFoldRec, for fold prediction of membrane segments in transmembrane proteins. TMFoldRec based on statistical potentials was tested on a benchmark set containing 124 TMP chains from the PDBTM database. Using a 10-fold jackknife method, the native folds were correctly identified in 77 % of the cases. This accuracy overcomes the state-of-the-art methods. In addition, a key feature of TMFoldRec algorithm is the ability to estimate the reliability of the prediction and to decide with an accuracy of 70 %, whether the obtained, lowest energy structure is the native one. CONCLUSION: These results imply that the membrane embedded parts of TMPs dictate the TM structures rather than the soluble parts. Moreover, predictions with reliability scores make in this way our algorithm applicable for proteome-wide analyses. AVAILABILITY: The program is available upon request for academic use

    Identification of Extracellular Segments by Mass Spectrometry Improves Topology Prediction of Transmembrane Proteins

    Get PDF
    Transmembrane proteins play crucial role in signaling, ion transport, nutrient uptake, as well as in maintaining the dynamic equilibrium between the internal and external environment of cells. Despite their important biological functions and abundance, less than 2% of all determined structures are transmembrane proteins. Given the persisting technical difficulties associated with high resolution structure determination of transmembrane proteins, additional methods, including computational and experimental techniques remain vital in promoting our understanding of their topologies, 3D structures, functions and interactions. Here we report a method for the high-throughput determination of extracellular segments of transmembrane proteins based on the identification of surface labeled and biotin captured peptide fragments by LC/MS/MS. We show that reliable identification of extracellular protein segments increases the accuracy and reliability of existing topology prediction algorithms. Using the experimental topology data as constraints, our improved prediction tool provides accurate and reliable topology models for hundreds of human transmembrane proteins

    Transmembrane protein topology prediction using support vector machines

    Get PDF
    Background: Alpha-helical transmembrane (TM) proteins are involved in a wide range of important biological processes such as cell signaling, transport of membrane-impermeable molecules, cell-cell communication, cell recognition and cell adhesion. Many are also prime drug targets, and it has been estimated that more than half of all drugs currently on the market target membrane proteins. However, due to the experimental difficulties involved in obtaining high quality crystals, this class of protein is severely under-represented in structural databases. In the absence of structural data, sequence-based prediction methods allow TM protein topology to be investigated.Results: We present a support vector machine-based (SVM) TM protein topology predictor that integrates both signal peptide and re-entrant helix prediction, benchmarked with full cross-validation on a novel data set of 131 sequences with known crystal structures. The method achieves topology prediction accuracy of 89%, while signal peptides and re-entrant helices are predicted with 93% and 44% accuracy respectively. An additional SVM trained to discriminate between globular and TM proteins detected zero false positives, with a low false negative rate of 0.4%. We present the results of applying these tools to a number of complete genomes. Source code, data sets and a web server are freely available from http://bioinf.cs.ucl.ac.uk/psipred/.Conclusion: The high accuracy of TM topology prediction which includes detection of both signal peptides and re-entrant helices, combined with the ability to effectively discriminate between TM and globular proteins, make this method ideally suited to whole genome annotation of alpha-helical transmembrane proteins

    Fast and accurate mutation detection in whole genome sequences of multiple isogenic samples with IsoMut

    Get PDF
    Background: Detection of somatic mutations is one of the main goals of next generation DNA sequencing. A wide range of experimental systems are available for the study of spontaneous or environmentally induced mutagenic processes. However, most of the routinely used mutation calling algorithms are not optimised for the simultaneous analysis of multiple samples, or for non-human experimental model systems with no reliable databases of common genetic variations. Most standard tools either require numerous in-house post filtering steps with scarce documentation or take an unpractically long time to run. To overcome these problems, we designed the streamlined IsoMut tool which can be readily adapted to experimental scenarios where the goal is the identification of experimentally induced mutations in multiple isogenic samples. Methods: Using 30 isogenic samples, reliable cohorts of validated mutations were created for testing purposes. Optimal values of the filtering parameters of IsoMut were determined in a thorough and strict optimization procedure based on these test sets. Results: We show that IsoMut, when tuned correctly, decreases the false positive rate compared to conventional tools in a 30 sample experimental setup; and detects not only single nucleotide variations, but short insertions and deletions as well. IsoMut can also be run more than a hundred times faster than the most precise state of art tool, due its straightforward and easily understandable filtering algorithm. Conclusions: IsoMut has already been successfully applied in multiple recent studies to find unique, treatment induced mutations in sets of isogenic samples with very low false positive rates. These types of studies provide an important contribution to determining the mutagenic effect of environmental agents or genetic defects, and IsoMut turned out to be an invaluable tool in the analysis of such data. © 2017 The Author(s)
    corecore