2,261 research outputs found

    CCharPPI web server: computational characterization of protein–protein interactions from structure

    Get PDF
    The atomic structures of protein–protein interactions are central to understanding their role in biological systems, and a wide variety of biophysical functions and potentials have been developed for their characterization and the construction of predictive models. These tools are scattered across a multitude of stand-alone programs, and are often available only as model parameters requiring reimplementation. This acts as a significant barrier to their widespread adoption. CCharPPI integrates many of these tools into a single web server. It calculates up to 108 parameters, including models of electrostatics, desolvation and hydrogen bonding, as well as interface packing and complementarity scores, empirical potentials at various resolutions, docking potentials and composite scoring functions.The research leading to these results has received funding from the People Programme (Marie Curie Actions) of the European Unions Seventh Framework Programme (FP7/2007- 2013) under REA grant agreement PIEF-GA-2012-327899 and grant BIO2013-48213-R from Spanish Ministry of Economy and Competitiveness.Peer ReviewedPostprint (published version

    Exploring the potential of 3D Zernike descriptors and SVM for protein\u2013protein interface prediction

    Get PDF
    Abstract Background The correct determination of protein–protein interaction interfaces is important for understanding disease mechanisms and for rational drug design. To date, several computational methods for the prediction of protein interfaces have been developed, but the interface prediction problem is still not fully understood. Experimental evidence suggests that the location of binding sites is imprinted in the protein structure, but there are major differences among the interfaces of the various protein types: the characterising properties can vary a lot depending on the interaction type and function. The selection of an optimal set of features characterising the protein interface and the development of an effective method to represent and capture the complex protein recognition patterns are of paramount importance for this task. Results In this work we investigate the potential of a novel local surface descriptor based on 3D Zernike moments for the interface prediction task. Descriptors invariant to roto-translations are extracted from circular patches of the protein surface enriched with physico-chemical properties from the HQI8 amino acid index set, and are used as samples for a binary classification problem. Support Vector Machines are used as a classifier to distinguish interface local surface patches from non-interface ones. The proposed method was validated on 16 classes of proteins extracted from the Protein–Protein Docking Benchmark 5.0 and compared to other state-of-the-art protein interface predictors (SPPIDER, PrISE and NPS-HomPPI). Conclusions The 3D Zernike descriptors are able to capture the similarity among patterns of physico-chemical and biochemical properties mapped on the protein surface arising from the various spatial arrangements of the underlying residues, and their usage can be easily extended to other sets of amino acid properties. The results suggest that the choice of a proper set of features characterising the protein interface is crucial for the interface prediction task, and that optimality strongly depends on the class of proteins whose interface we want to characterise. We postulate that different protein classes should be treated separately and that it is necessary to identify an optimal set of features for each protein class

    Understanding Stability of Protein-Protein Complexes

    Get PDF
    For all living organisms, macromolecular interactions facilitate most of their natural functions. Alterations to macromolecular structures through mutations, can affect the stability of their interactions, which may lead to unfavourable phenotypes and disease. Presented here, are a number of computational methods aimed at uncovering the principles behind complex stability - as described by binding affinity and dissociation rate constants. Several factors are known to govern the stability of protein-protein interactions, however, no one factor dominates, and it is the synergistic effect of a number of contributions, which amount to the affinity, and stability of a complex. The characterization of complex stability can thus be presented as a two-fold problem; modelling the individual factors and modelling the synergistic effect of the combination of such individual factors. Using machine learning as a central framework, empirical functions are designed for estimating affinity, dissociation rates and the effects of mutations on these properties. The performance of all models is in turn benchmarked on experimental data available from the literature and carefully curated datasets. Firstly, a wild-type binding free energy prediction model is designed, composed of a diverse set of stability descriptors, which account for flexibility and conformational changes undergone by the complex in question. Similarly, models for estimating the effects of mutations on binding affinity are also designed and benchmarked in a community-wide blind trial. Emphasis here is on the detection of a small subset of mutations that are able to enhance the stability of two de novo protein drugs targeting the flu virus hemagglutinin. Probing further the determinants of stability, a set of descriptors that link hotspot residues with the off-rate of a complex are designed, and applied to models predicting changes in off-rate upon mutation. Finally, the relationship between the distribution of hotspots at protein interfaces, and the rate of dissociation of such interfaces, is investigated

    Concepts to Interfere with Protein-Protein Complex Formations: Data Analysis, Structural Evidence and Strategies for Finding Small Molecule Modulators

    Get PDF
    (1) Analyzing protein-protein interactions at the atomic level is critical for our understanding of the principles governing the interactions involved in protein-protein recognition. For this purpose descriptors explaining the nature of different protein-protein complexes are desirable. In this work, we introduce Epic Protein Interface Classification (EPIC) as a framework handling the preparation, processing, and analysis of protein-protein complexes for classification with machine learning algorithms. We applied four different machine learning algorithms: Support Vector Machines (SVM), C4.5 Decision Trees, K Nearest Neighbors (KNN), and NaĂŻve Bayes (NB) algorithm in combination with three feature selection methods, Filter (Relief F), Wrapper, and Genetic Algorithms (GA) to extract discriminating features from the protein-protein complexes. To compare protein-protein complexes to each other, we represented the physicochemical characteristics of their interfaces in four different ways, using two different atomic contact vectors (ACVs), DrugScore pair potential vectors (DPV) and SFCscore descriptor vectors (SDV). We classified two different datasets: (A) 172 protein-protein complexes comprising 96 monomers, forming contacts enforced by the crystallographic packing environment (crystal contacts), and 76 biologically functional homodimer complexes; (B) 345 protein-protein complexes containing 147 permanent complexes and 198 transient complexes. We were able to classify up to 94.8% of the packing enforced/functional and up to 93.6% of the permanent/transient complexes correctly. Furthermore, we were able to extract relevant features from the different protein-protein complexes and introduce an approach for scoring the importance of the extracted features. (2) Since protein-protein interactions play pivotal role in the communication on the molecular level in virtually every biological system and process, the search and design for modulators of such interactions is of utmost interest. In recent years many inhibitors for specific protein-protein interactions have been developed, however, in only a few cases, small and druglike molecules are able to interfere the complex formation of proteins. On the other hand, there a several small molecules known to modulate protein-protein interactions by means of stabilizing an already assembled complex. To achieve this goal, a ligand is binding to a pocket, which is located rim-exposed at the interface of the interacting proteins, e.g. as the phytotoxin Fusicoccin, which stabilizes the interaction of plant H+-ATPase and 14-3-3 protein by nearly a factor of 100. To suggest alternative leads, we performed a virtual screening campaign to discover new molecules putatively stabilizing this complex. Furthermore, we screen a dataset of 198 transient recognition protein-protein complexes for cavities, which are located rim-exposed at their interfaces. We provide evidence for high similarity between such rim-exposed cavities and usual ligand accommodating active sites of enzymes. This analysis suggests that rim-exposed cavities at protein-protein interfaces are druggable targets. Therefore, the principle of stabilizing protein-protein interactions seems to be a promising alternative to the approach of the competitive inhibition of such interactions by small molecules. (3) AffinDB is a database of affinity data for structurally resolved protein-ligand complexes from the PDB. It is freely accessible at http://www.agklebe.de/affinity. Affinity data are collected from the scientific literature, both from primary sources describing the original experimental work of affinity determination and from secondary references which report affinity values determined by others. AffinDB currently contains over 730 affinity entries covering more than 450 different protein-ligand complexes. Besides the affinity value, PDB summary information and additional data are provided, including the experimental conditions of the affinity measurement (if available in the corresponding reference); 2D drawing, SMILES code, and molecular weight of the ligand; links to other databases, and bibliographic information. AffinDB can be queried by PDB code or by any combination of affinity range, temperature and pH-value of the measurement, ligand molecular weight, and publication data (author, journal, year). Search results can be saved as tabular reports in text files. The database is supposed to be a valuable resource for researchers interested in biomolecular recognition and the development of tools for correlating structural data with affinities, as needed, for example, in structure-based drug design

    Concepts to Interfere with Protein-Protein Complex Formations: Data Analysis, Structural Evidence and Strategies for Finding Small Molecule Modulators

    Get PDF
    (1) Analyzing protein-protein interactions at the atomic level is critical for our understanding of the principles governing the interactions involved in protein-protein recognition. For this purpose descriptors explaining the nature of different protein-protein complexes are desirable. In this work, we introduce Epic Protein Interface Classification (EPIC) as a framework handling the preparation, processing, and analysis of protein-protein complexes for classification with machine learning algorithms. We applied four different machine learning algorithms: Support Vector Machines (SVM), C4.5 Decision Trees, K Nearest Neighbors (KNN), and NaĂŻve Bayes (NB) algorithm in combination with three feature selection methods, Filter (Relief F), Wrapper, and Genetic Algorithms (GA) to extract discriminating features from the protein-protein complexes. To compare protein-protein complexes to each other, we represented the physicochemical characteristics of their interfaces in four different ways, using two different atomic contact vectors (ACVs), DrugScore pair potential vectors (DPV) and SFCscore descriptor vectors (SDV). We classified two different datasets: (A) 172 protein-protein complexes comprising 96 monomers, forming contacts enforced by the crystallographic packing environment (crystal contacts), and 76 biologically functional homodimer complexes; (B) 345 protein-protein complexes containing 147 permanent complexes and 198 transient complexes. We were able to classify up to 94.8% of the packing enforced/functional and up to 93.6% of the permanent/transient complexes correctly. Furthermore, we were able to extract relevant features from the different protein-protein complexes and introduce an approach for scoring the importance of the extracted features. (2) Since protein-protein interactions play pivotal role in the communication on the molecular level in virtually every biological system and process, the search and design for modulators of such interactions is of utmost interest. In recent years many inhibitors for specific protein-protein interactions have been developed, however, in only a few cases, small and druglike molecules are able to interfere the complex formation of proteins. On the other hand, there a several small molecules known to modulate protein-protein interactions by means of stabilizing an already assembled complex. To achieve this goal, a ligand is binding to a pocket, which is located rim-exposed at the interface of the interacting proteins, e.g. as the phytotoxin Fusicoccin, which stabilizes the interaction of plant H+-ATPase and 14-3-3 protein by nearly a factor of 100. To suggest alternative leads, we performed a virtual screening campaign to discover new molecules putatively stabilizing this complex. Furthermore, we screen a dataset of 198 transient recognition protein-protein complexes for cavities, which are located rim-exposed at their interfaces. We provide evidence for high similarity between such rim-exposed cavities and usual ligand accommodating active sites of enzymes. This analysis suggests that rim-exposed cavities at protein-protein interfaces are druggable targets. Therefore, the principle of stabilizing protein-protein interactions seems to be a promising alternative to the approach of the competitive inhibition of such interactions by small molecules. (3) AffinDB is a database of affinity data for structurally resolved protein-ligand complexes from the PDB. It is freely accessible at http://www.agklebe.de/affinity. Affinity data are collected from the scientific literature, both from primary sources describing the original experimental work of affinity determination and from secondary references which report affinity values determined by others. AffinDB currently contains over 730 affinity entries covering more than 450 different protein-ligand complexes. Besides the affinity value, PDB summary information and additional data are provided, including the experimental conditions of the affinity measurement (if available in the corresponding reference); 2D drawing, SMILES code, and molecular weight of the ligand; links to other databases, and bibliographic information. AffinDB can be queried by PDB code or by any combination of affinity range, temperature and pH-value of the measurement, ligand molecular weight, and publication data (author, journal, year). Search results can be saved as tabular reports in text files. The database is supposed to be a valuable resource for researchers interested in biomolecular recognition and the development of tools for correlating structural data with affinities, as needed, for example, in structure-based drug design
    • …
    corecore