1,772 research outputs found

    Homology Inference of Protein-Protein Interactions via Conserved Binding Sites

    Get PDF
    The coverage and reliability of protein-protein interactions determined by high-throughput experiments still needs to be improved, especially for higher organisms, therefore the question persists, how interactions can be verified and predicted by computational approaches using available data on protein structural complexes. Recently we developed an approach called IBIS (Inferred Biomolecular Interaction Server) to predict and annotate protein-protein binding sites and interaction partners, which is based on the assumption that the structural location and sequence patterns of protein-protein binding sites are conserved between close homologs. In this study first we confirmed high accuracy of our method and found that its accuracy depends critically on the usage of all available data on structures of homologous complexes, compared to the approaches where only a non-redundant set of complexes is employed. Second we showed that there exists a trade-off between specificity and sensitivity if we employ in the prediction only evolutionarily conserved binding site clusters or clusters supported by only one observation (singletons). Finally we addressed the question of identifying the biologically relevant interactions using the homology inference approach and demonstrated that a large majority of crystal packing interactions can be correctly identified and filtered by our algorithm. At the same time, about half of biological interfaces that are not present in the protein crystallographic asymmetric unit can be reconstructed by IBIS from homologous complexes without the prior knowledge of crystal parameters of the query protein

    Global Functional Atlas of \u3cem\u3eEscherichia coli\u3c/em\u3e Encompassing Previously Uncharacterized Proteins

    Get PDF
    One-third of the 4,225 protein-coding genes of Escherichia coli K-12 remain functionally unannotated (orphans). Many map to distant clades such as Archaea, suggesting involvement in basic prokaryotic traits, whereas others appear restricted to E. coli, including pathogenic strains. To elucidate the orphans’ biological roles, we performed an extensive proteomic survey using affinity-tagged E. coli strains and generated comprehensive genomic context inferences to derive a high-confidence compendium for virtually the entire proteome consisting of 5,993 putative physical interactions and 74,776 putative functional associations, most of which are novel. Clustering of the respective probabilistic networks revealed putative orphan membership in discrete multiprotein complexes and functional modules together with annotated gene products, whereas a machine-learning strategy based on network integration implicated the orphans in specific biological processes. We provide additional experimental evidence supporting orphan participation in protein synthesis, amino acid metabolism, biofilm formation, motility, and assembly of the bacterial cell envelope. This resource provides a “systems-wide” functional blueprint of a model microbe, with insights into the biological and evolutionary significance of previously uncharacterized proteins

    Structure-Templated Predictions of Novel Protein Interactions from Sequence Information

    Get PDF
    The multitude of functions performed in the cell are largely controlled by a set of carefully orchestrated protein interactions often facilitated by specific binding of conserved domains in the interacting proteins. Interacting domains commonly exhibit distinct binding specificity to short and conserved recognition peptides called binding profiles. Although many conserved domains are known in nature, only a few have well-characterized binding profiles. Here, we describe a novel predictive method known as domain–motif interactions from structural topology (D-MIST) for elucidating the binding profiles of interacting domains. A set of domains and their corresponding binding profiles were derived from extant protein structures and protein interaction data and then used to predict novel protein interactions in yeast. A number of the predicted interactions were verified experimentally, including new interactions of the mitotic exit network, RNA polymerases, nucleotide metabolism enzymes, and the chaperone complex. These results demonstrate that new protein interactions can be predicted exclusively from sequence information

    SPPS: A Sequence-Based Method for Predicting Probability of Protein-Protein Interaction Partners

    Get PDF
    Background: The molecular network sustained by different types of interactions among proteins is widely manifested as the fundamental driving force of cellular operations. Many biological functions are determined by the crosstalk between proteins rather than by the characteristics of their individual components. Thus, the searches for protein partners in global networks are imperative when attempting to address the principles of biology. Results: We have developed a web-based tool ‘‘Sequence-based Protein Partners Search’ ’ (SPPS) to explore interacting partners of proteins, by searching over a large repertoire of proteins across many species. SPPS provides a database containing more than 60,000 protein sequences with annotations and a protein-partner search engine in two modes (Single Query and Multiple Query). Two interacting proteins of human FBXO6 protein have been found using the service in the study. In addition, users can refine potential protein partner hits by using annotations and possible interactive network in the SPPS web server. Conclusions: SPPS provides a new type of tool to facilitate the identification of direct or indirect protein partners which may guide scientists on the investigation of new signaling pathways. The SPPS server is available to the public a

    Elucidating glycosaminoglycan–protein–protein interactions using carbohydrate microarray and computational approaches

    Get PDF
    Glycosaminoglycan polysaccharides play critical roles in many cellular processes, ranging from viral invasion and angiogenesis to spinal cord injury. Their diverse biological activities are derived from an ability to regulate a remarkable number of proteins. However, few methods exist for the rapid identification of glycosaminoglycan–protein interactions and for studying the potential of glycosaminoglycans to assemble multimeric protein complexes. Here, we report a multidisciplinary approach that combines new carbohydrate microarray and computational modeling methodologies to elucidate glycosaminoglycan–protein interactions. The approach was validated through the study of known protein partners for heparan and chondroitin sulfate, including fibroblast growth factor 2 (FGF2) and its receptor FGFR1, the malarial protein VAR2CSA, and tumor necrosis factor-α (TNF-α). We also applied the approach to identify previously undescribed interactions between a specific sulfated epitope on chondroitin sulfate, CS-E, and the neurotrophins, a critical family of growth factors involved in the development, maintenance, and survival of the vertebrate nervous system. Our studies show for the first time that CS is capable of assembling multimeric signaling complexes and modulating neurotrophin signaling pathways. In addition, we identify a contiguous CS-E-binding site by computational modeling that suggests a potential mechanism to explain how CS may promote neurotrophin-tyrosine receptor kinase (Trk) complex formation and neurotrophin signaling. Together, our combined microarray and computational modeling methodologies provide a general, facile means to identify new glycosaminoglycan–protein–protein interactions, as well as a molecular-level understanding of those complexes

    Semi-supervised multi-task learning for predicting interactions between HIV-1 and human proteins

    Get PDF
    Motivation: Protein–protein interactions (PPIs) are critical for virtually every biological function. Recently, researchers suggested to use supervised learning for the task of classifying pairs of proteins as interacting or not. However, its performance is largely restricted by the availability of truly interacting proteins (labeled). Meanwhile, there exists a considerable amount of protein pairs where an association appears between two partners, but not enough experimental evidence to support it as a direct interaction (partially labeled)
    corecore