49,128 research outputs found

    High-throughput identification of interacting protein-protein binding sites

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>With the advent of increasing sequence and structural data, a number of methods have been proposed to locate putative protein binding sites from protein surfaces. Therefore, methods that are able to identify whether these binding sites interact are needed.</p> <p>Results</p> <p>We have developed a new method using a machine learning approach to detect if protein binding sites, once identified, interact with each other. The method exploits information relating to sequence and structural complementary across protein interfaces and has been tested on a non-redundant data set consisting of 584 homo-dimers and 198 hetero-dimers extracted from the PDB. Results indicate 87.4% of the interacting binding sites and 68.6% non-interacting binding sites were correctly identified. Furthermore, we built a pipeline that links this method to a modified version of our previously developed method that predicts the location of binding sites.</p> <p>Conclusion</p> <p>We have demonstrated that this high-throughput pipeline is capable of identifying binding sites for proteins, their interacting binding sites and, ultimately, their binding partners on a large scale.</p

    Annotating Protein Functional Residues by Coupling High-Throughput Fitness Profile and Homologous-Structure Analysis.

    Get PDF
    Identification and annotation of functional residues are fundamental questions in protein sequence analysis. Sequence and structure conservation provides valuable information to tackle these questions. It is, however, limited by the incomplete sampling of sequence space in natural evolution. Moreover, proteins often have multiple functions, with overlapping sequences that present challenges to accurate annotation of the exact functions of individual residues by conservation-based methods. Using the influenza A virus PB1 protein as an example, we developed a method to systematically identify and annotate functional residues. We used saturation mutagenesis and high-throughput sequencing to measure the replication capacity of single nucleotide mutations across the entire PB1 protein. After predicting protein stability upon mutations, we identified functional PB1 residues that are essential for viral replication. To further annotate the functional residues important to the canonical or noncanonical functions of viral RNA-dependent RNA polymerase (vRdRp), we performed a homologous-structure analysis with 16 different vRdRp structures. We achieved high sensitivity in annotating the known canonical polymerase functional residues. Moreover, we identified a cluster of noncanonical functional residues located in the loop region of the PB1 β-ribbon. We further demonstrated that these residues were important for PB1 protein nuclear import through the interaction with Ran-binding protein 5. In summary, we developed a systematic and sensitive method to identify and annotate functional residues that are not restrained by sequence conservation. Importantly, this method is generally applicable to other proteins about which homologous-structure information is available.ImportanceTo fully comprehend the diverse functions of a protein, it is essential to understand the functionality of individual residues. Current methods are highly dependent on evolutionary sequence conservation, which is usually limited by sampling size. Sequence conservation-based methods are further confounded by structural constraints and multifunctionality of proteins. Here we present a method that can systematically identify and annotate functional residues of a given protein. We used a high-throughput functional profiling platform to identify essential residues. Coupling it with homologous-structure comparison, we were able to annotate multiple functions of proteins. We demonstrated the method with the PB1 protein of influenza A virus and identified novel functional residues in addition to its canonical function as an RNA-dependent RNA polymerase. Not limited to virology, this method is generally applicable to other proteins that can be functionally selected and about which homologous-structure information is available

    Structural Prediction of Protein–Protein Interactions by Docking: Application to Biomedical Problems

    Get PDF
    A huge amount of genetic information is available thanks to the recent advances in sequencing technologies and the larger computational capabilities, but the interpretation of such genetic data at phenotypic level remains elusive. One of the reasons is that proteins are not acting alone, but are specifically interacting with other proteins and biomolecules, forming intricate interaction networks that are essential for the majority of cell processes and pathological conditions. Thus, characterizing such interaction networks is an important step in understanding how information flows from gene to phenotype. Indeed, structural characterization of protein–protein interactions at atomic resolution has many applications in biomedicine, from diagnosis and vaccine design, to drug discovery. However, despite the advances of experimental structural determination, the number of interactions for which there is available structural data is still very small. In this context, a complementary approach is computational modeling of protein interactions by docking, which is usually composed of two major phases: (i) sampling of the possible binding modes between the interacting molecules and (ii) scoring for the identification of the correct orientations. In addition, prediction of interface and hot-spot residues is very useful in order to guide and interpret mutagenesis experiments, as well as to understand functional and mechanistic aspects of the interaction. Computational docking is already being applied to specific biomedical problems within the context of personalized medicine, for instance, helping to interpret pathological mutations involved in protein–protein interactions, or providing modeled structural data for drug discovery targeting protein–protein interactions.Spanish Ministry of Economy grant number BIO2016-79960-R; D.B.B. is supported by a predoctoral fellowship from CONACyT; M.R. is supported by an FPI fellowship from the Severo Ochoa program. We are grateful to the Joint BSC-CRG-IRB Programme in Computational Biology.Peer ReviewedPostprint (author's final draft

    PeptideMine - A webserver for the design of peptides for protein-peptide binding studies derived from protein-protein interactomes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Signal transduction events often involve transient, yet specific, interactions between structurally conserved protein domains and polypeptide sequences in target proteins. The identification and validation of these associating domains is crucial to understand signal transduction pathways that modulate different cellular or developmental processes. Bioinformatics strategies to extract and integrate information from diverse sources have been shown to facilitate the experimental design to understand complex biological events. These methods, primarily based on information from high-throughput experiments, have also led to the identification of new connections thus providing hypothetical models for cellular events. Such models, in turn, provide a framework for directing experimental efforts for validating the predicted molecular rationale for complex cellular processes. In this context, it is envisaged that the rational design of peptides for protein-peptide binding studies could substantially facilitate the experimental strategies to evaluate a predicted interaction. This rational design procedure involves the integration of protein-protein interaction data, gene ontology, physico-chemical calculations, domain-domain interaction data and information on functional sites or critical residues.</p> <p>Results</p> <p>Here we describe an integrated approach called "PeptideMine" for the identification of peptides based on specific functional patterns present in the sequence of an interacting protein. This approach based on sequence searches in the interacting sequence space has been developed into a webserver, which can be used for the identification and analysis of peptides, peptide homologues or functional patterns from the interacting sequence space of a protein. To further facilitate experimental validation, the PeptideMine webserver also provides a list of physico-chemical parameters corresponding to the peptide to determine the feasibility of using the peptide for <it>in vitro </it>biochemical or biophysical studies.</p> <p>Conclusions</p> <p>The strategy described here involves the integration of data and tools to identify potential interacting partners for a protein and design criteria for peptides based on desired biochemical properties. Alongside the search for interacting protein sequences using three different search programs, the server also provides the biochemical characteristics of candidate peptides to prune peptide sequences based on features that are most suited for a given experiment. The PeptideMine server is available at the URL: <url>http://caps.ncbs.res.in/peptidemine</url></p

    The ever-evolving concept of the gene: The use of RNA/Protein experimental techniques to understand genome functions

    Get PDF
    The completion of the human genome sequence together with advances in sequencing technologies have shifted the paradigm of the genome, as composed of discrete and hereditable coding entities, and have shown the abundance of functional noncoding DNA. This part of the genome, previously dismissed as "junk" DNA, increases proportionally with organismal complexity and contributes to gene regulation beyond the boundaries of known protein-coding genes. Different classes of functionally relevant nonprotein-coding RNAs are transcribed from noncoding DNA sequences. Among them are the long noncoding RNAs (lncRNAs), which are thought to participate in the basal regulation of protein-coding genes at both transcriptional and post-transcriptional levels. Although knowledge of this field is still limited, the ability of lncRNAs to localize in different cellular compartments, to fold into specific secondary structures and to interact with different molecules (RNA or proteins) endows them with multiple regulatory mechanisms. It is becoming evident that lncRNAs may play a crucial role in most biological processes such as the control of development, differentiation and cell growth. This review places the evolution of the concept of the gene in its historical context, from Darwin's hypothetical mechanism of heredity to the post-genomic era. We discuss how the original idea of protein-coding genes as unique determinants of phenotypic traits has been reconsidered in light of the existence of noncoding RNAs. We summarize the technological developments which have been made in the genome-wide identification and study of lncRNAs and emphasize the methodologies that have aided our understanding of the complexity of lncRNA-protein interactions in recent years

    Interaction and cross-talk between non-coding RNAs.

    Get PDF
    Non-coding RNA (ncRNA) has been shown to regulate diverse cellular processes and functions through controlling gene expression. Long non-coding RNAs (lncRNAs) act as a competing endogenous RNAs (ceRNAs) where microRNAs (miRNAs) and lncRNAs regulate each other through their biding sites. Interactions of miRNAs and lncRNAs have been reported to trigger decay of the targeted lncRNAs and have important roles in target gene regulation. These interactions form complicated and intertwined networks. Certain lncRNAs encode miRNAs and small nucleolar RNAs (snoRNAs), and may regulate expression of these small RNAs as precursors. SnoRNAs have also been reported to be precursors for PIWI-interacting RNAs (piRNAs) and thus may regulate the piRNAs as a precursor. These miRNAs and piRNAs target messenger RNAs (mRNAs) and regulate gene expression. In this review, we will present and discuss these interactions, cross-talk, and co-regulation of ncRNAs and gene regulation due to these interactions

    Protein-protein interactions: network analysis and applications in drug discovery

    Get PDF
    Physical interactions among proteins constitute the backbone of cellular function, making them an attractive source of therapeutic targets. Although the challenges associated with targeting protein-protein interactions (PPIs) -in particular with small molecules are considerable, a growing number of functional PPI modulators is being reported and clinically evaluated. An essential starting point for PPI inhibitor screening or design projects is the generation of a detailed map of the human interactome and the interactions between human and pathogen proteins. Different routes to produce these biological networks are being combined, including literature curation and computational methods. Experimental approaches to map PPIs mainly rely on the yeast two-hybrid (Y2H) technology, which have recently shown to produce reliable protein networks. However, other genetic and biochemical methods will be essential to increase both coverage and resolution of current protein networks in order to increase their utility towards the identification of novel disease-related proteins and PPIs, and their potential use as therapeutic targets

    A bidirectional fluorescent two-hybrid system for monitoring protein–protein interactions

    No full text
    Two-hybrid systems have been the cornerstone of research into protein–protein interactions, but these systems typically rely on life/death reporters that put additional selective pressure on the host organism, and potentially lead to false positives. Here we report a bidirectional fluorescence-based bacterial two- hybrid system that enables both the association and dissociation of a given protein–protein interaction to be monitored. The functionality of this system and its compatibility with FACS screening are demon- strated in the forward and reverse direction using known interacting protein-partners and their cyclic peptide inhibitors. The reported fluorescent two-hybrid system may be used in the forward direction for the identification of interacting protein partners, or as a reverse two-hybrid system for the high- throughput identification of protein–protein interaction inhibitors
    corecore