57 research outputs found

    A computational approach for genome-wide mapping of splicing factor binding sites

    Get PDF
    A computational method is presented for genome-wide mapping of splicing factor binding sites that considers both the genomic environment and evolutionary conservation

    SFmap: a web server for motif analysis and prediction of splicing factor binding sites

    Get PDF
    Alternative splicing (AS) is a post-transcriptional process considered to be responsible for the huge diversity of proteins in higher eukaryotes. AS events are regulated by different splicing factors (SFs) that bind to sequence elements on the RNA. SFmap is a web server for predicting putative SF binding sites in genomic data (http://sfmap.technion.ac.il). SFmap implements the COS(WR) algorithm, which computes similarity scores for a given regulatory motif based on information derived from its sequence environment and its evolutionary conservation. Input for SFmap is a human genomic sequence or a list of sequences in FASTA format that can either be uploaded from a file or pasted into a window. SFmap searches within a given sequence for significant hits of binding motifs that are either stored in our database or defined by the user. SFmap results are provided both as a text file and as a graphical web interface

    Prediction of DNA-binding propensity of proteins by the ball-histogram method using automatic template search

    Get PDF
    We contribute a novel, ball-histogram approach to DNA-binding propensity prediction of proteins. Unlike state-of-the-art methods based on constructing an ad-hoc set of features describing physicochemical properties of the proteins, the ball-histogram technique enables a systematic, Monte-Carlo exploration of the spatial distribution of amino acids complying with automatically selected properties. This exploration yields a model for the prediction of DNA binding propensity. We validate our method in prediction experiments, improving on state-of-the-art accuracies. Moreover, our method also provides interpretable features involving spatial distributions of selected amino acids

    Predicting and controlling the reactivity of immune cell populations against cancer

    Get PDF
    Heterogeneous cell populations form an interconnected network that determine their collective output. One example of such a heterogeneous immune population is tumor-infiltrating lymphocytes (TILs), whose output can be measured in terms of its reactivity against tumors. While the degree of reactivity varies considerably between different TILs, ranging from null to a potent response, the underlying network that governs the reactivity is poorly understood. Here, we asked whether one can predict and even control this reactivity. To address this we measured the subpopulation compositions of 91 TILs surgically removed from 27 metastatic melanoma patients. Despite the large number of subpopulations compositions, we were able to computationally extract a simple set of subpopulation-based rules that accurately predict the degree of reactivity. This raised the conjecture of whether one could control reactivity of TILs by manipulating their subpopulation composition. Remarkably, by rationally enriching and depleting selected subsets of subpopulations, we were able to restore anti-tumor reactivity to nonreactive TILs. Altogether, this work describes a general framework for predicting and controlling the output of a cell mixture

    Classifying RNA-Binding Proteins Based on Electrostatic Properties

    Get PDF
    Protein structure can provide new insight into the biological function of a protein and can enable the design of better experiments to learn its biological roles. Moreover, deciphering the interactions of a protein with other molecules can contribute to the understanding of the protein's function within cellular processes. In this study, we apply a machine learning approach for classifying RNA-binding proteins based on their three-dimensional structures. The method is based on characterizing unique properties of electrostatic patches on the protein surface. Using an ensemble of general protein features and specific properties extracted from the electrostatic patches, we have trained a support vector machine (SVM) to distinguish RNA-binding proteins from other positively charged proteins that do not bind nucleic acids. Specifically, the method was applied on proteins possessing the RNA recognition motif (RRM) and successfully classified RNA-binding proteins from RRM domains involved in protein–protein interactions. Overall the method achieves 88% accuracy in classifying RNA-binding proteins, yet it cannot distinguish RNA from DNA binding proteins. Nevertheless, by applying a multiclass SVM approach we were able to classify the RNA-binding proteins based on their RNA targets, specifically, whether they bind a ribosomal RNA (rRNA), a transfer RNA (tRNA), or messenger RNA (mRNA). Finally, we present here an innovative approach that does not rely on sequence or structural homology and could be applied to identify novel RNA-binding proteins with unique folds and/or binding motifs

    Mechanism of the Interaction between the Intrinsically Disordered C-Terminus of the Pro-Apoptotic ARTS Protein and the Bir3 Domain of XIAP

    Get PDF
    ARTS (Sept4_i2) is a mitochondrial pro-apoptotic protein that functions as a tumor suppressor. Its expression is significantly reduced in leukemia and lymphoma patients. ARTS binds and inhibits XIAP (X-linked Inhibitor of Apoptosis protein) by interacting with its Bir3 domain. ARTS promotes degradation of XIAP through the proteasome pathway. By doing so, ARTS removes XIAP inhibition of caspases and enables apoptosis to proceed. ARTS contains 27 unique residues in its C-terminal domain (CTD, residues 248–274) which are important for XIAP binding. Here we characterized the molecular details of this interaction. Biophysical and computational methods were used to show that the ARTS CTD is intrinsically disordered under physiological conditions. Direct binding of ARTS CTD to Bir3 was demonstrated using NMR and fluorescence spectroscopy. The Bir3 interacting region in ARTS CTD was mapped to ARTS residues 266–274, which are the nine C-terminal residues in the protein. Alanine scan of ARTS 266–274 showed the importance of several residues for Bir3 binding, with His268 and Cys273 contributing the most. Adding a reducing agent prevented binding to Bir3. A dimer of ARTS 266–274 formed by oxidation of the Cys residues into a disulfide bond bound with similar affinity and was probably required for the interaction with Bir3. The detailed analysis of the ARTS – Bir3 interaction provides the basis for setting it as a target for anti cancer drug design: It will enable the development of compounds that mimic ARTS CTD, remove IAPs inhibition of caspases, and thereby induce apoptosis

    EL_PSSM-RT:DNA-binding residue prediction by integrating ensemble learning with PSSM Relation Transformation

    Get PDF
    Background: Prediction of DNA-binding residue is important for understanding the protein-DNA recognition mechanism. Many computational methods have been proposed for the prediction, but most of them do not consider the relationships of evolutionary information between residues. Results: In this paper, we first propose a novel residue encoding method, referred to as the Position Specific Score Matrix (PSSM) Relation Transformation (PSSM-RT), to encode residues by utilizing the relationships of evolutionary information between residues. PDNA-62 and PDNA-224 are used to evaluate PSSM-RT and two existing PSSM encoding methods by five-fold cross-validation. Performance evaluations indicate that PSSM-RT is more effective than previous methods. This validates the point that the relationship of evolutionary information between residues is indeed useful in DNA-binding residue prediction. An ensemble learning classifier (EL_PSSM-RT) is also proposed by combining ensemble learning model and PSSM-RT to better handle the imbalance between binding and non-binding residues in datasets. EL_PSSM-RT is evaluated by five-fold cross-validation using PDNA-62 and PDNA-224 as well as two independent datasets TS-72 and TS-61. Performance comparisons with existing predictors on the four datasets demonstrate that EL_PSSM-RT is the best-performing method among all the predicting methods with improvement between 0.02-0.07 for MCC, 4.18-21.47% for ST and 0.013-0.131 for AUC. Furthermore, we analyze the importance of the pair-relationships extracted by PSSM-RT and the results validates the usefulness of PSSM-RT for encoding DNA-binding residues. Conclusions: We propose a novel prediction method for the prediction of DNA-binding residue with the inclusion of relationship of evolutionary information and ensemble learning. Performance evaluation shows that the relationship of evolutionary information between residues is indeed useful in DNA-binding residue prediction and ensemble learning can be used to address the data imbalance issue between binding and non-binding residues. A web service of EL_PSSM-RT ( http://hlt.hitsz.edu.cn:8080/PSSM-RT_SVM/ ) is provided for free access to the biological research community

    Computational Structural Analysis: Multiple Proteins Bound to DNA

    Get PDF
    BACKGROUND: With increasing numbers of crystal structures of proteinratioDNA and proteinratioproteinratioDNA complexes publically available, it is now possible to extract sufficient structural, physical-chemical and thermodynamic parameters to make general observations and predictions about their interactions. In particular, the properties of macromolecular assemblies of multiple proteins bound to DNA have not previously been investigated in detail. METHODOLOGY/PRINCIPAL FINDINGS: We have performed computational structural analyses on macromolecular assemblies of multiple proteins bound to DNA using a variety of different computational tools: PISA; PROMOTIF; X3DNA; ReadOut; DDNA and DCOMPLEX. Additionally, we have developed and employed an algorithm for approximate collision detection and overlapping volume estimation of two macromolecules. An implementation of this algorithm is available at http://promoterplot.fmi.ch/Collision1/. The results obtained are compared with structural, physical-chemical and thermodynamic parameters from proteinratioprotein and single proteinratioDNA complexes. Many of interface properties of multiple proteinratioDNA complexes were found to be very similar to those observed in binary proteinratioDNA and proteinratioprotein complexes. However, the conformational change of the DNA upon protein binding is significantly higher when multiple proteins bind to it than is observed when single proteins bind. The water mediated contacts are less important (found in less quantity) between the interfaces of components in ternary (proteinratioproteinratioDNA) complexes than in those of binary complexes (proteinratioprotein and proteinratioDNA).The thermodynamic stability of ternary complexes is also higher than in the binary interactions. Greater specificity and affinity of multiple proteins binding to DNA in comparison with binary protein-DNA interactions were observed. However, protein-protein binding affinities are stronger in complexes without the presence of DNA. CONCLUSIONS/SIGNIFICANCE: Our results indicate that the interface properties: interface area; number of interface residues/atoms and hydrogen bonds; and the distribution of interface residues, hydrogen bonds, van der Walls contacts and secondary structure motifs are independent of whether or not a protein is in a binary or ternary complex with DNA. However, changes in the shape of the DNA reduce the off-rate of the proteins which greatly enhances the stability and specificity of ternary complexes compared to binary ones
    corecore