7 research outputs found

    DeePromoter: Robust Promoter Predictor Using Deep Learning

    Get PDF
    The promoter region is located near the transcription start sites and regulates transcription initiation of the gene by controlling the binding of RNA polymerase. Thus, promoter region recognition is an important area of interest in the field of bioinformatics. Numerous tools for promoter prediction were proposed. However, the reliability of these tools still needs to be improved. In this work, we propose a robust deep learning model, called DeePromoter, to analyze the characteristics of the short eukaryotic promoter sequences, and accurately recognize the human and mouse promoter sequences. DeePromoter combines a convolutional neural network (CNN) and a long short-term memory (LSTM). Additionally, instead of using non-promoter regions of the genome as a negative set, we derive a more challenging negative set from the promoter sequences. The proposed negative set reconstruction method improves the discrimination ability and significantly reduces the number of false positive predictions. Consequently, DeePromoter outperforms the previously proposed promoter prediction tools. In addition, a web-server for promoter prediction is developed based on the proposed methods and made available at https://home.jbnu.ac.kr/NSCL/deepromoter.htm

    From protein-protein to isoform-isoform interactions: the toolkit to map alternative splicing to interactome

    Get PDF
    Alternative splicing (AS) can impact protein structure and lead to protein-protein interaction (PPI) rewiring. Available PPI networks neglect alternative splicing isoforms: as interactions might happen only between a subset of isoforms, the PPI network contains both false-positive and false-negative interactions. Since it is not feasible to validate all isoform-isoform interactions experimentally, we present a set of tools to investigate AS on a network level: DIGGER to map splicing to the PPI network, as well as NEASE and Spycone to evaluate the functional consequences of network rewiring. DIGGER (https://exbio.wzw.tum.de/digger) integrates PPIs, domain-domain, and residuelevel interactions - the structures that might be spliced in or out and result in interaction gain or loss. Users can explore possible rewiring for an isoform or exon of interest and extract relevant subnetworks. NEASE (https://github.com/louadi/NEASE) identifies pathways that are significantly affected by network rewiring. NEASE extends classic gene set enrichment analysis by considering isoform-specific interactions affecting pathways. Spycone (https://github.com/yollct/spycone) addresses the time-course changes in AS. It searches for isoforms that demonstrate similar temporal splicing patterns and reflect the splicing co-regulation. Spycone further integrates gene set, network, and splicing-aware NEASE enrichment. Overall, we offer a splicing-focused network analysis toolkit that allows for studying the mechanistic consequences of AS.Book of abstract: 4th Belgrade Bioinformatics Conference, June 19-23, 202

    Deep Splicing Code: Classifying Alternative Splicing Events Using Deep Learning

    No full text
    Alternative splicing (AS) is the process of combining different parts of the pre-mRNA to produce diverse transcripts and eventually different protein products from a single gene. In computational biology field, researchers try to understand AS behavior and regulation using computational models known as “Splicing Codes”. The final goal of these algorithms is to make an in-silico prediction of AS outcome from genomic sequence. Here, we develop a deep learning approach, called Deep Splicing Code (DSC), for categorizing the well-studied classes of AS namely alternatively skipped exons, alternative 5’ss, alternative 3’ss, and constitutively spliced exons based only on the sequence of the exon junctions. The proposed approach significantly improves the prediction and the obtained results reveal that constitutive exons have distinguishable local characteristics from alternatively spliced exons. Using the motif visualization technique, we show that the trained models learned to search for competitive alternative splice sites as well as motifs of important splicing factors with high precision. Thus, the proposed approach greatly expands the opportunities to improve alternative splicing modeling. In addition, a web-server for AS events prediction has been developed based on the proposed method

    DIGGER: exploring the functional role of alternative splicing in protein interactions.

    No full text
    Alternative splicing plays a major role in regulating the functional repertoire of the proteome. However, isoform-specific effects to protein-protein interactions (PPIs) are usually overlooked, making it impossible to judge the functional role of individual exons on a systems biology level. We overcome this barrier by integrating protein-protein interactions, domain-domain interactions and residue-level interactions information to lift exon expression analysis to a network level. Our user-friendly database DIGGER is available at https://exbio.wzw.tum.de/digger and allows users to seamlessly switch between isoform and exon-centric views of the interactome and to extract sub-networks of relevant isoforms, making it an essential resource for studying mechanistic consequences of alternative splicing
    corecore