117 research outputs found

    Online resources for microRNA analysis

    Get PDF

    Accurate microRNA Target Prediction Using Detailed Binding Site Accessibility and Machine Learning on Proteomics Data

    Get PDF
    MicroRNAs (miRNAs) are a class of small regulatory genes regulating gene expression by targeting messenger RNA. Though computational methods for miRNA target prediction are the prevailing means to analyze their function, they still miss a large fraction of the targeted genes and additionally predict a large number of false positives. Here we introduce a novel algorithm called DIANA-microT-ANN which combines multiple novel target site features through an artificial neural network (ANN) and is trained using recently published high-throughput data measuring the change of protein levels after miRNA overexpression, providing positive and negative targeting examples. The features characterizing each miRNA recognition element include binding structure, conservation level, and a specific profile of structural accessibility. The ANN is trained to integrate the features of each recognition element along the 3′untranslated region into a targeting score, reproducing the relative repression fold change of the protein. Tested on two different sets the algorithm outperforms other widely used algorithms and also predicts a significant number of unique and reliable targets not predicted by the other methods. For 542 human miRNAs DIANA-microT-ANN predicts 120000 targets not provided by TargetScan 5.0. The algorithm is freely available at http://microrna.gr/microT-ANN

    miRGen: a database for the study of animal microRNA genomic organization and function

    Get PDF
    miRGen is an integrated database of (i) positional relationships between animal miRNAs and genomic annotation sets and (ii) animal miRNA targets according to combinations of widely used target prediction programs. A major goal of the database is the study of the relationship between miRNA genomic organization and miRNA function. This is made possible by three integrated and user friendly interfaces. The Genomics interface allows the user to explore where whole-genome collections of miRNAs are located with respect to UCSC genome browser annotation sets such as Known Genes, Refseq Genes, Genscan predicted genes, CpG islands and pseudogenes. These miRNAs are connected through the Targets interface to their experimentally supported target genes from TarBase, as well as computationally predicted target genes from optimized intersections and unions of several widely used mammalian target prediction programs. Finally, the Clusters interface provides predicted miRNA clusters at any given inter-miRNA distance and provides specific functional information on the targets of miRNAs within each cluster. All of these unique features of miRGen are designed to facilitate investigations into miRNA genomic organization, co-transcription and targeting. miRGen can be freely accessed at

    Microbial signatures in human periodontal disease: a metatranscriptome meta-analysis

    Get PDF
    The characterization of oral microbial communities and their functional potential has been shaped by metagenomics and metatranscriptomics studies. Here, a meta-analysis of four geographically and technically diverse oral shotgun metatranscriptomics studies of human periodontitis was performed. In total, 54 subgingival plaque samples, 27 healthy and 27 periodontitis, were analyzed. The core microbiota of the healthy and periodontitis group encompassed 40 and 80 species, respectively, with 38 species being common to both microbiota. The differential abundance analysis identified 23 genera and 26 species, that were more abundant in periodontitis. Our results not only validated previously reported genera and species associated with periodontitis with heightened statistical significance, but also elucidated additional genera and species that were overlooked in the individual studies. Functional analysis revealed a significant up-regulation in the transcription of 50 gene families (UniRef-90) associated with transmembrane transport and secretion, amino acid metabolism, surface protein and flagella synthesis, energy metabolism, and DNA supercoiling in periodontitis samples. Notably, the overwhelming majority of the identified gene families did not exhibit differential abundance when examined across individual datasets. Additionally, 4 bacterial virulence factor genes, including TonB dependent receptor from P. gingivalis, surface antigen BspA from T. forsynthia, and adhesin A (PsaA) and Type I glyceraldehyde-3-phosphate dehydrogenase (GAPDH) from the Streptococcus genus, were also found to be significantly more transcribed in periodontitis group. Microbial co-occurrence analysis demonstrated that the periodontitis microbial network was less dense compared to the healthy network, but it contained more positive correlations between the species. Furthermore, there were discernible disparities in the patterns of interconnections between the species in the two networks, denoting the rewiring of the whole microbial network during the transition to the disease state. In summary, our meta-analysis has provided robust insights into the oral active microbiome and transcriptome in both health and disease

    Global Discriminative Learning for Higher-Accuracy Computational Gene Prediction

    Get PDF
    Most ab initio gene predictors use a probabilistic sequence model, typically a hidden Markov model, to combine separately trained models of genomic signals and content. By combining separate models of relevant genomic features, such gene predictors can exploit small training sets and incomplete annotations, and can be trained fairly efficiently. However, that type of piecewise training does not optimize prediction accuracy and has difficulty in accounting for statistical dependencies among different parts of the gene model. With genomic information being created at an ever-increasing rate, it is worth investigating alternative approaches in which many different types of genomic evidence, with complex statistical dependencies, can be integrated by discriminative learning to maximize annotation accuracy. Among discriminative learning methods, large-margin classifiers have become prominent because of the success of support vector machines (SVM) in many classification tasks. We describe CRAIG, a new program for ab initio gene prediction based on a conditional random field model with semi-Markov structure that is trained with an online large-margin algorithm related to multiclass SVMs. Our experiments on benchmark vertebrate datasets and on regions from the ENCODE project show significant improvements in prediction accuracy over published gene predictors that use intrinsic features only, particularly at the gene level and on genes with long introns

    Isoform Specific Gene Auto-Regulation via miRNAs: A Case Study on miR-128b and ARPP-21

    Get PDF
    In this study, we investigate whether miRNAs located within “host” protein-coding genes may regulate the expression of their host genes. We find that 43 of 174 miRNAs encoded within RefSeq genes are predicted to target their host genes. Statistical analysis of this phenomenon suggests that gene auto-regulation via miRNAs may be under positive selective pressure. Our analysis also indicates that several of the 43 miRNAs have a much lower expectation of targeting their host genes by chance than others. Among these examples, we identify miR-128b:ARPP-21 (cyclic AMP-regulated phosphoprotein, 21 kD) as a case in which both the miRNA and the target site are also evolutionarily conserved. We provide experimental support for this miRNA:target interaction via reporter silencing assays, and present evidence that this isoform-specific gene auto-regulation has been preserved in vertebrate species in order to prevent detrimental consequences of ARPP-21 over-expression in brain

    The DIANA-mirExTra Web Server: From Gene Expression Data to MicroRNA Function

    Get PDF
    Background: High-throughput gene expression experiments are widely used to identify the role of genes involved in biological conditions of interest. MicroRNAs (miRNA) are regulatory molecules that have been functionally associated with several developmental programs and their deregulation with diverse diseases including cancer. Methodology/Principal Findings: Although miRNA expression levels may not be routinely measured in high-throughput experiments, a possible involvement of miRNAs in the deregulation of gene expression can be computationally predicted and quantified through analysis of overrepresented motifs in the deregulated genes 39 untranslated region (39UTR) sequences. Here, we introduce a user-friendly web-server, DIANA-mirExTra (www.microrna.gr/mirextra) that allows the comparison of frequencies of miRNA associated motifs between sets of genes that can lead to the identification of miRNAs responsible for the deregulation of large numbers of genes. To this end, we have investigated different approaches and measures, and have practically implemented them on experimental data. Conclusions/Significance: On several datasets of miRNA overexpression and repression experiments, our proposed approaches have successfully identified the deregulated miRNA. Beyond the prediction of miRNAs responsible for the deregulation of transcripts, the web-server provides extensive links to DIANA-mirPath, a functional analysis tool incorporating miRNA targets in biological pathways. Additionally, in case information about miRNA expression changes i

    TarBase 6.0: capturing the exponential growth of miRNA targets with experimental support

    Get PDF
    As the relevant literature and the number of experiments increase at a super linear rate, databases that curate and collect experimentally verified microRNA (miRNA) targets have gradually emerged. These databases attempt to provide efficient access to this wealth of experimental data, which is scattered in thousands of manuscripts. Aim of TarBase 6.0 (http://www.microrna.gr/tarbase) is to face this challenge by providing a significant increase of available miRNA targets derived from all contemporary experimental techniques (gene specific and high-throughput), while incorporating a powerful set of tools in a user-friendly interface. TarBase 6.0 hosts detailed information for each miRNA–gene interaction, ranging from miRNA- and gene-related facts to information specific to their interaction, the experimental validation methodologies and their outcomes. All database entries are enriched with function-related data, as well as general information derived from external databases such as UniProt, Ensembl and RefSeq. DIANA microT miRNA target prediction scores and the relevant prediction details are available for each interaction. TarBase 6.0 hosts the largest collection of manually curated experimentally validated miRNA–gene interactions (more than 65 000 targets), presenting a 16.5–175-fold increase over other available manually curated databases
    corecore