77 research outputs found

    Inference of miRNA targets using evolutionary conservation and pathway analysis

    Get PDF
    BACKGROUND: MicroRNAs have emerged as important regulatory genes in a variety of cellular processes and, in recent years, hundreds of such genes have been discovered in animals. In contrast, functional annotations are available only for a very small fraction of these miRNAs, and even in these cases only partially. RESULTS: We developed a general Bayesian method for the inference of miRNA target sites, in which, for each miRNA, we explicitly model the evolution of orthologous target sites in a set of related species. Using this method we predict target sites for all known miRNAs in flies, worms, fish, and mammals. By comparing our predictions in fly with a reference set of experimentally tested miRNA-mRNA interactions we show that our general method performs at least as well as the most accurate methods available to date, including ones specifically tailored for target prediction in fly. An important novel feature of our model is that it explicitly infers the phylogenetic distribution of functional target sites, independently for each miRNA. This allows us to infer species-specific and clade-specific miRNA targeting. We also show that, in long human 3' UTRs, miRNA target sites occur preferentially near the start and near the end of the 3' UTR. To characterize miRNA function beyond the predicted lists of targets we further present a method to infer significant associations between the sets of targets predicted for individual miRNAs and specific biochemical pathways, in particular those of the KEGG pathway database. We show that this approach retrieves several known functional miRNA-mRNA associations, and predicts novel functions for known miRNAs in cell growth and in development. CONCLUSION: We have presented a Bayesian target prediction algorithm without any tunable parameters, that can be applied to sequences from any clade of species. The algorithm automatically infers the phylogenetic distribution of functional sites for each miRNA, and assigns a posterior probability to each putative target site. The results presented here indicate that our general method achieves very good performance in predicting miRNA target sites, providing at the same time insights into the evolution of target sites for individual miRNAs. Moreover, by combining our predictions with pathway analysis, we propose functions of specific miRNAs in nervous system development, inter-cellular communication and cell growth. The complete target site predictions as well as the miRNA/pathway associations are accessible on the ElMMo web server

    Identification of active regulatory regions from DNA methylation data

    Get PDF
    We have recently shown that transcription factor binding leads to defined reduction in DNA methylation, allowing for the identification of active regulatory regions from high-resolution methylomes. Here, we present MethylSeekR, a computational tool to accurately identify such footprints from bisulfite-sequencing data. Applying our method to a large number of published human methylomes, we demonstrate its broad applicability and generalize our previous findings from a neuronal differentiation system to many cell types and tissues. MethylSeekR is available as an R package at www.bioconductor.or

    The mammalian TRIM-NHL protein TRIM71/LIN-41 is a repressor of mRNA function

    Get PDF
    TRIM-NHL proteins are conserved regulators of development and differentiation but their molecular function has remained largely elusive. Here, we report an as yet unrecognized activity for the mammalian TRIM-NHL protein TRIM71 as a repressor of mRNAs. We show that TRIM71 is associated with mRNAs and that it promotes translational repression and mRNA decay. We have identified Rbl1 and Rbl2, two transcription factors whose down-regulation is important for stem cell function, as TRIM71 targets in mouse embryonic stem cells. Furthermore, one of the defining features of TRIM-NHL proteins, the NHL domain, is necessary and sufficient to target TRIM71 to RNA, while the RING domain that confers ubiquitin ligase activity is dispensable for repression. Our results reveal strong similarities between TRIM71 and Drosophila BRAT, the best-studied TRIM-NHL protein and a well-documented translational repressor, suggesting that BRAT and TRIM71 are part of a family of mRNA repressors regulating proliferation and differentiatio

    Overestimation of alternative splicing caused by variable probe characteristics in exon arrays

    Get PDF
    In higher eukaryotes, alternative splicing is a common mechanism for increasing transcriptome diversity. Affymetrix exon arrays were designed as a tool for monitoring the relative expression levels of hundreds of thousands of known and predicted exons with a view to detecting alternative splicing events. In this article, we have analyzed exon array data from many different human and mouse tissues and have uncovered a systematic relationship between transcript-fold change and alternative splicing as reported by the splicing index. Evidence from dilution experiments and deep sequencing suggest that this effect is of technical rather than biological origin and that it is driven by sequence features of the probes. This effect is substantial and results in a 12-fold overestimation of alternative splicing events in genes that are differentially expressed. By cross-species exon array comparison, we could further show that the systematic bias persists even across species boundaries. Failure to consider this effect in data analysis would result in the reproducible false detection of apparently conserved alternative splicing events. Finally, we have developed a software in R called COSIE (Corrected Splicing Indices for Exon arrays) that for any given set of new exon array experiments corrects for the observed bias and improves the detection of alternative splicing (available at www.fmi.ch/groups/gbioinfo

    Engineering of a conditional allele reveals multiple roles of XRN2 in Caenorhabditis elegans development and substrate specificity in microRNA turnover

    Get PDF
    Although XRN2 proteins are highly conserved eukaryotic 5′→3′ exonucleases, little is known about their function in animals. Here, we characterize Caenorhabditis elegans XRN2, which we find to be a broadly and constitutively expressed nuclear protein. An xrn-2 null mutation or loss of XRN2 catalytic activity causes a molting defect and early larval arrest. However, by generating a conditionally mutant xrn-2ts strain de novo through an approach that may be also applicable to other genes of interest, we reveal further functions in fertility, during embryogenesis and during additional larval stages. Consistent with the known role of XRN2 in controlling microRNA (miRNA) levels, we can demonstrate that loss of XRN2 activity stabilizes some rapidly decaying miRNAs. Surprisingly, however, other miRNAs continue to decay rapidly in xrn-2ts animals. Thus, XRN2 has unanticipated miRNA specificity in vivo, and its diverse developmental functions may relate to distinct substrates. Finally, our global analysis of miRNA stability during larval stage 1 reveals that miRNA passenger strands (miR*s) are substantially less stable than guide strands (miRs), supporting the notion that the former are mostly byproducts of biogenesis rather than a less abundant functional specie

    QuasR: quantification and annotation of short reads in R

    Get PDF
    Summary: QuasR is a package for the integrated analysis of high-throughput sequencing data in R, covering all steps from read preprocessing, alignment and quality control to quantification. QuasR supports different experiment types (including RNA-seq, ChIP-seq and Bis-seq) and analysis variants (e.g. paired-end, stranded, spliced and allele-specific), and is integrated in Bioconductor so that its output can be directly processed for statistical analysis and visualization. Availability and implementation: QuasR is implemented in R and C/C++. Source code and binaries for major platforms (Linux, OS X and MS Windows) are available from Bioconductor (www.bioconductor.org/packages/release/bioc/html/QuasR.html). The package includes a ‘vignette' with step-by-step examples for typical work flows. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics onlin

    Functional characterization of C. elegans Y-box-binding proteins reveals tissue-specific functions and a critical role in the formation of polysomes

    Get PDF
    The cold shock domain is one of the most highly conserved motifs between bacteria and higher eukaryotes. Y-box-binding proteins represent a subfamily of cold shock domain proteins with pleiotropic functions, ranging from transcription in the nucleus to translation in the cytoplasm. These proteins have been investigated in all major model organisms except Caenorhabditis elegans. In this study, we set out to fill this gap and present a functional characterization of CEYs, the C. elegans Y-box-binding proteins. We find that, similar to other organisms, CEYs are essential for proper gametogenesis. However, we also report a novel function of these proteins in the formation of large polysomes in the soma. In the absence of the somatic CEYs, polysomes are dramatically reduced with a simultaneous increase in monosomes and disomes, which, unexpectedly, has no obvious impact on animal biology. Because transcripts that are enriched in polysomes in wild-type animals tend to be less abundant in the absence of CEYs, our findings suggest that large polysomes might depend on transcript stabilization mediated by CEY protein

    Overestimation of alternative splicing caused by variable probe characteristics in exon arrays

    Get PDF
    In higher eukaryotes, alternative splicing is a common mechanism for increasing transcriptome diversity. Affymetrix exon arrays were designed as a tool for monitoring the relative expression levels of hundreds of thousands of known and predicted exons with a view to detecting alternative splicing events. In this article, we have analyzed exon array data from many different human and mouse tissues and have uncovered a systematic relationship between transcript-fold change and alternative splicing as reported by the splicing index. Evidence from dilution experiments and deep sequencing suggest that this effect is of technical rather than biological origin and that it is driven by sequence features of the probes. This effect is substantial and results in a 12-fold overestimation of alternative splicing events in genes that are differentially expressed. By cross-species exon array comparison, we could further show that the systematic bias persists even across species boundaries. Failure to consider this effect in data analysis would result in the reproducible false detection of apparently conserved alternative splicing events. Finally, we have developed a software in R called COSIE (Corrected Splicing Indices for Exon arrays) that for any given set of new exon array experiments corrects for the observed bias and improves the detection of alternative splicing (available at www.fmi.ch/groups/gbioinfo)

    Genomic Prevalence of Heterochromatic H3K9me2 and Transcription Do Not Discriminate Pluripotent from Terminally Differentiated Cells

    Get PDF
    Cellular differentiation entails reprogramming of the transcriptome from a pluripotent to a unipotent fate. This process was suggested to coincide with a global increase of repressive heterochromatin, which results in a reduction of transcriptional plasticity and potential. Here we report the dynamics of the transcriptome and an abundant heterochromatic histone modification, dimethylation of histone H3 at lysine 9 (H3K9me2), during neuronal differentiation of embryonic stem cells. In contrast to the prevailing model, we find H3K9me2 to occupy over 50% of chromosomal regions already in stem cells. Marked are most genomic regions that are devoid of transcription and a subgroup of histone modifications. Importantly, no global increase occurs during differentiation, but discrete local changes of H3K9me2 particularly at genic regions can be detected. Mirroring the cell fate change, many genes show altered expression upon differentiation. Quantitative sequencing of transcripts demonstrates however that the total number of active genes is equal between stem cells and several tested differentiated cell types. Together, these findings reveal high prevalence of a heterochromatic mark in stem cells and challenge the model of low abundance of epigenetic repression and resulting global basal level transcription in stem cells. This suggests that cellular differentiation entails local rather than global changes in epigenetic repression and transcriptional activity
    corecore