285 research outputs found

    Computational discovery of animal small RNA genes and targets

    Get PDF
    Though recently discovered, small RNAs appear to play a wealth of regulatory roles, being involved in degradation of target mRNAs, translation silencing of target genes, chromatin remodeling and transposon silencing. Presented here are the computational tools that I developed to annotate and characterize small RNA genes and to identify their targets. One of these tools is oligomap, a novel software for fast and exhaustive identi�cation of nearly-perfect matches of small RNAs in sequence databases. Oligomap is part of an automated annotation pipeline used in our laboratory to annotate small RNA sequences. The application of these tools to samples of small RNAs obtained from mouse and human germ cells together with subsequent computational analyses lead to the discovery of a new class of small RNAs which are now called piRNAs. The computational analysis revealed that piRNAs have a strong uridine preference at their 5' end, that unlike miRNAs, piRNAs are not excised from fold-back precursors but rather from long primary transcripts, and that the genome organization of their genes is conserved between human and mouse even though piRNAs on the sequence level are poorly conserved. In vertebrates, the most studied class of small regulatory RNAs are the miRNAs which bind to mRNAs and block translation. A computational framework is introduced to identify miRNA targets in mammals, ies, worms and �sh. The method uses extensive cross species conservation information to predict miRNA binding sites that are under evolutionary pressure. A downstream analysis of predicted miRNA targets revealed novel properties of miRNA target sites, one of which is a positional bias of miRNA target sites in long mammalian 3' untranslated regions. Intersection of our predictions with biochemical pathway annotation data suggested novel functions for some of the miRNAs. To gain further insights into the mechanism of miRNA targeting, I studied microarray data obtained in siRNA experiments. SiRNAs have been shown to produce o�- targets that resemble miRNA targets. This analysis suggests the presence of additional determinants of miRNA target site functionality (beyond complementarity between the miRNA 5' end and the target) in the close vicinity (about 150 nucleotides) of the miRNA-complementary site. Finally, as part of a study aiming to reduce siRNA o�-target e�ects by introducing chemical modi�cations in the siRNA, I performed microarray data analysis of siRNA transfection experiments. Presented are the methods used to quantify o�- target activity of siRNAs carrying di�erent types of chemical modi�cations. The analysis revealed that o�-targets caused by the passenger strand of the siRNA can be reduced by 5'-O-methylation

    Identification of active regulatory regions from DNA methylation data

    Get PDF
    We have recently shown that transcription factor binding leads to defined reduction in DNA methylation, allowing for the identification of active regulatory regions from high-resolution methylomes. Here, we present MethylSeekR, a computational tool to accurately identify such footprints from bisulfite-sequencing data. Applying our method to a large number of published human methylomes, we demonstrate its broad applicability and generalize our previous findings from a neuronal differentiation system to many cell types and tissues. MethylSeekR is available as an R package at www.bioconductor.or

    The mammalian TRIM-NHL protein TRIM71/LIN-41 is a repressor of mRNA function

    Get PDF
    TRIM-NHL proteins are conserved regulators of development and differentiation but their molecular function has remained largely elusive. Here, we report an as yet unrecognized activity for the mammalian TRIM-NHL protein TRIM71 as a repressor of mRNAs. We show that TRIM71 is associated with mRNAs and that it promotes translational repression and mRNA decay. We have identified Rbl1 and Rbl2, two transcription factors whose down-regulation is important for stem cell function, as TRIM71 targets in mouse embryonic stem cells. Furthermore, one of the defining features of TRIM-NHL proteins, the NHL domain, is necessary and sufficient to target TRIM71 to RNA, while the RING domain that confers ubiquitin ligase activity is dispensable for repression. Our results reveal strong similarities between TRIM71 and Drosophila BRAT, the best-studied TRIM-NHL protein and a well-documented translational repressor, suggesting that BRAT and TRIM71 are part of a family of mRNA repressors regulating proliferation and differentiatio

    Inference of miRNA targets using evolutionary conservation and pathway analysis

    Get PDF
    BACKGROUND: MicroRNAs have emerged as important regulatory genes in a variety of cellular processes and, in recent years, hundreds of such genes have been discovered in animals. In contrast, functional annotations are available only for a very small fraction of these miRNAs, and even in these cases only partially. RESULTS: We developed a general Bayesian method for the inference of miRNA target sites, in which, for each miRNA, we explicitly model the evolution of orthologous target sites in a set of related species. Using this method we predict target sites for all known miRNAs in flies, worms, fish, and mammals. By comparing our predictions in fly with a reference set of experimentally tested miRNA-mRNA interactions we show that our general method performs at least as well as the most accurate methods available to date, including ones specifically tailored for target prediction in fly. An important novel feature of our model is that it explicitly infers the phylogenetic distribution of functional target sites, independently for each miRNA. This allows us to infer species-specific and clade-specific miRNA targeting. We also show that, in long human 3' UTRs, miRNA target sites occur preferentially near the start and near the end of the 3' UTR. To characterize miRNA function beyond the predicted lists of targets we further present a method to infer significant associations between the sets of targets predicted for individual miRNAs and specific biochemical pathways, in particular those of the KEGG pathway database. We show that this approach retrieves several known functional miRNA-mRNA associations, and predicts novel functions for known miRNAs in cell growth and in development. CONCLUSION: We have presented a Bayesian target prediction algorithm without any tunable parameters, that can be applied to sequences from any clade of species. The algorithm automatically infers the phylogenetic distribution of functional sites for each miRNA, and assigns a posterior probability to each putative target site. The results presented here indicate that our general method achieves very good performance in predicting miRNA target sites, providing at the same time insights into the evolution of target sites for individual miRNAs. Moreover, by combining our predictions with pathway analysis, we propose functions of specific miRNAs in nervous system development, inter-cellular communication and cell growth. The complete target site predictions as well as the miRNA/pathway associations are accessible on the ElMMo web server

    Overestimation of alternative splicing caused by variable probe characteristics in exon arrays

    Get PDF
    In higher eukaryotes, alternative splicing is a common mechanism for increasing transcriptome diversity. Affymetrix exon arrays were designed as a tool for monitoring the relative expression levels of hundreds of thousands of known and predicted exons with a view to detecting alternative splicing events. In this article, we have analyzed exon array data from many different human and mouse tissues and have uncovered a systematic relationship between transcript-fold change and alternative splicing as reported by the splicing index. Evidence from dilution experiments and deep sequencing suggest that this effect is of technical rather than biological origin and that it is driven by sequence features of the probes. This effect is substantial and results in a 12-fold overestimation of alternative splicing events in genes that are differentially expressed. By cross-species exon array comparison, we could further show that the systematic bias persists even across species boundaries. Failure to consider this effect in data analysis would result in the reproducible false detection of apparently conserved alternative splicing events. Finally, we have developed a software in R called COSIE (Corrected Splicing Indices for Exon arrays) that for any given set of new exon array experiments corrects for the observed bias and improves the detection of alternative splicing (available at www.fmi.ch/groups/gbioinfo

    Engineering of a conditional allele reveals multiple roles of XRN2 in Caenorhabditis elegans development and substrate specificity in microRNA turnover

    Get PDF
    Although XRN2 proteins are highly conserved eukaryotic 5′→3′ exonucleases, little is known about their function in animals. Here, we characterize Caenorhabditis elegans XRN2, which we find to be a broadly and constitutively expressed nuclear protein. An xrn-2 null mutation or loss of XRN2 catalytic activity causes a molting defect and early larval arrest. However, by generating a conditionally mutant xrn-2ts strain de novo through an approach that may be also applicable to other genes of interest, we reveal further functions in fertility, during embryogenesis and during additional larval stages. Consistent with the known role of XRN2 in controlling microRNA (miRNA) levels, we can demonstrate that loss of XRN2 activity stabilizes some rapidly decaying miRNAs. Surprisingly, however, other miRNAs continue to decay rapidly in xrn-2ts animals. Thus, XRN2 has unanticipated miRNA specificity in vivo, and its diverse developmental functions may relate to distinct substrates. Finally, our global analysis of miRNA stability during larval stage 1 reveals that miRNA passenger strands (miR*s) are substantially less stable than guide strands (miRs), supporting the notion that the former are mostly byproducts of biogenesis rather than a less abundant functional specie

    QuasR: quantification and annotation of short reads in R

    Get PDF
    Summary: QuasR is a package for the integrated analysis of high-throughput sequencing data in R, covering all steps from read preprocessing, alignment and quality control to quantification. QuasR supports different experiment types (including RNA-seq, ChIP-seq and Bis-seq) and analysis variants (e.g. paired-end, stranded, spliced and allele-specific), and is integrated in Bioconductor so that its output can be directly processed for statistical analysis and visualization. Availability and implementation: QuasR is implemented in R and C/C++. Source code and binaries for major platforms (Linux, OS X and MS Windows) are available from Bioconductor (www.bioconductor.org/packages/release/bioc/html/QuasR.html). The package includes a ‘vignette' with step-by-step examples for typical work flows. Contact: [email protected] Supplementary information: Supplementary data are available at Bioinformatics onlin

    Functional characterization of C. elegans Y-box-binding proteins reveals tissue-specific functions and a critical role in the formation of polysomes

    Get PDF
    The cold shock domain is one of the most highly conserved motifs between bacteria and higher eukaryotes. Y-box-binding proteins represent a subfamily of cold shock domain proteins with pleiotropic functions, ranging from transcription in the nucleus to translation in the cytoplasm. These proteins have been investigated in all major model organisms except Caenorhabditis elegans. In this study, we set out to fill this gap and present a functional characterization of CEYs, the C. elegans Y-box-binding proteins. We find that, similar to other organisms, CEYs are essential for proper gametogenesis. However, we also report a novel function of these proteins in the formation of large polysomes in the soma. In the absence of the somatic CEYs, polysomes are dramatically reduced with a simultaneous increase in monosomes and disomes, which, unexpectedly, has no obvious impact on animal biology. Because transcripts that are enriched in polysomes in wild-type animals tend to be less abundant in the absence of CEYs, our findings suggest that large polysomes might depend on transcript stabilization mediated by CEY protein

    Overestimation of alternative splicing caused by variable probe characteristics in exon arrays

    Get PDF
    In higher eukaryotes, alternative splicing is a common mechanism for increasing transcriptome diversity. Affymetrix exon arrays were designed as a tool for monitoring the relative expression levels of hundreds of thousands of known and predicted exons with a view to detecting alternative splicing events. In this article, we have analyzed exon array data from many different human and mouse tissues and have uncovered a systematic relationship between transcript-fold change and alternative splicing as reported by the splicing index. Evidence from dilution experiments and deep sequencing suggest that this effect is of technical rather than biological origin and that it is driven by sequence features of the probes. This effect is substantial and results in a 12-fold overestimation of alternative splicing events in genes that are differentially expressed. By cross-species exon array comparison, we could further show that the systematic bias persists even across species boundaries. Failure to consider this effect in data analysis would result in the reproducible false detection of apparently conserved alternative splicing events. Finally, we have developed a software in R called COSIE (Corrected Splicing Indices for Exon arrays) that for any given set of new exon array experiments corrects for the observed bias and improves the detection of alternative splicing (available at www.fmi.ch/groups/gbioinfo)
    corecore