22 research outputs found

    Biased chromatin signatures around polyadenylation sites and exons

    Get PDF
    Core RNA-processing reactions in eukaryotic cells occur cotranscriptionally in a chromatin context, but the relationship between chromatin structure and pre-mRNA processing is poorly understood. We observed strong nucleosome depletion around human polyadenylation sites (PAS) and nucleosome enrichment just downstream of PAS. In genes with multiple alternative PAS, higher downstream nucleosome affinity was associated with higher PAS usage, independently of known PAS motifs that function at the RNA level. Conversely, exons were associated with distinct peaks in nucleosome density. Exons flanked by long introns or weak splice sites exhibited stronger nucleosome enrichment, and incorporation of nucleosome density data improved splicing simulation accuracy. Certain histone modifications, including H3K36me3 and H3K27me2, were specifically enriched on exons, suggesting active marking of exon locations at the chromatin level. Together, these findings provide evidence for extensive functional connections between chromatin structure and RNA processing

    Patterns of Intron Gain and Loss in Fungi

    Get PDF
    Little is known about the patterns of intron gain and loss or the relative contributions of these two processes to gene evolution. To investigate the dynamics of intron evolution, we analyzed orthologous genes from four filamentous fungal genomes and determined the pattern of intron conservation. We developed a probabilistic model to estimate the most likely rates of intron gain and loss giving rise to these observed conservation patterns. Our data reveal the surprising importance of intron gain. Between about 150 and 250 gains and between 150 and 350 losses were inferred in each lineage. We discuss one gene in particular (encoding 1-phosphoribosyl-5-pyrophosphate synthetase) that displays an unusually high rate of intron gain in multiple lineages. It has been recognized that introns are biased towards the 5′ ends of genes in intron-poor genomes but are evenly distributed in intron-rich genomes. Current models attribute this bias to 3′ intron loss through a poly-adenosine-primed reverse transcription mechanism. Contrary to standard models, we find no increased frequency of intron loss toward the 3′ ends of genes. Thus, recent intron dynamics do not support a model whereby 5′ intron positional bias is generated solely by 3′-biased intron loss

    Spark: A navigational paradigm for genomic data exploration

    Get PDF
    Biologists possess the detailed knowledge critical for extracting biological insight from genome-wide data resources, and yet they are increasingly faced with nontrivial computational analysis challenges posed by genome-scale methodologies. To lower this computational barrier, particularly in the early data exploration phases, we have developed an interactive pattern discovery and visualization approach, Spark, designed with epigenomic data in mind. Here we demonstrate Spark's ability to reveal both known and novel epigenetic signatures, including a previously unappreciated binding association between the YY1 transcription factor and the corepressor CTBP2 in human embryonic stem cells

    Genome-wide identification of Ago2 binding sites from mouse embryonic stem cells with and without mature microRNAs

    Get PDF
    MicroRNAs (miRNAs) are 19–22-nucleotide noncoding RNAs that post-transcriptionally regulate mRNA targets. We have identified endogenous miRNA binding sites in mouse embryonic stem cells (mESCs), by performing photo-cross-linking immunoprecipitation using antibodies to Argonaute (Ago2) followed by deep sequencing of RNAs (CLIP-seq). We also performed CLIP-seq in Dicer[superscript −/−] mESCs that lack mature miRNAs, allowing us to define whether the association of Ago2 with the identified sites was miRNA dependent. A significantly enriched motif, GCACUU, was identified only in wild-type mESCs in 3′ untranslated and coding regions. This motif matches the seed of a miRNA family that constitutes ~68% of the mESC miRNA population. Unexpectedly, a G-rich motif was enriched in sequences cross-linked to Ago2 in both the presence and absence of miRNAs. Expression analysis and reporter assays confirmed that the seed-related motif confers miRNA-directed regulation on host mRNAs and that the G-rich motif can modulate this regulation.Leukemia & Lymphoma Society of AmericaUnited States. Public Health Service (Grant R01-GM34277)United States. Public Health Service (Grant R01-CA133404)National Cancer Institute (U.S.) (Grant P01-CA42063)National Cancer Institute (U.S.) Cancer Center Support (Grant P30-CA14051

    Spark: a navigational paradigm for genomic data exploration.

    No full text

    Determinants of targeting by endogenous and exogenous microRNAs and siRNAs

    No full text
    Vertebrate mRNAs are frequently targeted for post-transcriptional repression by microRNAs (miRNAs) through mechanisms involving pairing of 3′ UTR seed matches to bases at the 5′ end of miRNAs. Through analysis of expression array data following miRNA or siRNA overexpression or inhibition, we found that mRNA fold change increases multiplicatively (i.e., log-additively) with seed match count and that a single 8 mer seed match mediates down-regulation comparable to two 7 mer seed matches. We identified several targeting determinants that enhance seed match-associated mRNA repression, including the presence of adenosine opposite miRNA base 1 and of adenosine or uridine opposite miRNA base 9, independent of complementarity to the siRNA/miRNA. Increased sequence conservation in the ∼50 bases 5′ and 3′ of the seed match and increased AU content 3′ of the seed match were each independently associated with increased mRNA down-regulation. All of these determinants are enriched in the vicinity of conserved miRNA seed matches, supporting their activity in endogenous miRNA targeting. Together, our results enable improved siRNA off-target prediction, allow integrated ranking of conserved and nonconserved miRNA targets, and show that targeting by endogenous and exogenous miRNAs/siRNAs involves similar or identical determinants

    ChAsE: chromatin analysis and exploration tool

    No full text
    Summary: We present ChAsE, a cross-platform desktop application developed for interactive visualization, exploration and clustering of epigenomic data such as ChIP-seq experiments. ChAsE is designed and developed in close collaboration with several groups of biologists and bioinformaticians with a focus on usability and interactivity. Data can be analyzed through k-means clustering, specifying presence or absence of signal in epigenetic data and performing set operations between clusters. Results can be explored in an interactive heat map and profile plot interface and exported for downstream analysis or as high quality figures suitable for publications. Availability and Implementation: Software, source code (MIT License), data and video tutorials available at http://chase.cs.univie.ac.at. Contact: [email protected] or [email protected] Supplementary information: Supplementary data are available at Bioinformatics online

    ChAsE: chromatin analysis and exploration tool:chromatin analysis and exploration tool

    No full text
    Summary: We present ChAsE, a cross-platform desktop application developed for interactive visualization, exploration and clustering of epigenomic data such as ChIP-seq experiments. ChAsE is designed and developed in close collaboration with several groups of biologists and bioinformaticians with a focus on usability and interactivity. Data can be analyzed through k-means clustering, specifying presence or absence of signal in epigenetic data and performing set operations between clusters. Results can be explored in an interactive heat map and profile plot interface and exported for downstream analysis or as high quality figures suitable for publications. Availability and Implementation: Software, source code (MIT License), data and video tutorials available at http://chase.cs.univie.ac.at. Contact: [email protected] or [email protected] Supplementary information: Supplementary data are available at Bioinformatics online
    corecore