452 research outputs found

    Ultraplex- A rapid, flexible, all-in-one fastq demultiplexer [version 1; peer review- 1 approved]

    Get PDF
    BACKGROUND: The first step of virtually all next generation sequencing analysis involves the splitting of the raw sequencing data into separate files using sample-specific barcodes, a process known as “demultiplexing”. However, we found that existing software for this purpose was either too inflexible or too computationally intensive for fast, streamlined processing of raw, single end fastq files containing combinatorial barcodes. RESULTS: Here, we introduce a fast and uniquely flexible demultiplexer, named Ultraplex, which splits a raw FASTQ file containing barcodes either at a single end or at both 5’ and 3’ ends of reads, trims the sequencing adaptors and low-quality bases, and moves unique molecular identifiers (UMIs) into the read header, allowing subsequent removal of PCR duplicates. Ultraplex is able to perform such single or combinatorial demultiplexing on both single- and paired-end sequencing data, and can process an entire Illumina HiSeq lane, consisting of nearly 500 million reads, in less than 20 minutes. CONCLUSIONS: Ultraplex greatly reduces computational burden and pipeline complexity for the demultiplexing of complex sequencing libraries, such as those produced by various CLIP and ribosome profiling protocols, and is also very user friendly, enabling streamlined, robust data processing. Ultraplex is available on PyPi and Conda and via Github

    RGS4 RNA secondary structure mediates Staufen2 RNP assembly in Neurons

    Get PDF
    RNA‐binding proteins (RBPs) act as posttranscriptional regulators controlling the fate of target mRNAs. Unraveling how RNAs are recognized by RBPs and in turn are assembled into neuronal RNA granules is therefore key to understanding the underlying mechanism. While RNA sequence elements have been extensively characterized, the functional impact of RNA secondary structures is only recently being explored. Here, we show that Staufen2 binds complex, long‐ranged RNA hairpins in the 3′‐untranslated region (UTR) of its targets. These structures are involved in the assembly of Staufen2 into RNA granules. Furthermore, we provide direct evidence that a defined Rgs4 RNA duplex regulates Staufen2‐dependent RNA localization to distal dendrites. Importantly, disrupting the RNA hairpin impairs the observed effects. Finally, we show that these secondary structures differently affect protein expression in neurons. In conclusion, our data reveal the importance of RNA secondary structure in regulating RNA granule assembly, localization and even-tually translation. It is therefore tempting to speculate that secondary structures represent an important code for cells to control the intracellular fate of their mRNAs

    Regulation of constitutive and alternative mRNA splicing across the human transcriptome by PRPF8 is determined by 5' splice site strength.

    Get PDF
    BACKGROUND: Sequential assembly of the human spliceosome on RNA transcripts regulates splicing across the human transcriptome. The core spliceosome component PRPF8 is essential for spliceosome assembly through its participation in ribonucleoprotein (RNP) complexes for splice-site recognition, branch-point formation and catalysis. PRPF8 deficiency is linked to human diseases like retinitis pigmentosa or myeloid neoplasia, but its genome-wide effects on constitutive and alternative splicing remain unclear. RESULTS: Here, we show that alterations in RNA splicing patterns across the human transcriptome that occur in conditions of restricted cellular PRPF8 abundance are defined by the altered splicing of introns with weak 5' splice sites. iCLIP of spliceosome components reveals that PRPF8 depletion decreases RNP complex formation at most splice sites in exon-intron junctions throughout the genome. However, impaired splicing affects only a subset of human transcripts, enriched for mitotic cell cycle factors, leading to mitotic arrest. Preferentially retained introns and differentially used exons in the affected genes contain weak 5' splice sites, but are otherwise indistinguishable from adjacent spliced introns. Experimental enhancement of splice-site strength in mini-gene constructs overcomes the effects of PRPF8 depletion on the kinetics and fidelity of splicing during transcription. CONCLUSIONS: Competition for PRPF8 availability alters the transcription-coupled splicing of RNAs in which weak 5' splice sites predominate, enabling diversification of human gene expression during biological processes like mitosis. Our findings exemplify the regulatory potential of changes in the core spliceosome machinery, which may be relevant to slow-onset human genetic diseases linked to PRPF8 deficiency

    A computationally-enhanced hiCLIP atlas reveals Staufen1-RNA binding features and links 3′ UTR structure to RNA metabolism

    Get PDF
    The structure of mRNA molecules plays an important role in its interactions with trans-acting factors, notably RNA binding proteins (RBPs), thus contributing to the functional consequences of this interplay. However, current transcriptome-wide experimental methods to chart these interactions are limited by their poor sensitivity. Here we extend the hiCLIP atlas of duplexes bound by Staufen1 (STAU1) ∼10-fold, through careful consideration of experimental assumptions, and the development of bespoke computational methods which we apply to existing data. We present Tosca, a Nextflow computational pipeline for the processing, analysis and visualisation of proximity ligation sequencing data generally. We use our extended duplex atlas to discover insights into the RNA selectivity of STAU1, revealing the importance of structural symmetry and duplex-span-dependent nucleotide composition. Furthermore, we identify heterogeneity in the relationship between transcripts with STAU1-bound 3′ UTR duplexes and metabolism of the associated RNAs that we relate to RNA structure: transcripts with short-range proximal 3′ UTR duplexes have high degradation rates, but those with long-range duplexes have low rates. Overall, our work enables the integrative analysis of proximity ligation data delivering insights into specific features and effects of RBP-RNA structure interactions

    Sixty years of genome biology

    Get PDF
    Sixty years after Watson and Crick published the double helix model of DNA's structure, thirteen members of Genome Biology's Editorial Board select key advances in the field of genome biology subsequent to that discovery

    iCLIP identifies novel roles for SAFB1 in regulating RNA processing and neuronal function

    Get PDF
    BACKGROUND: SAFB1 is a RNA binding protein implicated in the regulation of multiple cellular processes such as the regulation of transcription, stress response, DNA repair and RNA processing. To gain further insight into SAFB1 function we used iCLIP and mapped its interaction with RNA on a genome wide level. RESULTS: iCLIP analysis found SAFB1 binding was enriched, specifically in exons, ncRNAs, 3’ and 5’ untranslated regions. SAFB1 was found to recognise a purine-rich GAAGA motif with the highest frequency and it is therefore likely to bind core AGA, GAA, or AAG motifs. Confirmatory RT-PCR experiments showed that the expression of coding and non-coding genes with SAFB1 cross-link sites was altered by SAFB1 knockdown. For example, we found that the isoform-specific expression of neural cell adhesion molecule (NCAM1) and ASTN2 was influenced by SAFB1 and that the processing of miR-19a from the miR-17-92 cluster was regulated by SAFB1. These data suggest SAFB1 may influence alternative splicing and, using an NCAM1 minigene, we showed that SAFB1 knockdown altered the expression of two of the three NCAM1 alternative spliced isoforms. However, when the AGA, GAA, and AAG motifs were mutated, SAFB1 knockdown no longer mediated a decrease in the NCAM1 9–10 alternative spliced form. To further investigate the association of SAFB1 with splicing we used exon array analysis and found SAFB1 knockdown mediated the statistically significant up- and downregulation of alternative exons. Further analysis using RNAmotifs to investigate the frequency of association between the motif pairs (AGA followed by AGA, GAA or AAG) and alternative spliced exons found there was a highly significant correlation with downregulated exons. Together, our data suggest SAFB1 will play an important physiological role in the central nervous system regulating synaptic function. We found that SAFB1 regulates dendritic spine density in hippocampal neurons and hence provide empirical evidence supporting this conclusion. CONCLUSIONS: iCLIP showed that SAFB1 has previously uncharacterised specific RNA binding properties that help coordinate the isoform-specific expression of coding and non-coding genes. These genes regulate splicing, axonal and synaptic function, and are associated with neuropsychiatric disease, suggesting that SAFB1 is an important regulator of key neuronal processes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12915-015-0220-7) contains supplementary material, which is available to authorized users

    The SMAD2/3 interactome reveals that TGFβ controls m6A mRNA methylation in pluripotency.

    Get PDF
    The TGFβ pathway has essential roles in embryonic development, organ homeostasis, tissue repair and disease. These diverse effects are mediated through the intracellular effectors SMAD2 and SMAD3 (hereafter SMAD2/3), whose canonical function is to control the activity of target genes by interacting with transcriptional regulators. Therefore, a complete description of the factors that interact with SMAD2/3 in a given cell type would have broad implications for many areas of cell biology. Here we describe the interactome of SMAD2/3 in human pluripotent stem cells. This analysis reveals that SMAD2/3 is involved in multiple molecular processes in addition to its role in transcription. In particular, we identify a functional interaction with the METTL3-METTL14-WTAP complex, which mediates the conversion of adenosine to N6-methyladenosine (m6A) on RNA. We show that SMAD2/3 promotes binding of the m6A methyltransferase complex to a subset of transcripts involved in early cell fate decisions. This mechanism destabilizes specific SMAD2/3 transcriptional targets, including the pluripotency factor gene NANOG, priming them for rapid downregulation upon differentiation to enable timely exit from pluripotency. Collectively, these findings reveal the mechanism by which extracellular signalling can induce rapid cellular responses through regulation of the epitranscriptome. These aspects of TGFβ signalling could have far-reaching implications in many other cell types and in diseases such as cancer.We thank Cambridge Genomic Services for help in next generation sequencing. The work was 203 supported by the European Research Council starting grant “Relieve IMDs” (L.V., S.B., A.B., 204 P.M.); the Cambridge University Hospitals National Institute for Health Research Biomedical 205 Research Center (L.V., J.K., A.S.L.); the Wellcome Trust PhD program (A.O., L.Y.); a British 206 Heart Foundation PhD studentship (FS/11/77/39327 to A.B.); a Grant-in-Aid for JSPS Fellows 207 (16J08005 to S.N.); and a core support grant from the Wellcome Trust and Medical Research 208 Council to the Wellcome Trust – Medical Research Council Cambridge Stem Cell Institute
    • …
    corecore