22 research outputs found

    Discover hidden splicing variations by mapping personal transcriptomes to personal genomes.

    Get PDF
    RNA-seq has become a popular technology for studying genetic variation of pre-mRNA alternative splicing. Commonly used RNA-seq aligners rely on the consensus splice site dinucleotide motifs to map reads across splice junctions. Consequently, genomic variants that create novel splice site dinucleotides may produce splice junction RNA-seq reads that cannot be mapped to the reference genome. We developed and evaluated an approach to identify 'hidden' splicing variations in personal transcriptomes, by mapping personal RNA-seq data to personal genomes. Computational analysis and experimental validation indicate that this approach identifies personal specific splice junctions at a low false positive rate. Applying this approach to an RNA-seq data set of 75 individuals, we identified 506 personal specific splice junctions, among which 437 were novel splice junctions not documented in current human transcript annotations. 94 splice junctions had splice site SNPs associated with GWAS signals of human traits and diseases. These involve genes whose splicing variations have been implicated in diseases (such as OAS1), as well as novel associations between alternative splicing and diseases (such as ICA1). Collectively, our work demonstrates that the personal genome approach to RNA-seq read alignment enables the discovery of a large but previously unknown catalog of splicing variations in human populations

    αCP binding to a cytosine-rich subset of polypyrimidine tracts drives a novel pathway of cassette exon splicing in the mammalian transcriptome.

    Get PDF
    Alternative splicing (AS) is a robust generator of mammalian transcriptome complexity. Splice site specification is controlled by interactions of cis-acting determinants on a transcript with specific RNA binding proteins. These interactions are frequently localized to the intronic U-rich polypyrimidine tracts (PPT) located 5' to the majority of splice acceptor junctions. αCPs (also referred to as polyC-binding proteins (PCBPs) and hnRNPEs) comprise a subset of KH-domain proteins with high affinity and specificity for C-rich polypyrimidine motifs. Here, we demonstrate that αCPs promote the splicing of a defined subset of cassette exons via binding to a C-rich subset of polypyrimidine tracts located 5' to the αCP-enhanced exonic segments. This enhancement of splice acceptor activity is linked to interactions of αCPs with the U2 snRNP complex and may be mediated by cooperative interactions with the canonical polypyrimidine tract binding protein, U2AF65. Analysis of αCP-targeted exons predicts a substantial impact on fundamental cell functions. These findings lead us to conclude that the αCPs play a direct and global role in modulating the splicing activity and inclusion of an array of cassette exons, thus driving a novel pathway of splice site regulation within the mammalian transcriptome

    Transcriptional regulatory control of mammalian nephron progenitors revealed by multi-factor cistromic analysis and genetic studies

    Get PDF
    Nephron progenitor number determines nephron endowment; a reduced nephron count is linked to the onset of kidney disease. Several transcriptional regulators including Six2, Wt1, Osr1, Sall1, Eya1, Pax2, and Hox11 paralogues are required for specification and/or maintenance of nephron progenitors. However, little is known about the regulatory intersection of these players. Here, we have mapped nephron progenitor-specific transcriptional networks of Six2, Hoxd11, Osr1, and Wt1. We identified 373 multi-factor associated ‘regulatory hotspots’ around genes closely associated with progenitor programs. To examine their functional significance, we deleted ‘hotspot’ enhancer elements for Six2 and Wnt4. Removal of the distal enhancer for Six2 leads to a ~40% reduction in Six2 expression. When combined with a Six2 null allele, progeny display a premature depletion of nephron progenitors. Loss of the Wnt4 enhancer led to a significant reduction of Wnt4 expression in renal vesicles and a mildly hypoplastic kidney, a phenotype also enhanced in combination with a Wnt4 null mutation. To explore the regulatory landscape that supports proper target gene expression, we performed CTCF ChIP-seq to identify insulator-boundary regions. One such putative boundary lies between the Six2 and Six3 loci. Evidence for the functional significance of this boundary was obtained by deep sequencing of the radiation-induced Brachyrrhine (Br) mutant allele. We identified an inversion of the Six2/Six3 locus around the CTCF-bound boundary, removing Six2 from its distal enhancer regulation, but placed next to Six3 enhancer elements which support ectopic Six2 expression in the lens where Six3 is normally expressed. Six3 is now predicted to fall under control of the Six2 distal enhancer. Consistent with this view, we observed ectopic Six3 in nephron progenitors. 4C-seq supports the model for Six2 distal enhancer interactions in wild-type and Br/+ mouse kidneys. Together, these data expand our view of the regulatory genome and regulatory landscape underpinning mammalian nephrogenesis

    Transcriptional regulatory control of mammalian nephron progenitors revealed by multi-factor cistromic analysis and genetic studies.

    Get PDF
    Nephron progenitor number determines nephron endowment; a reduced nephron count is linked to the onset of kidney disease. Several transcriptional regulators including Six2, Wt1, Osr1, Sall1, Eya1, Pax2, and Hox11 paralogues are required for specification and/or maintenance of nephron progenitors. However, little is known about the regulatory intersection of these players. Here, we have mapped nephron progenitor-specific transcriptional networks of Six2, Hoxd11, Osr1, and Wt1. We identified 373 multi-factor associated \u27regulatory hotspots\u27 around genes closely associated with progenitor programs. To examine their functional significance, we deleted \u27hotspot\u27 enhancer elements for Six2 and Wnt4. Removal of the distal enhancer for Six2 leads to a ~40% reduction in Six2 expression. When combined with a Six2 null allele, progeny display a premature depletion of nephron progenitors. Loss of the Wnt4 enhancer led to a significant reduction of Wnt4 expression in renal vesicles and a mildly hypoplastic kidney, a phenotype also enhanced in combination with a Wnt4 null mutation. To explore the regulatory landscape that supports proper target gene expression, we performed CTCF ChIP-seq to identify insulator-boundary regions. One such putative boundary lies between the Six2 and Six3 loci. Evidence for the functional significance of this boundary was obtained by deep sequencing of the radiation-induced Brachyrrhine (Br) mutant allele. We identified an inversion of the Six2/Six3 locus around the CTCF-bound boundary, removing Six2 from its distal enhancer regulation, but placed next to Six3 enhancer elements which support ectopic Six2 expression in the lens where Six3 is normally expressed. Six3 is now predicted to fall under control of the Six2 distal enhancer. Consistent with this view, we observed ectopic Six3 in nephron progenitors. 4C-seq supports the model for Six2 distal enhancer interactions in wild-type and Br/+ mouse kidneys. Together, these data expand our view of the regulatory genome and regulatory landscape underpinning mammalian nephrogenesis

    Site identification in high-throughput RNA-protein interaction data

    Get PDF
    Motivation: Post-transcriptional and co-transcriptional regulation is a crucial link between genotype and phenotype. The central players are the RNA-binding proteins, and experimental technologies [such as cross-linking with immunoprecipitation-(CLIP-) and RIP-seq] for probing their activities have advanced rapidly over the course of the past decade. Statistically robust, flexible computational methods for binding site identification from high-throughput immunoprecipitation assays are largely lacking however.Results: We introduce a method for site identification which provides four key advantages over previous methods: (i) it can be applied on all variations of CLIP and RIP-seq technologies, (ii) it accurately models the underlying read-count distributions, (iii) it allows external covariates, such as transcript abundance (which we demonstrate is highly correlated with read count) to inform the site identification process and (iv) it allows for direct comparison of site usage across cell types or conditions. © The Author 2012. Published by Oxford University Press. All rights reserved

    Discover hidden splicing variations by mapping personal transcriptomes to personal genomes

    Get PDF
    RNA-seq has become a popular technology for studying genetic variation of pre-mRNA alternative splicing. Commonly used RNA-seq aligners rely on the consensus splice site dinucleotide motifs to map reads across splice junctions. Consequently, genomic variants that create novel splice site dinucleotides may produce splice junction RNA-seq reads that cannot be mapped to the reference genome. We developed and evaluated an approach to identify ‘hidden’ splicing variations in personal transcriptomes, by mapping personal RNA-seq data to personal genomes. Computational analysis and experimental validation indicate that this approach identifies personal specific splice junctions at a low false positive rate. Applying this approach to an RNA-seq data set of 75 individuals, we identified 506 personal specific splice junctions, among which 437 were novel splice junctions not documented in current human transcript annotations. 94 splice junctions had splice site SNPs associated with GWAS signals of human traits and diseases. These involve genes whose splicing variations have been implicated in diseases (such as OAS1), as well as novel associations between alternative splicing and diseases (such as ICA1). Collectively, our work demonstrates that the personal genome approach to RNA-seq read alignment enables the discovery of a large but previously unknown catalog of splicing variations in human populations

    αCP binding to a cytosine-rich subset of polypyrimidine tracts drives a novel pathway of cassette exon splicing in the mammalian transcriptome

    Get PDF
    Alternative splicing (AS) is a robust generator of mammalian transcriptome complexity. Splice site specification is controlled by interactions of cis-acting determinants on a transcript with specific RNA binding proteins. These interactions are frequently localized to the intronic U-rich polypyrimidine tracts (PPT) located 5′ to the majority of splice acceptor junctions. αCPs (also referred to as polyC-binding proteins (PCBPs) and hnRNPEs) comprise a subset of KH-domain proteins with high affinity and specificity for C-rich polypyrimidine motifs. Here, we demonstrate that αCPs promote the splicing of a defined subset of cassette exons via binding to a C-rich subset of polypyrimidine tracts located 5′ to the αCP-enhanced exonic segments. This enhancement of splice acceptor activity is linked to interactions of αCPs with the U2 snRNP complex and may be mediated by cooperative interactions with the canonical polypyrimidine tract binding protein, U2AF65. Analysis of αCP-targeted exons predicts a substantial impact on fundamental cell functions. These findings lead us to conclude that the αCPs play a direct and global role in modulating the splicing activity and inclusion of an array of cassette exons, thus driving a novel pathway of splice site regulation within the mammalian transcriptome

    Rbfox Proteins Regulate Splicing as Part of a Large Multiprotein Complex LASR

    No full text
    Rbfox proteins control alternative splicing and posttranscriptional regulation in mammalian brain and are implicated in neurological disease. These proteins recognize the RNA sequence (U)GCAUG, but their structures and diverse roles imply a variety of protein-protein interactions. We find that nuclear Rbfox proteins are bound within a large assembly of splicing regulators (LASR), a multimeric complex containing the proteins hnRNP M, hnRNP H, hnRNP C, Matrin3, NF110/NFAR-2, NF45, and DDX5, all approximately equimolar to Rbfox. We show that splicing repression mediated by hnRNP M is stimulated by Rbfox. Virtually all the intron-bound Rbfox is associated with LASR, and hnRNP M motifs are enriched adjacent to Rbfox crosslinking sites in vivo. These findings demonstrate that Rbfox proteins bind RNA with a defined set of cofactors and affect a broader set of exons than previously recognized. The function of this multimeric LASR complex has implications for deciphering the regulatory codes controlling splicing networks
    corecore