5,057 research outputs found

    Latent rank change detection for analysis of splice-junction microarrays with nonlinear effects

    Full text link
    Alternative splicing of gene transcripts greatly expands the functional capacity of the genome, and certain splice isoforms may indicate specific disease states such as cancer. Splice junction microarrays interrogate thousands of splice junctions, but data analysis is difficult and error prone because of the increased complexity compared to differential gene expression analysis. We present Rank Change Detection (RCD) as a method to identify differential splicing events based upon a straightforward probabilistic model comparing the over- or underrepresentation of two or more competing isoforms. RCD has advantages over commonly used methods because it is robust to false positive errors due to nonlinear trends in microarray measurements. Further, RCD does not depend on prior knowledge of splice isoforms, yet it takes advantage of the inherent structure of mutually exclusive junctions, and it is conceptually generalizable to other types of splicing arrays or RNA-Seq. RCD specifically identifies the biologically important cases when a splice junction becomes more or less prevalent compared to other mutually exclusive junctions. The example data is from different cell lines of glioblastoma tumors assayed with Agilent microarrays.Comment: Published in at http://dx.doi.org/10.1214/10-AOAS389 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Somatic Mutational Landscape of Splicing Factor Genes and Their Functional Consequences across 33 Cancer Types

    Get PDF
    Hotspot mutations in splicing factor genes have been recently reported at high frequency in hematological malignancies, suggesting the importance of RNA splicing in cancer. We analyzed whole-exome sequencing data across 33 tumor types in The Cancer Genome Atlas (TCGA), and we identified 119 splicing factor genes with significant non-silent mutation patterns, including mutation over-representation, recurrent loss of function (tumor suppressor-like), or hotspot mutation profile (oncogene-like). Furthermore, RNA sequencing analysis revealed altered splicing events associated with selected splicing factor mutations. In addition, we were able to identify common gene pathway profiles associated with the presence of these mutations. Our analysis suggests that somatic alteration of genes involved in the RNA-splicing process is common inĀ cancer and may represent an underappreciated hallmark of tumorigenesis

    Characterisation and expression of SPLUNC2, the human orthologue of rodent parotid secretory protein

    Get PDF
    We recently described the Palate Lung Nasal Clone (PLUNC) family of proteins as an extended group of proteins expressed in the upper airways, nose and mouth. Little is known about these proteins, but they are secreted into the airway and nasal lining fluids and saliva where, due to their structural similarity with lipopolysaccharide-binding protein and bactericidal/permeability-increasing protein, they may play a role in the innate immune defence. We now describe the generation and characterisation of novel affinity-purified antibodies to SPLUNC2, and use them to determine the expression of this, the major salivary gland PLUNC. Western blotting showed that the antibodies identified a number of distinct protein bands in saliva, whilst immunohistochemical analysis demonstrated protein expression in serous cells of the major salivary glands and in the ductal lumens as well as in cells of minor mucosal glands. Antibodies directed against distinct epitopes of the protein yielded different staining patterns in both minor and major salivary glands. Using RT-PCR of tissues from the oral cavity, coupled with EST analysis, we showed that the gene undergoes alternative splicing using two 5' non-coding exons, suggesting that the gene is regulated by alternative promoters. Comprehensive RACE analysis using salivary gland RNA as template failed to identify any additional exons. Analysis of saliva showed that SPLUNC2 is subject to N-glycosylation. Thus, our study shows that multiple SPLUNC2 isoforms are found in the oral cavity and suggest that these proteins may be differentially regulated in distinct tissues where they may function in the innate immune response

    Predicting mutually exclusive spliced exons based on exon length, splice site and reading frame conservation, and exon sequence homology

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Alternative splicing of pre-mature RNA is an important process eukaryotes utilize to increase their repertoire of different protein products. Several types of different alternative splice forms exist including exon skipping, differential splicing of exons at their 3'- or 5'-end, intron retention, and mutually exclusive splicing. The latter term is used for clusters of internal exons that are spliced in a mutually exclusive manner.</p> <p>Results</p> <p>We have implemented an extension to the WebScipio software to search for mutually exclusive exons. Here, the search is based on the precondition that mutually exclusive exons encode regions of the same structural part of the protein product. This precondition provides restrictions to the search for candidate exons concerning their length, splice site conservation and reading frame preservation, and overall homology. Mutually exclusive exons that are not homologous and not of about the same length will not be found. Using the new algorithm, mutually exclusive exons in several example genes, a dynein heavy chain, a muscle myosin heavy chain, and Dscam were correctly identified. In addition, the algorithm was applied to the whole <it>Drosophila melanogaster </it>X chromosome and the results were compared to the Flybase annotation and an <it>ab initio </it>prediction. Clusters of mutually exclusive exons might be subsequent to each other and might encode dozens of exons.</p> <p>Conclusions</p> <p>This is the first implementation of an automatic search for mutually exclusive exons in eukaryotes. Exons are predicted and reconstructed in the same run providing the complete gene structure for the protein query of interest. WebScipio offers high quality gene structure figures with the clusters of mutually exclusive exons colour-coded, and several analysis tools for further manual inspection. The genome scale analysis of all genes of the <it>Drosophila melanogaster </it>X chromosome showed that WebScipio is able to find all but two of the 28 annotated mutually exclusive spliced exons and predicts 39 new candidate exons. Thus, WebScipio should be able to identify mutually exclusive spliced exons in any query sequence from any species with a very high probability. WebScipio is freely available to academics at <url>http://www.webscipio.org</url>.</p

    Identification of Conserved Splicing Motifs in Mutually Exclusive Exons of 15 Insect Species

    Get PDF
    Background: During alternative splicing, the inclusion of an exon in the final mRNA molecule is determined by nuclear proteins that bind cis-regulatory sequences in a target pre-mRNA molecule. A recent study suggested that the regulatory codes of individual RNA-binding proteins may be nearly immutable between very diverse species such as mammals and insects. The model system Drosophila melanogaster therefore presents an excellent opportunity for the study of alternative splicing due to the availability of quality EST annotations in FlyBase. Methods: In this paper, we describe an in silico analysis pipeline to extract putative exonic splicing regulatory sequences from a multiple alignment of 15 species of insects. Our method, ESTs-to-ESRs (E2E), uses graph analysis of EST splicing graphs to identify mutually exclusive (ME) exons and combines phylogenetic measures, a sliding window approach along the multiple alignment and the Welchā€™s t statistic to extract conserved ESR motifs. Results: The most frequent 100% conserved word of length 5 bp in different insect exons was ā€œATGGAā€. We identified 799 statistically significant ā€œspikeā€ hexamers, 218 motifs with either a left or right FDR corrected spike magnitude p-value \u3c 0.05 and 83 with both left and right uncorrected p \u3c 0.01. 11 genes were identified with highly significant motifs in one ME exon but not in the other, suggesting regulation of ME exon splicing through these highly conserved hexamers. The majority of these genes have been shown to have regulated spatiotemporal expression. 10 elements were found to match three mammalian splicing regulator databases. A putative ESR motif, GATGCAG, was identified in the ME-13b but not in the ME-13a of Drosophila N-Cadherin, a gene that has been shown to have a distinct spatiotemporal expression pattern of spliced isoforms in a recent study. Conclusions: Analysis of phylogenetic relationships and variability of sequence conservation as implemented in the E2E spikes method may lead to improved identification of ESRs. We found that approximately half of the putative ESRs in common between insects and mammals have a high statistical support (p \u3c 0.01). Several Drosophila genes with spatiotemporal expression patterns were identified to contain putative ESRs located in one exon of the ME exon pairs but not in the other

    Fox-1 family of RNA-binding proteins

    Get PDF
    The Fox-1 family of RNA-binding proteins are evolutionarily conserved regulators of tissue-specific alternative splicing in metazoans. The Fox-1 family specifically recognizes the (U)GCAUG stretch in regulated exons or in flanking introns, and either promotes or represses target exons. Recent unbiased bioinformatics analyses of alternatively spliced exons and comparison of various vertebrate genomes identified the (U)GCAUG stretch as a highly conserved and widely distributed element enriched in intronic regions surrounding exons with altered inclusion in muscle, heart, and brain, consistent with specific expression of Fox-1 and Fox-2 in these tissues. Global identification of Fox-2 target RNAs in living cells revealed that many of the Fox-2 target genes themselves encode splicing regulators. Further systematic elucidation of target genes of the Fox-1 family and other splicing regulators in various tissues will lead to a comprehensive understanding of splicing regulatory networks
    • ā€¦
    corecore