3,757 research outputs found

    regSNPs-splicing: a tool for prioritizing synonymous single-nucleotide substitution

    Get PDF
    While synonymous single-nucleotide variants (sSNVs) have largely been unstudied, since they do not alter protein sequence, mounting evidence suggests that they may affect RNA conformation, splicing, and the stability of nascent-mRNAs to promote various diseases. Accurately prioritizing deleterious sSNVs from a pool of neutral ones can significantly improve our ability of selecting functional genetic variants identified from various genome-sequencing projects, and, therefore, advance our understanding of disease etiology. In this study, we develop a computational algorithm to prioritize sSNVs based on their impact on mRNA splicing and protein function. In addition to genomic features that potentially affect splicing regulation, our proposed algorithm also includes dozens structural features that characterize the functions of alternatively spliced exons on protein function. Our systematical evaluation on thousands of sSNVs suggests that several structural features, including intrinsic disorder protein scores, solvent accessible surface areas, protein secondary structures, and known and predicted protein family domains, show significant differences between disease-causing and neutral sSNVs. Our result suggests that the protein structure features offer an added dimension of information while distinguishing disease-causing and neutral synonymous variants. The inclusion of structural features increases the predictive accuracy for functional sSNV prioritization

    Transcriptional landscape of neuronal and cancer stem cells

    Get PDF
    Tumor mass is composed by heterogeneous cell population including a subset of “cancer stem cells” (CSC). Oncogenic signals foster CSC by transforming tissue stem cells or by reprogramming progenitor/differentiated cells towards stemness. Thus, CSC share features with cancer and stem cells (e.g. self-renewal, hierarchical developmental program leading to differentiated cells, epithelial/mesenchimal transition) and these latter are maintained by the constitutive activation of stemness-promoting signals. CSC could trigger tumor formation, drive to resistance to conventional therapeutics and underlie patients’ relapse. Indeed, stem cell signatures have been associated with poor prognosis in various. This background makes the identification of CSC molecular features mandatory to highlight the survival inner working and to design novel CSC specific therapeutic strategies. Medulloblastoma (MB) is the most common childhood malignant brain tumor and a leading cause of cancerrelated morbidity and mortality. Current multimodal therapies are effective in about 50% of patients but often cause long-term side effects, i.e. developmental, neurological, neuroendocrine and psychosocial deficits (Northcott PA Nature Rev cancer 2012). For many years, MB treated as a single tumor entity despite the divergent tumor histology, patients’ outcome and drug sensitivity, and also by the diversity of the stem cell of origin. Very recently the scenario of human MB has dramatically changed since its heterogeneous biology has been addressed by high-throughput gene expression analysis (oligonucleotide microarrays) or by the powerful genomic next-generation sequencing. These led to the identification of four tumor subgroups (WNT, SHH, Group 3 and Group 4) uncovering the existence of a highly diverse mutational spectra and gene expression. However a quantitative approach has not yet been applied to the transcriptional landscape of Medulloblastoma stem cells (MbSC) through RNA Next Generation Sequencing (RNA-Seq) technology. This is a relevant issue, since RNA-Seq is able to interrogate the genome wide global transcriptome including new transcripts, alternative spliced isoforms and non-coding RNAs. Lower rhombic lip progenitors of the dorsal brainstem are considered the trigger cells in WNT tumors; in SHH subgroup initiation cells are Prominin1+ CD15+ stem cells from the subventricular zone requiring the commitment to Math1+ granule cell progenitors [GCP] of the external granule cell layer [EGL]; while Math1+ or Math1- EGL-GCP or Prominin1+/lineage-negative stem cells sustain the MYC driven Group 3. MbSC derived from SHH tumors and postnatal normal cerebellar stem cells (NcSC) have been reported to share several features. A key signal for both of them is Hedgehog. Furthermore, both NcSC and MbSC display up-regulation of stemness genes (e.g Sox2, Nestin, Nanog, Prom1). Finally, constitutive activation of the Shh pathway by conditional deletion of Ptch1 inhibitory receptor in NcSC, promote medulloblastoma in vivo, producing a mouse model of the human SHH tumor. Acquisition of stemness features may therefore represent the first step of oncogenic conversion. Cooperation with additional oncogenic signals is however needed to enhance MbSC tumorigenicity. In order to understand the MbSCs transcriptional programs, we analyze by RNA-Seq, MbSC derived from Ptch1+/- tumors (Ptch1+/- MbSC). This choice, of a genetically determined model of MB, has allowed us to work with Ptch1+/- MbSC together with appropriate NcSC counterpart, and to analyze biological replicates doing statistical analysis. We identify a number of transcripts, annotated ones, novel isoforms, and long non-coding RNAs, characterizing MbSC and/or NcSC. Some of these genes control stemness or are cancer related and conserved in human medulloblastomas. Interestingly a subset of them, belonging to cell stress response, are of prognostic relevance being significantly related to clinical outcome. Correlation of genes expression characterizing MbSC with survival information from our human medulloblastomas database further demonstrates the significance of these findings. Our data suggest that the modulation of normal and cancer stem cell functions observed in vitro is effective in dissecting the transcriptional programs underlying the in vivo behavior of human medulloblastomas

    Alu Exonization Events Reveal Features Required for Precise Recognition of Exons by the Splicing Machinery

    Get PDF
    Despite decades of research, the question of how the mRNA splicing machinery precisely identifies short exonic islands within the vast intronic oceans remains to a large extent obscure. In this study, we analyzed Alu exonization events, aiming to understand the requirements for correct selection of exons. Comparison of exonizing Alus to their non-exonizing counterparts is informative because Alus in these two groups have retained high sequence similarity but are perceived differently by the splicing machinery. We identified and characterized numerous features used by the splicing machinery to discriminate between Alu exons and their non-exonizing counterparts. Of these, the most novel is secondary structure: Alu exons in general and their 5′ splice sites (5′ss) in particular are characterized by decreased stability of local secondary structures with respect to their non-exonizing counterparts. We detected numerous further differences between Alu exons and their non-exonizing counterparts, among others in terms of exon–intron architecture and strength of splicing signals, enhancers, and silencers. Support vector machine analysis revealed that these features allow a high level of discrimination (AUC = 0.91) between exonizing and non-exonizing Alus. Moreover, the computationally derived probabilities of exonization significantly correlated with the biological inclusion level of the Alu exons, and the model could also be extended to general datasets of constitutive and alternative exons. This indicates that the features detected and explored in this study provide the basis not only for precise exon selection but also for the fine-tuned regulation thereof, manifested in cases of alternative splicing

    ExonImpact: prioritizing pathogenic alternative splicing events

    Get PDF
    Alternative splicing (AS) is a closely regulated process that allows a single gene to encode multiple protein isoforms, thereby contributing to the diversity of the proteome. Dysregulation of the splicing process has been found to be associated with many inherited diseases. However, among the pathogenic AS events, there are numerous “passenger” events whose inclusion or exclusion does not lead to significant changes with respect to protein function. In this study, we evaluate the secondary and tertiary structural features of proteins associated with disease-causing and neutral AS events, and show that several structural features are strongly associated with the pathological impact of exon inclusion. We further develop a machine-learning-based computational model, ExonImpact, for prioritizing and evaluating the functional consequences of hitherto uncharacterized AS events. We evaluated our model using several strategies including cross-validation, and data from the Gene-Tissue Expression (GTEx) and ClinVar databases. ExonImpact is freely available at http://watson.compbio.iupui.edu/ExonImpact

    ClinGen Myeloid Malignancy Variant Curation Expert Panel recommendations for germline RUNX1 variants

    Get PDF
    Standardized variant curation is essential for clinical care recommendations for patients with inherited disorders. Clinical Genome Resource (ClinGen) variant curation expert panels are developing disease-associated gene specifications using the 2015 American College of Medical Genetics and Genomics (ACMG) and Association for Molecular Pathology (AMP) guidelines to reduce curation discrepancies. The ClinGen Myeloid Malignancy Variant Curation Expert Panel (MM-VCEP) was created collaboratively between the American Society of Hematology and ClinGen to perform gene- and disease-specific modifications for inherited myeloid malignancies. The MM-VCEP began optimizing ACMG/AMP rules for RUNX1 because many germline variants have been described in patients with familial platelet disorder with a predisposition to acute myeloid leukemia, characterized by thrombocytopenia, platelet functional/ultrastructural defects, and a predisposition to hematologic malignancies. The 28 ACMG/AMP codes were tailored for RUNX1 variants by modifying gene/disease specifications, incorporating strength adjustments of existing rules, or both. Key specifications included calculation of minor allele frequency thresholds, formulating a semi-quantitative approach to counting multiple independent variant occurrences, identifying functional domains and mutational hotspots, establishing functional assay thresholds, and characterizing phenotype-specific guidelines. Preliminary rules were tested by using a pilot set of 52 variants; among these, 50 were previously classified as benign/likely benign, pathogenic/likely pathogenic, variant of unknown significance (VUS), or conflicting interpretations (CONF) in ClinVar. The application of RUNX1-specific criteria resulted in a reduction in CONF and VUS variants by 33%, emphasizing the benefit of gene-specific criteria and sharing internal laboratory data.Xi Luo, Simone Feurstein, Shruthi Mohan, Christopher C. Porter, Sarah A. Jackson, Sioban Keel ... et al

    IDENTIFICATION OF RNA STRUCTURES MODULATING THE EXPRESSION OF THE mRNA BIOGENESIS FACTOR SUS1

    Full text link
    Tesis por compendioSus1 is a conserved protein involved in chromatin remodeling and mRNA biogenesis. The SUS1 gene of Saccharomyces cerevisiae is unusual, as it contains two introns and is alternatively spliced, retaining one or both introns in response to changes in environmental conditions. SUS1 splicing may allow the cell to control Sus1 expression, but the mechanisms that regulate this process remain unknown. In this thesis project, we have investigated whether the structure adopted by SUS1 RNA sequences contributes to regulate the splicing of this gene. Using in silico analyses together with NMR spectroscopy, gel electrophoresis and UV thermal denaturation experiments, we first show that the downstream intron (I2) of SUS1 forms a weakly-stable, 37-nucleotide stem-loop structure containing the branch site near its apical loop and the 3' splice site after the stem terminus. A cellular assay revealed that two of four mutants containing altered I2 structures had significantly impaired SUS1 expression. Semi-quantitative RT-PCR experiments indicated that all mutants accumulated unspliced SUS1 pre-mRNA and/or induced distorted levels of fully spliced mRNA relative to wild-type. Concomitantly, Sus1 cellular functions in histone H2B deubiquitination and mRNA export were affected in I2 hairpin mutants that inhibited splicing. The second part of the thesis project focuses on the exon located between the two introns of the SUS1 gene. This middle exon (E2) can be skipped during splicing, is generated in circular form, and has been found to influence the splicing of the flanking introns, an unusual situation in budding yeast where splicing mainly relies on intron recognition. Using NMR spectroscopy, gel electrophoresis, UV thermal denaturation and ribose 2'-OH modification experiments combined with computational predictions, we show that E2 of SUS1 comprises a conserved double-helical stem topped by a three-way junction. One of the hairpins emerging from the junction exhibited significant thermal stability and was closed by an unusually structured purine-rich loop. This loop contained two consecutive sheared G:A base pairs and was structurally related to the substrate loop of the VS ribozyme. Cellular assays revealed that three mutants containing altered E2 structures had impaired SUS1 expression and that a compensatory mutation restoring the conserved stem recovered expression to wild-type levels. Semi-quantitative RT-PCR experiments indicated that all mutants were capable of altering the quantities of unspliced and/or fully-spliced SUS1 RNA transcripts relative to wildtype. Overall, the results gathered in this thesis project indicate that RNA structures formed by the middle exon and the second intron of the S. cerevisiae SUS1 gene are relevant for splicing and also influence other processes of SUS1 mRNA biogenesis.Sus1 es una proteína conservada implicada en remodelación de cromatina y biogénesis de moléculas de ARNm. El gen SUS1 de Saccharomyces cerevisiae es peculiar, ya que contiene dos intrones y sufre un proceso de ayuste (corte y empalme) alternativo, reteniendo uno o ambos intrones en respuesta a cambios en las condiciones ambientales. El ayuste del ARNpre-m de SUS1 puede permitir a la célula controlar la expresión de la proteína Sus1, pero los mecanismos que regulan este proceso son poco conocidos. En este proyecto de tesis hemos investigado si la estructura adoptada por secuencias de ARN de SUS1 contribuye a regular el proceso de ayuste de este gen. Utilizando análisis in silico junto con espectroscopia de RMN, electroforesis en gel y experimentos de desnaturalización térmica monitorizados por UV, primero demostramos que el ARN del segundo intrón (I2) del gen SUS1 forma una horquilla débilmente estable de 37 nucleótidos. Esta horquilla contiene nucleótidos del sitio de ramificación (branch site) en su bucle apical y nucleótidos del sitio 3' de empalme adyacentes al extremo inferior del tallo. A través de ensayos funcionales descubrimos que dos de cuatro mutantes que alteran la estructura de la horquilla I2 exhibían peor expresión de SUS1. Experimentos de RT-PCR semicuantitativos indicaron que todos los mutantes acumularon ARNpre-m SUS1 no ayustado y/o indujeron cambios en los niveles de ARNm maduro con respecto a la secuencia silvestre. Además, las funciones celulares de Sus1 relativas a desubicuitinación de histona H2B y transporte de ARNm se vieron afectadas en los mutantes de la horquilla I2 que inhibían el proceso de ayuste. La segunda parte de la memoria de tesis se centra en el análisis del exón central (E2) situado entre los dos intrones del gen SUS1. Este exón puede eliminarse durante el proceso de ayuste, se genera en forma circular, e influye en el procesamiento de los intrones adyacentes, una situación inusual para las regiones exónicas de S. cerevisiae, donde el ayuste se basa principalmente en el reconocimiento de intrones. Utilizando experimentos de espectroscopía de RMN, electroforesis en gel, desnaturalización térmica y modificación química combinados con predicciones computacionales, demostramos que el ARN del exón E2 de SUS1 forma un tallo conservado de doble hélice coronado por una intersección de tres hélices. Una de las horquillas que emergen de esta intersección presentó una estabilidad térmica significativa, así como un bucle apical rico en purinas inusualmente estructurado. Este bucle contiene dos pares de bases G:A consecutivos y está estructuralmente relacionado con el bucle de substrato de la ribozima VS. Ensayos celulares revelaron que tres mutantes con estructuras modificadas de E2 exhibían peor expresión de SUS1, y que una mutación compensatoria que restauraba el tallo conservado recuperaba la expresión a los niveles de la secuencia silvestre. Experimentos de RT-PCR semicuantitativos indicaron que todos los mutantes de E2 eran capaces de alterar las cantidades de transcritos ayustados y no ayustados de SUS1 con respecto a la secuencia silvestre. En general, los resultados obtenidos en este proyecto de tesis indican que las estructuras de ARN formadas por el exón central y el segundo intrón del gen SUS1 de S. cerevisiae son relevantes para el ayuste y otros procesos implicados en la biogénesis del ARNm del gen SUS1.Sus1 és una proteïna conservada implicada a la remodelació de la cromatina i la biogènesi de l'ARNm. El gen SUS1 de Saccharomyces cerevisiae és inusual, ja que conté dos introns i s'empalma de manera alternativa, retenint un o ambdós introns en resposta a canvis en les condicions ambientals. L'empalmament de SUS1 pot permetre a la cèl·lula controlar l'expressió de Sus1, però els mecanismes que regulen aquest procés són segueixen sent desconeguts. En aquest projecte de tesi investiguem si l'estructura adoptada per seqüències d'ARN de SUS1 contribueix a regular l'empalmament d'aquest gen. Emprant anàlisi in silico juntament amb espectrometria de RMN, electroforesi en gel i experiments de desnaturalització tèrmica d'UV, es mostra primer que l'intró aigües a baix (I2) de SUS1 forma una estructura de forqueta de 37 nucleòtids feblement estable que conté el lloc de la branca a prop del seu bucle apical; i el lloc d'empalmamnet 3¿ després de l'extrem de la forqueta. Un assaig cel·lular va revelar que dos de quatre mutants que contenien estructures alterades de l'I2 havien modificat significativament l'expressió de SUS1. Els experiments semi-quantitatius de RT-PCR van indicar que tots els mutants acumulaven el pre-ARNm madur respecte al tipus salvatge. Concomitantment, les funcions cel·lulars de Sus1 a la desubiqüitinació de la histona H2B i l'exportació d'ARNm es van veure afectats als mutants de la forqueta d'I2 que inhibeixen l'empalmament. La segona part del projecte de tesi se centra a l'exó situat entre els dos introns del gen SUS1. Aquest exó (E2) es pot ometre durant l'empalmament, es genera amb forma circular, i s'ha trobat que influeix a l'empalmamet dels introns que flanquegen, una situació inusual al llevat on l'empalmament està basat principalment al reconeixement d'introns. Emprant espectroscòpia de RMN, electroforesi en gel, desnaturalització tèrmica d'UV i experiments de modificació de ribosa 2¿-OH combinats amb prediccions computacionals, mostrem que E2 de SUS1 comprén un tall conservat de doble hèlix corornat per una unió de tres vies. Una de les forquetes que emergeixen de la unió, va mostrar una estabilitat tèrmica significativa i va ser tapada per un bucle ric en purina inusualment estructurat. Aquest bucle contenia dos pars de bases G:A tallats consecutivament i estava estructuralment relacionat amb el bucle de substrat del ribozim VS. Els assajos cel·lulars van revelar que tres mutants que contenien estructures alterades de E2 havien alterat l'expressió de SUS1 i que una mutació compensatòria que restaurava el tall conservat recuperava l'expressió a nivells del tipus salvatge. Els assajos cel·lulars van revelat que tres mutants que contenien estructures alterades d'E2 havien alterat l'expressió de SUS1 i que una mutació compensatòria que restaurava el tall conservat recuperava l'expressió a nivell d'un tipus salvatge. Els experiments semi-quantitatius de RT-PCR van indicar que tots els mutants eren capaços d'alterar les quantitats de transcrits d'ARN de SUS1 no empalmats i/o empalmats en relació amb el tipus salvatge. En general, els resultats obtinguts en aquesta investigació indiquen que les estructures d'ARN formades per l'exó mitjà i el segon intró de SUS1 de S. cerevisiae són rellevants per l'empalmament i també influeixen a altres processos de biogènesi de l'ARN de SUS1Abuqattam, AN. (2017). IDENTIFICATION OF RNA STRUCTURES MODULATING THE EXPRESSION OF THE mRNA BIOGENESIS FACTOR SUS1 [Tesis doctoral no publicada]. Universitat Politècnica de València. https://doi.org/10.4995/Thesis/10251/90524TESISCompendi

    Incorporating Evolutionary Information and Functional Domains for Identifying RNA Splicing Factors in Humans

    Get PDF
    Regulation of pre-mRNA splicing is achieved through the interaction of RNA sequence elements and a variety of RNA-splicing related proteins (splicing factors). The splicing machinery in humans is not yet fully elucidated, partly because splicing factors in humans have not been exhaustively identified. Furthermore, experimental methods for splicing factor identification are time-consuming and lab-intensive. Although many computational methods have been proposed for the identification of RNA-binding proteins, there exists no development that focuses on the identification of RNA-splicing related proteins so far. Therefore, we are motivated to design a method that focuses on the identification of human splicing factors using experimentally verified splicing factors. The investigation of amino acid composition reveals that there are remarkable differences between splicing factors and non-splicing proteins. A support vector machine (SVM) is utilized to construct a predictive model, and the five-fold cross-validation evaluation indicates that the SVM model trained with amino acid composition could provide a promising accuracy (80.22%). Another basic feature, amino acid dipeptide composition, is also examined to yield a similar predictive performance to amino acid composition. In addition, this work presents that the incorporation of evolutionary information and domain information could improve the predictive performance. The constructed models have been demonstrated to effectively classify (73.65% accuracy) an independent data set of human splicing factors. The result of independent testing indicates that in silico identification could be a feasible means of conducting preliminary analyses of splicing factors and significantly reducing the number of potential targets that require further in vivo or in vitro confirmation
    • …
    corecore