207 research outputs found

    Identification, improved modeling and integration of signals to predict constitutive and altering splicing

    Get PDF
    Thesis (Ph. D.)--Massachusetts Institute of Technology, Dept. of Brain and Cognitive Sciences, 2004.Includes bibliographical references.(cont.) manipulation of intronic elements that enables fish genes to be spliced properly in mammalian cells; (iii) A computational analysis using EST data, genome sequence data, and microarray expression data of tissue- specific alternative splicing is conducted, which distinguishes human brain, testis and liver as having unusually high levels of AS, highlights differences in the types of AS occurring commonly in different tissues, and identifies candidate cis-regulatory elements and trans-factors likely to play important roles in tissue-specific AS in human cells; (iv) The identification of a set of discriminatory sequence features and their integration into a statistical machine-learning algorithm, ACEScan, which distinguishes exons subject to evolutionarily conserved alternative splicing from constitutively spliced or lineage-specifically-spliced exons is described; (v) The genome-wide search for and experimental validation of exon-skipping events using the combination of two silencing cis-elements, UAGG and GGGG.The regulation of pre-messenger RNA splicing by the spliceosomal machinery via interactions between cis-regulatory elements and splicing trans-factors to generate a specific mRNA i.e. constitutive splicing, or sometimes many distinct mRNA isoforms i.e. alternative splicing, is still a poorly understood process. Progress into illuminating this process is further exacerbated by the variation of splicing in the multitude of tissues and cell types present, as well as the variation of cis and trans elements in different organisms, and the possibility that some alternative splicing events present in expressed sequence tag (EST) databases may constitute biochemical 'noise' or transient evolutionary fluctuations. Several studies, mainly computational in nature, addressing different questions regarding constitutive and alternative splicing are described here, ranging from improved modeling of splicing signals, studying the variation of alternative splicing in various tissues, analyzing evolutionary differences of cis and trans elements of splicing in various vertebrates, and utilizing attributes indicative of alternative splicing events conserved in human and mouse to identify novel alternatively spliced exons. In particular: (i) A general approach for improved modeling of short sequence motifs, based on the Maximum Entropy principle, that incorporates local adjacent and non-adjacent position dependencies is introduced, and applied to understanding splice site signals. The splice site recognition algorithm, MaxENTScan, performs better than previous models that utilize as input similar length sequences; (ii) The first large-scale bioinformatics study is conducted that identifies similarities and differences in candidate cis-regulatory elements and trans-acting splicingby Gene W. Yeo.Ph.D

    Discovery and Analysis of Evolutionarily Conserved Intronic Splicing Regulatory Elements

    Get PDF
    Knowledge of the functional cis-regulatory elements that regulate constitutive and alternative pre-mRNA splicing is fundamental for biology and medicine. Here we undertook a genome-wide comparative genomics approach using available mammalian genomes to identify conserved intronic splicing regulatory elements (ISREs). Our approach yielded 314 ISREs, and insertions of ~70 ISREs between competing splice sites demonstrated that 84% of ISREs altered 5′ and 94% altered 3′ splice site choice in human cells. Consistent with our experiments, comparisons of ISREs to known splicing regulatory elements revealed that 40%–45% of ISREs might have dual roles as exonic splicing silencers. Supporting a role for ISREs in alternative splicing, we found that 30%–50% of ISREs were enriched near alternatively spliced (AS) exons, and included almost all known binding sites of tissue-specific alternative splicing factors. Further, we observed that genes harboring ISRE-proximal exons have biases for tissue expression and molecular functions that are ISRE-specific. Finally, we discovered that for Nova1, neuronal PTB, hnRNP C, and FOX1, the most frequently occurring ISRE proximal to an alternative conserved exon in the splicing factor strongly resembled its own known RNA binding site, suggesting a novel application of ISRE density and the propensity for splicing factors to auto-regulate to associate RNA binding sites to splicing factors. Our results demonstrate that ISREs are crucial building blocks in understanding general and tissue-specific AS regulation and the biological pathways and functions regulated by these AS events

    Inference of Splicing Regulatory Activities by Sequence Neighborhood Analysis

    Get PDF
    Sequence-specific recognition of nucleic-acid motifs is critical to many cellular processes. We have developed a new and general method called Neighborhood Inference (NI) that predicts sequences with activity in regulating a biochemical process based on the local density of known sites in sequence space. Applied to the problem of RNA splicing regulation, NI was used to predict hundreds of new exonic splicing enhancer (ESE) and silencer (ESS) hexanucleotides from known human ESEs and ESSs. These predictions were supported by cross-validation analysis, by analysis of published splicing regulatory activity data, by sequence-conservation analysis, and by measurement of the splicing regulatory activity of 24 novel predicted ESEs, ESSs, and neutral sequences using an in vivo splicing reporter assay. These results demonstrate the ability of NI to accurately predict splicing regulatory activity and show that the scope of exonic splicing regulatory elements is substantially larger than previously anticipated. Analysis of orthologous exons in four mammals showed that the NI score of ESEs, a measure of function, is much more highly conserved above background than ESE primary sequence. This observation indicates a high degree of selection for ESE activity in mammalian exons, with surprisingly frequent interchangeability between ESE sequences

    Patch-Seq Protocol to Analyze the Electrophysiology, Morphology and Transcriptome of Whole Single Neurons Derived From Human Pluripotent Stem Cells

    Get PDF
    The human brain is composed of a complex assembly of about 171 billion heterogeneous cellular units (86 billion neurons and 85 billion non-neuronal glia cells). A comprehensive description of brain cells is necessary to understand the nervous system in health and disease. Recently, advances in genomics have permitted the accurate analysis of the full transcriptome of single cells (scRNA-seq). We have built upon such technical progress to combine scRNA-seq with patch-clamping electrophysiological recording and morphological analysis of single human neurons in vitro. This new powerful method, referred to as Patch-seq, enables a thorough, multimodal profiling of neurons and permits us to expose the links between functional properties, morphology, and gene expression. Here, we present a detailed Patch-seq protocol for isolating single neurons from in vitro neuronal cultures. We have validated the Patch-seq whole-transcriptome profiling method with human neurons generated from embryonic and induced pluripotent stem cells (ESCs/iPSCs) derived from healthy subjects, but the procedure may be applied to any kind of cell type in vitro. Patch-seq may be used on neurons in vitro to profile cell types and states in depth to unravel the human molecular basis of neuronal diversity and investigate the cellular mechanisms underlying brain disorders

    High-Throughput and Cost-Effective Characterization of Induced Pluripotent Stem Cells.

    Get PDF
    Reprogramming somatic cells to induced pluripotent stem cells (iPSCs) offers the possibility of studying the molecular mechanisms underlying human diseases in cell types difficult to extract from living patients, such as neurons and cardiomyocytes. To date, studies have been published that use small panels of iPSC-derived cell lines to study monogenic diseases. However, to study complex diseases, where the genetic variation underlying the disorder is unknown, a sizable number of patient-specific iPSC lines and controls need to be generated. Currently the methods for deriving and characterizing iPSCs are time consuming, expensive, and, in some cases, descriptive but not quantitative. Here we set out to develop a set of simple methods that reduce cost and increase throughput in the characterization of iPSC lines. Specifically, we outline methods for high-throughput quantification of surface markers, gene expression analysis of in vitro differentiation potential, and evaluation of karyotype with markedly reduced cost

    The Period protein homolog LIN-42 negatively regulates microRNA biogenesis in C. elegans

    Get PDF
    AbstractMicroRNAs (miRNAs) are small RNAs that post-transcriptionally regulate gene expression in many multicellular organisms. They are encoded in the genome and transcribed into primary (pri-) miRNAs before two processing steps that ultimately produce the mature miRNA. In order to generate the appropriate amount of a particular miRNA in the correct location at the correct time, proper regulation of miRNA biogenesis is essential. Here we identify the Period protein homolog LIN-42 as a new regulator of miRNA biogenesis in Caenorhabditis elegans. We mapped a spontaneous suppressor of the normally lethal let-7(n2853) allele to the lin-42 gene. Mutations in this allele (ap201) or a second lin-42 allele (n1089) caused increased mature let-7 miRNA levels at most time points when mature let-7 miRNA is normally expressed. Levels of pri-let-7 and a let-7 transcriptional reporter were also increased in lin-42(n1089) worms. These results indicate that LIN-42 normally represses pri-let-7 transcription and thus the accumulation of let-7 miRNA. This inhibition is not specific to let-7, as pri- and mature levels of lin-4 and miR-35 were also increased in lin-42 mutants. Furthermore, small RNA-seq analysis showed widespread increases in the levels of mature miRNAs in lin-42 mutants. Thus, we propose that the period protein homolog LIN-42 is a global regulator of miRNA biogenesis

    Alternative Splicing Events Identified in Human Embryonic Stem Cells and Neural Progenitors

    Get PDF
    Human embryonic stem cells (hESCs) and neural progenitor (NP) cells are excellent models for recapitulating early neuronal development in vitro, and are key to establishing strategies for the treatment of degenerative disorders. While much effort had been undertaken to analyze transcriptional and epigenetic differences during the transition of hESC to NP, very little work has been performed to understand post-transcriptional changes during neuronal differentiation. Alternative RNA splicing (AS), a major form of post-transcriptional gene regulation, is important in mammalian development and neuronal function. Human ESC, hESC-derived NP, and human central nervous system stem cells were compared using Affymetrix exon arrays. We introduced an outlier detection approach, REAP (Regression-based Exon Array Protocol), to identify 1,737 internal exons that are predicted to undergo AS in NP compared to hESC. Experimental validation of REAP-predicted AS events indicated a threshold-dependent sensitivity ranging from 56% to 69%, at a specificity of 77% to 96%. REAP predictions significantly overlapped sets of alternative events identified using expressed sequence tags and evolutionarily conserved AS events. Our results also reveal that focusing on differentially expressed genes between hESC and NP will overlook 14% of potential AS genes. In addition, we found that REAP predictions are enriched in genes encoding serine/threonine kinase and helicase activities. An example is a REAP-predicted alternative exon in the SLK (serine/threonine kinase 2) gene that is differentially included in hESC, but skipped in NP as well as in other differentiated tissues. Lastly, comparative sequence analysis revealed conserved intronic cis-regulatory elements such as the FOX1/2 binding site GCAUG as being proximal to candidate AS exons, suggesting that FOX1/2 may participate in the regulation of AS in NP and hESC. In summary, a new methodology for exon array analysis was introduced, leading to new insights into the complexity of AS in human embryonic stem cells and their transition to neural stem cells

    Patch-Seq Protocol to Analyze the Electrophysiology, Morphology and Transcriptome of Whole Single Neurons Derived From Human Pluripotent Stem Cells

    Get PDF
    This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.The human brain is composed of a complex assembly of about 171 billion heterogeneous cellular units (86 billion neurons and 85 billion non-neuronal glia cells). A comprehensive description of brain cells is necessary to understand the nervous system in health and disease. Recently, advances in genomics have permitted the accurate analysis of the full transcriptome of single cells (scRNA-seq). We have built upon such technical progress to combine scRNA-seq with patch-clamping electrophysiological recording and morphological analysis of single human neurons in vitro. This new powerful method, referred to as Patch-seq, enables a thorough, multimodal profiling of neurons and permits us to expose the links between functional properties, morphology, and gene expression. Here, we present a detailed Patch-seq protocol for isolating single neurons from in vitro neuronal cultures. We have validated the Patch-seq whole-transcriptome profiling method with human neurons generated from embryonic and induced pluripotent stem cells (ESCs/iPSCs) derived from healthy subjects, but the procedure may be applied to any kind of cell type in vitro. Patch-seq may be used on neurons in vitro to profile cell types and states in depth to unravel the human molecular basis of neuronal diversity and investigate the cellular mechanisms underlying brain disorders.This work was supported by the Netherlands Organisation for Scientific Research (NWO), Rubicon Fellowship (019.163LW.032) (to MvdH); the Brain Foundation, the Walker Family, and the Perpetual Impact Philanthropy (grant IPAP2017/0717) (to CB); the G. Harold & Leila Y. Mathers Charitable Foundation, JPB Foundation, and the NIH (Grants MH095741, MH092758, and U01 MH106882) (to FG). GY was supported by R01 grants MH107369, HD085902, and AI095277 from the National Institute of Health and Seed Grant BRFSG-2014-14 from the Brain Research Foundation

    RNA-binding protein CPEB1 remodels host and viral RNA landscapes.

    Get PDF
    Host and virus interactions occurring at the post-transcriptional level are critical for infection but remain poorly understood. Here, we performed comprehensive transcriptome-wide analyses revealing that human cytomegalovirus (HCMV) infection results in widespread alternative splicing (AS), shortening of 3' untranslated regions (3' UTRs) and lengthening of poly(A)-tails in host gene transcripts. We found that the host RNA-binding protein CPEB1 was highly induced after infection, and ectopic expression of CPEB1 in noninfected cells recapitulated infection-related post-transcriptional changes. CPEB1 was also required for poly(A)-tail lengthening of viral RNAs important for productive infection. Strikingly, depletion of CPEB1 reversed infection-related cytopathology and post-transcriptional changes, and decreased productive HCMV titers. Host RNA processing was also altered in herpes simplex virus-2 (HSV-2)-infected cells, thereby indicating that this phenomenon might be a common occurrence during herpesvirus infections. We anticipate that our work may serve as a starting point for therapeutic targeting of host RNA-binding proteins in herpesvirus infections
    corecore