406 research outputs found

    Progression of RNA-sequencing to single-cell applications

    Get PDF
    New methods enable new discoveries. My time as a PhD student has run in parallel with the maturation of the RNA-seq method, and I have used it to discover basic properties of gene expression and transcriptomes. My part has been bioinformatics – the computer analysis of biological data. RNA-seq quantifies gene expression for all genes in one experiment, allowing discoveries without prior knowledge, as opposed to single-gene hypothesis testing. When I started my PhD, this was done by microarray followed by qRT-PCR validation, which can be arduous. In contrast to microarrays, RNA-seq quantifies expression with little ambiguity of which gene each expression value corresponds to, and in absolute terms. But at the time, data analysis of RNA-seq was full of unknowns and there were little software available. Nowadays, partly the result of my work, the data analysis is much less complicated, and RNA-seq can be performed on diminutive samples, down to single cells, which was not viable using microarrays. My first study (Paper I) used one of the very first RNA-seq datasets to study general features of transcriptomes, such as mean mRNA length (~1,500 nt) and the number of genes expressed per tissue (~13,000). I also found special features of some tissues: the liver transcriptome is dominated by a few highly expressed gene, brain expresses especially long mRNAs and testis expresses many more genes than other tissues. Following this tissue RNA-seq study, I evaluated a new library preparation method for single-cell RNA-seq (Paper III), developed before the prevalence of single-cell RNA-seq. I used technical replicates to show that the method was accurate and reliable for the more highly expressed genes at single-cell RNA levels, and with input RNA amounts corresponding to >50 cells it produced as good quality data as bulk RNA-seq. Then the method was applied on melanoma cells isolated from human blood, and I listed surface antigen genes that distinguished these circulating tumour cells from other cells in the blood. This single-cell RNA-seq method was then applied on pre-implantation embryo cells (Paper IV). Using first-generation crosses between two mouse strains, I could separate the expression from the maternal and the paternal copies of the genes. I found that 12-24% of the genes express only one of their two copies in any given cell, in a random manner that affects almost all the expressed genes. I also found that the two copies are expressed independently from each other. Finally, I studied Sox transcription factors during neural development (Paper II), combining RNA-seq and microarray data for different cell types with ChIP-seq data for transcription factor binding and histone modifications. I found that Sox proteins bind to the enhancers active in the stem cells where the Sox proteins are active, but also to enhancers specific to subsequent cells in ii development. I also found that different Sox factors bind to much the same enhancers, and that they can induce histone modifications. In conclusion, my work has advanced the RNA-seq method and increased the understanding of transcriptional regulation and output

    Mouse Model of Alagille Syndrome and Mechanisms of Jagged1 Missense Mutations.

    Get PDF
    BACKGROUND & AIMS: Alagille syndrome is a genetic disorder characterized by cholestasis, ocular abnormalities, characteristic facial features, heart defects, and vertebral malformations. Most cases are associated with mutations in JAGGED1 (JAG1), which encodes a Notch ligand, although it is not clear how these contribute to disease development. We aimed to develop a mouse model of Alagille syndrome to elucidate these mechanisms. METHODS: Mice with a missense mutation (H268Q) in Jag1 (Jag1+/Ndr mice) were outbred to a C3H/C57bl6 background to generate a mouse model for Alagille syndrome (Jag1Ndr/Ndr mice). Liver tissues were collected at different timepoints during development, analyzed by histology, and liver organoids were cultured and analyzed. We performed transcriptome analysis of Jag1Ndr/Ndr livers and livers from patients with Alagille syndrome, cross-referenced to the Human Protein Atlas, to identify commonly dysregulated pathways and biliary markers. We used species-specific transcriptome separation and ligand-receptor interaction assays to measure Notch signaling and the ability of JAG1Ndr to bind or activate Notch receptors. We studied signaling of JAG1 and JAG1Ndr via NOTCH 1, NOTCH2, and NOTCH3 and resulting gene expression patterns in parental and NOTCH1-expressing C2C12 cell lines. RESULTS: Jag1Ndr/Ndr mice had many features of Alagille syndrome, including eye, heart, and liver defects. Bile duct differentiation, morphogenesis, and function were dysregulated in newborn Jag1Ndr/Ndr mice, with aberrations in cholangiocyte polarity, but these defects improved in adult mice. Jag1Ndr/Ndr liver organoids collapsed in culture, indicating structural instability. Whole-transcriptome sequence analyses of liver tissues from mice and patients with Alagille syndrome identified dysregulated genes encoding proteins enriched at the apical side of cholangiocytes, including CFTR and SLC5A1, as well as reduced expression of IGF1. Exposure of Notch-expressing cells to JAG1Ndr, compared with JAG1, led to hypomorphic Notch signaling, based on transcriptome analysis. JAG1-expressing cells, but not JAG1Ndr-expressing cells, bound soluble Notch1 extracellular domain, quantified by flow cytometry. However, JAG1 and JAG1Ndr cells each bound NOTCH2, and signaling from NOTCH2 signaling was reduced but not completely inhibited, in response to JAG1Ndr compared with JAG1. CONCLUSIONS: In mice, expression of a missense mutant of Jag1 (Jag1Ndr) disrupts bile duct development and recapitulates Alagille syndrome phenotypes in heart, eye, and craniofacial dysmorphology. JAG1Ndr does not bind NOTCH1, but binds NOTCH2, and elicits hypomorphic signaling. This mouse model can be used to study other features of Alagille syndrome and organ development

    Loss of CSL Unlocks a Hypoxic Response and Enhanced Tumor Growth Potential in Breast Cancer Cells

    Get PDF
    Notch signaling is an important regulator of stem cell differentiation. All canonical Notch signaling is transmitted through the DNA-binding protein CSL, and hyperactivated Notch signaling is associated with tumor development; thus it may be anticipated that CSL deficiency should reduce tumor growth. In contrast, we report that genetic removal of CSL in breast tumor cells caused accelerated growth of xenografted tumors. Loss of CSL unleashed a hypoxic response during normoxic conditions, manifested by stabilization of the HIF1α protein and acquisition of a polyploid giant-cell, cancer stem cell-like, phenotype. At the transcriptome level, loss of CSL upregulated more than 1,750 genes and less than 3% of those genes were part of the Notch transcriptional signature. Collectively, this suggests that CSL exerts functions beyond serving as the central node in the Notch signaling cascade and reveals a role for CSL in tumorigenesis and regulation of the cellular hypoxic response.</p

    A xandarellid artiopodan from Morocco – a middle Cambrian link between soft-bodied euarthropod communities in North Africa and South China

    Get PDF
    NB. A corrigendum [correction] for this article was published online on 09 May 2017; this has been attached to this article as an additional file. This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ © The Author(s) 2017. The attached file is the published version of the article

    Comparative analysis of RNA sequencing methods for degraded or low-input samples

    Get PDF
    available in PMC 2014 January 01RNA-seq is an effective method for studying the transcriptome, but it can be difficult to apply to scarce or degraded RNA from fixed clinical samples, rare cell populations or cadavers. Recent studies have proposed several methods for RNA-seq of low-quality and/or low-quantity samples, but the relative merits of these methods have not been systematically analyzed. Here we compare five such methods using metrics relevant to transcriptome annotation, transcript discovery and gene expression. Using a single human RNA sample, we constructed and sequenced ten libraries with these methods and compared them against two control libraries. We found that the RNase H method performed best for chemically fragmented, low-quality RNA, and we confirmed this through analysis of actual degraded samples. RNase H can even effectively replace oligo(dT)-based methods for standard RNA-seq. SMART and NuGEN had distinct strengths for measuring low-quantity RNA. Our analysis allows biologists to select the most suitable methods and provides a benchmark for future method development.National Institutes of Health (U.S.) (Pioneer Award DP1-OD003958-01)National Human Genome Research Institute (U.S.) (NHGRI) 1P01HG005062-01)National Human Genome Research Institute (U.S.) (NHGRI Center of Excellence in Genome Science Award 1P50HG006193-01)Howard Hughes Medical Institute (Investigator)Merkin Family Foundation for Stem Cell ResearchBroad Institute of MIT and Harvard (Klarman Cell Observatory)National Human Genome Research Institute (U.S.) (NHGRI grant HG03067)Fonds voor Wetenschappelijk Onderzoek--Vlaandere

    HMMSplicer: A Tool for Efficient and Sensitive Discovery of Known and Novel Splice Junctions in RNA-Seq Data

    Get PDF
    Background: High-throughput sequencing of an organism’s transcriptome, or RNA-Seq, is a valuable and versatile new strategy for capturing snapshots of gene expression. However, transcriptome sequencing creates a new class of alignment problem: mapping short reads that span exon-exon junctions back to the reference genome, especially in the case where a splice junction is previously unknown. Methodology/Principal Findings: Here we introduce HMMSplicer, an accurate and efficient algorithm for discovering canonical and non-canonical splice junctions in short read datasets. HMMSplicer identifies more splice junctions than currently available algorithms when tested on publicly available A. thaliana, P. falciparum, and H. sapiens datasets without a reduction in specificity. Conclusions/Significance: HMMSplicer was found to perform especially well in compact genomes and on genes with low expression levels, alternative splice isoforms, or non-canonical splice junctions. Because HHMSplicer does not rely on prebuilt gene models, the products of inexact splicing are also detected. For H. sapiens, we find 3.6 % of 39 splice sites and 1.4% of 59 splice sites are inexact, typically differing by 3 bases in either direction. In addition, HMMSplicer provides a score for every predicted junction allowing the user to set a threshold to tune false positive rates depending on the needs of the experiment. HMMSplicer is implemented in Python. Code and documentation are freely available a

    A widely employed germ cell marker is an ancient disordered protein with reproductive functions in diverse eukaryotes

    Get PDF
    The advent of sexual reproduction and the evolution of a dedicated germline in multicellular organisms are critical landmarks in eukaryotic evolution. We report an ancient family of GCNA (germ cell nuclear antigen) proteins that arose in the earliest eukaryotes, and feature a rapidly evolving intrinsically disordered region (IDR). Phylogenetic analysis reveals that GCNA proteins emerged before the major eukaryotic lineages diverged; GCNA predates the origin of a dedicated germline by a billion years. Gcna gene expression is enriched in reproductive cells across eukarya – either just prior to or during meiosis in single-celled eukaryotes, and in stem cells and germ cells of diverse multicellular animals. Studies of Gcna-mutant C. elegans and mice indicate that GCNA has functioned in reproduction for at least 600 million years. Homology to IDR-containing proteins implicated in DNA damage repair suggests that GCNA proteins may protect the genomic integrity of cells carrying a heritable genome.Life Sciences Research FoundationHoward Hughes Medical Institut

    Phylogenetic and Biogeographic Analysis of Sphaerexochine Trilobites

    Get PDF
    BACKGROUND: Sphaerexochinae is a speciose and widely distributed group of cheirurid trilobites. Their temporal range extends from the earliest Ordovician through the Silurian, and they survived the end Ordovician mass extinction event (the second largest mass extinction in Earth history). Prior to this study, the individual evolutionary relationships within the group had yet to be determined utilizing rigorous phylogenetic methods. Understanding these evolutionary relationships is important for producing a stable classification of the group, and will be useful in elucidating the effects the end Ordovician mass extinction had on the evolutionary and biogeographic history of the group. METHODOLOGY/PRINCIPAL FINDINGS: Cladistic parsimony analysis of cheirurid trilobites assigned to the subfamily Sphaerexochinae was conducted to evaluate phylogenetic patterns and produce a hypothesis of relationship for the group. This study utilized the program TNT, and the analysis included thirty-one taxa and thirty-nine characters. The results of this analysis were then used in a Lieberman-modified Brooks Parsimony Analysis to analyze biogeographic patterns during the Ordovician-Silurian. CONCLUSIONS/SIGNIFICANCE: The genus Sphaerexochus was found to be monophyletic, consisting of two smaller clades (one composed entirely of Ordovician species and another composed of Silurian and Ordovician species). By contrast, the genus Kawina was found to be paraphyletic. It is a basal grade that also contains taxa formerly assigned to Cydonocephalus. Phylogenetic patterns suggest Sphaerexochinae is a relatively distinctive trilobite clade because it appears to have been largely unaffected by the end Ordovician mass extinction. Finally, the biogeographic analysis yields two major conclusions about Sphaerexochus biogeography: Bohemia and Avalonia were close enough during the Silurian to exchange taxa; and during the Ordovician there was dispersal between Eastern Laurentia and the Yangtze block (South China) and between Eastern Laurentia and Avalonia
    corecore