97 research outputs found

    Methods to study splicing from high-throughput RNA Sequencing data

    Full text link
    The development of novel high-throughput sequencing (HTS) methods for RNA (RNA-Seq) has provided a very powerful mean to study splicing under multiple conditions at unprecedented depth. However, the complexity of the information to be analyzed has turned this into a challenging task. In the last few years, a plethora of tools have been developed, allowing researchers to process RNA-Seq data to study the expression of isoforms and splicing events, and their relative changes under different conditions. We provide an overview of the methods available to study splicing from short RNA-Seq data. We group the methods according to the different questions they address: 1) Assignment of the sequencing reads to their likely gene of origin. This is addressed by methods that map reads to the genome and/or to the available gene annotations. 2) Recovering the sequence of splicing events and isoforms. This is addressed by transcript reconstruction and de novo assembly methods. 3) Quantification of events and isoforms. Either after reconstructing transcripts or using an annotation, many methods estimate the expression level or the relative usage of isoforms and/or events. 4) Providing an isoform or event view of differential splicing or expression. These include methods that compare relative event/isoform abundance or isoform expression across two or more conditions. 5) Visualizing splicing regulation. Various tools facilitate the visualization of the RNA-Seq data in the context of alternative splicing. In this review, we do not describe the specific mathematical models behind each method. Our aim is rather to provide an overview that could serve as an entry point for users who need to decide on a suitable tool for a specific analysis. We also attempt to propose a classification of the tools according to the operations they do, to facilitate the comparison and choice of methods.Comment: 31 pages, 1 figure, 9 tables. Small corrections adde

    Heterogeneity of Glia in the Retina and Optic Nerve of Birds and Mammals

    Get PDF
    We have recently described a novel type of glial cell that is scattered across the inner layers of the avian retina [1]. These cells are stimulated by insulin-like growth factor 1 (IGF1) to proliferate, migrate distally into the retina, and up-regulate the nestin-related intermediate filament transitin. These changes in glial activity correspond with increased susceptibility of neurons to excitotoxic damage. This novel cell-type has been termed the Non-astrocytic Inner Retinal Glia-like (NIRG) cells. The purpose of the study was to investigate whether the retinas of non-avian species contain cells that resemble NIRG cells. We assayed for NIRG cells by probing for the expression of Sox2, Sox9, Nkx2.2, vimentin and nestin. NIRG cells were distinguished from astrocytes by a lack of expression for Glial Fibrilliary Acidic Protein (GFAP). We examined the retinas of adult mice, guinea pigs, dogs and monkeys (Macaca fasicularis). In the mouse retina and optic nerve head, we identified numerous astrocytes that expressed GFAP, S100β, Sox2 and Sox9; however, we found no evidence for NIRG-like cells that were positive for Nkx2.2, nestin, and negative for GFAP. In the guinea pig retina, we did not find astrocytes or NIRG cells in the retina, whereas we identified astrocytes in the optic nerve. In the eyes of dogs and monkeys, we found astrocytes and NIRG-like cells scattered across inner layers of the retina and within the optic nerve. We conclude that NIRG-like cells are present in the retinas of canines and non-human primates, whereas the retinas of mice and guinea pigs do not contain NIRG cells

    Draft genome sequence of Wickerhamomyces anomalus LBCM1105, isolated from cachaça fermentation

    Get PDF
    Wickerhamomyces anomalus LBCM1105 is a yeast isolated from cachaça distillery fermentation vats, notable for exceptional glycerol consumption ability. We report its draft genome with 20.5x in-depth coverage and around 90% extension and completeness. It harbors the sequences of proteins involved in glycerol transport and metabolism.The authors gratefully acknowledge Laboratorio Nacional de Ciencia e Tecnologia do Bioetanol (CTBE) and the Centro Nacional de Pesquisa em Energia e Materiais (CNPEM) for support with the sequencing of LBCM1105. This work was supported by CAPES/Brazil (PNPD 2755/2011; PCF-PVE 021/2012), by CNPq (Brazil), processes 304815/2012 (research grant) and 305135/2015-5, and by AUXPE-PVES 1801/2012 (Process 23038.015294/2016-18) from Brazilian Government and by UFOP. C.L. is supported by the strategic program UID/BIA/04050/2013 [POCI-01-0145-FEDER-007569] funded by national funds through the FCT I.P. and by the ERDF through the COMPETE2020 - Programa Operacional de Competitividade e Internacionalizacao (POCI). DMRP is a fellow from the CNPq (Conselho Nacional de Desenvolvimento Cientifico e Tecnologico) - Brazil (310080/2018-5)

    Longer First Introns Are a General Property of Eukaryotic Gene Structure

    Get PDF
    While many properties of eukaryotic gene structure are well characterized, differences in the form and function of introns that occur at different positions within a transcript are less well understood. In particular, the dynamics of intron length variation with respect to intron position has received relatively little attention. This study analyzes all available data on intron lengths in GenBank and finds a significant trend of increased length in first introns throughout a wide range of species. This trend was found to be even stronger when using high-confidence gene annotation data for three model organisms (Arabidopsis thaliana, Caenorhabditis elegans, and Drosophila melanogaster) which show that the first intron in the 5′ UTR is - on average - significantly longer than all downstream introns within a gene. A partial explanation for increased first intron length in A. thaliana is suggested by the increased frequency of certain motifs that are present in first introns. The phenomenon of longer first introns can potentially be used to improve gene prediction software and also to detect errors in existing gene annotations

    Characterisation of the Trichinella spiralis deubiquitinating enzyme, TsUCH37, an evolutionarily conserved proteasome interaction partner.

    Get PDF
    Trichinella spiralis is a parasitic nematode that infects mammals indiscriminately. Although the biggest impact of trichinellosis is observed in developing countries, the parasite is found on all continents except Antarctica. In humans, Trichinella infection contributes globally to helminth related morbidity and disability adjusted life years. In animals, infection is implicated as a serious agricultural problem and drug treatment is largely ineffective. During chronic infection, larvae invade skeletal muscle cells, forming a nurse cell complex in which they become encysted. The nurse cell is a product of the severe disruption of the host cell homeostasis. Proteins of the Ub/proteasome pathway are highly conserved throughout evolution, and considering their importance in the regulation of cell homeostasis, provide interesting and novel therapeutic targets for various diseases. In order to target this system in parasites, pathogen proteins that play a role in this pathway must be identified. We report the identification of the first T. spiralis deubiquitinating enzyme, and show evidence that the function of this protein as a proteasome interaction partner has been evolutionarily conserved. We show that members of this enzyme family are important for T. spiralis survival and that the use of inhibitor compounds may help elucidate their role in infection

    A Cre-conditional MYCN-driven neuroblastoma mouse model as an improved tool for preclinical studies

    Get PDF
    Neuroblastoma, a childhood cancer that originates from neural crest-derived cells, is the most common deadly solid tumor of infancy. Amplification of the MYCN oncogene, which occurs in approximately 20-25% of human neuroblastomas, is the most prominent genetic marker of high-stage disease. The availability of valid preclinical in vivo models is a prerequisite to develop novel targeted therapies. We here report on the generation of transgenic mice with Cre-conditional induction of MYCN in dopamine β-hydroxylase-expressing cells, termed LSL-MYCN;Dbh-iCre. These mice develop neuroblastic tumors with an incidence of >75%, regardless of strain background. Molecular profiling of tumors revealed upregulation of the MYCN-dependent miR-17-92 cluster as well as expression of neuroblastoma marker genes, including tyrosine hydroxylase and the neural cell adhesion molecule 1. Gene set enrichment analyses demonstrated significant correlation with MYC-associated expression patterns. Array comparative genome hybridization showed that chromosomal aberrations in LSL-MYCN;Dbh-iCre tumors were syntenic to those observed in human neuroblastomas. Treatment of a cell line established from a tumor derived from a LSL-MYCN;Dbh-iCre mouse with JQ1 or MLN8237 reduced cell viability and demonstrated oncogene addiction to MYCN. Here we report establishment of the first Cre-conditional human MYCN-driven mouse model for neuroblastoma that closely recapitulates the human disease with respect to tumor localization, histology, marker expression and genomic make up. This mouse model is a valuable tool for further functional studies and to assess the effect of targeted therapies

    Primula vulgaris (primrose) genome assembly, annotation and gene expression, with comparative genomics on the heterostyly supergene

    Get PDF
    Primula vulgaris (primrose) exhibits heterostyly: plants produce self-incompatible pin- or thrum-form flowers, with anthers and stigma at reciprocal heights. Darwin concluded that this arrangement promotes insect-mediated cross-pollination; later studies revealed control by a cluster of genes, or supergene, known as the S (Style length) locus. The P. vulgaris S locus is absent from pin plants and hemizygous in thrum plants (thrum-specific); mutation of S locus genes produces self-fertile homostyle flowers with anthers and stigma at equal heights. Here, we present a 411 Mb P. vulgaris genome assembly of a homozygous inbred long homostyle, representing ~87% of the genome. We annotate over 24,000 P. vulgaris genes, and reveal more genes up-regulated in thrum than pin flowers. We show reduced genomic read coverage across the S locus in other Primula species, including P. veris, where we define the conserved structure and expression of the S locus genes in thrum. Further analysis reveals the S locus has elevated repeat content (64%) compared to the wider genome (37%). Our studies suggest conservation of S locus genetic architecture in Primula, and provide a platform for identification and evolutionary analysis of the S locus and downstream targets that regulate heterostyly in diverse heterostylous species

    The multicellularity genes of dictyostelid social amoebas

    Get PDF
    The evolution of multicellularity enabled specialization of cells, but required novel signalling mechanisms for regulating cell differentiation. Early multicellular organisms are mostly extinct and the origins of these mechanisms are unknown. Here using comparative genome and transcriptome analysis across eight uni- and multicellular amoebozoan genomes, we find that 80% of proteins essential for the development of multicellular Dictyostelia are already present in their unicellular relatives. This set is enriched in cytosolic and nuclear proteins, and protein kinases. The remaining 20%, unique to Dictyostelia, mostly consists of extracellularly exposed and secreted proteins, with roles in sensing and recognition, while several genes for synthesis of signals that induce cell-type specialization were acquired by lateral gene transfer. Across Dictyostelia, changes in gene expression correspond more strongly with phenotypic innovation than changes in protein functional domains. We conclude that the transition to multicellularity required novel signals and sensors rather than novel signal processing mechanisms

    Drivers of genetic diversity in secondary metabolic gene clusters within a fungal species

    Get PDF
    Drivers of genetic diversity in secondary metabolic gene clusters within a fungal speciesFilamentous fungi produce a diverse array of secondary metabolites (SMs) critical for defense, virulence, and communication. The metabolic pathways that produce SMs are found in contiguous gene clusters in fungal genomes, an atypical arrangement for metabolic pathways in other eukaryotes. Comparative studies of filamentous fungal species have shown that SM gene clusters are often either highly divergent or uniquely present in one or a handful of species, hampering efforts to determine the genetic basis and evolutionary drivers of SM gene cluster divergence. Here, we examined SM variation in 66 cosmopolitan strains of a single species, the opportunistic human pathogen Aspergillus fumigatus. Investigation of genome-wide within-species variation revealed 5 general types of variation in SM gene clusters: nonfunctional gene polymorphisms; gene gain and loss polymorphisms; whole cluster gain and loss polymorphisms; allelic polymorphisms, in which different alleles corresponded to distinct, nonhomologous clusters; and location polymorphisms, in which a cluster was found to differ in its genomic location across strains. These polymorphisms affect the function of representative A. fumigatus SM gene clusters, such as those involved in the production of gliotoxin, fumigaclavine, and helvolic acid as well as the function of clusters with undefined products. In addition to enabling the identification of polymorphisms, the detection of which requires extensive genome-wide synteny conservation (e.g., mobile gene clusters and nonhomologous cluster alleles), our approach also implicated multiple underlying genetic drivers, including point mutations, recombination, and genomic deletion and insertion events as well as horizontal gene transfer from distant fungi. Finally, most of the variants that we uncover within A. fumigatus have been previously hypothesized to contribute to SM gene cluster diversity across entire fungal classes and phyla. We suggest that the drivers of genetic diversity operating within a fungal species shown here are sufficient to explain SM cluster macroevolutionary patterns.National Science Foundation (grant number DEB-1442113). Received by AR. U.S. National Library of Medicine training grant (grant number 2T15LM007450). Received by ALL. Conselho Nacional de Desenvolvimento Cientı´fico e 573 Tecnológico. Northern Portugal Regional Operational Programme (grant number NORTE-01- 0145-FEDER-000013). Received by FR. Fundação de Amparo à Pesquisa do 572 Estado de São Paulo. Received by GHG. National Institutes of Health (grant number R01 AI065728-01). Received by NPK. National Science Foundation (grant number IOS-1401682). Received by JHW. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.info:eu-repo/semantics/publishedVersio

    The genome of the seagrass Zostera marina reveals angiosperm adaptation to the sea

    Get PDF
    Seagrasses colonized the sea(1) on at least three independent occasions to form the basis of one of the most productive and widespread coastal ecosystems on the planet(2). Here we report the genome of Zostera marina (L.), the first, to our knowledge, marine angiosperm to be fully sequenced. This reveals unique insights into the genomic losses and gains involved in achieving the structural and physiological adaptations required for its marine lifestyle, arguably the most severe habitat shift ever accomplished by flowering plants. Key angiosperm innovations that were lost include the entire repertoire of stomatal genes(3), genes involved in the synthesis of terpenoids and ethylene signalling, and genes for ultraviolet protection and phytochromes for far-red sensing. Seagrasses have also regained functions enabling them to adjust to full salinity. Their cell walls contain all of the polysaccharides typical of land plants, but also contain polyanionic, low-methylated pectins and sulfated galactans, a feature shared with the cell walls of all macroalgae(4) and that is important for ion homoeostasis, nutrient uptake and O-2/CO2 exchange through leaf epidermal cells. The Z. marina genome resource will markedly advance a wide range of functional ecological studies from adaptation of marine ecosystems under climate warming(5,6), to unravelling the mechanisms of osmoregulation under high salinities that may further inform our understanding of the evolution of salt tolerance in crop plants(7)
    corecore