133 research outputs found

    RNAcentral 2021: secondary structure integration, improved sequence search and new member databases.

    Get PDF
    RNAcentral is a comprehensive database of non-coding RNA (ncRNA) sequences that provides a single access point to 44 RNA resources and >18 million ncRNA sequences from a wide range of organisms and RNA types. RNAcentral now also includes secondary (2D) structure information for >13 million sequences, making RNAcentral the world's largest RNA 2D structure database. The 2D diagrams are displayed using R2DT, a new 2D structure visualization method that uses consistent, reproducible and recognizable layouts for related RNAs. The sequence similarity search has been updated with a faster interface featuring facets for filtering search results by RNA type, organism, source database or any keyword. This sequence search tool is available as a reusable web component, and has been integrated into several RNAcentral member databases, including Rfam, miRBase and snoDB. To allow for a more fine-grained assignment of RNA types and subtypes, all RNAcentral sequences have been annotated with Sequence Ontology terms. The RNAcentral database continues to grow and provide a central data resource for the RNA community. RNAcentral is freely available at https://rnacentral.org

    RNAcentral 2021: secondary structure integration, improved sequence search and new member databases

    Get PDF
    RNAcentral is a comprehensive database of non-coding RNA (ncRNA) sequences that provides a single access point to 44 RNA resources and >18 million ncRNA sequences from a wide range of organisms and RNA types. RNAcentral now also includes secondary (2D) structure information for >13 million sequences, making RNAcentral the world's largest RNA 2D structure database. The 2D diagrams are displayed using R2DT, a new 2D structure visualization method that uses consistent, reproducible and recognizable layouts for related RNAs. The sequence similarity search has been updated with a faster interface featuring facets for filtering search results by RNA type, organism, source database or any keyword. This sequence search tool is available as a reusable web component, and has been integrated into several RNAcentral member databases, including Rfam, miRBase and snoDB. To allow for a more fine-grained assignment of RNA types and subtypes, all RNAcentral sequences have been annotated with Sequence Ontology terms. The RNAcentral database continues to grow and provide a central data resource for the RNA community

    RNAcentral 2021: secondary structure integration, improved sequence search and new member databases

    Get PDF
    RNAcentral is a comprehensive database of non-coding RNA (ncRNA) sequences that provides a single access point to 44 RNA resources and >18 million ncRNA sequences from a wide range of organisms and RNA types. RNAcentral now also includes secondary (2D) structure information for >13 million sequences, making RNAcentral the world’s largest RNA 2D structure database. The 2D diagrams are displayed using R2DT, a new 2D structure visualization method that uses consistent, reproducible and recognizable layouts for related RNAs. The sequence similarity search has been updated with a faster interface featuring facets for filtering search results by RNA type, organism, source database or any keyword. This sequence search tool is available as a reusable web component, and has been integrated into several RNAcentral member databases, including Rfam, miRBase and snoDB. To allow for a more fine-grained assignment of RNA types and subtypes, all RNAcentral sequences have been annotated with Sequence Ontology terms. The RNAcentral database continues to grow and provide a central data resource for the RNA community. RNAcentral is freely available at https://rnacentral.org

    RNAcentral : a hub of information for non-coding RNA sequences

    Get PDF
    RNAcentral is a comprehensive database of non-coding RNA (ncRNA) sequences, collating information on ncRNA sequences of all types from a broad range of organisms. We have recently added a new genome mapping pipeline that identifies genomic locations for ncRNA sequences in 296 species. We have also added several new types of functional annotations, such as tRNA secondary structures, Gene Ontology annotations, and miRNA-target interactions. A new quality control mechanism based on Rfam family assignments identifies potential contamination, incomplete sequences, and more. The RNAcentral database has become a vital component of many workflows in the RNA community, serving as both the primary source of sequence data for academic and commercial groups, as well as a source of stable accessions for the annotation of genomic and functional features. These examples are facilitated by an improved RNAcentral web interface, which features an updated genome browser, a new sequence feature viewer, and improved text search functionality. RNAcentral is freely available at https://rnacentral.org

    Multiple Genome Sequences of Lactobacillus pentosus Strains Isolated from Biofilms on the Skin of Fermented Green Table Olives

    Get PDF
    The draft genome sequences of five Lactobacillus pentosus strains isolated from biofilms on the skin of green table olives are presented here. These genome sequences will assist in revealing the potential probiotic properties of these strains, as the intake of fermented olives implicates the passage of millions of Lactobacillus spp. throughout a consumer’s gastrointestinal tract.This work was funded by the Junta de Andalucía through the P11-AGR-7051 project and by the Spanish Government through MICIIN projects AGL2013-48300-R/ALI (OliFilm project: www.olifilm.science.com.es) and BFU2016-77297-P

    Relative Information Gain: Shannon entropy-based measure of the relative structural conservation in RNA alignments

    Get PDF
    Structural characterization of RNAs is a dynamic field, offeringmanymodelling possibilities. RNA secondary structure models are usually characterized by an encoding that depicts structural information of the molecule through string representations or graphs. In this work, we provide a generalization of the BEAR encoding (a context-aware structural encoding we previously developed) by expanding the set of alignments used for the construction of substitution matrices and then applying it to secondary structure encodings ranging from fine-grained to more coarse-grained representations. We also introduce a re-interpretation of the Shannon Information applied on RNA alignments, proposing a new scoring metric, the Relative Information Gain (RIG). The RIG score is available for any position in an alignment, showing how different levels of detail encoded in the RNA representation can contribute differently to convey structural information. The approaches presented in this study can be used alongside state-ofthe-art tools to synergistically gain insights into the structural elements that RNAs and RNA families are composed of. This additional information could potentially contribute to their improvement or increase the degree of confidence in the secondary structure of families and any set of aligned RNAs

    Relative Information Gain: Shannon entropy-based measure of the relative structural conservation in RNA alignments

    Get PDF
    open7noAIRC [project [to MHC] (grant number IG 23539).Structural characterization of RNAs is a dynamic field, offering many modelling possibilities. RNA secondary structure models are usually characterized by an encoding that depicts structural information of the molecule through string representations or graphs. In this work, we provide a generalization of the BEAR encoding (a context-aware structural encoding we previously developed) by expanding the set of alignments used for the construction of substitution matrices and then applying it to secondary structure encodings ranging from fine-grained to more coarse-grained representations. We also introduce a re-interpretation of the Shannon Information applied on RNA alignments, proposing a new scoring metric, the Relative Information Gain (RIG). The RIG score is available for any position in an alignment, showing how different levels of detail encoded in the RNA representation can contribute differently to convey structural information. The approaches presented in this study can be used alongside state-of-the-art tools to synergistically gain insights into the structural elements that RNAs and RNA families are composed of. This additional information could potentially contribute to their improvement or increase the degree of confidence in the secondary structure of families and any set of aligned RNAs.openPietrosanto M.; Adinolfi M.; Guarracino A.; FerrĂš Fabrizio.; Ausiello G.; Vitale I.; Helmer-Citterich M.Pietrosanto M.; Adinolfi M.; Guarracino A.; FerrĂš Fabrizio.; Ausiello G.; Vitale I.; Helmer-Citterich M

    Computational strategies to combat COVID-19: useful tools to accelerate SARS-CoV-2 and coronavirus research

    Get PDF
    SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) is a novel virus of the family Coronaviridae. The virus causes the infectious disease COVID-19. The biology of coronaviruses has been studied for many years. However, bioinformatics tools designed explicitly for SARS-CoV-2 have only recently been developed as a rapid reaction to the need for fast detection, understanding and treatment of COVID-19. To control the ongoing COVID-19 pandemic, it is of utmost importance to get insight into the evolution and pathogenesis of the virus. In this review, we cover bioinformatics workflows and tools for the routine detection of SARS-CoV-2 infection, the reliable analysis of sequencing data, the tracking of the COVID-19 pandemic and evaluation of containment measures, the study of coronavirus evolution, the discovery of potential drug targets and development of therapeutic strategies. For each tool, we briefly describe its use case and how it advances research specifically for SARS-CoV-2. All tools are free to use and available online, either through web applications or public code repositories.Peer Reviewe

    CoDNaS-RNA: a database of Conformational Diversity in the Native State of RNA

    Get PDF
    Conformational changes in RNA native ensembles are central to fulfill many of their biological roles. Systematic knowledge of the extent and possible modulators of this conformational diversity is desirable to better understand the relationship between RNA dynamics and function.We have developed CoDNaS-RNA as the first database of conformational diversity in RNA molecules. Known RNA structures are retrieved and clustered to identify alternative conformers of each molecule. Pairwise structural comparisons within each cluster allows to measure the variability of the molecule. Additional data on structural features, molecular interactions and functional annotations are provided. CoDNaS-RNA is implemented as a public resource that can be of much interest for computational and bench scientists alike.Fil: GonzĂĄlez BuitrĂłn, MartĂ­n. Universidad Nacional de Quilmes. Departamento de Ciencia y TecnologĂ­a; Argentina. Consejo Nacional de Investigaciones CientĂ­ficas y TĂ©cnicas; ArgentinaFil: Romario Tunque Cahui, Ronaldo. Pontificia Universidad CatĂłlica de PerĂș; PerĂșFil: GarcĂ­a RĂ­os, Emilio. Pontificia Universidad CatĂłlica de PerĂș; PerĂșFil: Hirsh, Layla. Pontificia Universidad CatĂłlica de PerĂș; PerĂșFil: Fornasari, Maria Silvina. Universidad Nacional de Quilmes. Departamento de Ciencia y TecnologĂ­a; Argentina. Consejo Nacional de Investigaciones CientĂ­ficas y TĂ©cnicas; ArgentinaFil: Parisi, Gustavo Daniel. Universidad Nacional de Quilmes. Departamento de Ciencia y TecnologĂ­a; Argentina. Consejo Nacional de Investigaciones CientĂ­ficas y TĂ©cnicas; ArgentinaFil: Palopoli, NicolĂĄs. Universidad Nacional de Quilmes. Departamento de Ciencia y TecnologĂ­a; Argentina. Consejo Nacional de Investigaciones CientĂ­ficas y TĂ©cnicas; Argentin

    De Novo Transcriptome Assembly of Cucurbita Pepo L. Leaf Tissue Infested by Aphis Gossypii

    Get PDF
    Zucchini (Cucurbita pepo L.), extensively cultivated in temperate areas, belongs to the Cucurbitaceae family and it is a species with great economic value. One major threat related to zucchini cultivation is the damage imposed by the cotton/melon aphid Aphis gossypii Glover (Homoptera: Aphididae). We performed RNA-sequencing on cultivar "San Pasquale" leaves, uninfested and infested by A. gossypii, that were collected at three time points (24, 48, and 96 h post infestation). Then, we combined all high-quality reads for de novo assembly of the transcriptome. This resource was primarily established to be used as a reference for gene expression studies in order to investigate the transcriptome reprogramming of zucchini plants following aphid infestation. In addition, raw reads will be valuable for new experiments based on the latest bioinformatic tools and analytical approaches. The assembled transcripts will serve as an important reference for sequence-based studies and for primer design. Both datasets can be used to support/improve the prediction of protein-coding genes in the zucchini genome, which has been recently released into the public domain
    • 

    corecore