21 research outputs found

    Ontology-Enriched Specifications Enabling Findable, Accessible, Interoperable, and Reusable Marine Metagenomic Datasets in Cyberinfrastructure Systems

    Get PDF
    Marine microbial ecology requires the systematic comparison of biogeochemical and sequence data to analyze environmental influences on the distribution and variability of microbial communities. With ever-increasing quantities of metagenomic data, there is a growing need to make datasets Findable, Accessible, Interoperable, and Reusable (FAIR) across diverse ecosystems. FAIR data is essential to developing analytical frameworks that integrate microbiological, genomic, ecological, oceanographic, and computational methods. Although community standards defining the minimal metadata required to accompany sequence data exist, they haven’t been consistently used across projects, precluding interoperability. Moreover, these data are not machine-actionable or discoverable by cyberinfrastructure systems. By making ‘omic and physicochemical datasets FAIR to machine systems, we can enable sequence data discovery and reuse based on machine-readable descriptions of environments or physicochemical gradients. In this work, we developed a novel technical specification for dataset encapsulation for the FAIR reuse of marine metagenomic and physicochemical datasets within cyberinfrastructure systems. This includes using Frictionless Data Packages enriched with terminology from environmental and life-science ontologies to annotate measured variables, their units, and the measurement devices used. This approach was implemented in Planet Microbe, a cyberinfrastructure platform and marine metagenomic web-portal. Here, we discuss the data properties built into the specification to make global ocean datasets FAIR within the Planet Microbe portal. We additionally discuss the selection of, and contributions to marine-science ontologies used within the specification. Finally, we use the system to discover data by which to answer various biological questions about environments, physicochemical gradients, and microbial communities in meta-analyses. This work represents a future direction in marine metagenomic research by proposing a specification for FAIR dataset encapsulation that, if adopted within cyberinfrastructure systems, would automate the discovery, exchange, and re-use of data needed to answer broader reaching questions than originally intended. Copyright © 2021 Blumberg, Ponsero, Bomhoff, Wood-Charlson, DeLong and Hurwitz.Open access journalThis item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at [email protected]

    Assignment of a gene (NEMI) for autosomal dominant nemaline myopathy to chromosome I

    No full text
    Nemaline myopathy (NEM) is a neuromuscular disorder characterized by the presence, in skeletal muscle, of nemaline rods composed at least in part of α-actinin. A candidate gene and linkage approach was used to localize the gene (NEM1) for an autosomal dominant form (MIM 161800) in one large kindred with 10 living affected family members. Markers on chromosome 19 that were linked to the central core disease gene, a marker at the complement 3 locus, and a marker on chromosome 1 at the α-actinin locus exclude these three candidate genes. The family was fully informative for APOA2, which is localized to 1q21-q23. NEM1 was assigned to chromosome 1 by close linkage for APOA2, which is localized to 1q21-q23. NEM1 was assigned to chromosome 1 by close linkage to APOA2, with a lod score of 3.8 at a recombination fraction of 0. Recombinants with NGFB (1p13) and AT3 (1q23-25.1) indicate that NEM1 lies between 1p13 and 1q25.1. In total, 47 loci were investigated on chromosomes 1, 2, 4, 5, 7–11, 14, 16, 17, and 19, with no indications of significant linkage other than to markers on chromosome 1

    The lymphocyte-epithelial-bacterial interface

    No full text
    corecore