74 research outputs found

    Artificial intelligence for ocean science data integration:current state, gaps, and way forward

    Get PDF

    Automated Ontology Evaluation: Evaluating Coverage and Correctness using a Domain Corpus

    Get PDF

    Can Large Language Models Augment a Biomedical Ontology with missing Concepts and Relations?

    Full text link
    Ontologies play a crucial role in organizing and representing knowledge. However, even current ontologies do not encompass all relevant concepts and relationships. Here, we explore the potential of large language models (LLM) to expand an existing ontology in a semi-automated fashion. We demonstrate our approach on the biomedical ontology SNOMED-CT utilizing semantic relation types from the widely used UMLS semantic network. We propose a method that uses conversational interactions with an LLM to analyze clinical practice guidelines (CPGs) and detect the relationships among the new medical concepts that are not present in SNOMED-CT. Our initial experimentation with the conversational prompts yielded promising preliminary results given a manually generated gold standard, directing our future potential improvements.Comment: Presented as a short paper at the Knowledge Representation for Healthcare 2023 worksho

    A design space for RDF data representations

    Get PDF
    RDF triplestores' ability to store and query knowledge bases augmented with semantic annotations has attracted the attention of both research and industry. A multitude of systems offer varying data representation and indexing schemes. However, as recently shown for designing data structures, many design choices are biased by outdated considerations and may not result in the most efficient data representation for a given query workload. To overcome this limitation, we identify a novel three-dimensional design space. Within this design space, we map the trade-offs between different RDF data representations employed as part of an RDF triplestore and identify unexplored solutions. We complement the review with an empirical evaluation of ten standard SPARQL benchmarks to examine the prevalence of these access patterns in synthetic and real query workloads. We find some access patterns, to be both prevalent in the workloads and under-supported by existing triplestores. This shows the capabilities of our model to be used by RDF store designers to reason about different design choices and allow a (possibly artificially intelligent) designer to evaluate the fit between a given system design and a query workload

    Expression of an Androgenic Gland-Specific Insulin-Like Peptide during the Course of Prawn Sexual and Morphotypic Differentiation

    Get PDF
    The crustacean male-specific androgenic gland (AG) regulates sexual differentiation. In the prawn Macrobrachium rosenbergii, silencing an AG-specific insulin-like encoding transcript (Mr-IAG) inhibited the development of male sexual characters, suggesting that Mr-IAG is a key androgenic hormone. We used recombinant pro-Mr-IAG peptide to generate antibodies that recognized the peptide in AG cells and extracts, as verified by mass spectrometry. We revealed the temporal expression pattern of Mr-IAG and studied its relevance to the timetable of sex differentiation processes in juveniles and after puberty. Mr-IAG was expressed from as early as 20 days after metamorphosis, prior to the appearance of external male sexual characters. Mr-IAG expression was lower in the less reproductively active orange-clawed males than in both the dominant blue-clawed males and the actively sneak mating small males. These results suggest a role for Mr-IAG both in the timing of male sexual differentiation and in regulating reproductive strategies

    Assigning Diagnosis Codes Using Medication History

    Get PDF
    Diagnosis assignment is the process of assigning disease codes to patients. Automatic diagnosis assignment has the potential to validate code assignments, correct erroneous codes, and register completion. Previous methods build on text-based techniques utilizing medical notes but are inapplicable in the absence of these notes. We propose using patients' medication data to assign diagnosis codes. We present a proof-of-concept study using medical data from an American dataset (MIMIC-III) and Danish nationwide registers to train a machine-learning-based model that predicts an extensive collection of diagnosis codes for multiple levels of aggregation over a disease hierarchy. We further suggest a specialized loss function designed to utilize the innate hierarchical nature of the disease hierarchy. We evaluate the proposed method on a subset of 567 disease codes. Moreover, we investigate the technique's generalizability and transferability by (1) training and testing models on the same subsets of disease codes over the two medical datasets and (2) training models on the American dataset while evaluating them on the Danish dataset, respectively. Results demonstrate the proposed method can correctly assign diagnosis codes on multiple levels of aggregation from the disease hierarchy over the American dataset with recall 70.0% and precision 69.48% for top-10 assigned codes; thereby being comparable to text-based techniques. Furthermore, the specialized loss function performs consistently better than the non-hierarchical state-of-the-art version. Moreover, results suggest the proposed method is language and dataset-agnostic, with initial indications of transferability over subsets of disease codes

    Completeness and Ambiguity of Schema Cover

    Get PDF
    Given a schema and a set of concepts, representative of entities in the domain of discourse, schema cover defines correspondences between concepts and parts of the schema. Schema cover aims at interpreting the schema in terms of concepts and thus, vastly simplifying the task of schema integration. In this work we investigate two properties of schema cover, namely completeness and ambiguity. The former measures the part of a schema that can be covered by a set of concepts and the latter examines the amount of overlap between concepts in a cover. To study the tradeoffs between completeness and ambiguity we define a cover model to which previous frameworks are special cases. We analyze the theoretical complexity of variations of the cover problem, some aim at maximizing completeness while others aim at minimizing ambiguity. We show that variants of the schema cover problem are hard problems in general and formulate an exhaustive search solution using integer linear programming. We then provide a thorough empirical analysis, using both real-world and simulated data sets, showing empirically that the integer linear programming solution scales well for large schemata. We also show that some instantiations of the general schema cover problem are more effective than others

    A Sexual Shift Induced by Silencing of a Single Insulin-Like Gene in Crayfish: Ovarian Upregulation and Testicular Degeneration

    Get PDF
    In sequential hermaphrodites, intersexuality occurs naturally, usually as a transition state during sexual re-differentiation processes. In crustaceans, male sexual differentiation is controlled by the male-specific androgenic gland (AG). An AG-specific insulin-like gene, previously identified in the red-claw crayfish Cherax quadricarinatus (designated Cq-IAG), was found in this study to be the prominent transcript in an AG cDNA subtractive library. In C. quadricarinatus, sexual plasticity is exhibited by intersex individuals in the form of an active male reproductive system and male secondary sex characters, along with a constantly arrested ovary. This intersexuality was exploited to follow changes caused by single gene silencing, accomplished via dsRNA injection. Cq-IAG silencing induced dramatic sex-related alterations, including male feature feminization, a reduction in sperm production, extensive testicular degeneration, expression of the vitellogenin gene, and accumulation of yolk proteins in the developing oocytes. Upon silencing of the gene, AG cells hypertrophied, possibly to compensate for low hormone levels, as reflected in the poor production of the insulin-like hormone (and revealed by immunohistochemistry). These results demonstrate both the functionality of Cq-IAG as an androgenic hormone-encoding gene and the dependence of male gonad viability on the Cq-IAG product. This study is the first to provide evidence that silencing an insulin-like gene in intersex C. quadricarinatus feminizes male-related phenotypes. These findings, moreover, contribute to the understanding of the regulation of sexual shifts, whether naturally occurring in sequential hermaphrodites or abnormally induced by endocrine disruptors found in the environment, and offer insight into an unusual gender-related link to the evolution of insulins

    A RT-qPCR system using a degenerate probe for specific identification and differentiation of SARS-CoV-2 Omicron (B.1.1.529) variants of concern

    Get PDF
    Fast surveillance strategies are needed to control the spread of new emerging SARS-CoV-2 variants and gain time for evaluation of their pathogenic potential. This was essential for the Omicron variant (B.1.1.529) that replaced the Delta variant (B.1.617.2) and is currently the dominant SARS-CoV-2 variant circulating worldwide. RT-qPCR strategies complement whole genome sequencing, especially in resource lean countries, but mutations in the targeting primer and probe sequences of new emerging variants can lead to a failure of the existing RT-qPCRs. Here, we introduced an RT-qPCR platform for detecting the Delta- and the Omicron variant simultaneously using a degenerate probe targeting the key ΔH69/V70 mutation in the spike protein. By inclusion of the L452R mutation into the RT-qPCR platform, we could detect not only the Delta and the Omicron variants, but also the Omicron sub-lineages BA.1, BA.2 and BA.4/BA.5. The RT-qPCR platform was validated in small- and large-scale. It can easily be incorporated for continued monitoring of Omicron sub-lineages, and offers a fast adaption strategy of existing RT-qPCRs to detect new emerging SARS-CoV-2 variants using degenerate probes.</p
    corecore