11 research outputs found

    A Progressive Visual Analytics Tool for Incremental Experimental Evaluation

    Full text link
    This paper presents a visual tool, AVIATOR, that integrates the progressive visual analytics paradigm in the IR evaluation process. This tool serves to speed-up and facilitate the performance assessment of retrieval models enabling a result analysis through visual facilities. AVIATOR goes one step beyond the common "compute wait visualize" analytics paradigm, introducing a continuous evaluation mechanism that minimizes human and computational resource consumption

    A visual analytics tool for the incremental evaluation of search engines

    Get PDF
    In the field of information retrieval (IR), it is of fundamental importance the activity of evaluating the performance of an information retrieval system (IRS), by means of specific metrics. For this reason, we developed AVIATOR, a visual analytics tool capable of incrementally indexing a document collection in order to help IR experts to automatically obtain the values for the evaluation metrics, so that they can be compared dynamically as the indexing process proceeds

    Search, access, and explore life science nanopublications on the Web

    No full text
    Nanopublications are Resource Description Framework (RDF) graphs encoding scientific facts extracted from the literature and enriched with provenance and attribution information. There are millions of nanopublications currently available on the Web, especially in the life science domain. Nanopublications are thought to facilitate the discovery, exploration, and re-use of scientific facts. Nevertheless, they are still not widely used by scientists outside specific circles; they are hard to find and rarely cited. We believe this is due to the lack of services to seek, find and understand nanopublications\u2019 content. To this end, we present the NanoWeb application to seamlessly search, access, explore, and re-use the nanopublications publicly available on the Web. For the time being, NanoWeb focuses on the life science domain where the vastest amount of nanopublications are available. It is a unified access point to the world of nanopublications enabling search over graph data, direct connections to evidence papers, and scientific curated databases, and visual and intuitive exploration of the relation network created by the encoded scientific facts

    Building a large gene expression-cancer knowledge base with limited human annotations

    No full text
    Cancer prevention is one of the most pressing challenges that public health needs to face. In this regard, data-driven research is central to assist medical solutions targeting cancer. To fully harness the power of data-driven research, it is imperative to have well-organized machine-readable facts into a knowledge base (KB). Motivated by this urgent need, we introduce the Collaborative Oriented Relation Extraction (CORE) system for building KBs with limited manual annotations. CORE is based on the combination of distant supervision and active learning paradigms and offers a seamless, transparent, modular architecture equipped for large-scale processing. We focus on precision medicine and build the largest KB on ‘fine-grained’ gene expression–cancer associations—a key to complement and validate experimental data for cancer research. We show the robustness of CORE and discuss the usefulness of the provided KB

    Linking Historical Evidence to Digital Maps: The MICOLL Map

    No full text
    This paper introduces the MICOLL Map, a digital historical map which is currently under construction by the ERC-funded project MICOLL: Migrating Commercial Law and Language: Rethinking Lex Mercatoria (11th-17th Centuries). The eventual aim of the map is to display the changing routes by which goods and information circulated in the late Middle Ages and the early modern period, with an initial focus on Northern Italy, Southern Germany, and Trans-Alpine exchange. The paper will firstly survey existing digital historical mapping tools before explaining how the MICOLL Map aims to go beyond the current state of the art in a number of ways, chiefly through the promotion of source transparency which will enable the map to be used as a source by professional historians. The second half of the paper will outline the current technical solutions in place to achieve this

    Modelling digital health data: The ExaMode ontology for computational pathology

    Get PDF
    Computational pathology can significantly benefit from ontologies to standardize the employed nomenclature and help with knowledge extraction processes for high-quality annotated image datasets. The end goal is to reach a shared model for digital pathology to overcome data variability and integration problems. Indeed, data annotation in such a specific domain is still an unsolved challenge and datasets cannot be steadily reused in diverse contexts due to heterogeneity issues of the adopted labels, multilingualism, and different clinical practices. Material and methods: This paper presents the ExaMode ontology, modeling the histopathology process by considering 3 key cancer diseases (colon, cervical, and lung tumors) and celiac disease. The ExaMode ontology has been designed bottom-up in an iterative fashion with continuous feedback and validation from pathologists and clinicians. The ontology is organized into 5 semantic areas that defines an ontological template to model any disease of interest in histopathology. Results: The ExaMode ontology is currently being used as a common semantic layer in: (i) an entity linking tool for the automatic annotation of medical records; (ii) a web-based collaborative annotation tool for histopathology text reports; and (iii) a software platform for building holistic solutions integrating multimodal histopathology data. Discussion: The ontology ExaMode is a key means to store data in a graph database according to the RDF data model. The creation of an RDF dataset can help develop more accurate algorithms for image analysis, especially in the field of digital pathology. This approach allows for seamless data integration and a unified query access point, from which we can extract relevant clinical insights about the considered diseases using SPARQL queries
    corecore