51,785 research outputs found
Overview of ImageCLEF 2018: Challenges, Datasets and Evaluation
This paper presents an overview of the ImageCLEF 2018 evaluation campaign, an event that was organized as part of the CLEF (Conference and Labs of the Evaluation Forum) Labs 2018. ImageCLEF is an ongoing initiative (it started in 2003) that promotes the evaluation of technologies for annotation, indexing and retrieval with the aim of providing information access to collections of images in various usage scenarios and domains. In 2018, the 16th edition of ImageCLEF ran three main tasks and a pilot task: (1) a caption prediction task that aims at predicting the caption of a figure from the biomedical literature based only on the figure image; (2) a tuberculosis task that aims at detecting the tuberculosis type, severity and drug resistance from CT (Computed Tomography) volumes of the lung; (3) a LifeLog task (videos, images and other sources) about daily activities understanding and moment retrieval, and (4) a pilot task on visual question answering where systems are tasked with answering medical questions. The strong participation, with over 100 research groups registering and 31 submitting results for the tasks, shows an increasing interest in this benchmarking campaign
A Relation Extraction Approach for Clinical Decision Support
In this paper, we investigate how semantic relations between concepts
extracted from medical documents can be employed to improve the retrieval of
medical literature. Semantic relations explicitly represent relatedness between
concepts and carry high informative power that can be leveraged to improve the
effectiveness of retrieval functionalities of clinical decision support
systems. We present preliminary results and show how relations are able to
provide a sizable increase of the precision for several topics, albeit having
no impact on others. We then discuss some future directions to minimize the
impact of negative results while maximizing the impact of good results.Comment: 4 pages, 1 figure, DTMBio-KMH 2018, in conjunction with ACM 27th
Conference on Information and Knowledge Management (CIKM), October 22-26
2018, Lingotto, Turin, Ital
Ontology-Based MEDLINE Document Classification
An increasing and overwhelming amount of biomedical information is available in the research literature mainly in the form of free-text. Biologists need tools that automate their information search and deal with the high volume and ambiguity of free-text. Ontologies can help automatic information processing by providing standard concepts and information about the relationships between concepts. The Medical Subject Headings (MeSH) ontology is already available and used by MEDLINE indexers to annotate the conceptual content of biomedical articles. This paper presents a domain-independent method that uses the MeSH ontology inter-concept relationships to extend the existing MeSH-based representation of MEDLINE documents. The extension method is evaluated within a document triage task organized by the Genomics track of the 2005 Text REtrieval Conference (TREC). Our method for extending the representation of documents leads to an improvement of 17% over a non-extended baseline in terms of normalized utility, the metric defined for the task. The SVMlight software is used to classify documents
Issues in the Design of a Pilot Concept-Based Query Interface for the Neuroinformatics Information Framework
This paper describes a pilot query interface that has been constructed to help us explore a "concept-based" approach for searching the
Neuroscience Information Framework (NIF). The query interface is
concept-based in the sense that the search terms submitted through the
interface are selected from a standardized vocabulary of terms
(concepts) that are structured in the form of an ontology. The NIF
contains three primary resources: the NIF Resource Registry, the NIF
Document Archive, and the NIF Database Mediator. These NIF resources
are very different in their nature and therefore pose challenges when
designing a single interface from which searches can be automatically
launched against all three resources simultaneously. The paper first
discusses briefly several background issues involving the use of
standardized biomedical vocabularies in biomedical information
retrieval, and then presents a detailed example that illustrates how
the pilot concept-based query interface operates. The paper concludes
by discussing certain lessons learned in the development of the current
version of the interface
Improving average ranking precision in user searches for biomedical research datasets
Availability of research datasets is keystone for health and life science
study reproducibility and scientific progress. Due to the heterogeneity and
complexity of these data, a main challenge to be overcome by research data
management systems is to provide users with the best answers for their search
queries. In the context of the 2016 bioCADDIE Dataset Retrieval Challenge, we
investigate a novel ranking pipeline to improve the search of datasets used in
biomedical experiments. Our system comprises a query expansion model based on
word embeddings, a similarity measure algorithm that takes into consideration
the relevance of the query terms, and a dataset categorisation method that
boosts the rank of datasets matching query constraints. The system was
evaluated using a corpus with 800k datasets and 21 annotated user queries. Our
system provides competitive results when compared to the other challenge
participants. In the official run, it achieved the highest infAP among the
participants, being +22.3% higher than the median infAP of the participant's
best submissions. Overall, it is ranked at top 2 if an aggregated metric using
the best official measures per participant is considered. The query expansion
method showed positive impact on the system's performance increasing our
baseline up to +5.0% and +3.4% for the infAP and infNDCG metrics, respectively.
Our similarity measure algorithm seems to be robust, in particular compared to
Divergence From Randomness framework, having smaller performance variations
under different training conditions. Finally, the result categorization did not
have significant impact on the system's performance. We believe that our
solution could be used to enhance biomedical dataset management systems. In
particular, the use of data driven query expansion methods could be an
alternative to the complexity of biomedical terminologies
- …