Search CORE

6,887 research outputs found

Ontology-Based MEDLINE Document Classification

Author: J.T. Eppig
M.E. Funk
R. Rada
T. Joachims
T. Joachims
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

An increasing and overwhelming amount of biomedical information is available in the research literature mainly in the form of free-text. Biologists need tools that automate their information search and deal with the high volume and ambiguity of free-text. Ontologies can help automatic information processing by providing standard concepts and information about the relationships between concepts. The Medical Subject Headings (MeSH) ontology is already available and used by MEDLINE indexers to annotate the conceptual content of biomedical articles. This paper presents a domain-independent method that uses the MeSH ontology inter-concept relationships to extend the existing MeSH-based representation of MEDLINE documents. The extension method is evaluated within a document triage task organized by the Genomics track of the 2005 Text REtrieval Conference (TREC). Our method for extending the representation of documents leads to an improvement of 17% over a non-extended baseline in terms of normalized utility, the metric defined for the task. The SVMlight software is used to classify documents

CiteSeerX

Crossref

Irish Universities

DCU Online Research Access Service

On the use of clustering and the MeSH controlled vocabulary to improve MEDLINE abstract search

Author: Blott Stephen
Camous Fabrice
Gurrin Cathal
Jones Gareth J.F.
Publication venue
Publication date: 01/03/2005
Field of study

Databases of genomic documents contain substantial amounts of structured information in addition to the texts of titles and abstracts. Unstructured information retrieval techniques fail to take advantage of the structured information available. This paper describes a technique to improve upon traditional retrieval methods by clustering the retrieval result set into two distinct clusters using additional structural information. Our hypothesis is that the relevant documents are to be found in the tightest cluster of the two, as suggested by van Rijsbergen's cluster hypothesis. We present an experimental evaluation of these ideas based on the relevance judgments of the 2004 TREC workshop Genomics track, and the CLUTO software clustering package

Irish Universities

DCU Online Research Access Service

Report on the EHCR (Deliverable 26.1)

Author: Kalra D.
Publication venue: UCL (University College London)
Publication date: 01/01/2005
Field of study

The challenge of richly interpreting electronic health information, in order to populate EHR instances with suitable terms, to provide decision support in the care of individuals, to identify suitable patients for teaching or clinical trials recruitment, and to mine populations of records for public health or to discover new medical knowledge, all require that the heterogeneous clinical entry instances within EHR repositories can be systematically analysed and interpreted. Achieving this requires the combination and co-operation of many different health informatics tools and technologies, underpinned by shared representations of clinical concepts and inferencing formalisms. Much of this work is at the level of R&D, and is well represented across the Semantic Mining consortium. The challenge of WP26 is to build up a vision of the ways in which these historically independent threads of health informatics research can collaborate, and uncover the research challenges that are needed in order to deliver good demonstrations of semantically indexed and richly analysable EHRs. The partners have begun WP26 by acquiring a better knowledge of each other’s areas of endeavour, and are beginning to steer their research interests towards future areas of collaboration

UCL Discovery

Graph-Sparse LDA: A Topic Model with Structured Sparsity

Author: Adams Ryan
Doshi-Velez Finale
Wallace Byron
Publication venue
Publication date: 21/11/2014
Field of study

Originally designed to model text, topic modeling has become a powerful tool for uncovering latent structure in domains including medicine, finance, and vision. The goals for the model vary depending on the application: in some cases, the discovered topics may be used for prediction or some other downstream task. In other cases, the content of the topic itself may be of intrinsic scientific interest. Unfortunately, even using modern sparse techniques, the discovered topics are often difficult to interpret due to the high dimensionality of the underlying space. To improve topic interpretability, we introduce Graph-Sparse LDA, a hierarchical topic model that leverages knowledge of relationships between words (e.g., as encoded by an ontology). In our model, topics are summarized by a few latent concept-words from the underlying graph that explain the observed words. Graph-Sparse LDA recovers sparse, interpretable summaries on two real-world biomedical datasets while matching state-of-the-art prediction performance

arXiv.org e-Print Archive

CiteSeerX

Association for the Advancement of Artificial Intelligence: AAAI Publications

User experiments with the Eurovision cross-language image retrieval system

Author: Armitage
Ballesteros
Chen
Clough
Clough
Clough
Cox
Dunlop
Elworthy
Flank
Flank
Flank
Gollins
Gonzalo
Goodrum
Grefenstette
Guglielmo
Harmandas
Houghton
Hutchins
Jones
Oard
Oard
Peters
Qu
Reid
Resnik
Robertson
Sanderson
Sanderson
Schäuble
Smeaton
Smeaton
Smeulders
Sormunen
Srihari
Systran Ltd.
Voorhees
Publication venue: 'Wiley'
Publication date: 01/01/2006
Field of study

In this paper we present Eurovision, a text-based system for cross-language (CL) image retrieval. The system is evaluated by multilingual users for two search tasks with the system configured in English and five other languages. To our knowledge this is the first published set of user experiments for CL image retrieval. We show that: (1) it is possible to create a usable multilingual search engine using little knowledge of any language other than English, (2) categorizing images assists the user's search, and (3) there are differences in the way users search between the proposed search tasks. Based on the two search tasks and user feedback, we describe important aspects of any CL image retrieval system

Crossref

RMIT Research Repository

White Rose Research Online