Search CORE

69 research outputs found

Word sense disambiguation and information retrieval

Author: Sanderson M.
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/1994
Field of study

It has often been thought that word sense ambiguity is a cause of poor performance in Information Retrieval (IR) systems. The belief is that if ambiguous words can be correctly disambiguated, IR performance will increase. However, recent research into the application of a word sense disambiguator to an IR system failed to show any performance increase. From these results it has become clear that more basic research is needed to investigate the relationship between sense ambiguity, disambiguation, and IR. Using a technique that introduces additional sense ambiguity into a collection, this paper presents research that goes beyond previous work in this field to reveal the influence that ambiguity and disambiguation have on a probabilistic IR system. We conclude that word sense ambiguity is only problematic to an IR system when it is retrieving from very short queries. In addition we argue that if a word sense disambiguator is to be of any use to an IR system, the disambiguator must be able to resolve word senses to a high degree of accuracy

White Rose Research Online

Word sense disambiguation and information retrieval

Author: Sanderson M.
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/1914
Field of study

MIT Libraries Dome

White Rose Research Online

Document Ranking Method for High Precision Rate

Author: Jeon Mee-Sun
Park Se-Young
Publication venue: City University of Hong Kong
Publication date: 01/01/1995
Field of study

Waseda University Repository

Analysis of equivalence mapping for terminology services

Author: Macgregor G.
McCulloch E.
Publication venue: 'SAGE Publications'
Publication date: 01/01/2007
Field of study

This paper assesses the range of equivalence or mapping types required to facilitate interoperability in the context of a distributed terminology server. A detailed set of mapping types were examined, with a view to determining their validity for characterizing relationships between mappings from selected terminologies (AAT, LCSH, MeSH, and UNESCO) to the Dewey Decimal Classification (DDC) scheme. It was hypothesized that the detailed set of 19 match types proposed by Chaplan in 1995 is unnecessary in this context and that they could be reduced to a less detailed conceptually-based set. Results from an extensive mapping exercise support the main hypothesis and a generic suite of match types are proposed, although doubt remains over the current adequacy of the developing Simple Knowledge Organization System (SKOS) Core Mapping Vocabulary Specification (MVS) for inter-terminology mapping

CiteSeerX

E-LIS

University of Strathclyde Institutional Repository

Word sense discrimination in information retrieval: a spectral clustering-based approach

Author: Chifu Adrian-Gabriel
Hristea Florentina
Mothe Josiane
Popescu Marius
Publication venue: 'Elsevier BV'
Publication date: 01/07/2014
Field of study

International audienceWord sense ambiguity has been identified as a cause of poor precision in information retrieval (IR) systems. Word sense disambiguation and discrimination methods have been defined to help systems choose which documents should be retrieved in relation to an ambiguous query. However, the only approaches that show a genuine benefit for word sense discrimination or disambiguation in IR are generally supervised ones. In this paper we propose a new unsupervised method that uses word sense discrimination in IR. The method we develop is based on spectral clustering and reorders an initially retrieved document list by boosting documents that are semantically similar to the target query. For several TREC ad hoc collections we show that our method is useful in the case of queries which contain ambiguous terms. We are interested in improving the level of precision after 5, 10 and 30 retrieved documents (P@5, P@10, P@30) respectively. We show that precision can be improved by 8% above current state-of-the-art baselines. We also focus on poor performing queries

Crossref

Scientific Publications of the University of Toulouse II Le Mirail

Open Archive Toulouse Archive Ouverte

HAL Descartes

Implementation and Evaluation of an Indexing Model of Teaching and Learning Resources

Author: Guemmat Kamal EL
Lahmar El habib BEN
Lamrani El Khadir
Talea Mohamed
Publication venue: The Authors. Published by Elsevier Ltd.
Publication date: 02/06/2015
Field of study

AbstractWith the advent of teaching and learning resources (TLR), indexing becomes essential to ensure his identification, adaptation, reuse and sharing.Several Models indexing of TLR emerged.The problem is to go on a coherent model that would ensure interoperability between stakeholders in the learning domain (designer, developer, teacher resource center) and between systems that manage these resources.There are several standard of TLR indexing among them (DUBLIN CORE, LOM, SCORM, IMS-LD…) and we have the models to represent the semantic context of the content and have a combination of both.We designed our previous contributions Model entitled MIMTLR(El Guemmat et al., 2013a), which defines a Multi indexing model of Teaching and Learning Resources that aims to enhance the limited indexing LOM standard, with a semantic content indexing is based on ontology's. The purpose of this paper is to evaluate, implemented of MIMTLR and validate itby a programming languages, to ensure that it best meets the constraints provide by a powerful model indexing of TLR.We will present simulation and the advantages of this model that meets the needs identified in the development of teaching and learning resources that will be useful for those involved in information and communication technology for teaching and learning (ICTTL) especially E-Learning

Elsevier - Publisher Connector

Retrieving with good sense

Author: Sanderson M.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2000
Field of study

Although always present in text, word sense ambiguity only recently became regarded as a problem to information retrieval which was potentially solvable. The growth of interest in word senses resulted from new directions taken in disambiguation research. This paper first outlines this research and surveys the resulting efforts in information retrieval. Although the majority of attempts to improve retrieval effectiveness were unsuccessful, much was learnt from the research. Most notably a notion of under what circumstance disambiguation may prove of use to retrieval

CiteSeerX

White Rose Research Online

Statistical versus symbolic parsing for captioned-information retrieval / Workshop on the Balancing Act, ACL-94, Las Cruces NM, July 1994

Author: Rowe Neil C.
Publication venue: Monterey, California. Naval Postgraduate School
Publication date: 01/07/1994
Field of study

Workshop on the Balancing Act, ACL-94, Las Cruces NM, July 1994We discuss implementation issues of MARIE-1, a mostly symbolic parser fully implemented, and MARIE-2, a more statistical parser partially implemented. They address a corpus of 100,000 picture captions. We argue that the mixed approach of MARIE-2 should be better for this corpus because its algorithms (not data) are simpler.This work was sponsored by DARPA as part of the I3 Project under AO 8939. Copyright is held by the ACL.This work was sponsored by DARPA as part of the I3 Project under AO 8939. Copyright is held by the ACL

Calhoun, Institutional Archive of the Naval Postgraduate School