Location of Repository

Interactive cross-language document selection

By D. Oard, J. Gonzalo, M. Sanderson, F. Lopez-Ostenero and J. Wang

Abstract

The problem of finding documents written in a language that the searcher cannot read is perhaps the most challenging application of cross-language information retrieval technology. In interactive applications, that task involves at least two steps: (1) the machine locates promising documents in a collection that is larger than the searcher could scan, and (2) the searcher recognizes documents relevant to their intended use from among those nominated by the machine. This article presents the results of experiments designed to explore three techniques for supporting interactive relevance assessment: (1) full machine translation, (2) rapid term-by-term translation, and (3) focused phrase translation. Machine translation was found to better support this task than term-by-term translation, and focused phrase translation further improved recall without an adverse effect on precision. The article concludes with an assessment of the strengths and weaknesses of the evaluation framework used in this study and some remarks on implications of these results for future evaluation campaigns.\u

Publisher: Springer
Year: 2004
OAI identifier: oai:eprints.whiterose.ac.uk:4560

Suggested articles

Preview


To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.