Exploiting Query Structure and Document Structure to Improve Document Retrieval Effectiveness

Apers, P.M.G.; Blok, H.E.; Hiemstra, D.; Mihajlovic, V.

research

Exploiting Query Structure and Document Structure to Improve Document Retrieval Effectiveness

Authors: P.M.G. Apers
H.E. Blok
D. Hiemstra
V. Mihajlovic
Publication date: 1 January 2006
Publisher: Centre for Telematics and Information Technology, University of Twente

Abstract

In this paper we present a systematic analysis of document retrieval using unstructured and structured queries within the score region algebra (SRA) structured retrieval framework. The behavior of di®erent retrieval models, namely Boolean, tf.idf, GPX, language models, and Okapi, is tested using the transparent SRA framework in our three-level structured retrieval system called TIJAH. The retrieval models are implemented along four elementary retrieval aspects: element and term selection, element score computation, score combination, and score propagation. The analysis is performed on a numerous experiments evaluated on TREC and CLEF collections, using manually generated unstructured and structured queries. Unstructured queries range from the short title queries to long title + description + narrative queries. For generating structured queries we exploit the knowledge of the document structure and the content used to semantically describe or classify documents. We show that such structured information can be utilized in retrieval engines to give more precise answers to user queries then when using unstructured queries

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Radboud Repository

oai:repository.ubn.ru.nl:2066/...

Last time updated on 30/10/2021

University of Twente Research Information

oai:ris.utwente.nl:publication...

Last time updated on 12/07/2023