Search CORE

658 research outputs found

The Simplest Evaluation Measures for XML Information Retrieval that Could Possibly Work

Author: Hiemstra D.
Mihajlovic V.
Publication venue: University of Otago
Publication date: 01/01/2005
Field of study

This paper reviews several evaluation measures developed for evaluating XML information retrieval (IR) systems. We argue that these measures, some of which are currently in use by the INitiative for the Evaluation of XML Retrieval (INEX), are complicated, hard to understand, and hard to explain to users of XML IR systems. To show the value of keeping things simple, we report alternative evaluation results of official evaluation runs submitted to INEX 2004 using simple metrics, and show its value for INEX

CiteSeerX

University of Twente Research Information

The State-of-the-arts in Focused Search

Author: Li Rongmei
Publication venue: University of Twente, Centre for Telematics and Information Technology
Publication date: 01/01/2009
Field of study

The continuous influx of various text data on the Web requires search engines to improve their retrieval abilities for more specific information. The need for relevant results to a user’s topic of interest has gone beyond search for domain or type specific documents to more focused result (e.g. document fragments or answers to a query). The introduction of XML provides a format standard for data representation, storage, and exchange. It helps focused search to be carried out at different granularities of a structured document with XML markups. This report aims at reviewing the state-of-the-arts in focused search, particularly techniques for topic-specific document retrieval, passage retrieval, XML retrieval, and entity ranking. It is concluded with highlight of open problems

University of Twente Research Information

Exploiting Query Structure and Document Structure to Improve Document Retrieval Effectiveness

Author: Apers P.M.G.
Blok H.E.
Hiemstra D.
Mihajlovic V.
Publication venue: Centre for Telematics and Information Technology, University of Twente
Publication date: 01/01/2006
Field of study

In this paper we present a systematic analysis of document retrieval using unstructured and structured queries within the score region algebra (SRA) structured retrieval framework. The behavior of di®erent retrieval models, namely Boolean, tf.idf, GPX, language models, and Okapi, is tested using the transparent SRA framework in our three-level structured retrieval system called TIJAH. The retrieval models are implemented along four elementary retrieval aspects: element and term selection, element score computation, score combination, and score propagation. The analysis is performed on a numerous experiments evaluated on TREC and CLEF collections, using manually generated unstructured and structured queries. Unstructured queries range from the short title queries to long title + description + narrative queries. For generating structured queries we exploit the knowledge of the document structure and the content used to semantically describe or classify documents. We show that such structured information can be utilized in retrieval engines to give more precise answers to user queries then when using unstructured queries

Radboud Repository

University of Twente Research Information

Queensland University of Technology at TREC 2005

Author: Geva Shlomo
King John
Lu Chengye
Sahama Tony
Woodley Alan
Publication venue: 'National Institute of Standards and Technology (NIST)'
Publication date: 01/01/2005
Field of study

The Information Retrieval and Web Intelligence (IR-WI) research group is a research team at the Faculty of Information Technology, QUT, Brisbane, Australia. The IR-WI group participated in the Terabyte and Robust track at TREC 2005, both for the first time. For the Robust track we applied our existing information retrieval system that was originally designed for use with structured (XML) retrieval to the domain of document retrieval. For the Terabyte track we experimented with an open source IR system, Zettair and performed two types of experiments. First, we compared Zettair’s performance on both a high-powered supercomputer and a distributed system across seven midrange personal computers. Second, we compared Zettair’s performance when a standard TREC title is used, compared with a natural language query, and a query expanded with synonyms. We compare the systems both in terms of efficiency and retrieval performance. Our results indicate that the distributed system is faster than the supercomputer, while slightly decreasing retrieval performance, and that natural language queries also slightly decrease retrieval performance, while our query expansion technique significantly decreased performance

Queensland University of Technology ePrints Archive

Structural Features in XML Retrieval

Author: Ramirez Camps G. (Georgina)
Publication venue
Publication date: 02/11/2007
Field of study

CWI's Institutional Repository

Seven years of INEX interactive retrieval experiments – lessons and challenges

Author: Nordlie Ragnar
Pharo Nils
Publication venue: Springer Verlag
Publication date: 01/01/2012
Field of study

This paper summarizes a major effort in interactive search investigation, the INEX i-track, a collective effort run over a seven-year period. We present the experimental conditions, report some of the findings of the participating groups, and examine the challenges posed by this kind of collective experimental effort

NORA - Norwegian Open Research Archives

Open Digital Archive at Oslo and Akershus University College

Recommended from our members

Field-Weighted XML Retrieval Based on BM25.

Author: A. Theobald
C.L.A. Clarke
J. Kekäläinen
J.-N. Vittaut
P. Ogilvie
R.R. Larson
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

This is the first year for the Centre for Interactive Systems Research participation of INEX. Based on a newly developed XML indexing and retrieval system on Okapi, we extend Robertson’s field-weighted BM25F for document retrieval to element level retrieval function BM25E. In this paper, we introduce this new function and our experimental method in detail, and then show how we tuned weights for our selected fields by using INEX 2004 topics and assessments. Based on the tuned models we submitted our runs for CO.Thorough, CO.FetchBrowse, the methods we propose show real promise. Existing problems and future work are also discussed

City Research Online

Crossref

The State-of-the-arts in Focused Search

Author
Publication venue: Centre for Telematics and Information Technology (CTIT)
Publication date: 08/07/2009
Field of study

University of Twente Research Information

The effect of granularity and order in XML element retrieval

Author: Pharo Nils
Publication venue: Elsevier
Publication date: 01/01/2008
Field of study

The article presents an analysis of the effect of granularity and order in an XML encoded collection of full text journal articles. 218 sessions of searchers performing simulated work tasks in the collection have been analysed. The results show that searchers prefer to use smaller sections of the article as their source of information. In interaction sessions during which articles are assessed, however, they are to a large degree evaluated as more important than the articles’ sections and subsections

NORA - Norwegian Open Research Archives

Open Digital Archive at Oslo and Akershus University College

Overview of the INEX 2008 Interactive Track

Author: Fachry Khairun Nisa
Nordlie Ragnar
Pharo Nils
Publication venue: Springer Verlag
Publication date: 01/01/2009
Field of study

This paper presents the organization of the INEX 2008 interactive track. In this year’s iTrack we aimed at exploring the value of element retrieval for two different task types, fact-finding and research tasks. Two research groups collected data from 29 test persons, each performing two tasks. We describe the methods used for data collection and the tasks performed by the participants. A general result indicates that test persons were more satisfied when completing research task compared to fact-finding task. In our experiment, test persons regarded the research task easier, were more satisfied with the search results and found more relevant information for the research tasks

NORA - Norwegian Open Research Archives

Open Digital Archive at Oslo and Akershus University College