Search CORE

5 research outputs found

The SPIRIT collection: an overview of a large web collection

Author: Cacheda F.
Hideo Joho
Mark Sanderson
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2004
Field of study

A large scale collection of web pages has been essential for research in information retrieval and related areas. This paper provides an overview of a large web collection used in the SPIRIT project for the design and testing of spatially-aware retrieval systems. Several statistics are derived and presented to show the characteristics of the collection

CiteSeerX

Crossref

White Rose Research Online

Wikipedia-Based Semantic Enhancements for Information Nugget Retrieval

Author: MacKinnon Ian
Publication venue: 'University of Waterloo'
Publication date: 01/01/2008
Field of study

When the objective of an information retrieval task is to return a nugget rather than a document, query terms that exist in a document often will not be used in the most relevant nugget in the document for the query. In this thesis a new method of query expansion is proposed based on the Wikipedia link structure surrounding the most relevant articles selected either automatically or by human assessors for the query. Evaluated with the Nuggeteer automatic scoring software, which we show to have a high correlation with human assessor scores for the ciQA 2006 topics, an increase in the F-scores is found from the TREC Complex Interactive Question Answering task when integrating this expansion into an already high-performing baseline system. In addition, the method for finding synonyms using Wikipedia is evaluated using more common synonym detection tasks

University of Waterloo's Institutional Repository

From Document Retrieval to Question Answering

Author: Monz C.
Publication venue: ILLC
Publication date: 01/01/2003
Field of study

CiteSeerX

University of Twente Research Information

UvA-DARE

International Migration, Integration and Social Cohesion online publications

The theory of extended topic and its application in information retrieval

Author: Yin Ling
Publication venue
Publication date: 01/09/2012
Field of study

University of Brighton Research Portal

The Impact of Corpus Size on Question Answering Performance

Author: C. L. A. Clarke
E. L. Terra
G. V. Cormack
M. Laszlo
T. R. Lynam
Publication venue
Publication date: 01/01/2002
Field of study

Using our question answering system, questions from the TREC 2001 evaluation were executed over a series of Web data collections, with the sizes of the collections increasing from 25 gigabytes up to nearly a terabyte

CiteSeerX

Crossref