The construction of standard datasets and benchmarks to evaluate ontology-based search approaches and to compare then against baseline IR models is a major open problem in the semantic technologies community. In this paper we propose a novel evaluation benchmark for ontology-based IR models based on an adaptation of the well-known Cranfield paradigm (Cleverdon, 1967) traditionally used by the IR community. The proposed benchmark comprises: 1) a text document collection, 2) a set of queries and their corresponding document relevance judgments and 3) a set of ontologies and Knowledge Bases covering the query topics. The document collection and the set of queries and judgments are taken from one of the most widely used datasets in the IR community, the TREC Web track. As a use case example we apply the proposed benchmark to compare a real ontology-based search model (Fernandez, et al., 2008) against the best IR systems of TREC 9 and TREC 2001 competitions. A deep analysis of the strengths and weaknesses of this benchmark and a discussion of how it can be used to evaluate other ontology-based search systems is also included at the end of the paper

Castells, Pablo

Fernandez, Miriam

Lopez, Vanessa

Motta, Enrico

Sabou, Marta

Uren, Victoria

Vallet, David

English

Also published online by CEUR Workshop Proceedings (CEUR-WS.org, ISSN 1613-0073) Proceedings of the Workshop on Semantic Search (SemSearch 2009) at the 18th International World Wide Web Conference (WWW 2009)The construction of standard datasets and benchmarks to evaluate
ontology-based search approaches and to compare then against
baseline IR models is a major open problem in the semantic technologies
community. In this paper we propose a novel evaluation
benchmark for ontology-based IR models based on an adaptation
of the well-known Cranfield paradigm (Cleverdon, 1967) traditionally
used by the IR community. The proposed benchmark
comprises: 1) a text document collection, 2) a set of queries and
their corresponding document relevance judgments and 3) a set of
ontologies and Knowledge Bases covering the query topics. The
document collection and the set of queries and judgments are
taken from one of the most widely used datasets in the IR community,
the TREC Web track. As a use case example we apply the
proposed benchmark to compare a real ontology-based search
model (Fernandez, et al., 2008) against the best IR systems of
TREC 9 and TREC 2001 competitions. A deep analysis of the
strengths and weaknesses of this benchmark and a discussion of
how it can be used to evaluate other ontology-based search systems
is also included at the end of the paper

Fernández Sánchez, Miriam

López, Vanessa

Uren, Victoria S.

Vallet Weadon, David Jordi

Castells Azpilicueta, Pablo

Biblos-e Archivo

Using TREC for cross-comparison between classic IR and ontology-based search models at a Web scale

Open Research Online (The Open University)

https://repositorio.uam.es/bitstream/handle/10486/665416/using_fernandez_CWP_2009.pdf?sequence=1

Using TREC for cross-comparison between classic IR and ontology-based search models at a Web scale

Abstract

Similar works

Full text

Available Versions

Biblos-e Archivo

Open Research Online (The Open University)