Search CORE

15,937 research outputs found

Recommended from our members

Using TREC for cross-comparison between classic IR and ontology-based search models at a Web scale

Author: Castells Pablo
Fernandez Miriam
Lopez Vanessa
Motta Enrico
Sabou Marta
Uren Victoria
Vallet David
Publication venue
Publication date: 01/01/2009
Field of study

The construction of standard datasets and benchmarks to evaluate ontology-based search approaches and to compare then against baseline IR models is a major open problem in the semantic technologies community. In this paper we propose a novel evaluation benchmark for ontology-based IR models based on an adaptation of the well-known Cranfield paradigm (Cleverdon, 1967) traditionally used by the IR community. The proposed benchmark comprises: 1) a text document collection, 2) a set of queries and their corresponding document relevance judgments and 3) a set of ontologies and Knowledge Bases covering the query topics. The document collection and the set of queries and judgments are taken from one of the most widely used datasets in the IR community, the TREC Web track. As a use case example we apply the proposed benchmark to compare a real ontology-based search model (Fernandez, et al., 2008) against the best IR systems of TREC 9 and TREC 2001 competitions. A deep analysis of the strengths and weaknesses of this benchmark and a discussion of how it can be used to evaluate other ontology-based search systems is also included at the end of the paper

Open Research Online (The Open University)

Biblos-e Archivo

Term-Specific Eigenvector-Centrality in Multi-Relation Networks

Author: Bry François
Furche Tim
Kneißl Fabian
Weiand Klara
Publication venue: 'Inderscience Publishers'
Publication date: 01/01/2011
Field of study

Fuzzy matching and ranking are two information retrieval techniques widely used in web search. Their application to structured data, however, remains an open problem. This article investigates how eigenvector-centrality can be used for approximate matching in multi-relation graphs, that is, graphs where connections of many different types may exist. Based on an extension of the PageRank matrix, eigenvectors representing the distribution of a term after propagating term weights between related data items are computed. The result is an index which takes the document structure into account and can be used with standard document retrieval techniques. As the scheme takes the shape of an index transformation, all necessary calculations are performed during index tim

CiteSeerX

Crossref

Open Access LMU

Ontologies on the semantic web

Author: Ashburner
Berners-Lee
Berners-Lee
Bollobas
Borgida
Brachman
Brachman
Brooks
Buchanan
Burton-Jones
Bush
Cayzer
Chisholm
Copeland
Cost
Cruse
De Bruijn
Decker
Fensel
Fensel
Frege
Genesereth
Goble
Gruber
Gruber
Guha
Harré
Heery
Heflin
Hendler
Hendler
Horrocks
Horrocks
Kant
Kirk
Klein
Legg
Lenat
Lenat
Lenat
Lenat
Lindsay
Lowe
Lowe
Maedche
McCool
McGuinness
McIlraith
Minsky
Noy
Noy
Pease
Peirce
Peirce
Quillian
Quine
Rorty
Rozenberg
Schlick
Sicilia
Smith
Smith
Smith
Sowa
Sowa
Sowa
Weinberger
Weiss
Zalta
Publication venue: 'Wiley'
Publication date: 01/01/2007
Field of study

As an informational technology, the World Wide Web has enjoyed spectacular success. In just ten years it has transformed the way information is produced, stored, and shared in arenas as diverse as shopping, family photo albums, and high-level academic research. The “Semantic Web” was touted by its developers as equally revolutionary but has not yet achieved anything like the Web’s exponential uptake. This 17 000 word survey article explores why this might be so, from a perspective that bridges both philosophy and IT

Deakin Research Online

Crossref

Research Commons@Waikato

Spatial information retrieval and geographical ontologies: an overview of the SPIRIT project

Author: Jones C.
Purves R.
Ruas A.
Sanderson M.
Sester M.
van Kreveld M.
Weibel R.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2002
Field of study

A large proportion of the resources available on the world-wide web refer to information that may be regarded as geographically located. Thus most activities and enterprises take place in one or more places on the Earth's surface and there is a wealth of survey data, images, maps and reports that relate to specific places or regions. Despite the prevalence of geographical context, existing web search facilities are poorly adapted to help people find information that relates to a particular location. When the name of a place is typed into a typical search engine, web pages that include that name in their text will be retrieved, but it is likely that many resources that are also associated with the place may not be retrieved. Thus resources relating to places that are inside the specified place may not be found, nor may be places that are nearby or that are equivalent but referred to by another name. Specification of geographical context frequently requires the use of spatial relationships concerning distance or containment for example, yet such terminology cannot be understood by existing search engines. Here we provide a brief survey of existing facilities for geographical information retrieval on the web, before describing a set of tools and techniques that are being developed in the project SPIRIT : Spatially-Aware Information Retrieval on the Internet (funded by European Commission Framework V Project IST-2001-35047)

CiteSeerX

Crossref

Online Research @ Cardiff

White Rose Research Online

Utrecht University Repository

Requirements for Information Extraction for Knowledge Management

Author: Cimiano Philipp
Ciravegna Fabio
Domingue John
Handschuh Siegfried
Lavelli Alberto
Staab Steffen
Stevenson Mark
Publication venue
Publication date: 01/01/2003
Field of study

Knowledge Management (KM) systems inherently suffer from the knowledge acquisition bottleneck - the difficulty of modeling and formalizing knowledge relevant for specific domains. A potential solution to this problem is Information Extraction (IE) technology. However, IE was originally developed for database population and there is a mismatch between what is required to successfully perform KM and what current IE technology provides. In this paper we begin to address this issue by outlining requirements for IE based KM

Archivio della ricerca - Fondazione Bruno Kessler

Open Research Online (The Open University)