Search CORE

64 research outputs found

PFTijah: text search in an XML database system

Author: Flokstra J.
Hiemstra D.
Os R. van
Rode H.
Publication venue: Ecole Nationale Supérieure des Mines de Saint-Etienne
Publication date: 01/01/2006
Field of study

This paper introduces the PFTijah system, a text search system that is integrated with an XML/XQuery database management system. We present examples of its use, we explain some of the system internals, and discuss plans for future work. PFTijah is part of the open source release of MonetDB/XQuery

CiteSeerX

University of Twente Research Information

A Database Approach to Content-based XML retrieval

Author: Hiemstra D.
Publication venue: European Research Consortium for Informatics and Mathematics (ERCIM)
Publication date: 01/01/2002
Field of study

This paper describes a rst prototype system for content-based retrieval from XML data. The system's design supports both XPath queries and complex information retrieval queries based on a language modelling approach to information retrieval. Evaluation using the INEX benchmark shows that it is beneficial if the system is biased to retrieve large XML fragments over small fragments

CiteSeerX

University of Twente Research Information

XML and Context: Structural Features Relevant to Search Tasks

Author: Ramirez Camps G. (Georgina)
Vries A.P. (Arjen) de
Publication venue: 'American College of Medical Physics (ACMP)'
Publication date: 01/01/2005
Field of study

We describe ongoing research into the relationship between search tasks and information retrieval strategies with respect to the use of structural information. We define a classification of search tasks in structured document collections and analyse the relevance of different structural features regarding each of these tasks for the INEX collection. The results presented show important differences in relevance of different features such as size and type of the components regarding informational and resource tasks

Multimedia search without visual analysis: the value of linguistic and contextual information

Author: Jong Franciska M.G. de
Vries Arjen P. de
Westerveld Thijs
Publication venue: IEEE Computer Society Press
Publication date: 01/01/2007
Field of study

This paper addresses the focus of this special issue by analyzing the potential contribution of linguistic content and other non-image aspects to the processing of audiovisual data. It summarizes the various ways in which linguistic content analysis contributes to enhancing the semantic annotation of multimedia content, and, as a consequence, to improving the effectiveness of conceptual media access tools. A number of techniques are presented, including the time-alignment of textual resources, audio and speech processing, content reduction and reasoning tools, and the exploitation of surface features

CiteSeerX

University of Twente Research Information

Structural Features in XML Retrieval

Author: Ramirez Camps G. (Georgina)
Publication venue
Publication date: 02/11/2007
Field of study

Structural features in content oriented XML retrieval

Author: Ramirez Camps G. (Georgina)
Vries A.P. (Arjen) de
Westerveld T.H.W. (Thijs)
Publication venue: ACM Press
Publication date: 01/01/2005
Field of study

The structural features of XML components are an extra source of information that should be used in a content-oriented retrieval task on this type of documents. In this paper we explore one of the structural features from the INEX collection that could be used in content-oriented search. We analyse the gain this knowledge could add to the performance of an information retrieval system and present a first approach on how this structural information could be extracted from a relevance feedback process to be used as priors in a language modelling framework

Vague element selection and query rewriting for XML retrieval

Author: Blok H.E.
Hiemstra Djoerd
Mihajlovic V.
Publication venue: Neslia Paniculata
Publication date: 01/01/2006
Field of study

In this paper we present the extension of our prototype three-level database system (TIJAH) developed for struc-tured information retrieval. The extension is aimed at mod-eling vague search on XML elements. All three levels (con-ceptual, logical, and physical) of the TIJAH system are enhanced to support vague search concepts. The vague search is implemented as vague selection of XML elements using XML element name expansion lists and rewriting tech-niques. We test the performance of retrieval models us-ing automatically generated expansion lists and compared them with models that use manual ones. The goal is to find the best approach for structured information retrieval with vague structural constraints on element names expressed in the query. 1

CiteSeerX

University of Twente Research Information

Investigating the document structure as a source of evidence for multimedia fragment retrieval

Author: Boughanem Mohand
Pinel-Sauvagnat Karen
Torjmen-Khemakhem Mouna
Publication venue: 'Elsevier BV'
Publication date: 01/11/2013
Field of study

International audienceMultimedia objects can be retrieved using their context that can be for instance the text surrounding them in documents. This text may be either near or far from the searched objects. Our goal in this paper is to study the impact, in term of effectiveness, of text position relatively to searched objects. The multimedia objects we consider are described in structured documents such as XML ones. The document structure is therefore exploited to provide this text position in documents. Although structural information has been shown to be an effective source of evidence in textual information retrieval, only a few works investigated its interest in multimedia retrieval. More precisely, the task we are interested in this paper is to retrieve multimedia fragments (i.e. XML elements having at least one multimedia object). Our general approach is built on two steps: we first retrieve XML elements containing multimedia objects, and we then explore the surrounding information to retrieve relevant multimedia fragments. In both cases, we study the impact of the surrounding information using the documents structure.Our work is carried out on images, but it can be extended to any other media, since the physical content of multimedia objects is not used. We conducted several experiments in the context of the Multimedia track of the INEX evaluation campaign. Results showed that structural evidences are of high interest to tune the importance of textual context for multimedia retrieval. Moreover, the proposed approach outperforms state of the art approaches