Search CORE

37 research outputs found

Efficient XML Keyword Search based on DAG-Compression

Author: Böttcher Stefan
Hartel Rita
Rabe Jonathan
Publication venue
Publication date: 26/11/2013
Field of study

In contrast to XML query languages as e.g. XPath which require knowledge on the query language as well as on the document structure, keyword search is open to anybody. As the size of XML sources grows rapidly, the need for efficient search indices on XML data that support keyword search increases. In this paper, we present an approach of XML keyword search which is based on the DAG of the XML data, where repeated substructures are considered only once, and therefore, have to be searched only once. As our performance evaluation shows, this DAG-based extension of the set intersection search algorithm[1], [2], can lead to search times that are on large documents more than twice as fast as the search times of the XML-based approach. Additionally, we utilize a smaller index, i.e., we consume less main memory to compute the results

arXiv.org e-Print Archive

CiteSeerX

Quasi-SLCA based Keyword Query Processing over Probabilistic XML Data

Author: Li Jianxin
Liu Chengfei
Yu Jeffrey Xu
Zhou Rui
Publication venue
Publication date: 10/01/2013
Field of study

The probabilistic threshold query is one of the most common queries in uncertain databases, where a result satisfying the query must be also with probability meeting the threshold requirement. In this paper, we investigate probabilistic threshold keyword queries (PrTKQ) over XML data, which is not studied before. We first introduce the notion of quasi-SLCA and use it to represent results for a PrTKQ with the consideration of possible world semantics. Then we design a probabilistic inverted (PI) index that can be used to quickly return the qualified answers and filter out the unqualified ones based on our proposed lower/upper bounds. After that, we propose two efficient and comparable algorithms: Baseline Algorithm and PI index-based Algorithm. To accelerate the performance of algorithms, we also utilize probability density function. An empirical study using real and synthetic data sets has verified the effectiveness and the efficiency of our approaches

arXiv.org e-Print Archive

Deakin Research Online

Adelaide Research & Scholarship

Swinburne Research Bank

Efficient search of general and-or keyword queries in XML data

Author: WANG XIANJUN
Publication venue
Publication date: 28/09/2007
Field of study

Master'sMASTER OF SCIENC

ScholarBank@NUS

No-But-Semantic-Match: Computing Semantically Matched XML Keyword Search Results

Author: Islam Md. Saiful
Liu Chengfei
Moser Irene
Naseriparsa Mehdi
Publication venue
Publication date: 06/03/2017
Field of study

Users are rarely familiar with the content of a data source they are querying, and therefore cannot avoid using keywords that do not exist in the data source. Traditional systems may respond with an empty result, causing dissatisfaction, while the data source in effect holds semantically related content. In this paper we study this no-but-semantic-match problem on XML keyword search and propose a solution which enables us to present the top-k semantically related results to the user. Our solution involves two steps: (a) extracting semantically related candidate queries from the original query and (b) processing candidate queries and retrieving the top-k semantically related results. Candidate queries are generated by replacement of non-mapped keywords with candidate keywords obtained from an ontological knowledge base. Candidate results are scored using their cohesiveness and their similarity to the original query. Since the number of queries to process can be large, with each result having to be analyzed, we propose pruning techniques to retrieve the top-

k

results efficiently. We develop two query processing algorithms based on our pruning techniques. Further, we exploit a property of the candidate queries to propose a technique for processing multiple queries in batch, which improves the performance substantially. Extensive experiments on two real datasets verify the effectiveness and efficiency of the proposed approaches.Comment: 24 pages, 21 figures, 6 tables, submitted to The VLDB Journal for possible publicatio

arXiv.org e-Print Archive

Federation ResearchOnline

Web Queries: From a Web of Data to a Semantic Web?

Author: Bry François
Furche Tim
Vossen Gottfried
Weiand Klara
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Open Access LMU

Efficient and effective keyword search in XML database

Author: CHEN BO
Publication venue
Publication date: 26/05/2008
Field of study

Master'sMASTER OF SCIENC

ScholarBank@NUS