Search CORE

774 research outputs found

Combining Indexing Schemes to Accelerate Querying XML on Content and Structure

Author: Ramirez Camps G. (Georgina)
Vries A.P. (Arjen) de
Publication venue: C.T.I.T.
Publication date: 01/01/2004
Field of study

This paper presents the advantages of combining multiple document representation schemes for query processing of XML queries on content and structure. We show how extending the Text Region approach [2] with the main features of the Binary Relation approach developed in [8] leads to a considerable speed-up in the processing of the XPath location steps. We detail how, by using the combined scheme, we reduce the number of structural joins used to process the XPath steps, while simultaneously limiting the amount of memory usage. We discuss optimisation strategies enabled by the new `combined representation scheme'. Experiments comparing the efficiency of alternative query processing strategies on a subset of the queries used at INEX 2003 (the Initiative for the Evaluation of XML Retrieval [4]) demonstrate a favourable performance for the combined indexing scheme

CWI's Institutional Repository

Accelerating data retrieval steps in XML documents

Author: Shen Yun
Publication venue
Publication date: 01/01/2005
Field of study

Repository@Hull - Worktribe

A database approach to information retrieval:The remarkable relationship between language models and region models

Author: Hiemstra Djoerd
Mihajlovic V.
Publication venue: Centre for Telematics and Information Technology (CTIT)
Publication date: 01/01/2005
Field of study

In this report, we unify two quite distinct approaches to information retrieval: region models and language models. Region models were developed for structured document retrieval. They provide a well-defined behaviour as well as a simple query language that allows application developers to rapidly develop applications. Language models are particularly useful to reason about the ranking of search results, and for developing new ranking approaches. The unified model allows application developers to define complex language modeling approaches as logical queries on a textual database. We show a remarkable one-to-one relationship between region queries and the language models they represent for a wide variety of applications: simple ad-hoc search, cross-language retrieval, video retrieval, and web search

University of Twente Research Information

Meeting of the MINDS: an information retrieval research agenda

Author: Allan J.
Callan J.
Clarke C.L.A.
Dumais S.
Evans D.A.
Sanderson M.
Zhai C.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/12/2007
Field of study

Since its inception in the late 1950s, the field of Information Retrieval (IR) has developed tools that help people find, organize, and analyze information. The key early influences on the field are well-known. Among them are H. P. Luhn's pioneering work, the development of the vector space retrieval model by Salton and his students, Cleverdon's development of the Cranfield experimental methodology, Spärck Jones' development of idf, and a series of probabilistic retrieval models by Robertson and Croft. Until the development of the WorldWideWeb (Web), IR was of greatest interest to professional information analysts such as librarians, intelligence analysts, the legal community, and the pharmaceutical industry

White Rose Research Online

Searching and browsing Linked Data with SWSE: The Semantic Web Search Engine

Author: Aidan Hogan
Alani
Andreas Harth
Axel Polleres
Batsakis
Bechhofer
Berners-Lee
Bizer
Boldi
Bonatti
Brin
Broekstra
Caverlee
Chakrabarti
Chen
Cheng
Dietze
Diligenti
Ding
Dong
Ehrig
Elmagarmid
Erdös
Fagin
Fensel
Friendly
Glaser
Harth
Harth
Hatcher
He
Heydon
Hirai
Hitzler
Hogan
Huynh
Jürgen Umbrich
Kleinberg
Lee
Lopez
Meditskos
Najork
Neumann
Newcombe
Oren
Oren
Pant
Polleres
Sheila Kinsella
Stefan Decker
Stonebraker
ter Horst
Thelwall
Wei
Weiss
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

SIQXC: Schema Independent Queryable XML Compression for Smartphones

Author: Dinakenyane Otlhapile
Publication venue: 'University of Sheffield Conference Proceedings'
Publication date: 01/01/2014
Field of study

The explosive growth of XML use over the last decade has led to a lot of research on how to best store and access it. This growth has resulted in XML being described as a de facto standard for storage and exchange of data over the web. However, XML has high redundancy because of its self-‐ describing nature making it verbose. The verbose nature of XML poses a storage problem. This has led to much research devoted to XML compression. It has become of more interest since the use of resource constrained devices is also on the rise. These devices are limited in storage space, processing power and also have finite energy. Therefore, these devices cannot cope with storing and processing large XML documents. XML queryable compression methods could be a solution but none of them has a query processor that runs on such devices. Currently, wireless connections are used to alleviate the problem but they have adverse effects on the battery life. They are therefore not a sustainable solution. This thesis describes an attempt to address this problem by proposing a queryable compressor (SIQXC) with a query processor that runs in a resource constrained environment thereby lowering wireless connection dependency yet alleviating the storage problem. It applies a novel simple 2 tuple integer encoding system, clustering and gzip. SIQXC achieves an average compression ratio of 70% which is higher than most queryable XML compressors and also supports a wide range of XPATH operators making it competitive approach. It was tested through a practical implementation evaluated against the real data that is usually used for XML benchmarking. The evaluation covered the compression ratio, compression time and query evaluation accuracy and response time. SIQXC allows users to some extent locally store and manipulate the otherwise verbose XML on their Smartphones

White Rose E-theses Online

Edge Influence Computation in Dynamic Graphs

Author: Falkner Nickolas J. G.
Parkinson Simon
Qin Yongrui
Sheng Quan Z.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 21/03/2017
Field of study

Reachability queries are of great importance in many research and application areas, including general graph mining, social network analysis and so on. Many approaches have been proposed to compute whether there exists one path from one node to another node in a graph. Most of these approaches focus on static graphs, however in practice dynamic graphs are more common. In this paper, we focus on handling graph reachability queries in dynamic graphs. Specifically we investigate the influence of a given edge in the graph, aiming to study the overall reachability changes in the graph brought by the possible failure/deletion of the edge. To this end, we firstly develop an efficient update algorithm for handling edge deletions. We then define the edge influence concept and put forward a novel computation algorithm to accelerate the computation of edge influence. We evaluate our approach using several real world datasets. The experimental results show that our approach outperforms traditional approaches significantly

Crossref

University of Huddersfield Repository

Huddersfield Research Portal