Search CORE

45,791 research outputs found

Querying XML Documents Made Easy: Nearest Concept Queries

Author: Kersten M.L. (Martin)
Schmidt A.R.
Windhouwer M.A. (Menzo)
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2001
Field of study

Due to the ubiquity and popularity of XML, users often are in the following situation: they want to query XML documents which contain potentially interesting information but they are unaware of the mark-up structure that is used. For example, it is easy to guess the contents of an XML bibliography file whereas the mark-up depends on the methodological, cultural and personal background of the author(s). Nonetheless, it is this hierarchical structure that forms the basis of XML query languages. In this paper we exploit the tree structure of XML documents to equip users with a powerful tool, the meet operator, that lets them query databases with whose content they are familiar, but without requiring knowledge of tags and hierarchies. Our approach is based on computing the lowest common ancestor of nodes in the XML syntax tree: eg, given two strings, we are looking for nodes whose offspring contains these two strings. The novelty of this approach is that the result type is unknown at query formulation time and dependent on the database instance. If the two strings are an author's name and a year, mainly publications of the author in this year are returned. If the two strings are numbers the result mostly consists of publications that have the numbers as year or page numbers. Because the result type of a query is not specified by the user we refer to the lowest common ancestor as nearest concept We also present a running example taken from the bibliography domain, and demonstrate that the operator can be implemented efficiently

CWI's Institutional Repository

International Migration, Integration and Social Cohesion online publications

User Feedback in Probabilistic XML

Author: Keijzer A. de
Keulen M. van
Publication venue: Centre for Telematics and Information Technology, University of Twente
Publication date: 01/01/2007
Field of study

Data integration is a challenging problem in many application areas. Approaches mostly attempt to resolve semantic uncertainty and conflicts between information sources as part of the data integration process. In some application areas, this is impractical or even prohibitive, for example, in an ambient environment where devices on an ad hoc basis have to exchange information autonomously. We have proposed a probabilistic XML approach that allows data integration without user involvement by storing semantic uncertainty and conflicts in the integrated XML data. As a\ud consequence, the integrated information source represents\ud all possible appearances of objects in the real world, the\ud so-called possible worlds.\ud \ud In this paper, we show how user feedback on query results\ud can resolve semantic uncertainty and conflicts in the\ud integrated data. Hence, user involvement is effectively postponed to query time, when a user is already interacting actively with the system. The technique relates positive and\ud negative statements on query answers to the possible worlds\ud of the information source thereby either reinforcing, penalizing, or eliminating possible worlds. We show that after repeated user feedback, an integrated information source better resembles the real world and may converge towards a non-probabilistic information source

University of Twente Research Information

XGI: A Graphical Interface for XQuery Creation and XML Schema Visualization

Author: Li Xiang
Publication venue
Publication date: 01/12/2006
Field of study

XML (Extensible Markup Language) is used in many contexts of modern information technology to facilitate sharing of information between heterogeneous data sources and inter-platform applications. The prevalence of XML implementation in data storage and exchange necessitates a method to adequately query XML data. The World Wide Web Consortium (W3C) is proposing XQuery as the standard querying language for semistructured XML data. XQuery is designed for experienced database programmers, since its syntax and capabilities are analogous to the SQL relational query language. Therefore, the inherent complexity of formulating XQuery statements makes it an intimidating task for anyone, except an expert in the XQuery language, to construct queries. The development of XQuery Graphical Interface (XGI), a visual interface for creating XQuery in a graphical format, is motivated by the need to simplify the query formation for unskilled users and speed up the query construction for expert users. The implementation of XGI is mainly inspired by three existing systems: Query and Reporting Semistructured Data (QURSED), XQuery By Example (XQBE), and XBrain. A review of these systems and many other systems has helped us understand the benefits and drawbacks of various system design approaches, and has assisted us in identifying a set of features for XGI that will successfully reduce the complexity of creating queries in the XQuery language. XGI provides a web interface for users to explore their own XML source data schema, search for specific schema elements, and visually create queries in the XQuery language for the targeted XML data source. A validation of the XGI system has verified its ability to efficiently and accurately create queries for various XML data sources. From the validation, we have recognized some strengths and weaknesses of the XGI system compared to other systems. We also recommend several areas in which XGI can be improved

University of Washington Structural Informatics Group Publications

Compression vs Queryability - A Case Study

Author: Anantharaman Siva
Publication venue: Dagstuhl Seminar Proceedings. 08261 - Structure-Based Compression of Complex Massive Data
Publication date: 01/01/2008
Field of study

International audienceSome compromise on compression is known to be necessary, if the relative positions of the information stored by semi-structured documents are to remain accessible under queries. With this in view, we compare, on an example, the `query-friendliness' of XML documents, when compressed into straightline tree grammars which are either regular or context-free. The queries considered are in a limited fragment of XPath, corresponding to a type of patterns; each such query defines naturally a non-deterministic, bottom-up `query automaton' that runs just as well on a tree as on its compressed dag

Dagstuhl Research Online Publication Server

Constraint-based Query Distribution Framework for an Integrated Global Schema

Author: Iftikhar Nadeem
Malik Ahmad Kamran
Qadir Muhammad Abdul
Usman Muhammad
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2009
Field of study

Distributed heterogeneous data sources need to be queried uniformly using global schema. Query on global schema is reformulated so that it can be executed on local data sources. Constraints in global schema and mappings are used for source selection, query optimization,and querying partitioned and replicated data sources. The provided system is all XML-based which poses query in XML form, transforms, and integrates local results in an XML document. Contributions include the use of constraints in our existing global schema which help in source selection and query optimization, and a global query distribution framework for querying distributed heterogeneous data sources.Comment: The Proceedings of the 13th INMIC 2009), Dec. 14-15, 2009, Islamabad, Pakistan. Pages 1 - 6 Print ISBN: 978-1-4244-4872-2 INSPEC Accession Number: 11072575 Date of Current Version : 15 January 201

arXiv.org e-Print Archive

VBN

Reasoning & Querying – State of the Art

Author: Bry François
Furche Tim
Weiand Klara
Publication venue
Publication date: 31/08/2008
Field of study

Various query languages for Web and Semantic Web data, both for practical use and as an area of research in the scientific community, have emerged in recent years. At the same time, the broad adoption of the internet where keyword search is used in many applications, e.g. search engines, has familiarized casual users with using keyword queries to retrieve information on the internet. Unlike this easy-to-use querying, traditional query languages require knowledge of the language itself as well as of the data to be queried. Keyword-based query languages for XML and RDF bridge the gap between the two, aiming at enabling simple querying of semi-structured data, which is relevant e.g. in the context of the emerging Semantic Web. This article presents an overview of the field of keyword querying for XML and RDF

Open Access LMU