2,661 research outputs found
Taming Existence in RDF Querying
We introduce the recursive, rule-based RDF query language
RDFLog. RDFLog extends previous RDF query languages by arbitrary
quantifier alternation: blank nodes may occur in the scope of all, some,
or none of the universal variables of a rule. In addition RDFLog is aware
of important RDF features such as the distinction between blank nodes,
literals and URIs or the RDFS vocabulary. The semantics of RDFLog is
closed (every answer is an RDF graph), but lifts RDFās restrictions on
literal and blank node occurrences for intermediary data. We show how
to define a sound and complete operational semantics that can be implemented
using existing logic programming techniques. Using RDFLog
we classify previous approaches to RDF querying along their support for
blank node construction and show equivalence between languages with
full quantifier alternation and languages with only āā rules
Everything you always wanted to know about blank nodes (but were afraid to ask)
In this paper we thoroughly cover the issue of blank nodes, which have been defined in RDF as "existential variables". We
first introduce the theoretical precedent for existential blank nodes from first order logic and incomplete Information
in database theory. We then cover the different (and sometimes incompatible) treatment of blank nodes across the
W3C stack of RDF-related standards. We present an empirical survey of the blank nodes present in a large sample of
RDF data published on the Web (the BTC-2012 dataset), where we find that 25.7% of unique RDF terms are blank
nodes, that 44.9% of documents and 66.2% of domains featured use of at least one blank node, and that aside from
one Linked Data domain whose RDF data contains many "blank node cycles", the vast majority of blank nodes form
tree structures that are efficient to compute simple entailment over. With respect to the RDF-merge of the full data,
we show that 6.1% of blank-nodes are redundant under simple entailment. The vast majority of non-lean cases are
isomorphisms resulting from multiple blank nodes with no discriminating information being given within an RDF
document or documents being duplicated in multiple Web locations. Although simple entailment is NP-complete and
leanness-checking is coNP-complete, in computing this latter result, we demonstrate that in practice, real-world RDF
graphs are sufficiently "rich" in ground information for problematic cases to be avoided by non-naive algorithms
Updating OWL2 ontologies using pruned rulesets
Evolution in Semantic Web content produces difference files (deltas) that track changes between RDF versions. These changes may represent ontology modifications and be expressed in OWL. The deltas can be used to reduce the storage and bandwidth overhead involved in disseminating ontology updates. Minimising the delta size can be achieved by reasoning over the underlying knowledge base. OWL 2 is a development of the OWL 1 standard that incorporates new features to aid application development. Among the sub languages of OWL 2, OWL 2 RL/RDF provides an enriched rule set that extends the semantic capability of the OWL environment. This additional semantic content can be exploited in change detection approaches that strive to minimise the alterations to be made when ontologies are updated. The presence of blank nodes (i.e. nodes that are neither a URI nor a literal) in RDF collections provides a further challenge to ontology change detection because of the practical problems they introduce when comparing data structures before and after update. In the light of OWL 2 RL/RDF, this paper examines the potential for reducing the delta size by pruning the application of unnecessary rules from the reasoning process and using an approach to delta generation that produces the smallest number of updates. It also assesses the impact of alternative approaches to handling blank nodes during the change detection process in ontology structures. The results indicate that pruning the rule set is a potentially expensive process but has the benefit of reducing the joins when carrying out the subsequent inferencing
vSPARQL: A View Definition Language for the Semantic Web
Translational medicine applications would like to leverage the biological and biomedical ontologies, vocabularies, and data sets available on the semantic web. We present a general solution for RDF information set reuse inspired by database views. Our view definition language, vSPARQL, allows applications to specify the exact content that they are interested in and how that content should be restructured or modified. Applications can access relevant content by querying against these view definitions. We evaluate the expressivity of our approach by defining views for practical use cases and comparing our view definition language to existing query languages
Data Model and Query Constructs for Versatile Web Query Languages
As the Semantic Web is gaining momentum, the need for
truly versatile query languages becomes increasingly apparent. A Web
query language is called versatile if it can access in the same query program
data in different formats (e.g. XML and RDF). Most query languages
are not versatile: they have not been specifically designed to cope
with both worlds, providing a uniform language and common constructs
to query and transform data in various formats. Moreover, most of them
do not provide a flexible data model that is powerful enough to naturally
convey both Semantic Web data formats (especially RDF and
Topic Maps) and XML. This article highlights challenges related to the
data model and language constructs for querying both standard Web
and Semantic Web data with an emphasis on facilitating sophisticated
reasoning. It is shown that Xcerptās data model and querying constructs
are particularly well-suited for the Semantic Web, but that some adjustments
of the Xcerpt syntax allow for even more effective and natural
querying of RDF and Topic Maps
Four Lessons in Versatility or How Query Languages Adapt to the Web
Exposing not only human-centered information, but machine-processable data on the Web is one of the commonalities of recent Web trends. It has enabled a new kind of applications and businesses where the data is used in ways not foreseen by the data providers. Yet this exposition has fractured the Web into islands of data, each in different Web formats: Some providers choose XML, others RDF, again others JSON or OWL, for their data, even in similar domains. This fracturing stifles innovation as application builders have to cope not only with one Web stack (e.g., XML technology) but with several ones, each of considerable complexity. With Xcerpt we have developed a rule- and pattern based query language that aims to give shield application builders from much of this complexity: In a single query language XML and RDF data can be accessed, processed, combined, and re-published. Though the need for combined access to XML and RDF data has been recognized in previous work (including the W3Cās GRDDL), our approach differs in four main aspects: (1) We provide a single language (rather than two separate or embedded languages), thus minimizing the conceptual overhead of dealing with disparate data formats. (2) Both the declarative (logic-based) and the operational semantics are unified in that they apply for querying XML and RDF in the same way. (3) We show that the resulting query language can be implemented reusing traditional database technology, if desirable. Nevertheless, we also give a unified evaluation approach based on interval labelings of graphs that is at least as fast as existing approaches for tree-shaped XML data, yet provides linear time and space querying also for many RDF graphs. We believe that Web query languages are the right tool for declarative data access in Web applications and that Xcerpt is a significant step towards a more convenient, yet highly efficient data access in a āWeb of Dataā
RDF Querying
Reactive Web systems, Web services, and Web-based publish/
subscribe systems communicate events as XML messages, and in
many cases require composite event detection: it is not sufficient to react
to single event messages, but events have to be considered in relation to
other events that are received over time.
Emphasizing language design and formal semantics, we describe the
rule-based query language XChangeEQ for detecting composite events.
XChangeEQ is designed to completely cover and integrate the four complementary
querying dimensions: event data, event composition, temporal
relationships, and event accumulation. Semantics are provided as
model and fixpoint theories; while this is an established approach for rule
languages, it has not been applied for event queries before
Reasoning & Querying ā State of the Art
Various query languages for Web and Semantic Web data, both for practical use and as an area of research in the scientific community, have emerged in recent years. At the same time, the broad adoption of the internet where keyword search is used in many applications, e.g. search engines, has familiarized casual users with using keyword queries to retrieve information on the internet. Unlike this easy-to-use querying, traditional query languages require knowledge of the language itself as well as of the data to be queried. Keyword-based query languages for XML and RDF bridge the gap between the two, aiming at enabling simple querying of semi-structured data, which is relevant e.g. in the context of the emerging Semantic Web. This article presents an overview of the field of keyword querying for XML and RDF
- ā¦