20 research outputs found
Querying Linked Data: An Experimental Evaluation of State-of-the-Art Interfaces
The adoption of Semantic Web technologies, and in particular the Open Data
initiative, has contributed to the steady growth of the number of datasets and
triples accessible on the Web. Most commonly, queries over RDF data are
evaluated over SPARQL endpoints. Recently, however, alternatives such as TPF
have been proposed with the goal of shifting query processing load from the
server running the SPARQL endpoint towards the client that issued the query.
Although these interfaces have been evaluated against standard benchmarks and
testbeds that showed their benefits over previous work in general, a
fine-granular evaluation of what types of queries exploit the strengths of the
different available interfaces has never been done. In this paper, we present
the results of our in-depth evaluation of existing RDF interfaces. In addition,
we also examine the influence of the backend on the performance of these
interfaces. Using representative and diverse query loads based on the query log
of a public SPARQL endpoint, we stress test the different interfaces and
backends and identify their strengths and weaknesses.Comment: 18 pages, 14 figure
A Proposal for a Two-Way Journey on Validating Locations in Unstructured and Structured Data
The Web of Data has grown explosively over the past few years, and as with any dataset, there are bound to be invalid statements in the data, as well as gaps. Natural Language Processing (NLP) is gaining interest to fill gaps in data by transforming (unstructured) text into structured data. However, there is currently a fundamental mismatch in approaches between Linked Data and NLP as the latter is often based on statistical methods, and the former on explicitly modelling knowledge. However, these fields can strengthen each other by joining forces. In this position paper, we argue that using linked data to validate the output of an NLP system, and using textual data to validate Linked Open Data (LOD) cloud statements is a promising research avenue. We illustrate our proposal with a proof of concept on a corpus of historical travel stories
Star Pattern Fragments: Accessing Knowledge Graphs through Star Patterns
The Semantic Web offers access to a vast Web of interlinked information
accessible via SPARQL endpoints. Such endpoints offer a well-defined interface
to retrieve results for complex SPARQL queries. The computational load for
processing such SPARQL endpoints offer access to a vast amount of interlinked
information. While they offer a well-defined interface for efficiently
retrieving results for complex SPARQL queries, complex query loads can easily
overload or crash endpoints as all the computational load of answering the
queries resides entirely with the server hosting the endpoint. Recently
proposed interfaces, such as Triple Pattern Fragments, have therefore shifted
some of the query processing load from the server to the client at the expense
of increased network traffic in the case of non-selective triple patterns. This
paper therefore proposes Star Pattern Fragments (SPF), an RDF interface
enabling a better load balancing between server and client by decomposing
SPARQL queries into star-shaped subqueries, evaluating them on the server side.
Experiments using synthetic data (WatDiv), as well as real data (DBpedia), show
that SPF does not only significantly reduce network traffic, it is also up to
two orders of magnitude faster than the state-of-the-art interfaces under high
query load