Search CORE

1,444 research outputs found

FishMark: A Linked Data Application Benchmark

Author: Alkiviadous S.
Bail S.
Concalves R. S.
Garilao Cristina
Parsia B.
van Harmelen M.
Workman D.
Publication venue: CEUR
Publication date: 01/01/2012
Field of study

Abstract. FishBase is an important species data collection produced by the FishBase Information and Research Group Inc (FIN), a not-forprofit NGO with the aim of collecting comprehensive information (from the taxonomic to the ecological) about all the world’s finned fish species. FishBase is exposed as a MySQL backed website (supporting a range of canned, although complex queries) and serves over 33 million hits per month. FishDelish is a transformation of FishBase into LinkedData weighing in at 1.38 billion triples. We have ported a substantial number of FishBase SQL queries to FishDelish SPARQL query which form the basis of a new linked data application benchmark (using our derivative of the Berlin SPARQL Benchmark harness). We use this benchmarking framework to compare the performance of the native MySQL application, the Virtuoso RDF triple store, and the Quest OBDA system on a fishbase.org like application.

OceanRep

CiteSeerX

The University of Manchester - Institutional Repository

A pragmatic approach to semantic repositories benchmarking

Author: G. Kobilarov
J. Broekstra
L. Ma
M. Hausenblas
M. Mongiello
O. Erling
S. Auer
Y. Guo
Z. Ding
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

The aim of this paper is to benchmark various semantic repositories in order to evaluate their deployment in a commercial image retrieval and browsing application. We adopt a two-phase approach for evaluating the target semantic repositories: analytical parameters such as query language and reasoning support are used to select the pool of the target repositories, and practical parameters such as load and query response times are used to select the best match to application requirements. In addition to utilising a widely accepted benchmark for OWL repositories (UOBM), we also use a real-life dataset from the target application, which provides us with the opportunity of consolidating our findings. A distinctive advantage of this benchmarking study is that the essential requirements for the target system such as the semantic expressivity and data scalability are clearly defined, which allows us to claim contribution to the benchmarking methodology for this class of applications

CiteSeerX

Crossref

Nottingham Trent Institutional Repository (IRep)

Scaling out federated queries for life sciences data in production

Author: Constandt Hans
De Vocht Laurens
De Witte Dieter
Mannens Erik
Pattyn Filip
Verborgh Ruben
Publication venue
Publication date: 01/01/2016
Field of study

Ghent University Academic Bibliography

Geographica: A Benchmark for Geospatial RDF Stores

Author: Garbis George
Koubarakis Manolis
Kyzirakos Kostis
Publication venue
Publication date: 24/05/2013
Field of study

Geospatial extensions of SPARQL like GeoSPARQL and stSPARQL have recently been defined and corresponding geospatial RDF stores have been implemented. However, there is no widely used benchmark for evaluating geospatial RDF stores which takes into account recent advances to the state of the art in this area. In this paper, we develop a benchmark, called Geographica, which uses both real-world and synthetic data to test the offered functionality and the performance of some prominent geospatial RDF stores

arXiv.org e-Print Archive

CiteSeerX

An Empirical Study of Real-World SPARQL Queries

Author: Arias Mario
de la Fuente Pablo
Fernández Javier D.
Martínez-Prieto Miguel A.
Publication venue
Publication date: 01/01/2011
Field of study

Understanding how users tailor their SPARQL queries is crucial when designing query evaluation engines or fine-tuning RDF stores with performance in mind. In this paper we analyze 3 million real-world SPARQL queries extracted from logs of the DBPedia and SWDF public endpoints. We aim at finding which are the most used language elements both from syntactical and structural perspectives, paying special attention to triple patterns and joins, since they are indeed some of the most expensive SPARQL operations at evaluation phase. We have determined that most of the queries are simple and include few triple patterns and joins, being Subject-Subject, Subject-Object and Object-Object the most common join types. The graph patterns are usually star-shaped and despite triple pattern chains exist, they are generally short.Comment: 1st International Workshop on Usage Analysis and the Web of Data (USEWOD2011) in the 20th International World Wide Web Conference (WWW2011), Hyderabad, India, March 28th, 201

arXiv.org e-Print Archive

CiteSeerX

Enabling Fine-Grained HTTP Caching of SPARQL Query Results

Author: Gregory Todd Williams
Jesse Weaver
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Abstract. As SPARQL endpoints are increasingly used to serve linked data, their ability to scale becomes crucial. Although much work has been done to improve query evaluation, little has been done to take advantage of caching. Effective solutions for caching query results can improve scala-bility by reducing latency, network IO, and CPU overhead. We show that simple augmentation of the database indexes found in common SPARQL implementations can directly lead to effective caching at the HTTP pro-tocol level. Using tests from the Berlin SPARQL benchmark, we evaluate the potential of such caching to improve overall efficiency of SPARQL query evaluation.

CiteSeerX

Crossref