Search CORE

1,548 research outputs found

Predicting SPARQL Query Performance

Author: Gandon Fabien
Hasan Rakebul
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/05/2014
Field of study

International audienceWe address the problem of predicting SPARQL query performance. We use machine learning techniques to learn SPARQL query performance from previously executed queries. We show how to model SPARQL queries as feature vectors, and use k -nearest neighbors regression and Support Vector Machine with the nu-SVR kernel to accurately (R^2 value of 0.98526) predict SPARQL query execution time

CiteSeerX

HAL-UNICE

INRIA a CCSD electronic archive server

HAL Descartes

A Machine Learning Approach to SPARQL Query Performance Prediction

Author: Gandon Fabien
Hasan Rakebul
Publication venue: HAL CCSD
Publication date: 11/08/2014
Field of study

International audienceIn this paper we address the problem of predicting SPARQL query performance. We use machine learning techniques to learn SPARQL query performance from previously executed queries. Traditional approaches for estimating SPARQL query cost are based on statistics about the underlying data. However, in many use-cases involving querying Linked Data, statistics about the underlying data are often missing. Our approach does not require any statistics about the underlying RDF data, which makes it ideal for the Linked Data scenario. We show how to model SPARQL queries as feature vectors, and use k-nearest neighbors regression and Support Vector Machine with the nu-SVR kernel to accurately predict SPARQL query execution time

CiteSeerX

HAL-UNICE

INRIA a CCSD electronic archive server

Using SPARQL – the practitioners’ viewpoint

Author: A Bonifati
M Ford
R Angles
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

A number of studies have analyzed SPARQL log data to draw conclusions about how SPARQL is being used. To complement this work, a survey of SPARQL users has been undertaken. Whilst confirming some of the conclusions of the previous studies, the current work is able to provide additional insight into how users create SPARQL queries, the difficulties they encounter, and the features they would like to see included in the language. Based on this insight, a number of recommendations are presented to the community. These relate to predicting and avoiding computationally expensive queries; extensions to the language; and extending the search paradigm

Crossref

Open Research Online (The Open University)

A pragmatic approach to semantic repositories benchmarking

Author: G. Kobilarov
J. Broekstra
L. Ma
M. Hausenblas
M. Mongiello
O. Erling
S. Auer
Y. Guo
Z. Ding
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

The aim of this paper is to benchmark various semantic repositories in order to evaluate their deployment in a commercial image retrieval and browsing application. We adopt a two-phase approach for evaluating the target semantic repositories: analytical parameters such as query language and reasoning support are used to select the pool of the target repositories, and practical parameters such as load and query response times are used to select the best match to application requirements. In addition to utilising a widely accepted benchmark for OWL repositories (UOBM), we also use a real-life dataset from the target application, which provides us with the opportunity of consolidating our findings. A distinctive advantage of this benchmarking study is that the essential requirements for the target system such as the semantic expressivity and data scalability are clearly defined, which allows us to claim contribution to the benchmarking methodology for this class of applications

CiteSeerX

Crossref

Nottingham Trent Institutional Repository (IRep)

Learning-based SPARQL query performance modeling and prediction

Author: A Rajaraman
A Smola
C Chang
DD Lee
G James
H Hotelling
I Jolliffe
J Li
J Pėrez
Kerry Taylor
Lina Yao
NS Altman
Quan Z. Sheng
Wei Emma Zhang
X Wu
Yongrui Qin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/10/2017
Field of study

One of the challenges of managing an RDF database is predicting performance of SPARQL queries before they are executed. Performance characteristics, such as the execution time and memory usage, can help data consumers identify unexpected long-running queries before they start and estimate the system workload for query scheduling. Extensive works address such performance prediction problem in traditional SQL queries but they are not directly applicable to SPARQL queries. In this paper, we adopt machine learning techniques to predict the performance of SPARQL queries. Our work focuses on modeling features of a SPARQL query to a vector representation. Our feature modeling method does not depend on the knowledge of underlying systems and the structure of the underlying data, but only on the nature of SPARQL queries. Then we use these features to train prediction models. We propose a two-step prediction process and consider performances in both cold and warm stages. Evaluations are performed on real world SPRAQL queries, whose execution time ranges from milliseconds to hours. The results demonstrate that the proposed approach can effectively predict SPARQL query performance and outperforms state-of-the-art approaches

Crossref

Adelaide Research & Scholarship

University of Huddersfield Repository

Huddersfield Research Portal

SE-KGE: A Location-Aware Knowledge Graph Embedding Model for Geographic Question Answering and Spatial Semantic Lifting

Author: Cai Ling
Janowicz Krzysztof
Lao Ni
Mai Gengchen
Regalia Blake
Shi Meilin
Yan Bo
Zhu Rui
Publication venue: 'Wiley'
Publication date: 25/04/2020
Field of study

Learning knowledge graph (KG) embeddings is an emerging technique for a variety of downstream tasks such as summarization, link prediction, information retrieval, and question answering. However, most existing KG embedding models neglect space and, therefore, do not perform well when applied to (geo)spatial data and tasks. For those models that consider space, most of them primarily rely on some notions of distance. These models suffer from higher computational complexity during training while still losing information beyond the relative distance between entities. In this work, we propose a location-aware KG embedding model called SE-KGE. It directly encodes spatial information such as point coordinates or bounding boxes of geographic entities into the KG embedding space. The resulting model is capable of handling different types of spatial reasoning. We also construct a geographic knowledge graph as well as a set of geographic query-answer pairs called DBGeo to evaluate the performance of SE-KGE in comparison to multiple baselines. Evaluation results show that SE-KGE outperforms these baselines on the DBGeo dataset for geographic logic query answering task. This demonstrates the effectiveness of our spatially-explicit model and the importance of considering the scale of different geographic entities. Finally, we introduce a novel downstream task called spatial semantic lifting which links an arbitrary location in the study area to entities in the KG via some relations. Evaluation on DBGeo shows that our model outperforms the baseline by a substantial margin.Comment: Accepted to Transactions in GI

arXiv.org e-Print Archive

Crossref

Explore Bristol Research