P-LUPOSDATE: Using Precomputed Bloom Filters to Speed Up SPARQL Processing in the Cloud

Dennis Heinrich; Le Gruenwald; Marc Stelzner; Stefan Werner; Sven Groppe; Thomas Kiencke

P-LUPOSDATE: Using Precomputed Bloom Filters to Speed Up SPARQL Processing in the Cloud

Authors: Dennis Heinrich
Le Gruenwald
Marc Stelzner
Stefan Werner
Sven Groppe
Thomas Kiencke
Publication date: 1 January 2014
Publisher: RonPub

Abstract

Increasingly data on the Web is stored in the form of Semantic Web data. Because of today's information overload, it becomes very important to store and query these big datasets in a scalable way and hence in a distributed fashion. Cloud Computing offers such a distributed environment with dynamic reallocation of computing and storing resources based on needs. In this work we introduce a scalable distributed Semantic Web database in the Cloud. In order to reduce the number of (unnecessary) intermediate results early, we apply bloom filters. Instead of computing bloom filters, a time-consuming task during query processing as it has been done traditionally, we precompute the bloom filters as much as possible and store them in the indices besides the data. The experimental results with data sets up to 1 billion triples show that our approach speeds up query processing significantly and sometimes even reduces the processing time to less than half

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

RonPub -- Research Online Publishing

oai:ronpub.com:OJSW-v1i2n02_Gr...

Last time updated on 18/04/2020