Search CORE

46,641 research outputs found

Towards Analytics Aware Ontology Based Access to Static and Streaming Data (Extended Version)

Author: Brandt Sebastian
Horrocks Ian
Ioannidis Yannis
Kharlamov Evgeny
Kotidis Yannis
Lamparter Steffen
Mailis Theofilos
Möller Ralf
Neuenstadt Christian
Nikolaou Charalampos
Svingos Christoforos
Zheleznyakov Dmitriy
Özcep Özgür
Publication venue
Publication date: 01/01/2016
Field of study

Real-time analytics that requires integration and aggregation of heterogeneous and distributed streaming and static data is a typical task in many industrial scenarios such as diagnostics of turbines in Siemens. OBDA approach has a great potential to facilitate such tasks; however, it has a number of limitations in dealing with analytics that restrict its use in important industrial applications. Based on our experience with Siemens, we argue that in order to overcome those limitations OBDA should be extended and become analytics, source, and cost aware. In this work we propose such an extension. In particular, we propose an ontology, mapping, and query language for OBDA, where aggregate and other analytical functions are first class citizens. Moreover, we develop query optimisation techniques that allow to efficiently process analytical tasks over static and streaming data. We implement our approach in a system and evaluate our system with Siemens turbine data

arXiv.org e-Print Archive

Oxford University Research Archive

Window-based Streaming Graph Partitioning Algorithm

Author: Abdolrashidi A.
Bader D. A.
Gonzalez Joseph E.
Sajjad H. P.
Wang R.
Zaharia Matei
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2019
Field of study

In the recent years, the scale of graph datasets has increased to such a degree that a single machine is not capable of efficiently processing large graphs. Thereby, efficient graph partitioning is necessary for those large graph applications. Traditional graph partitioning generally loads the whole graph data into the memory before performing partitioning; this is not only a time consuming task but it also creates memory bottlenecks. These issues of memory limitation and enormous time complexity can be resolved using stream-based graph partitioning. A streaming graph partitioning algorithm reads vertices once and assigns that vertex to a partition accordingly. This is also called an one-pass algorithm. This paper proposes an efficient window-based streaming graph partitioning algorithm called WStream. The WStream algorithm is an edge-cut partitioning algorithm, which distributes a vertex among the partitions. Our results suggest that the WStream algorithm is able to partition large graph data efficiently while keeping the load balanced across different partitions, and communication to a minimum. Evaluation results with real workloads also prove the effectiveness of our proposed algorithm, and it achieves a significant reduction in load imbalance and edge-cut with different ranges of dataset

arXiv.org e-Print Archive

Crossref

University of Tasmania Open Access Repository

Continuous client-side query evaluation over dynamic linked data

Author: Colpaert Pieter
Mannens Erik
Taelman Ruben
Verborgh Ruben
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Existing solutions to query dynamic Linked Data sources extend the SPARQL language, and require continuous server processing for each query. Traditional SPARQL endpoints already accept highly expressive queries, so extending these endpoints for time-sensitive queries increases the server cost even further. To make continuous querying over dynamic Linked Data more affordable, we extend the low-cost Triple Pattern Fragments (TPF) interface with support for time-sensitive queries. In this paper, we introduce the TPF Query Streamer that allows clients to evaluate SPARQL queries with continuously updating results. Our experiments indicate that this extension significantly lowers the server complexity, at the expense of an increase in the execution time per query. We prove that by moving the complexity of continuously evaluating queries over dynamic Linked Data to the clients and thus increasing bandwidth usage, the cost at the server side is significantly reduced. Our results show that this solution makes real-time querying more scalable for a large amount of concurrent clients when compared to the alternatives

Ghent University Academic Bibliography

Publishing LO(D)D: Linked Open (Dynamic) Data for Smart Sensing and Measuring Environments

Author: Corno Fulvio
Faisal Razzak
Publication venue: Elsevier
Publication date: 01/01/2012
Field of study

The paper proposes a distributed framework that provides a systematic way to publish environment data which is being updated continuously; such updates might be issued at speciﬁc time intervals or bound to some environment- speciﬁc event. The framework targets smart environments having networks of devices and sensors which are interacting with each other and with their respective environments to gather and generate data and willing to publish this data. This paper addresses the issues of supporting the data publishers to maintain up-to-date and machine understandable representations, separation of views (static or dynamic data) and delivering up-to-date information to data consumers in real time, helping data consumers to keep track of changes triggered from diverse environments and keeping track of evolution of the smart environment. The paper also describes a prototype implementation of the proposed architecture. A preliminary use case implementation over a real energy metering infrastructure is also provided in the paper to prove the feasibility of the architectur

Elsevier - Publisher Connector

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino