Search CORE

8 research outputs found

Optimization of Retrieval Algorithms on Large Scale Knowledge Graphs

Author: Dörpinghaus Jens
Stefan Andreas
Publication venue
Publication date: 10/02/2020
Field of study

Knowledge graphs have been shown to play an important role in recent knowledge mining and discovery, for example in the field of life sciences or bioinformatics. Although a lot of research has been done on the field of query optimization, query transformation and of course in storing and retrieving large scale knowledge graphs the field of algorithmic optimization is still a major challenge and a vital factor in using graph databases. Few researchers have addressed the problem of optimizing algorithms on large scale labeled property graphs. Here, we present two optimization approaches and compare them with a naive approach of directly querying the graph database. The aim of our work is to determine limiting factors of graph databases like Neo4j and we describe a novel solution to tackle these challenges. For this, we suggest a classification schema to differ between the complexity of a problem on a graph database. We evaluate our optimization approaches on a test system containing a knowledge graph derived biomedical publication data enriched with text mining data. This dense graph has more than 71M nodes and 850M relationships. The results are very encouraging and - depending on the problem - we were able to show a speedup of a factor between 44 and 3839

arXiv.org e-Print Archive

Model-driven engineering of an openCypher engine: using graph queries to compile graph queries

Author: A Silberschatz
B Bryla
D Koenig
D Varró
DS Kolovos
E Leblebici
G Bergmann
G Szárnyas
G Varró
G Varró
G Varró
H Bunke
H Garcia-Molina
I Robinson
J Pérez
JL Ambite
M Búr
MA Rodriguez
R Geiß
S Erdweg
T Arendt
Z Ujhelyi
Á Hegedüs
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Graph database systems are increasingly adapted for storing and processing heterogeneous network-like datasets. Many challenging applications with near real-time requirements - such as financial fraud detection, on-the-fly model validation and root cause analysis - can be formalised as graph problems and tackled with graph databases efficiently. However, as no standard graph query language has yet emerged, users are subjected to the possibility of vendor lock-in. The openCypher group aims to define an open specification for a declarative graph query language. However, creating an openCypher-compatible query engine requires significant research and engineering efforts. Meanwhile, model-driven language workbenches support the creation of domain-specific languages by providing high-level tools to create parsers, editors and compilers. In this paper, we present an approach to build a compiler and optimizer for openCypher using model-driven technologies, which allows developers to define declarative optimization rules

Crossref

Repository of the Academy's Library

Formalising openCypher Graph Queries in Relational Algebra

Author: C Krause
DS Kolovos
G Bergmann
G Szárnyas
H Garcia-Molina
J Pérez
L Libkin
R Elmasri
T Arendt
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Graph database systems are increasingly adapted for storing and processing heterogeneous network-like datasets. However, due to the novelty of such systems, no standard data model or query language has yet emerged. Consequently, migrating datasets or applications even between related technologies often requires a large amount of manual work or ad-hoc solutions, thus subjecting the users to the possibility of vendor lock-in. To avoid this threat, vendors are working on supporting existing standard languages (e.g. SQL) or creating standardised languages. In this paper, we present a formal specification for openCypher, a high-level declarative graph query language with an ongoing standardisation effort. We introduce relational graph algebra, which extends relational operators by adapting graph-specific operators and define a mapping from core openCypher constructs to this algebra. We propose an algorithm that allows systematic compilation of openCypher queries.Comment: ADBIS conference (21st European Conference on Advances in Databases and Information Systems) The final publication is available at Springer via https://doi.org/10.1007/978-3-319-66917-5_1

arXiv.org e-Print Archive

Crossref

Repository of the Academy's Library

Evaluation of Optimization Strategies for Incremental Graph Queries

Author: Maginecz János
Szárnyas Gábor
Varró Dániel
Publication venue: Budapest University of Technology and Economics (BME)
Publication date: 01/01/2017
Field of study

The last decade brought considerable improvements in distributed storage and query technologies, known as NoSQL systems. These systems provide quick evaluation of simple retrieval operations and are able to answer certain complex queries in a scalable way, albeit not instantly. Providing scalability and quick response times at the same time for querying large data sets is still a challenging task. Evaluating complex graph queries is particularly difficult, as it requires lots of join, antijoin and filtering operations. This paper presents optimization techniques used in relational database systems and applies them on graph queries. We evaluate various query plans on multiple datasets and discuss the effect of different optimization techniques

Crossref

Repository of the Academy's Library

Periodica Polytechnica (Budapest University of Technology and Economics)

Evaluation of Optimization Strategies for Incremental Graph Queries

Author
Publication venue: 'Periodica Polytechnica Budapest University of Technology and Economics'
Publication date
Field of study

Crossref

An Algebra and Equivalences to Transform Graph Patterns in Neo4j

Author: Grossniklaus Michael
Hölsch Jürgen
Publication venue
Publication date: 01/01/2016
Field of study

Modern query optimizers of relational database systems embody more than three decades of research and practice in the area of data management and processing. Key advances in- clude algebraic query transformation, intelligent search space pruning, and modular optimizer architectures. Surprisingly, many of these contributions seem to have been overlooked in the emerging field of graph databases so far. In particular, we believe that query optimization based on a general graph algebra and its equivalences can greatly improve on the current state of the art. Although some graph algebras have already been proposed, they have often been developed in a context, in which a relational database system is used as a backend to process graph data. As a consequence, these algebras are typically tightly coupled to the relational algebra, making them unsuitable for native graph databases. While we support the approach of extending the relational algebra, we argue that graph-specific operations should be defined at a higher level, independent of the database backend. In this paper, we introduce such a general graph algebra and corresponding equivalences. We demonstrate how it can be used to optimize Cypher queries in the setting of the Neo4j native graph database.publishe

KOPS - The Institutional Repository of the University of Konstanz

CiteSeerX