Search CORE

10 research outputs found

Which NoSQL Database? A Performance Overview

Author: Abramova Veronika
Bernardino Jorge
Furtado Pedro
Publication venue: RonPub
Publication date: 01/01/2014
Field of study

NoSQL data stores are widely used to store and retrieve possibly large amounts of data, typically in a key-value format. There are many NoSQL types with different performances, and thus it is important to compare them in terms of performance and verify how the performance is related to the database type. In this paper, we evaluate five most popular NoSQL databases: Cassandra, HBase, MongoDB, OrientDB and Redis. We compare those databases in terms of query performance, based on reads and updates, taking into consideration the typical workloads, as represented by the Yahoo! Cloud Serving Benchmark. This comparison allows users to choose the most appropriate database according to the specific mechanisms and application needs

RonPub -- Research Online Publishing

Estudo Geral

LDBC Graphalytics: A Benchmark for Large-Scale Graph Analysis on Parallel and Distributed Platforms

Author: Alexandru Iosup
Arnau Prat-Pérez
Gabriel Tȃnase
Hassan Chafi
Michael Anderson
Mihai Capotȃ
Nai ⊕ Peter Boncz
Narayanan Sundaram
Ngai △ Stijn Heldens
Thomas Manhardt
Tim Hegeman
Wing Lung
Yinglong Xia
⊗ Lifeng
⊙ Ilie
Publication venue
Publication date: 06/03/2020
Field of study

ABSTRACT In this paper we introduce LDBC Graphalytics, a new industrial-grade benchmark for graph analysis platforms. It consists of six deterministic algorithms, standard datasets, synthetic dataset generators, and reference output, that enable the objective comparison of graph analysis platforms. Its test harness produces deep metrics that quantify multiple kinds of system scalability, such as horizontal/vertical and weak/strong, and of robustness, such as failures and performance variability. The benchmark comes with open-source software for generating data and monitoring performance. We describe and analyze six implementations of the benchmark (three from the community, three from the industry), providing insights into the strengths and weaknesses of the platforms. Key to our contribution, vendors perform the tuning and benchmarking of their platforms

CiteSeerX

The LDBC Graphalytics Benchmark

Author: Anderson Michael
Boncz Peter
Capotă Mihai
Chafi Hassan
Depner Siegfried
Hegeman Tim
Heldens Stijn
Iosup Alexandru
Manhardt Thomas
Musaafir Ahmed
Nai Lifeng
Ngai Wing Lung
Pérez Arnau Prat
Sundaram Narayanan
Szárnyas Gábor
Tănase Ilie Gabriel
Uta Alexandru
Xia Yinglong
Publication venue
Publication date: 15/02/2023
Field of study

In this document, we describe LDBC Graphalytics, an industrial-grade benchmark for graph analysis platforms. The main goal of Graphalytics is to enable the fair and objective comparison of graph analysis platforms. Due to the diversity of bottlenecks and performance issues such platforms need to address, Graphalytics consists of a set of selected deterministic algorithms for full-graph analysis, standard graph datasets, synthetic dataset generators, and reference output for validation purposes. Its test harness produces deep metrics that quantify multiple kinds of systems scalability, weak and strong, and robustness, such as failures and performance variability. The benchmark also balances comprehensiveness with runtime necessary to obtain the deep metrics. The benchmark comes with open-source software for generating performance data, for validating algorithm results, for monitoring and sharing performance data, and for obtaining the final benchmark result as a standard performance report

arXiv.org e-Print Archive

Katsaus NoSQL-tietokantojen suorituskykyyn

Author: Mertanen Kaisa
Publication venue
Publication date: 14/05/2019
Field of study

Tässä tutkielmassa käsitellään NoSQL-tietokantojen suorituskykyä. Työssä esitellään NoSQL-tietomallien keskeisimmät ominaisuudet ja vertaillaan NoSQL- ja relaatiotietomallien välisiä eroja. Lisäksi käsitellään tietokantojen hajautusmekanismeja sekä hajautettujen tietokantojen ominaisuuksia määrittelevää CAP-teoreemaa. Työssä perehdytään suorituskyvyn mittaamiseen ja sen mittareihin. Mittaustyökaluista esitellään tarkemmin avain-arvoparitietokantojen vertailuun kehitetty Yahoo! Cloud Serving Benchmark (YCSB). Työn tavoitteena on NoSQL-tietomallien ja niihin perustuvien tietokantojen suorituskyvyn vertailu aihetta käsittelevien tutkimusten, julkaisujen ja artikkeleiden pohjalta. Tutkimuksen tuloksena saatiin tietoa NoSQL-tietomallien ja -tietokantojen suorituskyvystä sekä suorituskykyyn vaikuttavista tekijöistä. Tutkimustuloksia voidaan käyttää apuna suorituskyvyltään käyttökohteeseensa parhaan mahdollisen tietokannan valinnassa

Trepo - Institutional Repository of Tampere University

Workload mix definition for benchmarking BPMN 2.0 Workflow Management Systems

Author: Skouradaki Marigianna
Publication venue
Publication date: 01/01/2017
Field of study

Nowadays, enterprises broadly use Workﬂow Management Systems (WfMSs) to design, deploy, execute, monitor and analyse their automated business processes. Through the years, WfMSs evolved into platforms that deliver complex service oriented applications. In this regard, they need to satisfy enterprise-grade performance requirements, such as dependability and scalability. With the ever-growing number of WfMSs that are currently available in the market, companies are called to choose which product is optimal for their requirements and business models. Benchmarking is an established practice used to compare alternative products and leverages the continuous improvement of technology by setting a clear target in measuring and assessing performance. In particular, for service oriented WfMSs there is not yet a widely accepted standard benchmark available, even if workﬂow modelling languages such as Web Services Business Process Execution Language (WS-BPEL) and Business Process Model and Notation 2.0 (BPMN 2.0) have been adopted as the de-facto standards. A possible explanation on this deﬁciency can be given by the inherent architectural complexity of WfMSs and the very large number of parameters aﬀecting their performance. However, the need for a standard benchmark for WfMSs is frequently aﬃrmed by the literature. The goal of the BenchFlow approach is to propose a framework towards the ﬁrst standard benchmark forassessing and comparing the performance of BPMN 2.0 WfMSs. To this end, the approach addresses a set of challenges spanning from logistic challenges, that are related to the collection of a representative set of usage scenarios,to technical challenges, that concern the speciﬁc characteristics of a WfMS. This work focuses on a subset of these challenges dealing with the definition of a representative set of process models and corresponding data that will be given as an input to the benchmark. This set of representative process models and corresponding data are referred to as the workload mix of the benchmark. More particularly, we ﬁrst prepare the theoretical background for deﬁning a representative workload mix. This is accomplished through identiﬁcation of the basic components of a workload model for WfMS benchmarks, as well as the investigation of the impact of the BPMN 2.0 language constructs to the WfMS’s performance, by means of introducing the ﬁrst BPMN 2.0 micro-benchmark. We proceed by collecting real-world process models for the identiﬁcation of a representative workload mix. Therefore, the collection is analysed with respect to its statistical characteristics and also with a novel algorithm that detects and extracts the reoccurring structural patterns of the collection.The extracted reoccurring structures are then used for generating synthetic process models that reﬂect the essence of the original collection.The introduced methods are brought together in a tool chain that supports the workload mix generation. As a ﬁnal step, we applied the proposed methods on a real-world case study, that bases on a collection of thousands of real-world process models and generates a representative workload mix to be used in a benchmark. The results show that the generated workload mix is successful in its application for stressing the WfMSs under test

Graph databases and their application to the Italian Business Register for efficient search of relationships among companies

Author: Sinico Luca
Publication venue
Publication date: 08/04/2022
Field of study

We studied and tested three of the major graph databases, and we compared them with a relational database. We worked on a dataset representing equity participations among companies, and we found out that the strong points of graph databases are: the purposely designed storage techniques; and their query languages. The main performance increments have been obtained when heavy graph situations are queried; for simpler situations and queries, a relational database performs equally wellope

Padua Thesis and Dissertation Archive

Graph database benchmarking on cloud environments with XGDBench

Author: B. Shao
B.F. Cooper
C. Bizer
C. Vicknair
D. Chakrabarti
D. Chakrabarti
D. Dominguez-Sal
D. Dominguez-Sal
D. Thakker
F. Holzschuher
F. Versaci
I. Robinson
J. Dongarra
J. Dudley
J. Leskovec
J. Wang
K. Huppler
K. Myunghwan
K. Rohloff
L. Ma
M. Ciglan
M. Dayarathna
M. Faloutsos
M. Morsey
M. Newmann
M. Sarwat
Miyuru Dayarathna
P. Charles
P. Cudré-Mauroux
P. Shannon
R. Angles
R. Murphy
R. Nambiar
S. Ekins
S. Sakr
T. Endo
Toyotaro Suzumura
Y. Guo
Z. Zhao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref