Search CORE

65 research outputs found

An introduction to Graph Data Management

Author: A Dries
A Gutiérrez
A Iosup
A Morari
A Poulovassilis
AD Zhu
AO Mendelzon
B Amann
B Elser
C Berge
C Vicknair
C Watters
C Weiss
CS Chang
D Conte
D Dominguez-Sal
D Theodoratos
DC Faye
DW Shipman
EF Codd
FW Tompa
G Malewicz
GM Kuper
H He
HS Kunii
IF Cruz
IF Cruz
J Hidders
J Paredaens
J Peckham
J. Hidders
Jonathan Hayes
K Zeng
L Kowalik
L Zou
M Atre
M Ciglan
M Consens
M Gemis
M Gyssens
M Han
M Levene
M Levene
M Levene
M Mainguenaud
M Schmidt
M Yannakakis
MA Bornea
MA Rodriguez
MA Rodriguez
Marc Andries
MP Consens
MP Consens
N Kiesel
N Roussopoulos
O Erling
P Barceló Baeza
P Buneman
P Yuan
Philippe Cudré-Mauroux
PPS Chen
PT Wood
PT Wood
R Agrawal
R Angles
R Angles
R Brijder
R Ronen
RH Güting
RS Xin
S Abiteboul
S Abiteboul
T Neumann
W Fan
W Kim
Y Guo
Y Low
Y Papakonstantinou
Y Tian
Y Zhao
YA Liu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 29/12/2017
Field of study

A graph database is a database where the data structures for the schema and/or instances are modeled as a (labeled)(directed) graph or generalizations of it, and where querying is expressed by graph-oriented operations and type constructors. In this article we present the basic notions of graph databases, give an historical overview of its main development, and study the main current systems that implement them

arXiv.org e-Print Archive

Crossref

Graph databases and their application to the Italian Business Register for efficient search of relationships among companies

Author: Sinico Luca
Publication venue
Publication date: 08/04/2022
Field of study

We studied and tested three of the major graph databases, and we compared them with a relational database. We worked on a dataset representing equity participations among companies, and we found out that the strong points of graph databases are: the purposely designed storage techniques; and their query languages. The main performance increments have been obtained when heavy graph situations are queried; for simpler situations and queries, a relational database performs equally wellope

Padua Thesis and Dissertation Archive

A design space for RDF data representations

Author: Hose Katja
Lissandrini Matteo
Pedersen Torben Bach
Sagi Tomer
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

RDF triplestores' ability to store and query knowledge bases augmented with semantic annotations has attracted the attention of both research and industry. A multitude of systems offer varying data representation and indexing schemes. However, as recently shown for designing data structures, many design choices are biased by outdated considerations and may not result in the most efficient data representation for a given query workload. To overcome this limitation, we identify a novel three-dimensional design space. Within this design space, we map the trade-offs between different RDF data representations employed as part of an RDF triplestore and identify unexplored solutions. We complement the review with an empirical evaluation of ten standard SPARQL benchmarks to examine the prevalence of these access patterns in synthetic and real query workloads. We find some access patterns, to be both prevalent in the workloads and under-supported by existing triplestores. This shows the capabilities of our model to be used by RDF store designers to reason about different design choices and allow a (possibly artificially intelligent) designer to evaluate the fit between a given system design and a query workload

Catalogo dei prodotti della ricerca

VBN

Data management in cloud environments: NoSQL and NewSQL data stores

Author: Capretz Miriam AM
Grolinger Katarina
Higashino Wilson A
Tiwari Abhinav
Publication venue: Scholarship@Western
Publication date: 01/01/2013
Field of study

: Advances in Web technology and the proliferation of mobile devices and sensors connected to the Internet have resulted in immense processing and storage requirements. Cloud computing has emerged as a paradigm that promises to meet these requirements. This work focuses on the storage aspect of cloud computing, specifically on data management in cloud environments. Traditional relational databases were designed in a different hardware and software era and are facing challenges in meeting the performance and scale requirements of Big Data. NoSQL and NewSQL data stores present themselves as alternatives that can handle huge volume of data. Because of the large number and diversity of existing NoSQL and NewSQL solutions, it is difficult to comprehend the domain and even more challenging to choose an appropriate solution for a specific task. Therefore, this paper reviews NoSQL and NewSQL solutions with the objective of: (1) providing a perspective in the field, (2) providing guidance to practitioners and researchers to choose the appropriate data store, and (3) identifying challenges and opportunities in the field. Specifically, the most prominent solutions are compared focusing on data models, querying, scaling, and security related capabilities. Features driving the ability to scale read requests and write requests, or scaling data storage are investigated, in particular partitioning, replication, consistency, and concurrency control. Furthermore, use cases and scenarios in which NoSQL and NewSQL data stores have been used are discussed and the suitability of various solutions for different sets of applications is examined. Consequently, this study has identified challenges in the field, including the immense diversity and inconsistency of terminologies, limited documentation, sparse comparison and benchmarking criteria, and nonexistence of standardized query languages

Scholarship@Western

Springer - Publisher Connector

Graph-based data integration for ensuring FAIR project management information

Author: Dusseldorp Niels
Publication venue
Publication date: 20/12/2022
Field of study

Pure OAI Repository

An exploratory study of a NoSQL database for a clinical data repository

Author: A Collins
ABM Moniruzzaman
AK Hamoud
C Costa
D Kunda
JS Einbinder
M Madison
M Shertil
V Abramova
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

The need to implement a distributed Clinical Data Repository (CDR) at a healthcare facility, rose in large part due to the high volume of data and the discrepancy of their sources. Over the years, Relational Database Management Systems (RDBMS) began to present difficulties in responding to the needs of various organizations when it comes to manipulating a large amount of data and to its scalability. Therefore, it was necessary to explore other techniques to choose the appropriate technology to build the CDR. In this way, NoSQL emerged as a new type of database that is quite useful to work with multiple and different types of data. In addition, NoSQL introduces a number of user-friendly features such as a distributed, scalable, elastic and also fault tolerant system. In this way, Oracle NoSQL Database was the NoSQL solution chosen to develop this case study, using the key-value storage. This article was motivated to propose a CDR architecture based on Oracle NoSQL Database functionalities. A one-single node database was deployed for better comprehension, in order to enhance their features for future implementation.The work has been supported by FCT – Fundação para a Ciência e Tecnologia within the Project Scope UID/CEC/00319/2019 and DSAIPA/DS/0084/2018

Universidade do Minho: RepositoriUM

Crossref

An automated materials and processes identification tool for material informatics using deep learning approach

Author: Junaida Sulaiman
Masuduzzaman Md
Miah M. Saef Ullah
Nur Ibrahim
Rajan Jose
Sarwar Talha
Publication venue: Elsevier Ltd
Publication date: 01/01/2023
Field of study

This article reports a tool that enables Materials Informatics, termed as MatRec, via a deep learning approach. The tool captures data, makes appropriate domain suggestions, extracts various entities such as materials and processes, and helps to establish entity-value relationships. This tool uses keyword extraction, a document similarity index to suggest relevant documents, and a deep learning approach employing Bi-LSTM for entity extraction. For example, materials and processes for electrical charge storage under an electric double layer capacitor (EDLC) mechanism are demonstrated herewith. A knowledge graph approach finds and visualizes different latent knowledge sets from the processed information. The MatRec received an F1 score of 9̃6% for entity extraction, 8̃3% for material-value relationship extraction, and 8̃7% for process-value relationship extraction, respectively. The proposed MatRec could be extended to solve material selection issues for various applications and could be an excellent tool for academia and industry

UMP Institutional Repository

Further with Knowledge Graphs:proceedings of the 17th International Conference on Semantic Systems, 6-9 September 2021, Amsterdam, The Netherlands

Author
Publication venue: 'IOS Press'
Publication date: 01/01/2021
Field of study

International Migration, Integration and Social Cohesion online publications