Search CORE

104,181 research outputs found

Ontology-Based Data Access to Big Data

Author: Ralf Möller
Simon Schiff
Özgür L. Özcep
Publication venue: RonPub
Publication date: 01/01/2018
Field of study

Recent approaches to ontology-based data access (OBDA) have extended the focus from relational database systems to other types of backends such as cluster frameworks in order to cope with the four Vs associated with big data: volume, veracity, variety and velocity (stream processing). The abstraction that an ontology provides is a benefit from the enduser point of view, but it represents a challenge for developers because high-level queries must be transformed into queries executable on the backend level. In this paper, we discuss and evaluate an OBDA system that uses STARQL (Streaming and Temporal ontology Access with a Reasoning-based Query Language), as a high-level query language to access data stored in a SPARK cluster framework. The development of the STARQL-SPARK engine show that there is a need to provide a homogeneous interface to access both static and temporal as well as streaming data because cluster frameworks usually lack such an interface. The experimental evaluation shows that building a scalable OBDA system that runs with SPARK is more than plug-and-play as one needs to know quite well the data formats and the data organisation in the cluster framework

RonPub -- Research Online Publishing

Using Ontologies for Semantic Data Integration

Author: DE GIACOMO Giuseppe
Lembo Domenico
Lenzerini Maurizio
Poggi Antonella
Rosati Riccardo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

While big data analytics is considered as one of the most important paths to competitive advantage of today’s enterprises, data scientists spend a comparatively large amount of time in the data preparation and data integration phase of a big data project. This shows that data integration is still a major challenge in IT applications. Over the past two decades, the idea of using semantics for data integration has become increasingly crucial, and has received much attention in the AI, database, web, and data mining communities. Here, we focus on a specific paradigm for semantic data integration, called Ontology-Based Data Access (OBDA). The goal of this paper is to provide an overview of OBDA, pointing out both the techniques that are at the basis of the paradigm, and the main challenges that remain to be addressed

Archivio della ricerca- Università di Roma La Sapienza

A Review of Accessing Big Data with Significant Ontologies

Author: Hammad Jehad Abdulhamid
Sleeman Jumah Y.J
Publication venue: 'State University of Malang (UM)'
Publication date: 01/12/2020
Field of study

Ontology Based Data Access (OBDA) is a recently proposed approach which is able to provide a conceptual view on relational data sources. It addresses the problem of the direct access to big data through providing end-users with an ontology that goes between users and sources in which the ontology is connected to the data via mappings. We introduced the languages used to represent the ontologies and the mapping assertions technique that derived the query answering from sources. Query answering is divided into two steps: (i) Ontology rewriting, in which the query is rewritten with respect to the ontology into new query; (ii) mapping rewriting the query that obtained from previous step reformulating it over the data sources using mapping assertions. In this survey, we aim to study the earlier works done by other researchers in the fields of ontology, mapping and query answering over data sources

Portal Jurnal Elektronik Universitas Negeri Malang

Directory of Open Access Journals

Bridging the gap between the semantic web and big data: answering SPARQL queries over NoSQL databases

Author: El Massari Hakim
Gherabi Noreddine
Mhammedi Sajida
Publication venue: Institute of Advanced Engineering and Science
Publication date: 01/12/2022
Field of study

Nowadays, the database field has gotten much more diverse, and as a result, a variety of non-relational (NoSQL) databases have been created, including JSON-document databases and key-value stores, as well as extensible markup language (XML) and graph databases. Due to the emergence of a new generation of data services, some of the problems associated with big data have been resolved. In addition, in the haste to address the challenges of big data, NoSQL abandoned several core databases features that make them extremely efficient and functional, for instance the global view, which enables users to access data regardless of how it is logically structured or physically stored in its sources. In this article, we propose a method that allows us to query non-relational databases based on the ontology-based access data (OBDA) framework by delegating SPARQL protocol and resource description framework (RDF) query language (SPARQL) queries from ontology to the NoSQL database. We applied the method on a popular database called Couchbase and we discussed the result obtained

ZENODO

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Institute of Advanced Engineering and Science

Recommended from our members

Ontology-based end-user visual query formulation: Why, what, who, how, and which?

Author: A Cali
A D’Ulizia
A Gomez-Perez
A Harth
A Jimeno-Yepes
A Katifori
A McAfee
A Segev
A Soylu
A Soylu
A Soylu
A Soylu
A Soylu
AHM Hofstede Ter
Ahmet Soylu
AK Dey
AS Dadzie
B Glimm
B Henderson-Sellers
B Shneiderman
B Shneiderman
BC Grau
BR Gaines
C Beshers
C Bettini
C Bizer
C Bobed
C Civili
C Martinez-Cruz
D Braga
D Damljanovic
D Howe
DE Spanos
Dmitriy Zheleznyakov
E Kapetanios
E Kaufmann
EF Codd
EF Codd
EF Codd
Ernesto Jimenez-Ruiz
Evgeny Kharlamov
F Benzi
F Fonseca
F Ham van
G Allen
G Lindgaard
G Marchionini
G Marchionini
G Tummarello
GL Lohse
H Kondylakis
H Storrle
HJ Levesque
Ian Horrocks
J Claussen
J Coutaz
J Gersh
J Kawash
J Mackinlay
J Minker
J Nielsen
J Nielsen
JA Gallud
JA Konstan
JF Sequeda
JM Brunetti
K Munir
K Siau
K Zheng
KL Siau
KY Whang
L Certo
L Cinque
LJ Campbell
M Angelaccioa
M Erwig
M Giese
M Kifer
M Latapy
M Salehie
M Turk
MA Hearst
Martin Giese
MC Schraefel
ML Wilson
MM Burnett
MM Zloof
MR Kogalovsky
MYM Yen
N Bevan
NH Balkir
O Kolomiyets
P Besnard
P Ingwersen
PD Bruza
PK Chen
PK Robertson
R Baeza-Yates
R Cassino
R Stevens
R Studer
RG Epstein
RM Friedhoff
RN Cuff
RW White
S Krivov
S Lederman
S Madden
S Philippi
S Spiekermann
T Berners-Lee
T Catarci
T Catarci
T Eiter
T Halpin
T Tran
TR Gruber
V Lopez
V Uren
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Value creation in an organisation is a time-sensitive and data-intensive process, yet it is often delayed and bounded by the reliance on IT experts extracting data for domain experts. Hence, there is a need for providing people who are not professional developers with the flexibility to pose relatively complex and ad hoc queries in an easy and intuitive way. In this respect, visual methods for query formulation undertake the challenge of making querying independent of users’ technical skills and the knowledge of the underlying textual query language and the structure of data. An ontology is more promising than the logical schema of the underlying data for guiding users in formulating queries, since it provides a richer vocabulary closer to the users’ understanding. However, on the one hand, today the most of world’s enterprise data reside in relational databases rather than triple stores, and on the other, visual query formulation has become more compelling due to ever-increasing data size and complexity—known as Big Data. This article presents and argues for ontology-based visual query formulation for end-users; discusses its feasibility in terms of ontology-based data access, which virtualises legacy relational databases as RDF, and the dimensions of Big Data; presents key conceptual aspects and dimensions, challenges, and requirements; and reviews, categorises, and discusses notable approaches and systems

City Research Online

Crossref

NORA - Norwegian Open Research Archives

Semantic technology for open data publishing

Author: Cima Gianluca
Lenzerini Maurizio
Poggi Antonella
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2017
Field of study

After years of focus on technologies for big data storing and processing, many observers are pointing out that making sense of big data cannot be done without suitable tools for conceptualizing, preparing, and integrating data (see http://www.dbta.com/). Research in the last years has shown that taking into account the semantics of data is crucial for devising powerful data integration solutions. In this work we focus on a specific paradigm for semantic data integration, named "Ontology-Based Data Access" (OBDA), proposed in [1-4]. According to such paradigm, the client of the information system is freed from being aware of how data and processes are structured in concrete resources (databases, software programs, services, etc.), and interacts with the system by expressing her queries and goals in terms of a conceptual representation of the domain of interest, called ontology. More precisely, a system realizing the vision of OBDA is constituted by three components: The ontology, whose goal is to provide a formal, clean and high level representation of the domain of interest, and constitutes the component with which the clients of the system (both humans and software programs) interact. fiedata source layer, representing the existing data sources in the information system, which are managed by the processes and services operating on their data. e mapping between the two layers, which is an explicit representation of the relationship between the data sources and the ontology, and is used to translate the operations on the ontology (e.g., query answering) in terms of concrete actions on the data sources.

Crossref

Archivio della ricerca- Università di Roma La Sapienza

Challenges for the Multilingual Web of Data

Author: Buitelaar Paul
Cimiano Philipp
Gracia del Río Jorge
Gómez-Pérez A.
McCrae J.
Montiel-Ponsoda Elena
Publication venue: Facultad de Informática (UPM)
Publication date: 01/01/2011
Field of study

The Web has witnessed an enormous growth in the amount of semantic information published in recent years. This growth has been stimulated to a large extent by the emergence of Linked Data. Although this brings us a big step closer to the vision of a Semantic Web, it also raises new issues such as the need for dealing with information expressed in different natural languages. Indeed, although the Web of Data can contain any kind of information in any language, it still lacks explicit mechanisms to automatically reconcile such information when it is expressed in ifferent languages. This leads to situations in which data expressed in a certain language is not easily accessible to speakers of other languages. The Web of Data shows the potential for being extended to a truly multilingual web as vocabularies and data can be published in a language-independent fashion, while associated language-dependent (linguistic) information supporting the access across languages can be stored separately. In this sense, the multilingual Web of Data can be realized in our view as a layer of services and resources on top of the existing Linked Data infrastructure adding i) linguistic information for data and vocabularies in different languages, ii) mappings between data with labels in different languages, and iii) services to dynamically access and traverse Linked Data across different languages. In this article we present this vision of a multilingual Web of Data. We discuss challenges that need to be addressed to make this vision come true and discuss the role that techniques such as ontology localization, ontology mapping, and cross-lingual ontology-based information access and presentation will play in achieving this. Further, we propose an initial architecture and describe a roadmap that can provide a basis for the implementation of this vision

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Access to Research at National University of Ireland, Galway

Archivo Digital UPM

A semantic big biodiversity data integration tool

Author: Abdelreheim Marwa
Algergawy Alsayed
Askar Majid
König-Ries Birgitta
Soliman Taysir
Publication venue
Publication date: 01/01/2018
Field of study

Our planet is facing huge effects of global climate changes that are threatening biodiversity data to be surviving. Biodiversity data exist in very complex characteristics, such as high volume, variety, veracity, velocity, and value, as Big data. The variety or heterogeneity of biodiversity data provides a very high challenging research problem since they exist in unstructured, semi-structured, quasi-structured, and generated in XML, EML, Excel sheets, videos, images, or ontologies. In addition, the availability of biodiversity data includes trait-measurements, species distribution, species’ morphology, genetic sequences, phylogenetic trees, spatial data, and ecological niches; data are collected and uploaded in Bio Portals via citizen scientists, museums’ collections, ecological surveys, and environmental studies. These data collections generate big data, which is important current research. The first phase of Big data analytics life cycle discovers whether the data is enough to perform the analytics process, which takes more time than any other phase. In addition, Big biodiversity data management life cycle includes data integration as a main phase, affecting storage, indexing, and querying. In the data integration phase, we apply semantic data integration in order to combine data from different sources and consolidate them into valuable information that depends on semantic technologies. A number of research attempts have been achieved on semantic big data integration. For example, Ontology-Based Data Access (OBDA) has been proposed in relational schema and in NOSQL [1,2] databases since it provides a semantically conceptual schema over data repository. Another example is Semantic Extract Transform Load (ETL) framework [3], which integrates and publishes data from multiple sources as open linked data provides through semantic technologies. Moreover, Semantic MongoDB-based has been developed where researchers represented as an OWL ontology. However, the need for semantic big data integration tools becomes highly recommended because of the growth of biodiversity big data. In the current work, a semantic big data integration system is developed, which handles the following features: 1) Data heterogeneity, 2) NoSQL databases, 3) Ontology based Integration, and 4) User Interaction, where data integration components can be chosen. A proof-of-concept will be developed based on biodiversity data, having various data formats. In addition, related ontologies will be used from BioPortal

Digitale Bibliothek Thüringen

PREDICAT: a semantic service-oriented platform for data interoperability and linking in earth observation and disaster prediction

Author: Archimède Bernard
Baazaoui Zghal Hajer
Ben Abdallah Ben Lamine Sana
Boukadi khouloud
Guegan Chirine Ghedira
Karray Mohamed Hedi
Masmoudi Maroua
Mrissa Michael
Taktak Hela
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 19/11/2018
Field of study

The increasing volume of data generated by earth observation programs such as Copernicus, NOAA, and NASA Earth Data, is overwhelming. Although these programs are very costly, data usage remains limited due to lack of interoperability and data linking. In fact, multi-source and heterogeneous data exploitation could be significantly improved in different domains especially in the natural disaster prediction one. To deal with this issue, we introduce the PREDICAT project that aims at providing a semantic service-oriented platform to PREDIct natural CATastrophes. The PREDICAT platform considers (1) data access based on web service technology; (2) ontology-based interoperability for the environmental monitoring domain; (3) data integration and linking via big data techniques; (4) a prediction approach based on semantic machine learning mechanisms. The focus in this paper is to provide an overview of the PREDICAT platform architecture. A scenario explaining the operation of the platform is presented based on data provided by our collaborators, including the international intergovernmental Sahara and Sahel Observatory (OSS)

HAL - Université de Franche-Comté

Crossref

Open Archive Toulouse Archive Ouverte

HAL

Hal-Diderot

Link Before You Share: Managing Privacy Policies through Blockchain

Author: Banerjee Agniva
Joshi Karuna Pande
Publication venue
Publication date: 15/10/2017
Field of study

With the advent of numerous online content providers, utilities and applications, each with their own specific version of privacy policies and its associated overhead, it is becoming increasingly difficult for concerned users to manage and track the confidential information that they share with the providers. Users consent to providers to gather and share their Personally Identifiable Information (PII). We have developed a novel framework to automatically track details about how a users' PII data is stored, used and shared by the provider. We have integrated our Data Privacy ontology with the properties of blockchain, to develop an automated access control and audit mechanism that enforces users' data privacy policies when sharing their data across third parties. We have also validated this framework by implementing a working system LinkShare. In this paper, we describe our framework on detail along with the LinkShare system. Our approach can be adopted by Big Data users to automatically apply their privacy policy on data operations and track the flow of that data across various stakeholders.Comment: 10 pages, 6 figures, Published in: 4th International Workshop on Privacy and Security of Big Data (PSBD 2017) in conjunction with 2017 IEEE International Conference on Big Data (IEEE BigData 2017) December 14, 2017, Boston, MA, US

arXiv.org e-Print Archive

Crossref