Search CORE

1,480 research outputs found

Mapping languages analysis of comparative characteristics

Author: De Meester Ben
Dimou Anastasia
Heyvaert Pieter
Verborgh Ruben
Publication venue
Publication date: 01/01/2019
Field of study

RDF generation processes are becoming more interoperable, reusable, and maintainable due to the increased usage of mapping languages: languages used to describe how to generate an RDF graph from (semi-)structured data. This gives rise to new mapping languages, each with different characteristics. However, it is not clear which mapping language is fit for a given task. Thus, a comparative framework is needed. In this paper, we investigate a set of mapping languages that inhibit complementary characteristics, and present an initial set of comparative characteristics based on requirements as put forward by the reference works of those mapping languages. Initial investigation found 9 broad characteristics, classified in 3 categories. To further formalize and complete the set of characteristics, further investigation is needed, requiring a joint effort of the community

Ghent University Academic Bibliography

What factors influence the design of a linked data generation algorithm?

Author: De Meester Ben
Dimou Anastasia
Heyvaert Pieter
Verborgh Ruben
Publication venue
Publication date: 01/01/2018
Field of study

Ghent University Academic Bibliography

Machine-interpretable dataset and service descriptions for heterogeneous data access and retrieval

Author: Alexander K.
Christensen E.
Das S.
de Bruijn J.
de Bruijn J.
Dimou A.
Lanthaler M.
Maali F.
Martin D.
Stadler C.
Tennison J.
Vitvar T.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2015
Field of study

Crossref

Ghent University Academic Bibliography

Streamlining Knowledge Graph Construction with a fa\c{c}ade: The SPARQL Anything project

Author: Asprino Luigi
Daga Enrico
Dowdy Justin
Gangemi Aldo
Mulholland Paul
Ratta Marco
Publication venue
Publication date: 25/10/2023
Field of study

What should a data integration framework for knowledge engineers look like? Recent research on Knowledge Graph construction proposes the design of a fa\c{c}ade, a notion borrowed from object-oriented software engineering. This idea is applied to SPARQL Anything, a system that allows querying heterogeneous resources as-if they were in RDF, in plain SPARQL 1.1, by overloading the SERVICE clause. SPARQL Anything supports a wide variety of file formats, from popular ones (CSV, JSON, XML, Spreadsheets) to others that are not supported by alternative solutions (Markdown, YAML, DOCx, Bibtex). Features include querying Web APIs with high flexibility, parametrised queries, and chaining multiple transformations into complex pipelines. In this paper, we describe the design rationale and software architecture of the SPARQL Anything system. We provide references to an extensive set of reusable, real-world scenarios from various application domains. We report on the value-to-users of the founding assumptions of its design, compared to alternative solutions through a community survey and a field report from the industry.Comment: 15 page

arXiv.org e-Print Archive

Knowledge Organization Systems (KOS) in the Semantic Web: A Multi-Dimensional Review

Author: Mayr Philipp
Zeng Marcia Lei
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 13/01/2018
Field of study

Since the Simple Knowledge Organization System (SKOS) specification and its SKOS eXtension for Labels (SKOS-XL) became formal W3C recommendations in 2009 a significant number of conventional knowledge organization systems (KOS) (including thesauri, classification schemes, name authorities, and lists of codes and terms, produced before the arrival of the ontology-wave) have made their journeys to join the Semantic Web mainstream. This paper uses "LOD KOS" as an umbrella term to refer to all of the value vocabularies and lightweight ontologies within the Semantic Web framework. The paper provides an overview of what the LOD KOS movement has brought to various communities and users. These are not limited to the colonies of the value vocabulary constructors and providers, nor the catalogers and indexers who have a long history of applying the vocabularies to their products. The LOD dataset producers and LOD service providers, the information architects and interface designers, and researchers in sciences and humanities, are also direct beneficiaries of LOD KOS. The paper examines a set of the collected cases (experimental or in real applications) and aims to find the usages of LOD KOS in order to share the practices and ideas among communities and users. Through the viewpoints of a number of different user groups, the functions of LOD KOS are examined from multiple dimensions. This paper focuses on the LOD dataset producers, vocabulary producers, and researchers (as end-users of KOS).Comment: 31 pages, 12 figures, accepted paper in International Journal on Digital Librarie

arXiv.org e-Print Archive

Crossref

SSOAR - Social Science Open Access Repository

A dataflow platform for applications based on Linked Data

Author: Bottoni Paolo Gaspare
Publication venue: 'Inderscience Publishers'
Publication date: 01/01/2018
Field of study

Modern software applications increasingly benefit from accessing the multifarious and heterogeneous Web of Data, thanks to the use of web APIs and Linked Data principles. In previous work, the authors proposed a platform to develop applications consuming Linked Data in a declarative and modular way. This paper describes in detail the functional language the platform gives access to, which is based on SPARQL (the standard query language for Linked Data) and on the dataflow paradigm. The language features interactive and meta-programming capabilities so that complex modules/applications can be developed. By adopting a declarative style, it favours the development of modules that can be reused in various specific execution context

Archivio della ricerca- Università di Roma La Sapienza

Parallel RDF generation from heterogeneous big data

Author: Dimou Anastasia
Haesendonck Gerald
Heyvaert Pieter
Maroy Wouter
Verborgh Ruben
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2019
Field of study

To unlock the value of increasingly available data in high volumes, we need flexible ways to integrate data across different sources. While semantic integration can be provided through RDF generation, current generators insufficiently scale in terms of volume. Generators are limited by memory constraints. Therefore, we developed the RMLStreamer, a generator that parallelizes the ingestion and mapping tasks of RDF generation across multiple instances. In this paper, we analyze what aspects are parallelizable and we introduce an approach for parallel RDF generation. We describe how we implemented our proposed approach, in the frame of the RMLStreamer, and how the resulting scaling behavior compares to other RDF generators. The RMLStreamer ingests data at 50% faster rate than existing generators through parallel ingestion

Crossref

Ghent University Academic Bibliography

Conformance test cases for the RDF mapping language (RML)

Author: A Chebotko
Diego Calvanese
J Lehmann
Juan F. Sequeda
K Kyzirakos
M Koubarakis
M-E Vidal
N Konstantinou
R Battle
Stefan Bischof
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Crossref

Ghent University Academic Bibliography