Search CORE

6,362 research outputs found

Tracking Federated Queries in the Linked Data

Author: Desmontils Emmanuel
Molli Pascal
Nassopoulos Georges
Serrano-Alvarado Patricia
Publication venue
Publication date: 25/08/2015
Field of study

Federated query engines allow data consumers to execute queries over the federation of Linked Data (LD). However, as federated queries are decomposed into potentially thousands of subqueries distributed among SPARQL endpoints, data providers do not know federated queries, they only know subqueries they process. Consequently, unlike warehousing approaches, LD data providers have no access to secondary data. In this paper, we propose FETA (FEderated query TrAcking), a query tracking algorithm that infers Basic Graph Patterns (BGPs) processed by a federation from a shared log maintained by data providers. Concurrent execution of thousand subqueries generated by multiple federated query engines makes the query tracking process challenging and uncertain. Experiments with Anapsid show that FETA is able to extract BGPs which, even in a worst case scenario, contain BGPs of original queries

arXiv.org e-Print Archive

HAL Descartes

Representing Dataset Quality Metadata using Multi-Dimensional Views

Author: Auer Sören
Debattista Jeremy
Lange Christoph
Publication venue
Publication date: 01/01/2014
Field of study

Data quality is commonly defined as fitness for use. The problem of identifying quality of data is faced by many data consumers. Data publishers often do not have the means to identify quality problems in their data. To make the task for both stakeholders easier, we have developed the Dataset Quality Ontology (daQ). daQ is a core vocabulary for representing the results of quality benchmarking of a linked dataset. It represents quality metadata as multi-dimensional and statistical observations using the Data Cube vocabulary. Quality metadata are organised as a self-contained graph, which can, e.g., be embedded into linked open datasets. We discuss the design considerations, give examples for extending daQ by custom quality metrics, and present use cases such as analysing data versions, browsing datasets by quality, and link identification. We finally discuss how data cube visualisation tools enable data publishers and consumers to analyse better the quality of their data.Comment: Preprint of a paper submitted to the forthcoming SEMANTiCS 2014, 4-5 September 2014, Leipzig, German

arXiv.org e-Print Archive

Crossref

Fraunhofer-ePrints

Git4Voc: Git-based Versioning for Collaborative Vocabulary Development

Author: Auer Sören
Coskun Gökhan
Grangel-González Irlán
Halilaj Lavdim
Publication venue
Publication date: 11/01/2016
Field of study

Collaborative vocabulary development in the context of data integration is the process of finding consensus between the experts of the different systems and domains. The complexity of this process is increased with the number of involved people, the variety of the systems to be integrated and the dynamics of their domain. In this paper we advocate that the realization of a powerful version control system is the heart of the problem. Driven by this idea and the success of Git in the context of software development, we investigate the applicability of Git for collaborative vocabulary development. Even though vocabulary development and software development have much more similarities than differences there are still important differences. These need to be considered within the development of a successful versioning and collaboration system for vocabulary development. Therefore, this paper starts by presenting the challenges we were faced with during the creation of vocabularies collaboratively and discusses its distinction to software development. Based on these insights we propose Git4Voc which comprises guidelines how Git can be adopted to vocabulary development. Finally, we demonstrate how Git hooks can be implemented to go beyond the plain functionality of Git by realizing vocabulary-specific features like syntactic validation and semantic diffs

arXiv.org e-Print Archive

Crossref

Fraunhofer-ePrints

Towards a Knowledge Graph based Speech Interface

Author: Auer Sören
Kumar Ashwini Jaya
köhler Joachim
Schmidt Christoph
Publication venue
Publication date: 23/05/2017
Field of study

Applications which use human speech as an input require a speech interface with high recognition accuracy. The words or phrases in the recognised text are annotated with a machine-understandable meaning and linked to knowledge graphs for further processing by the target application. These semantic annotations of recognised words can be represented as a subject-predicate-object triples which collectively form a graph often referred to as a knowledge graph. This type of knowledge representation facilitates to use speech interfaces with any spoken input application, since the information is represented in logical, semantic form, retrieving and storing can be followed using any web standard query languages. In this work, we develop a methodology for linking speech input to knowledge graphs and study the impact of recognition errors in the overall process. We show that for a corpus with lower WER, the annotation and linking of entities to the DBpedia knowledge graph is considerable. DBpedia Spotlight, a tool to interlink text documents with the linked open data is used to link the speech recognition output to the DBpedia knowledge graph. Such a knowledge-based speech recognition interface is useful for applications such as question answering or spoken dialog systems.Comment: Under Review in International Workshop on Grounding Language Understanding, Satellite of Interspeech 201

arXiv.org e-Print Archive

Fraunhofer-ePrints

MultiFarm: A benchmark for multilingual ontology matching

Author: Andrei Tamilin
Christian Meilicke
Cássia Trojahn
Elena Montiel-Ponsoda
Euzenat
Euzenat
Fred Freitas
Fu
García-Castro
Giunchiglia
Heiner Stuckenschmidt
Jung
Neches
Niepert
Ondřej Šváb-Zamazal
Raúl García-Castro
Ryan Ribeiro de Azevedo
Shenghui Wang
Vojtěch Svátek
Wang
Willem Robert van Hage
Publication venue: Facultad de Informática (UPM)
Publication date: 01/01/2012
Field of study

In this paper we present the MultiFarm dataset, which has been designed as a benchmark for multilingual ontology matching. The MultiFarm dataset is composed of a set of ontologies translated in different languages and the corresponding alignments between these ontologies. It is based on the OntoFarm dataset, which has been used successfully for several years in the Ontology Alignment Evaluation Initiative (OAEI). By translating the ontologies of the OntoFarm dataset into eight different languages – Chinese, Czech, Dutch, French, German, Portuguese, Russian, and Spanish – we created a comprehensive set of realistic test cases. Based on these test cases, it is possible to evaluate and compare the performance of matching approaches with a special focus on multilingualism

VU Research Portal

Crossref

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

MAnnheim DOCument Server

Archivo Digital UPM

Recommended from our members

MetaMorphosis+ - A social network of educational Web resources based on semantic integration of services and data

Author: Dietze Stefan
Dovrolis Nikolas
Kaldoudi Eleni
Taibi Davide
Yu Hong Qing
Publication venue
Publication date: 01/01/2011
Field of study

Past research aiming at interoperability within the Technology Enhanced Learning (TEL) field has led to a fragmented landscape of competing metadata schemas and interface mechanisms. So far, Web-scale integration of resources is not facilitated, mainly due to the lack of take-up of shared principles, datasets and schemas. On the other hand, the Linked Data approach has emerged as the de facto standard for sharing data on the Web. We propose MetaMorphosis+, a social educational application which adopts a general approach to exploit existing TEL data on the Web by allowing its exposure as Linked Data and by taking into account automated enrichment and interlinking techniques to provide rich and well-interlinked data for the educational domain

Open Research Online (The Open University)