Search CORE

348 research outputs found

PicShark: mitigating metadata scarcity through large-scale P2P collaboration

Author: Aberer Karl
Budura Adriana
Cudré-Mauroux Philippe
Hauswirth Manfred
Publication venue
Publication date: 18/06/2018
Field of study

With the commoditization of digital devices, personal information and media sharing is becoming a key application on the pervasive Web. In such a context, data annotation rather than data production is the main bottleneck. Metadata scarcity represents a major obstacle preventing efficient information processing in large and heterogeneous communities. However, social communities also open the door to new possibilities for addressing local metadata scarcity by taking advantage of global collections of resources. We propose to tackle the lack of metadata in large-scale distributed systems through a collaborative process leveraging on both content and metadata. We develop a community-based and self-organizing system called PicShark in which information entropy—in terms of missing metadata—is gradually alleviated through decentralized instance and schema matching. Our approach focuses on semi-structured metadata and confines computationally expensive operations to the edge of the network, while keeping distributed operations as simple as possible to ensure scalability. PicShark builds on structured Peer-to-Peer networks for distributed look-up operations, but extends the application of self-organization principles to the propagation of metadata and the creation of schema mappings. We demonstrate the practical applicability of our method in an image sharing scenario and provide experimental evidences illustrating the validity of our approac

Irish Universities

RERO DOC Digital Library

Multimedia Markup Tools for OpenKnowledge

Author: Croitoru Madalina
Dasmahapatra Srinandan
Dupplaw David
Lewis Paul
Loizou Antonis
Tuffield Mischa
Xiao Liang
Publication venue
Publication date: 05/12/2007
Field of study

OpenKnowledge is a peer-to-peer system for sharing knowledge and is driven by interaction models that give the necessary context for mapping of ontological knowledge fragments necessary for the interaction to take place. The OpenKnowledge system is agnostic to any specific data formats that are used in the interactions, relying on ontology mapping techniques for shimming the messages. The potentially large search space for matching ontologies is reduced by the shared context of the interaction. In this paper we investigate what this means for multimedia data on the OpenKnowledge network by discussing how an existing application that provides multimedia annotation (the Semantic Logger) can be migrated into the OpenKnowledge domain

Southampton (e-Prints Soton)

XML-based approaches for the integration of heterogeneous bio-molecular data

Author: Berlanga-Llavori Rafael
Jiménez-Ruiz Ernesto
Manset David
Mesiti Marco
Perlasca Paolo
Sanz Ismael
Valentini Giorgio
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Background: The today's public database infrastructure spans a very large collection of heterogeneous biological data, opening new opportunities for molecular biology, bio-medical and bioinformatics research, but raising also new problems for their integration and computational processing. Results: In this paper we survey the most interesting and novel approaches for the representation, integration and management of different kinds of biological data by exploiting XML and the related recommendations and approaches. Moreover, we present new and interesting cutting edge approaches for the appropriate management of heterogeneous biological data represented through XML. Conclusion: XML has succeeded in the integration of heterogeneous biomolecular information, and has established itself as the syntactic glue for biological data sources. Nevertheless, a large variety of XML-based data formats have been proposed, thus resulting in a difficult effective integration of bioinformatics data schemes. The adoption of a few semantic-rich standard formats is urgent to achieve a seamless integration of the current biological resources. </p

CiteSeerX

City Research Online

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

AIR Universita degli studi di Milano

Springer - Publisher Connector

PubMed Central

Repositori Institucional de la Universitat Jaume I

Oxford University Research Archive

Bioinformatics service reconciliation by heterogeneous schema transformation

Author: Martin Nigel
Poulovassilis Alexandra
Zamboulis Lucas
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/06/2007
Field of study

This paper focuses on the problem of bioinformatics service reconciliation in a generic and scalable manner so as to enhance interoperability in a highly evolving field. Using XML as a common representation format, but also supporting existing flat-file representation formats, we propose an approach for the scalable semi-automatic reconciliation of services, possibly invoked from within a scientific workflows tool. Service reconciliation may use the AutoMed heterogeneous data integration system as an intermediary service, or may use AutoMed to produce services that mediate between services. We discuss the application of our approach for the reconciliation of services in an example bioinformatics workflow. The main contribution of this research is an architecture for the scalable reconciliation of bioinformatics services

Birkbeck Institutional Research Online

GridVine: an Infrastructure for Peer Information Management

Author: Aberer Karl
Agarwal Suchit
Cudré-Mauroux Philippe
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 05/10/2007
Field of study

GridVine is a semantic overlay infrastructure based on a peer-to-peer (P2P) access structure. Built following the principle of data independence, it separates a logical layer — in which data, schemas, and schema mappings are managed — from a physical layer consisting of a structured P2P network supporting decentralized indexing, key load-balancing, and efficient routing. The system is decentralized, yet fosters semantic interoperability through pair-wise schema mappings and query reformulation. GridVine’s heterogeneous but semantically related information sources can be queried transparently using iterative query reformulation. The authors discuss a reference implementation of the system and several mechanisms for resolving queries collaboratively

Infoscience - École polytechnique fédérale de Lausanne

Viewpoints on emergent semantics

Author: Abdelmoty Alia I.
Catarci Tiziana
Damiani Ernesto
Illaramendi Arantxa
Jarrar Mustafa
Meersman Robert
Neuhold Erich J.
Parent Christine
Sattler Kai-Uwe
Scannapieco Monica
Spaccapietra Stefano
Spyns Peter
De Tre Guy
Publication venue
Publication date: 01/01/2006
Field of study

Authors include:Philippe Cudr´e-Mauroux, and Karl Aberer (editors), Alia I. Abdelmoty, Tiziana Catarci, Ernesto Damiani, Arantxa Illaramendi, Robert Meersman, Erich J. Neuhold, Christine Parent, Kai-Uwe Sattler, Monica Scannapieco, Stefano Spaccapietra, Peter Spyns, and Guy De Tr´eWe introduce a novel view on how to deal with the problems of semantic interoperability in distributed systems. This view is based on the concept of emergent semantics, which sees both the representation of semantics and the discovery of the proper interpretation of symbols as the result of a self-organizing process performed by distributed agents exchanging symbols and having utilities dependent on the proper interpretation of the symbols. This is a complex systems perspective on the problem of dealing with semantics. We highlight some of the distinctive features of our vision and point out preliminary examples of its applicatio

FADA - Birzeit University

Publishing Network for Geoscientific and Environmental Data

Lightweight Synchronization of Ontologies

Author: Sharma Arun
Publication venue: HAL CCSD
Publication date: 01/01/2006
Field of study

Master's thesis, RWTH, Aachen (DE) - sharma2006aThe semantic web is based on the idea of having formalized knowledge expressed on the web (in languages like RDF). However, we know that people do not like to strictly comply with some ontology and they would tend to add their own tags within existing ontology descriptions. This thesis addresses the issue of heterogeneity within the domain of photo annotation. It presents a peer-to-peer infrastructure and client software that enables users to provide ontology based photo annotations in a free manner (by using the most convenient vocabulary) and share them with other users in a peer-to-peer environment. Moreover, the thesis presents an ontology alignment based mediator service to translate queries among the peers

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Lightweight Synchronization of Ontologies

Author: Sharma Arun
Publication venue: HAL CCSD
Publication date: 01/01/2006
Field of study

Hal - Université Grenoble Alpes