Search CORE

51 research outputs found

Applying an ontology on data integration

Author: Brisaboa Nieves R.
Buccella Agustina
Cechich Alejandra
Publication venue
Publication date: 01/05/2003
Field of study

The term “Federated Databases” refers to the data integration of distributed, autonomous and heterogeneous databases. However, a federation can also include information systems, not only databases. When integrating data, several issues must be addressed. Here, we focus on the problem of heterogeneity, more specifically on semantic heterogeneity – that is problems related to semantically equivalent concepts or semantically related/unrelated concepts. In order to address this problem, we apply the idea of ontologies as a tool for data integration. In this paper, we explain this concept and we briefly describe a method for constructing an ontology by using a hybrid ontology approach.Eje: Bases de datosRed de Universidades con Carreras en Informática (RedUNCI

An ontology approach to data integration

Author: Buccella Agustina
Cechich Alejandra
Rodríguez Brisaboa Nieves
Publication venue
Publication date: 01/10/2003
Field of study

The term Federated Databases refers to the data integration of distributed, autonomous and heterogeneous databases. However, a federation can also include information systems, not only databases. At integrating data, several issues must be addressed. Here, we focus on the problem of heterogeneity, more specifically on semantic heterogeneity that is, problems rela ted to semantically equivalent concepts or semantically related/unrelated concepts. In order to address this problem, we apply the idea of ontologies as a tool for data integration. In this paper, we explain this concept and we briefly describe a method for constructing an ontology by using a hybrid ontology approach.Facultad de Informátic

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Servicio de Difusión de la Creación Intelectual

APPLYING CLASSIFICATION TECHNIQUES IN SEMANTIC INTEGRATION OF HETEROGENEOUS DATA SOURCES

Author: Ram Sudha
Zhao Huimin
Publication venue: AIS Electronic Library (AISeL)
Publication date: 31/12/2002
Field of study

AIS Electronic Library (AISeL)

Data Processing in Space, Time, and Semantics Dimensions

Author: Aleman-Meza Boanerges
Hakimpour Farshad
Perry Matthew
Sheth Amit P.
Publication venue: CORE Scholar
Publication date: 01/01/2006
Field of study

This work presents an experimental system for data processing in space, time and semantics dimensions using current Semantic Web technologies. The paper describes how we obtain geographic and event data from Internet sources and also how we integrate them into an RDF store. We briefly introduce a set of functionalities in space, time and semantics dimensions. These functionalities are implemented based on our existing technology for main-memory based RDF data processing developed in the LSDIS Lab. A number of these functionalities are exposed as REST Web services. We present two sample client side applications that are developed using a combination of our services with Google map service

Scholar Commons - Institutional Repository of the University of South Carolina

CORE

A Conditional Random Field for Discriminatively-trained Finite-state String Edit Distance

Author: Bellare Kedar
McCallum Andrew
Pereira Fernando
Publication venue: ScholarWorks@UMass Amherst
Publication date: 01/01/2005
Field of study

The need to measure sequence similarity arises in information extraction, object identity, data mining, biological sequence analysis, and other domains. This paper presents discriminative string-edit CRFs, a finitestate conditional random field model for edit sequences between strings. Conditional random fields have advantages over generative approaches to this problem, such as pair HMMs or the work of Ristad and Yianilos, because as conditionally-trained methods, they enable the use of complex, arbitrary actions and features of the input strings. As in generative models, the training data does not have to specify the edit sequences between the given string pairs. Unlike generative models, however, our model is trained on both positive and negative instances of string pairs. We present positive experimental results on several data sets

CiteSeerX

ScholarWorks@UMass Amherst

Semantic Integration of heterogeneous data sources in the MOMIS Data Transformation System

Author: Beneventano Domenico
Bergamaschi Sonia
Vincini Maurizio
Publication venue
Publication date: 01/01/2013
Field of study

In the last twenty years, many data integration systems following a classical wrapper/mediator architecture and providing a Global Virtual Schema (a.k.a. Global Virtual View - GVV) have been proposed by the research community. The main issues faced by these approaches range from system-level heterogeneities, to structural syntax level heterogeneities at the semantic level. Despite the research effort, all the approaches proposed require a lot of user intervention for customizing and managing the data integration and reconciliation tasks. In some cases, the effort and the complexity of the task is huge, since it requires the development of specific programming codes. Unfortunately, due to the specificity to be addressed, application codes and solutions are not frequently reusable in other domains. For this reason, the Lowell Report 2005 has provided the guideline for the definition of a public benchmark for information integration problem. The proposal, called THALIA (Test Harness for the Assessment of Legacy information Integration Approaches), focuses on how the data integration systems manage syntactic and semantic heterogeneities, which definitely are the greatest technical challenges in the field. We developed a Data Transformation System (DTS) that supports data transformation functions and produces query translation in order to push down to the sources the execution. Our DTS is based on MOMIS, a mediator-based data integration system that our research group is developing and supporting since 1999. In this paper, we show how the DTS is able to solve all the twelve queries of the THALIA benchmark by using a simple combination of declarative translation functions already available in the standard SQL language. We think that this is a remarkable result, mainly for two reasons: firstly to the best of our knowledge there is no system that has provided a complete answer to the benchmark, secondly, our queries does not require any overhead of new code

ZENODO

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

ARPHA OAI-PMH Endpoint

ARPHA Preprints

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia