Search CORE

3,089 research outputs found

Nonparametric Bayesian Modeling for Automated Database Schema Matching

Author: Ferragut Erik M.
Laska Jason
Publication venue
Publication date: 06/07/2015
Field of study

The problem of merging databases arises in many government and commercial applications. Schema matching, a common first step, identifies equivalent fields between databases. We introduce a schema matching framework that builds nonparametric Bayesian models for each field and compares them by computing the probability that a single model could have generated both fields. Our experiments show that our method is more accurate and faster than the existing instance-based matching algorithms in part because of the use of nonparametric Bayesian models

arXiv.org e-Print Archive

Crossref

Current State of Ontology Matching. A Survey of Ontology and Schema Matching

Author: Arvo Jukka
Knuutila Timo
Lehtonen Teijo
Mukkala Lauri
Publication venue: University of Turku, Technology Research Center
Publication date: 14/08/2015
Field of study

Ontology matching is an important task when data from multiple data sources is integrated. Problems of ontology matching have been studied widely in the researchliterature and many diﬀerent solutions and approaches have been proposed alsoin commercial software tools. In this survey, well-known approaches of ontologymatching, and its subtype schema matching, are reviewed and compared. The aimof this report is to summarize the knowledge about the state-of-the-art solutionsfrom the research literature, discuss how the methods work on diﬀerent application domains, and analyze pros and cons of diﬀerent open source and academic tools inthe commercial world.Siirretty Doriast

UTUPub

A Survey of Cognitive Theories to Support Data Integration

Author: Evermann Joerg
Lukyanenko Roman
Publication venue: AIS Electronic Library (AISeL)
Publication date: 06/08/2011
Field of study

Business intelligence applications are being increasingly used to facilitate managerial insight and maintain competitiveness.These applications rely on the availability of integrated data from multiple data sources, making database integration anincreasingly important task. A central step in the process of data integration is schema matching, the identification of similarelements in the two databases. While a number of approaches have been proposed, the majority of schema matchingtechniques are based on ad-hoc heuristics, instead of an established theoretical foundation. The absence of a theoreticalfoundation makes it difficult to explain and improve schema matching process. This research surveys current cognitivetheories of similarity and demonstrates their application to the problem of schema matching. Better integration techniqueswill benefit business intelligence applications and can thereby contribute to business value

AIS Electronic Library (AISeL)

User's and Administrator's Manual of AMGA Metadata Catalog v 2.4.0 (EMI-3)

Author: Hwang Soon Wook
Publication venue
Publication date: 27/05/2013
Field of study

User's and Administrator's Manual of AMGA Metadata Catalog v 2.4.0 (EMI-3

ZENODO

A Large Scale Dataset for the Evaluation of Ontology Matching Systems

Author: Avesani Paolo
Giunchiglia Fausto
Shvaiko Pavel
Yatskevich Mikalai
Publication venue
Publication date: 01/01/2008
Field of study

Recently, the number of ontology matching techniques and systems has increased significantly. This makes the issue of their evaluation and comparison more severe. One of the challenges of the ontology matching evaluation is in building large scale evaluation datasets. In fact, the number of possible correspondences between two ontologies grows quadratically with respect to the numbers of entities in these ontologies. This often makes the manual construction of the evaluation datasets demanding to the point of being infeasible for large scale matching tasks. In this paper we present an ontology matching evaluation dataset composed of thousands of matching tasks, called TaxME2. It was built semi-automatically out of the Google, Yahoo and Looksmart web directories. We evaluated TaxME2 by exploiting the results of almost two dozen of state of the art ontology matching systems. The experiments indicate that the dataset possesses the desired key properties, namely it is error-free, incremental, discriminative, monotonic, and hard for the state of the art ontology matching systems. The paper has been accepted for publication in "The Knowledge Engineering Review", Cambridge Universty Press (ISSN: 0269-8889, EISSN: 1469-8005)

CiteSeerX

Archivio della ricerca - Fondazione Bruno Kessler

Unitn-eprints Research