3,089 research outputs found

    Nonparametric Bayesian Modeling for Automated Database Schema Matching

    Full text link
    The problem of merging databases arises in many government and commercial applications. Schema matching, a common first step, identifies equivalent fields between databases. We introduce a schema matching framework that builds nonparametric Bayesian models for each field and compares them by computing the probability that a single model could have generated both fields. Our experiments show that our method is more accurate and faster than the existing instance-based matching algorithms in part because of the use of nonparametric Bayesian models

    Current State of Ontology Matching. A Survey of Ontology and Schema Matching

    Get PDF
    Ontology matching is an important task when data from multiple data sources is integrated. Problems of ontology matching have been studied widely in the researchliterature and many diļ¬€erent solutions and approaches have been proposed alsoin commercial software tools. In this survey, well-known approaches of ontologymatching, and its subtype schema matching, are reviewed and compared. The aimof this report is to summarize the knowledge about the state-of-the-art solutionsfrom the research literature, discuss how the methods work on diļ¬€erent application domains, and analyze pros and cons of diļ¬€erent open source and academic tools inthe commercial world.Siirretty Doriast

    A Survey of Cognitive Theories to Support Data Integration

    Get PDF
    Business intelligence applications are being increasingly used to facilitate managerial insight and maintain competitiveness.These applications rely on the availability of integrated data from multiple data sources, making database integration anincreasingly important task. A central step in the process of data integration is schema matching, the identification of similarelements in the two databases. While a number of approaches have been proposed, the majority of schema matchingtechniques are based on ad-hoc heuristics, instead of an established theoretical foundation. The absence of a theoreticalfoundation makes it difficult to explain and improve schema matching process. This research surveys current cognitivetheories of similarity and demonstrates their application to the problem of schema matching. Better integration techniqueswill benefit business intelligence applications and can thereby contribute to business value

    User's and Administrator's Manual of AMGA Metadata Catalog v 2.4.0 (EMI-3)

    Get PDF
    User's and Administrator's Manual of AMGA Metadata Catalog v 2.4.0 (EMI-3

    A Large Scale Dataset for the Evaluation of Ontology Matching Systems

    Get PDF
    Recently, the number of ontology matching techniques and systems has increased significantly. This makes the issue of their evaluation and comparison more severe. One of the challenges of the ontology matching evaluation is in building large scale evaluation datasets. In fact, the number of possible correspondences between two ontologies grows quadratically with respect to the numbers of entities in these ontologies. This often makes the manual construction of the evaluation datasets demanding to the point of being infeasible for large scale matching tasks. In this paper we present an ontology matching evaluation dataset composed of thousands of matching tasks, called TaxME2. It was built semi-automatically out of the Google, Yahoo and Looksmart web directories. We evaluated TaxME2 by exploiting the results of almost two dozen of state of the art ontology matching systems. The experiments indicate that the dataset possesses the desired key properties, namely it is error-free, incremental, discriminative, monotonic, and hard for the state of the art ontology matching systems. The paper has been accepted for publication in "The Knowledge Engineering Review", Cambridge Universty Press (ISSN: 0269-8889, EISSN: 1469-8005)
    • ā€¦
    corecore