Search CORE

10 research outputs found

Survey: Models and Prototypes of Schema Matching

Author: Mustofa Khabib
Sutanta Edhy
Wardoyo Retantyo
Winarko Edi
Publication venue: 'Institute of Advanced Engineering and Science'
Publication date: 01/06/2016
Field of study

Schema matching is critical problem within many applications to integration of data/information, to achieve interoperability, and other cases caused by schematic heterogeneity. Schema matching evolved from manual way on a specific domain, leading to a new models and methods that are semi-automatic and more general, so it is able to effectively direct the user within generate a mapping among elements of two the schema or ontologies better. This paper is a summary of literature review on models and prototypes on schema matching within the last 25 years to describe the progress of and research chalenge and opportunities on a new models, methods, and/or prototypes

Institute of Advanced Engineering and Science

Automatisierte Umsetzung von komplexen XML-Schemaänderungen

Author: Hartung Michael
Publication venue
Publication date: 01/02/2019
Field of study

Dieser Beitrag untersucht die Frage, wie komplexe Änderungen bei der Evolution von XML-Schemas automatisiert unterstützt werden können. Hierzu werden mögliche Änderungen an XML-Schemas durch Evolutionsoperatoren beschrieben, klassifiziert und beurteilt. Im Speziellen wird die Verschiebung (Move) von Elementen innerhalb von XML-Schemas analysiert. Die automatisierte Generierung und Ausführung von Transformationsregeln zur Migration von Instanzda-ten und die Beurteilung möglicher Informationsverluste während einer Transformation wird allgemein, wie auch anhand von Publikationsdaten untersucht

Qucosa - Publikationsserver der Universität Leipzig

On view processing for a native XML DBMS

Author: CHEN TING
Publication venue
Publication date: 23/12/2004
Field of study

Master'sMASTER OF SCIENC

ScholarBank@NUS

Evaluierung von Clio zur Transformation von Metamodellen

Author: Motschiunigg Oliver
Publication venue
Publication date: 01/01/2008
Field of study

Clio ist ein Tool zur teilautomatischen Erzeugung von Schema Mappings und der anschließenden Transformation der Instanz eines Quellschemas in die Instanz eines Zielschemas. Ein Metamodell ist das Modell eines Modells und dient zur Beschreibung seiner Elemente und ihrer Beziehungen zueinander. Ecore ist eine Implementierung der Meta Object Facility, der standardisierten Sprache der Object Management Group (OMG) zur Beschreibung von Metamodellen. Diese Arbeit untersucht Clio in Anwendung auf Ecore-basierte Metamodelle. Es soll festgestellt werden, ob ein Einsatz von Clio zur Transformation dieser Metamodelle möglich und sinnvoll ist. Dabei wird die Bedienung Clios mit besonderem Augenmerk auf den notwendigen Input untersucht. Anschließend wird eine Methode entwickelt, um Metamodelle entsprechend umzuformen. Schließlich werden diese umgeformten Metamodelle verwendet, um sie mit Clio zu transformieren.Clio is a tool for the semi-automatic generation of schema mappings and the following transformation of an instance of a source schema into the instance of a target schema. A metamodel is the model of a model. It is used to describe model elements and their relationships to each other. Ecore is an implementation of the Meta Object Facility – the standardized language of the Object Management Group (OMG) for the description of metamodels. This thesis evaluates the application of Clio to Ecore-based metamodels. The goal is an evaluation of the pros and cons of using Clio as a tool for the transformation of Ecore-based metamodels. Therefore it is necessary to examine how to use Clio focusing on the required input. Subsequently, a method to translate metamodels is developed. Finally, Clio is used to transform these metamodels

OTHES

The Basics of Complex Correspondences and Functions and their Implementation and Semi-automatic Detection in COMA++

Author: Arnold Patrick
Publication venue
Publication date: 26/02/2018
Field of study

In der vorliegenden Masterarbeit wird erläutert, wie ein klassischer Schema Matcher erweitert wird, um Komplexe Korrespondenzen (many-to-many-Korrespondenzen) und allgemeine Funktionen zwischen zwei Schemata auszudrücken, sowie deren automatische Entdeckung als Erweiterung der herkömmlichen Entdeckung von (1:1)-Korrespondenzen. Der letzte Punkt widmet sich dabei einem Gebiet der Datenintegration, das bisher kaum untersucht wurde, und es werden Ansätze vorgestellt, die für viele Schema Matcher eine Bereicherung darstellen können. Zu diesem Zweckwerden im ersten Teil der Arbeit Komplexe Korrespondenzen und Funktionen im Bereich des Schema Mappings ausführlich vorgestellt

Qucosa - Publikationsserver der Universität Leipzig

Semantic Enrichment of Ontology Mappings

Author: Arnold Patrick
Publication venue
Publication date: 15/12/2015
Field of study

Schema and ontology matching play an important part in the field of data integration and semantic web. Given two heterogeneous data sources, meta data matching usually constitutes the first step in the data integration workflow, which refers to the analysis and comparison of two input resources like schemas or ontologies. The result is a list of correspondences between the two schemas or ontologies, which is often called mapping or alignment. Many tools and research approaches have been proposed to automatically determine those correspondences. However, most match tools do not provide any information about the relation type that holds between matching concepts, for the simple but important reason that most common match strategies are too simple and heuristic to allow any sophisticated relation type determination. Knowing the specific type holding between two concepts, e.g., whether they are in an equality, subsumption (is-a) or part-of relation, is very important for advanced data integration tasks, such as ontology merging or ontology evolution. It is also very important for mappings in the biological or biomedical domain, where is-a and part-of relations may exceed the number of equality correspondences by far. Such more expressive mappings allow much better integration results and have scarcely been in the focus of research so far. In this doctoral thesis, the determination of the correspondence types in a given mapping is the focus of interest, which is referred to as semantic mapping enrichment. We introduce and present the mapping enrichment tool STROMA, which obtains a pre-calculated schema or ontology mapping and for each correspondence determines a semantic relation type. In contrast to previous approaches, we will strongly focus on linguistic laws and linguistic insights. By and large, linguistics is the key for precise matching and for the determination of relation types. We will introduce various strategies that make use of these linguistic laws and are able to calculate the semantic type between two matching concepts. The observations and insights gained from this research go far beyond the field of mapping enrichment and can be also applied to schema and ontology matching in general. Since generic strategies have certain limits and may not be able to determine the relation type between more complex concepts, like a laptop and a personal computer, background knowledge plays an important role in this research as well. For example, a thesaurus can help to recognize that these two concepts are in an is-a relation. We will show how background knowledge can be effectively used in this instance, how it is possible to draw conclusions even if a concept is not contained in it, how the relation types in complex paths can be resolved and how time complexity can be reduced by a so-called bidirectional search. The developed techniques go far beyond the background knowledge exploitation of previous approaches, and are now part of the semantic repository SemRep, a flexible and extendable system that combines different lexicographic resources. Further on, we will show how additional lexicographic resources can be developed automatically by parsing Wikipedia articles. The proposed Wikipedia relation extraction approach yields some millions of additional relations, which constitute significant additional knowledge for mapping enrichment. The extracted relations were also added to SemRep, which thus became a comprehensive background knowledge resource. To augment the quality of the repository, different techniques were used to discover and delete irrelevant semantic relations. We could show in several experiments that STROMA obtains very good results w.r.t. relation type detection. In a comparative evaluation, it was able to achieve considerably better results than related applications. This corroborates the overall usefulness and strengths of the implemented strategies, which were developed with particular emphasis on the principles and laws of linguistics

Qucosa - Publikationsserver der Universität Leipzig

Mapping XML and Relational Schemas with Clio

Author: Felix Naumann Y
Howard Ho Y
Lucian Popa
Mauricio A. Hernández
Renée J. Miller
Yannis Velegrakis
Publication venue
Publication date: 01/01/2002
Field of study

Merging and coalescing data from multiple and diverse sources into different data formats continues to be an important problem in modern information systems. Schema Matching, the process of matching elements of a source schema with elements of a target schema, and Schema Mapping, the process of creating a query that maps between two disparate schemas, are at the heart of data integration systems. We demonstrate Clio, a semi-automatic schema mapping tool developed at the IBM Almaden Research Center. In this demonstration we showcase Clio’s mapping engine that allows mapping to and from relational and XML schemas, and takes advantage of data constraints in order to preserve data associations. The semantically correct and complete creation and interpretation of mappings is a highly nontrivial process. Curren

CiteSeerX

Dokumenten-Publikationsserver der Humboldt-Universität zu Berlin

Réconciliation sémantique des données et des services mis en œuvre au sein d'une situation collaborative

Author: BOISSEL-DALLIER Nicolas
PINGAUD Hervé
Publication venue: INPT, Toulouse
Publication date: 01/01/2012
Field of study

La collaboration entre organisations est l un des principaux enjeux de l écosystème industriel actuel. L établissement d une telle collaboration doit être réactive, afin de saisir les différentes opportunités, et flexibles, pour pouvoir s adapter aux changements dans la collaboration. Pour cela, ces collaborations doivent être supportées par un système d information (SI) dédié, en charge de fournir l interopérabilité entre les différents SI des partenaires et capable de gérer les spécificités de la collaboration. Le projet MISE (Mediation Information System Engineering) propose une approche dirigée par les modèles permettant à l utilisateur de concevoir un Système d Information de Médiation (SIM) adapté au support de cette collaboration. Deux étapes sont au coeur de la conception de ce SIM : la génération du processus métier collaboratif depuis une description de la situation (niveau abstrait) et sa transformation en un système exécutable (niveau concret). Ce manuscrit s intéresse à cette seconde phase et tente, à l aide de technologies basées sur la connaissance, de réconcilier ces modèles métiers avec les services techniques disponibles. Après une étude du besoin et des méthodes existantes d apport sémantique pour les différents niveaux d abstraction, nous faisons le choix de nous intéresser aux standards SAWSDL et WSMO-Lite au niveau des services et nous proposons un nouveau mécanisme d annotation sémantique au niveau des processus métier (appelé SABPMN), faute de standard reconnu. Les informations sémantiques ajoutées aux modèles sont ensuite exploitées lors de la transformation des processus métier en workflows exécutables proposée ici. Cette transformation se déroule alors en trois phases : (i) on recherche pour les différentes activités métier du processus le ou les service(s) qui répond(ent) au besoin métier exprimé à l aide de mécanismes de sélection et de composition de services ; (ii) on génère pour chaque service à invoquer la transformation de données nécessaire pour garantir une bonne communication avec les autres composants ; (iii) une fois ces informations validées par l utilisateur, on génère les fichiers nécessaires à l exécution de ce processus sur la plateforme collaborative. Les résultats de cette thèse s inscrivent aussi au sein du projet FUI ISTA3 (Interopérabilité de 3ème génération pour les Sous-Traitants de l Aéronautique) qui se propose d améliorer l interopérabilité de la chaine logistique des sous-traitants aéronautiques de l Aerospace Valley afin de faciliter la co-conception. Une implémentation des différents mécanismes proposés a été réalisée et est disponible sous la forme d un prototype fonctionnel open-source.Collaboration bewteen organisations is one of nowadays main stakes in industrial ecosystem. Establishment of such collaboration must be reactive, in order to take avantage of opportunities, and flexible, in order to adapt collaboration to context changes. In this view, such collaboration must be supported by a dedicated Information System (IS), responsible for ensuring interoperability between partner s IS and able to manage collaboration specificities. MISE project (Mediation Information System Engineering) provides a model-driven engineering approach dedicated to design a Mediation Information System (MIS) which supports this collaboration. Two steps are involved in the MIS design : generation of business processes from the description of the collaborative situation (abstract level) and transformation of these process models into an executable system (concrete level). This PhD thesis takes interest in the second level trying to match those business models with available technical services, thanks to knowledge based technologies. First, we studied our semantic needs and existing methods of semantic annotation for models from both business and technical levels. We chose SAWSDL and WSMOLite standards for service annotations whereas we provided a new semantic annotation mechanism for business processes (called SABPMN), in the absence of existing standard. Added semantic information is then used during the business processes to executable workflows transformation. This transformation is performed in three steps : (i) for each activity involved in business processes we search for technical services which fit our business needs thanks to our service selection and composition mechanisms ; (ii) we generate for each selected service the required data transformation to ensure correct communication with other components ; (iii) once this information validated by user, we generate technical files expected by the collaborative platform to execute those processes. Those results are in line with the FUI ISTA3 project (3rd generation of Interoperability for Aeronautics Sub-contracTors) which focuses on improving supply chain interoperability for aeronautics sub-contractors of Aerospace Valley in order to facilitate co-design. All proposed transformation and matchmaking mecanisms are implemented as open-source functional prototypes.TOULOUSE-INP (315552154) / SudocSudocFranceF

OpenGrey Repository

Réconciliation sémantique des données et des services mis en œuvre au sein d’une situation collaborative

Author: Boissel-Dallier Nicolas
Publication venue: INPT
Publication date: 20/11/2012
Field of study

La collaboration entre organisations est l’un des principaux enjeux de l’écosystème industriel actuel. L’établissement d’une telle collaboration doit être réactive, afin de saisir les différentes opportunités, et flexibles, pour pouvoir s’adapter aux changements dans la collaboration. Pour cela, ces collaborations doivent être supportées par un système d’information (SI) dédié, en charge de fournir l’interopérabilité entre les différents SI des partenaires et capable de gérer les spécificités de la collaboration. Le projet MISE (Mediation Information System Engineering) propose une approche dirigée par les modèles permettant à l’utilisateur de concevoir un Système d’Information de Médiation (SIM) adapté au support de cette collaboration. Deux étapes sont au coeur de la conception de ce SIM : la génération du processus métier collaboratif depuis une description de la situation (niveau abstrait) et sa transformation en un système exécutable (niveau concret). Ce manuscrit s’intéresse à cette seconde phase et tente, à l’aide de technologies basées sur la connaissance, de réconcilier ces modèles métiers avec les services techniques disponibles. Après une étude du besoin et des méthodes existantes d’apport sémantique pour les différents niveaux d’abstraction, nous faisons le choix de nous intéresser aux standards SAWSDL et WSMO-Lite au niveau des services et nous proposons un nouveau mécanisme d’annotation sémantique au niveau des processus métier (appelé SABPMN), faute de standard reconnu. Les informations sémantiques ajoutées aux modèles sont ensuite exploitées lors de la transformation des processus métier en workflows exécutables proposée ici. Cette transformation se déroule alors en trois phases : (i) on recherche pour les différentes activités métier du processus le ou les service(s) qui répond(ent) au besoin métier exprimé à l’aide de mécanismes de sélection et de composition de services ; (ii) on génère pour chaque service à invoquer la transformation de données nécessaire pour garantir une bonne communication avec les autres composants ; (iii) une fois ces informations validées par l’utilisateur, on génère les fichiers nécessaires à l’exécution de ce processus sur la plateforme collaborative. Les résultats de cette thèse s’inscrivent aussi au sein du projet FUI ISTA3 (Interopérabilité de 3ème génération pour les Sous-Traitants de l’Aéronautique) qui se propose d’améliorer l’interopérabilité de la chaine logistique des sous-traitants aéronautiques de l’Aerospace Valley afin de faciliter la co-conception. Une implémentation des différents mécanismes proposés a été réalisée et est disponible sous la forme d’un prototype fonctionnel open-source. ABSTRACT : Collaboration bewteen organisations is one of nowadays main stakes in industrial ecosystem. Establishment of such collaboration must be reactive, in order to take avantage of opportunities, and flexible, in order to adapt collaboration to context changes. In this view, such collaboration must be supported by a dedicated Information System (IS), responsible for ensuring interoperability between partner’s IS and able to manage collaboration specificities. MISE project (Mediation Information System Engineering) provides a model-driven engineering approach dedicated to design a Mediation Information System (MIS) which supports this collaboration. Two steps are involved in the MIS design : generation of business processes from the description of the collaborative situation (abstract level) and transformation of these process models into an executable system (concrete level). This PhD thesis takes interest in the second level trying to match those business models with available technical services, thanks to knowledge based technologies. First, we studied our semantic needs and existing methods of semantic annotation for models from both business and technical levels. We chose SAWSDL and WSMOLite standards for service annotations whereas we provided a new semantic annotation mechanism for business processes (called SABPMN), in the absence of existing standard. Added semantic information is then used during the business processes to executable workflows transformation. This transformation is performed in three steps : (i) for each activity involved in business processes we search for technical services which fit our business needs thanks to our service selection and composition mechanisms ; (ii) we generate for each selected service the required data transformation to ensure correct communication with other components ; (iii) once this information validated by user, we generate technical files expected by the collaborative platform to execute those processes. Those results are in line with the FUI ISTA3 project (3rd generation of Interoperability for Aeronautics Sub-contracTors) which focuses on improving supply chain interoperability for aeronautics sub-contractors of Aerospace Valley in order to facilitate co-design. All proposed transformation and matchmaking mecanisms are implemented as open-source functional prototypes

Thèses en Ligne

Open Archive Toulouse Archive Ouverte

Institut National Polytechnique de Toulouse (Theses)

Methods for Matching of Linked Open Social Science Data

Author: Zapilko Benjamin
Publication venue
Publication date: 01/01/2014
Field of study

In recent years, the concept of Linked Open Data (LOD), has gained popularity and acceptance across various communities and domains. Science politics and organizations claim that the potential of semantic technologies and data exposed in this manner may support and enhance research processes and infrastructures providing research information and services. In this thesis, we investigate whether these expectations can be met in the domain of the social sciences. In particular, we analyse and develop methods for matching social scientific data that is published as Linked Data, which we introduce as Linked Open Social Science Data. Based on expert interviews and a prototype application, we investigate the current consumption of LOD in the social sciences and its requirements. Following these insights, we first focus on the complete publication of Linked Open Social Science Data by extending and developing domain-specific ontologies for representing research communities, research data and thesauri. In the second part, methods for matching Linked Open Social Science Data are developed that address particular patterns and characteristics of the data typically used in social research. The results of this work contribute towards enabling a meaningful application of Linked Data in a scientific domain

MAnnheim DOCument Server