3,670,342 research outputs found

    Ontology-Based Data Access and Integration

    Get PDF
    An ontology-based data integration (OBDI) system is an information management system consisting of three components: an ontology, a set of data sources, and the mapping between the two. The ontology is a conceptual, formal description of the domain of interest to a given organization (or a community of users), expressed in terms of relevant concepts, attributes of concepts, relationships between concepts, and logical assertions characterizing the domain knowledge. The data sources are the repositories accessible by the organization where data concerning the domain are stored. In the general case, such repositories are numerous, heterogeneous, each one managed and maintained independently from the others. The mapping is a precise specification of the correspondence between the data contained in the data sources and the elements of the ontology. The main purpose of an OBDI system is to allow information consumers to query the data using the elements in the ontology as predicates. In the special case where the organization manages a single data source, the term ontology-based data access (ODBA) system is used

    Using Ontologies for Semantic Data Integration

    Get PDF
    While big data analytics is considered as one of the most important paths to competitive advantage of today’s enterprises, data scientists spend a comparatively large amount of time in the data preparation and data integration phase of a big data project. This shows that data integration is still a major challenge in IT applications. Over the past two decades, the idea of using semantics for data integration has become increasingly crucial, and has received much attention in the AI, database, web, and data mining communities. Here, we focus on a specific paradigm for semantic data integration, called Ontology-Based Data Access (OBDA). The goal of this paper is to provide an overview of OBDA, pointing out both the techniques that are at the basis of the paradigm, and the main challenges that remain to be addressed

    Lightweight Data Integration Frameworks for Clinical Research

    Get PDF
    Research data from a single clinical study is often spread across multiple applications and systems. We present a reusable, lightweight, secure framework for automatically integrating and querying study data from heterogeneous sources in order to answer routine, operational questions for researchers

    Taming Data Explosion in Probabilistic Information Integration

    Get PDF
    Data integration has been a challenging problem for decades. In an ambient environment, where many autonomous devices have their own information sources and network connectivity is ad hoc and peer-to-peer, it even becomes a serious bottleneck. To enable devices to exchange information without the need for interaction with a user at data integration time and without the need for extensive semantic annotations, a probabilistic approach seems rather promising. It simply teaches the device how to cope with the uncertainty occurring during data integration. Unfortunately, without any kind of world knowledge, almost everything becomes uncertain, hence maintaining all possibilities produces huge integrated information sources. In this paper, we claim that only very simple and generic rules are enough world knowledge to drastically reduce the amount of uncertainty, hence to tame the data explosion to a manageable size

    UK utility data integration: overcoming schematic heterogeneity

    Get PDF
    In this paper we discuss syntactic, semantic and schematic issues which inhibit the integration of utility data in the UK. We then focus on the techniques employed within the VISTA project to overcome schematic heterogeneity. A Global Schema based architecture is employed. Although automated approaches to Global Schema definition were attempted the heterogeneities of the sector were too great. A manual approach to Global Schema definition was employed. The techniques used to define and subsequently map source utility data models to this schema are discussed in detail. In order to ensure a coherent integrated model, sub and cross domain validation issues are then highlighted. Finally the proposed framework and data flow for schematic integration is introduced

    Integration of environmental data in BIM tool & linked building data

    Get PDF
    Environmental assessment is a critical need to ensure building sustainability. In order to enhance the sustainability of building, involved actors should be able to access and share not only information about the building but also data about products and especially their environmental assessment. Among several approaches that have been proposed to achieve that, semantic web technologies stand out from the crowd by their capabilities to share data and enhance interoperability in between the most heterogeneous systems. This paper presents the implementation of a method in which semantic web technologies and particularly Linked Data have been combined with Building Information Modelling (BIM) tools to foster building sustainability by introducing products with their environmental assessment in building data during the modelling phase. Based on Linked Building Data (LBD) vocabularies and environmental data, several ontologies have been generated in order to make both of them available as Resource Description Framework (RDF) graphs. A database access plugin has been developed and installed in a BIM tool. In that way, the LBD generated from the BIM tool contains, for each product a reference to its environmental assessment which is contained in a triplestore

    Enhanced Data Integration for LabVIEW Laboratory Systems

    Full text link
    Integrating data is a basic concern in many accredited laboratories that perform a large variety of measurements. However, the present working style in engineering faculties does not focus much on this aspect. To deal with this challenge, we developed an educational platform that allows characterization of acquisition ensembles, generation of Web pages for lessons, as well as transformation of measured data and storage in a common format. As generally we had to develop individual parsers for each instrument, we also added the possibility to integrate the LabVIEW workbench, often used for rapid development of applications in electrical engineering and automatic control. This paper describes how we configure the platform for specific equipment, i.e. how we model it, how we create the learning material and how we integrate the results in a central database. It also introduces a case study for collecting data from a thermocouple-based acquisition system based on LabVIEW, used by students for a laboratory of measurement technologies and transducers.Comment: 6 pages, 9 figure
    corecore