432 research outputs found

    A Survey of the State of Dataspaces

    Get PDF
    Published in International Journal of Computer and Information Technology.This paper presents a survey of the state of dataspaces. With dataspaces becoming the modern technique of systems integration, the achievement of complete dataspace development is a critical issue. This has led to the design and implementation of dataspace systems using various approaches. Dataspaces are data integration approaches that target for data coexistence in the spatial domain. Unlike traditional data integration techniques, they do not require up front semantic integration of data. In this paper, we outline and compare the properties and implementations of dataspaces including the approaches of optimizing dataspace development. We finally present actual dataspace development recommendations to provide a global overview of this significant research topic.This paper presents a survey of the state of dataspaces . With dataspaces becoming the modern technique of systems integration, the ach ievement of complete dataspace development is a critical issue. This has led to the design and implementation of dataspace systems using various approaches. Dataspaces are data integration approaches that target for data coexistence in the spatial domain. Unlike traditional data integration techniques, they do not require up front semantic integration of data. In this paper, we outline and compare the properties and implementations of dataspaces including the approaches of optimizing dataspace development. We finally present actual dataspace development recommendations to provide a global overview of this significant research topic

    Cost-based Optimization of Multistore Query Plans

    Get PDF
    Multistores are data management systems that enable query processing across different and heterogeneous databases; besides the distribution of data, complexity factors like schema heterogeneity and data replication must be resolved through integration and data fusion activities. Our multistore solution relies on a dataspace to provide the user with an integrated view of the available data and enables the formulation and execution of GPSJ queries. In this paper, we propose a technique to optimize the execution of GPSJ queries by formulating and evaluating different execution plans on the multistore. In particular, we outline different strategies to carry out joins and data fusion by relying on different schema representations; then, a self-learning black-box cost model is used to estimate execution times and select the most efficient plan. The experiments assess the effectiveness of the cost model in choosing the best execution plan for the given queries and exploit multiple multistore benchmarks to investigate the factors that influence the performance of different plans

    In-House Digital Libraries for Law Firms

    Get PDF
    In the current legal research and knowledge management environment, digital libraries and in-house knowledge management systems are not only a growing field but also are a necessity for a law firm’s continued survival. For a law firm to compete in the present world, a knowledge management system, which maintains a usable and searchable database of the firms work product and shared knowledge, is an absolute must to a collaborative work environment and client relations. This paper will focus on creating and maintaining such knowledge management databases, i.e. digital libraries, in law firms. First, this paper will define digital libraries. Second, it will discuss the history of digital libraries in general. Third it will discuss the current trends in knowledge management of law firms. Fourth, it will shift into a discussion of various database platform software options and what is necessary for selecting one, with a focus on selecting a vendor maintained software versus on open-source one. Fifth the paper will look at licensing and copyright concerns. Sixth, it will discuss metadata principles. Seventh, this paper will look at the problem of updating information. Lastly, this paper will discuss a land use database created using the principles studied for this paper

    Architecture Design Options for Federated Data Spaces

    Get PDF
    The massive growth of data and the increasing potential of data analytics in industrial production fuel the emergence of data spaces and corresponding platforms that realize data ecosystems and enable data-driven sustainability applications. To leverage their benefits of demand-driven and scalable data integration, the stakeholders of emerging data space initiatives must make informed decisions about their data space support platforms (DSSPs). This study proposes a conceptual framework based on federated architectures and by considering existing endeavors of data infrastructures. Based on existing literature about data ecosystem resources and an explorative single case study of an industrial data space with sustainability-focused applications, we elaborate on the key design options of data, services, and computing infrastructures. The resulting conceptual framework guides design decisions for DSSPs. The framework captures not only the resources involved but also the operational concepts of federated services and shared services to introduce governance mechanisms and sustainability policies

    The role of linked data and the semantic web in building operation

    Get PDF
    Effective Decision Support Systems (DSS) for building service managers require adequate performance data from many building data silos in order to deliver a complete view of building performance. Current performance analysis techniques tend to focus on a limited number of data sources, such as BMS measured data (temperature, humidity, C02), excluding a wealth of other data sources increasingly available in the modern building, including weather data, occupant feedback, mobile sensors & feedback systems, schedule information, equipment usage information. This paper investigates the potential for using Linked Data and Semantic Web technologies to improve interoperability across AEC domains, overcoming many of the roadblocks hindering information transfer currently

    Data Spaces

    Get PDF
    This open access book aims to educate data space designers to understand what is required to create a successful data space. It explores cutting-edge theory, technologies, methodologies, and best practices for data spaces for both industrial and personal data and provides the reader with a basis for understanding the design, deployment, and future directions of data spaces. The book captures the early lessons and experience in creating data spaces. It arranges these contributions into three parts covering design, deployment, and future directions respectively. The first part explores the design space of data spaces. The single chapters detail the organisational design for data spaces, data platforms, data governance federated learning, personal data sharing, data marketplaces, and hybrid artificial intelligence for data spaces. The second part describes the use of data spaces within real-world deployments. Its chapters are co-authored with industry experts and include case studies of data spaces in sectors including industry 4.0, food safety, FinTech, health care, and energy. The third and final part details future directions for data spaces, including challenges and opportunities for common European data spaces and privacy-preserving techniques for trustworthy data sharing. The book is of interest to two primary audiences: first, researchers interested in data management and data sharing, and second, practitioners and industry experts engaged in data-driven systems where the sharing and exchange of data within an ecosystem are critical

    Data Spaces

    Get PDF
    This open access book aims to educate data space designers to understand what is required to create a successful data space. It explores cutting-edge theory, technologies, methodologies, and best practices for data spaces for both industrial and personal data and provides the reader with a basis for understanding the design, deployment, and future directions of data spaces. The book captures the early lessons and experience in creating data spaces. It arranges these contributions into three parts covering design, deployment, and future directions respectively. The first part explores the design space of data spaces. The single chapters detail the organisational design for data spaces, data platforms, data governance federated learning, personal data sharing, data marketplaces, and hybrid artificial intelligence for data spaces. The second part describes the use of data spaces within real-world deployments. Its chapters are co-authored with industry experts and include case studies of data spaces in sectors including industry 4.0, food safety, FinTech, health care, and energy. The third and final part details future directions for data spaces, including challenges and opportunities for common European data spaces and privacy-preserving techniques for trustworthy data sharing. The book is of interest to two primary audiences: first, researchers interested in data management and data sharing, and second, practitioners and industry experts engaged in data-driven systems where the sharing and exchange of data within an ecosystem are critical

    Opening Urban Mirror Worlds: Possibilities for Participation in Digital Urban Dataspaces

    Get PDF

    Operating System Support for Redundant Multithreading

    Get PDF
    Failing hardware is a fact and trends in microprocessor design indicate that the fraction of hardware suffering from permanent and transient faults will continue to increase in future chip generations. Researchers proposed various solutions to this issue with different downsides: Specialized hardware components make hardware more expensive in production and consume additional energy at runtime. Fault-tolerant algorithms and libraries enforce specific programming models on the developer. Compiler-based fault tolerance requires the source code for all applications to be available for recompilation. In this thesis I present ASTEROID, an operating system architecture that integrates applications with different reliability needs. ASTEROID is built on top of the L4/Fiasco.OC microkernel and extends the system with Romain, an operating system service that transparently replicates user applications. Romain supports single- and multi-threaded applications without requiring access to the application's source code. Romain replicates applications and their resources completely and thereby does not rely on hardware extensions, such as ECC-protected memory. In my thesis I describe how to efficiently implement replication as a form of redundant multithreading in software. I develop mechanisms to manage replica resources and to make multi-threaded programs behave deterministically for replication. I furthermore present an approach to handle applications that use shared-memory channels with other programs. My evaluation shows that Romain provides 100% error detection and more than 99.6% error correction for single-bit flips in memory and general-purpose registers. At the same time, Romain's execution time overhead is below 14% for single-threaded applications running in triple-modular redundant mode. The last part of my thesis acknowledges that software-implemented fault tolerance methods often rely on the correct functioning of a certain set of hardware and software components, the Reliable Computing Base (RCB). I introduce the concept of the RCB and discuss what constitutes the RCB of the ASTEROID system and other fault tolerance mechanisms. Thereafter I show three case studies that evaluate approaches to protecting RCB components and thereby aim to achieve a software stack that is fully protected against hardware errors
    • …
    corecore