3,615 research outputs found

    Incremental View Maintenance For Collection Programming

    Get PDF
    In the context of incremental view maintenance (IVM), delta query derivation is an essential technique for speeding up the processing of large, dynamic datasets. The goal is to generate delta queries that, given a small change in the input, can update the materialized view more efficiently than via recomputation. In this work we propose the first solution for the efficient incrementalization of positive nested relational calculus (NRC+) on bags (with integer multiplicities). More precisely, we model the cost of NRC+ operators and classify queries as efficiently incrementalizable if their delta has a strictly lower cost than full re-evaluation. Then, we identify IncNRC+; a large fragment of NRC+ that is efficiently incrementalizable and we provide a semantics-preserving translation that takes any NRC+ query to a collection of IncNRC+ queries. Furthermore, we prove that incremental maintenance for NRC+ is within the complexity class NC0 and we showcase how recursive IVM, a technique that has provided significant speedups over traditional IVM in the case of flat queries [25], can also be applied to IncNRC+.Comment: 24 pages (12 pages plus appendix

    Special Libraries, May-June 1977

    Get PDF
    Volume 68, Issue 5-6https://scholarworks.sjsu.edu/sla_sl_1977/1004/thumbnail.jp

    Grids and the Virtual Observatory

    Get PDF
    We consider several projects from astronomy that benefit from the Grid paradigm and associated technology, many of which involve either massive datasets or the federation of multiple datasets. We cover image computation (mosaicking, multi-wavelength images, and synoptic surveys); database computation (representation through XML, data mining, and visualization); and semantic interoperability (publishing, ontologies, directories, and service descriptions)

    Technical Report: CSVM Ecosystem

    Full text link
    The CSVM format is derived from CSV format and allows the storage of tabular like data with a limited but extensible amount of metadata. This approach could help computer scientists because all information needed to uses subsequently the data is included in the CSVM file and is particularly well suited for handling RAW data in a lot of scientific fields and to be used as a canonical format. The use of CSVM has shown that it greatly facilitates: the data management independently of using databases; the data exchange; the integration of RAW data in dataflows or calculation pipes; the search for best practices in RAW data management. The efficiency of this format is closely related to its plasticity: a generic frame is given for all kind of data and the CSVM parsers don't make any interpretation of data types. This task is done by the application layer, so it is possible to use same format and same parser codes for a lot of purposes. In this document some implementation of CSVM format for ten years and in different laboratories are presented. Some programming examples are also shown: a Python toolkit for using the format, manipulating and querying is available. A first specification of this format (CSVM-1) is now defined, as well as some derivatives such as CSVM dictionaries used for data interchange. CSVM is an Open Format and could be used as a support for Open Data and long term conservation of RAW or unpublished data.Comment: 31 pages including 2p of Anne

    Geoscience after IT: Part L. Adjusting the emerging information system to new technology

    Get PDF
    Coherent development depends on following widely used standards that respect our vast legacy of existing entries in the geoscience record. Middleware ensures that we see a coherent view from our desktops of diverse sources of information. Developments specific to managing the written word, map content, and structured data come together in shared metadata linking topics and information types

    Office of Space Terrestrial Applications (OSTA)/Applications Data Service (ADS) data systems standards

    Get PDF
    Standards needed to interconnect applications data service pilots for data sharing were identified. Current pilot methodologies are assessed. Recommendations for future work are made. A preliminary set of requirements for guidelines and standards for catalogues, directories, and dictionaries was identified. The user was considered to be a scientist at a terminal. Existing and emerging national and international telecommunication standards were adopted where possible in view of new and unproven standards

    Digital twin in industrial applications: how model-based systems engineering (MBSE) and asset administration shell (AAS) complement each other

    Get PDF
    In the development, production and usage of cyber-physical systems, the number of stakeholders, involved interfaces and volatile environmental conditions is constantly rising. In addition, use cases require more consideration of the entire system life cycle. This significantly increases the administration effort and forms a barrier for the digital transformation of industrial companies. While model-based systems engineering (MBSE) addresses internal challenges within the product development and Asset Administration Shell (AAS) addresses vendor independent information exchange and interoperability, both approaches need to be coupled to address today’s challenges. In this publication typical tasks within product development are discussed: “search the right information”, “integrate the right information” and “provide the right information”. It is shown how they were approached today, without the alignment of MBSE and AAS, what technological concepts exists to address the challenges and how the tasks are realized by an alignment of MBSE and AAS
    • …
    corecore