3,615 research outputs found
Incremental View Maintenance For Collection Programming
In the context of incremental view maintenance (IVM), delta query derivation
is an essential technique for speeding up the processing of large, dynamic
datasets. The goal is to generate delta queries that, given a small change in
the input, can update the materialized view more efficiently than via
recomputation. In this work we propose the first solution for the efficient
incrementalization of positive nested relational calculus (NRC+) on bags (with
integer multiplicities). More precisely, we model the cost of NRC+ operators
and classify queries as efficiently incrementalizable if their delta has a
strictly lower cost than full re-evaluation. Then, we identify IncNRC+; a large
fragment of NRC+ that is efficiently incrementalizable and we provide a
semantics-preserving translation that takes any NRC+ query to a collection of
IncNRC+ queries. Furthermore, we prove that incremental maintenance for NRC+ is
within the complexity class NC0 and we showcase how recursive IVM, a technique
that has provided significant speedups over traditional IVM in the case of flat
queries [25], can also be applied to IncNRC+.Comment: 24 pages (12 pages plus appendix
Special Libraries, May-June 1977
Volume 68, Issue 5-6https://scholarworks.sjsu.edu/sla_sl_1977/1004/thumbnail.jp
Grids and the Virtual Observatory
We consider several projects from astronomy that benefit from the Grid paradigm and
associated technology, many of which involve either massive datasets or the federation
of multiple datasets. We cover image computation (mosaicking, multi-wavelength
images, and synoptic surveys); database computation (representation through XML,
data mining, and visualization); and semantic interoperability (publishing, ontologies,
directories, and service descriptions)
Technical Report: CSVM Ecosystem
The CSVM format is derived from CSV format and allows the storage of tabular
like data with a limited but extensible amount of metadata. This approach could
help computer scientists because all information needed to uses subsequently
the data is included in the CSVM file and is particularly well suited for
handling RAW data in a lot of scientific fields and to be used as a canonical
format. The use of CSVM has shown that it greatly facilitates: the data
management independently of using databases; the data exchange; the integration
of RAW data in dataflows or calculation pipes; the search for best practices in
RAW data management. The efficiency of this format is closely related to its
plasticity: a generic frame is given for all kind of data and the CSVM parsers
don't make any interpretation of data types. This task is done by the
application layer, so it is possible to use same format and same parser codes
for a lot of purposes. In this document some implementation of CSVM format for
ten years and in different laboratories are presented. Some programming
examples are also shown: a Python toolkit for using the format, manipulating
and querying is available. A first specification of this format (CSVM-1) is now
defined, as well as some derivatives such as CSVM dictionaries used for data
interchange. CSVM is an Open Format and could be used as a support for Open
Data and long term conservation of RAW or unpublished data.Comment: 31 pages including 2p of Anne
Geoscience after IT: Part L. Adjusting the emerging information system to new technology
Coherent development depends on following widely used standards that respect our vast legacy of existing entries in the geoscience record. Middleware ensures that we see a coherent view from our desktops of diverse sources of information. Developments specific to managing the written word, map content, and structured data come together in shared metadata linking topics and information types
Office of Space Terrestrial Applications (OSTA)/Applications Data Service (ADS) data systems standards
Standards needed to interconnect applications data service pilots for data sharing were identified. Current pilot methodologies are assessed. Recommendations for future work are made. A preliminary set of requirements for guidelines and standards for catalogues, directories, and dictionaries was identified. The user was considered to be a scientist at a terminal. Existing and emerging national and international telecommunication standards were adopted where possible in view of new and unproven standards
Digital twin in industrial applications: how model-based systems engineering (MBSE) and asset administration shell (AAS) complement each other
In the development, production and usage of cyber-physical systems, the number of stakeholders, involved interfaces and volatile environmental conditions is constantly rising. In addition, use cases require more consideration of the entire system life cycle. This significantly increases the administration effort and forms a barrier for the digital transformation of industrial companies. While model-based systems engineering (MBSE) addresses internal challenges within the product development and Asset Administration Shell (AAS) addresses vendor independent information exchange and interoperability, both approaches need to be coupled to address today’s challenges. In this publication typical tasks within product development are discussed: “search the right information”, “integrate the right information” and “provide the right information”. It is shown how they were approached today, without the alignment of MBSE and AAS, what technological concepts exists to address the challenges and how the tasks are realized by an alignment of MBSE and AAS
- …