61,091 research outputs found
Design Of Specialized Storage for Heterogeneous Project Data
This article describes an approach for designing a storage system that will allow persistence of large number of heterogeneous entities, used to describe software development process in various process supporting tools (bug trackers, repos- itories, wiki engines, etc). Goal of this work is to extract core semantics essences from entities like issues, code commits, builds in continuous integration systems and others and persist them together in one indexable, integrated and searchable systems disregarding from which tool each of them came from. Taking into high level of variety in data shapes and forms, requirement to perform complicated cross-entities queries and potentially high volumes of data - designing a storage for such systems turns into a challenging task
Apache Calcite: A Foundational Framework for Optimized Query Processing Over Heterogeneous Data Sources
Apache Calcite is a foundational software framework that provides query
processing, optimization, and query language support to many popular
open-source data processing systems such as Apache Hive, Apache Storm, Apache
Flink, Druid, and MapD. Calcite's architecture consists of a modular and
extensible query optimizer with hundreds of built-in optimization rules, a
query processor capable of processing a variety of query languages, an adapter
architecture designed for extensibility, and support for heterogeneous data
models and stores (relational, semi-structured, streaming, and geospatial).
This flexible, embeddable, and extensible architecture is what makes Calcite an
attractive choice for adoption in big-data frameworks. It is an active project
that continues to introduce support for the new types of data sources, query
languages, and approaches to query processing and optimization.Comment: SIGMOD'1
Checkpointing as a Service in Heterogeneous Cloud Environments
A non-invasive, cloud-agnostic approach is demonstrated for extending
existing cloud platforms to include checkpoint-restart capability. Most cloud
platforms currently rely on each application to provide its own fault
tolerance. A uniform mechanism within the cloud itself serves two purposes: (a)
direct support for long-running jobs, which would otherwise require a custom
fault-tolerant mechanism for each application; and (b) the administrative
capability to manage an over-subscribed cloud by temporarily swapping out jobs
when higher priority jobs arrive. An advantage of this uniform approach is that
it also supports parallel and distributed computations, over both TCP and
InfiniBand, thus allowing traditional HPC applications to take advantage of an
existing cloud infrastructure. Additionally, an integrated health-monitoring
mechanism detects when long-running jobs either fail or incur exceptionally low
performance, perhaps due to resource starvation, and proactively suspends the
job. The cloud-agnostic feature is demonstrated by applying the implementation
to two very different cloud platforms: Snooze and OpenStack. The use of a
cloud-agnostic architecture also enables, for the first time, migration of
applications from one cloud platform to another.Comment: 20 pages, 11 figures, appears in CCGrid, 201
Designing interoperable museum information systems
Museum collections are characterized by heterogeneity, since they usually host a plethora of objects of categories, while each of them requires different description policies and metadata standards. Moreover the museum records, which keep the history and evolution of the hosted collections, request proactive curation in order to preserve this rich and diverse information. In this paper,
the architecture of an innovative museum information system, as well as its implementation details is presented. In particular the requirements and the system architecture are presented along with the problems that were encountered. The main directions of the system design are (a) to increase interoperability levels and therefore assist proactive curation and (b) to enhance navigation by the usage of handheld devices. The first direction is satisfied by the design of a rich metadata schema based on the CIDOC/CRM standard. The second direction is fulfilled by the implementation of a module, which integrates the museum database with a subsystem appropriate to support user navigation into the museum floors and rooms. The module is expressed as a navigation functionality, which is accessed through handheld devices and peripherals, such as PDAs and RFID tags. The proposed system is functional and operates into the Solomos Museum, situated in Zakynthos island, Greece
Digitalization and Innovation
Developments in digital technology offer new opportunities to design new products and services. However, creating such digitalized products and services often creates new problems and challenges to firms that are trying to innovate. In this essay, we analyze the impact of digitalization of products and services on innovations. In particular, we argue that digitalization of products will lead to an emergence of new layered product architecture. The layered architecture is characterized by its generative design rules that connect loosely coupled heterogeneous layers. It is pregnant with the potential of unbounded innovations. The new product architecture will require organizations to adopt a new organizing logic of innovation that we dubbed as doubly distributed innovation network. Based on this analysis, we propose five key issues that future researchers need to explore.innovation, innovation, product architecture, design rules
- âŠ