Search CORE

838 research outputs found

Transferring Pareto Frontiers across Heterogeneous Hardware Environments

Author: Valov Pavel
Publication venue: 'University of Waterloo'
Publication date: 05/08/2020
Field of study

Configurable software systems provide options that affect functional and non-functional system properties. Selection of options forms system configurations with corresponding properties values. Nowadays configurable software systems are ubiquitously used by software developers, system administrators, and end users due to their inherent adaptability. However, this flexibility comes at a cost. Configuration space exponentially explodes with addition of new options, making exhaustive system analysis impossible. Nevertheless, many use cases require a deep understanding of a configurable system behavior, especially if the system is redeployed across heterogeneous hardware. This leads to the following research questions: (1) Is it possible to transfer configurable software property information across a collection of heterogeneous hardware? (2) Is it possible to transfer relative configurations optimality information across heterogeneous hardware? We address the first question by proposing an approach for transferring performance prediction models of configurable systems across heterogeneous hardware. This approach builds a predictor model to approximate performance on a source hardware, and a separate transferrer model to transfer approximated performance values to a destination hardware. Experiments on three configurable software systems across 23 heterogeneous hardware environments demonstrated high accuracy (less than 10% error) for all studied combinations. We address the second question by proposing an approach for approximation and transferring of Pareto frontiers of optimal configurations (based on multiple system properties) across heterogeneous hardware. This approach builds an individual predictor and transferrer for each analyzed system property. Using trained models, we can build an approximated Pareto frontier on a source hardware and transfer this frontier to a destination hardware. Experiments on five configurable systems across 34 heterogeneous hardware demonstrated feasibility of the approach and that the accuracy of a transferred frontier mainly depends on the approximation rather than the transferring process, while being linearly proportional to a training sample size

University of Waterloo's Institutional Repository

CARMA: Context-Aware Runtime Reconfiguration for Energy-Efficient Sensor Fusion

Author: Faruque Mohammad Abdullah Al
Huang Sitao
Li Yuhui
Malawade Arnav Vaibhav
Seong DongHwan
Zhang Xiaofang
Zhang Yifan
Publication venue
Publication date: 27/06/2023
Field of study

Autonomous systems (AS) are systems that can adapt and change their behavior in response to unanticipated events and include systems such as aerial drones, autonomous vehicles, and ground/aquatic robots. AS require a wide array of sensors, deep-learning models, and powerful hardware platforms to perceive and safely operate in real-time. However, in many contexts, some sensing modalities negatively impact perception while increasing the system's overall energy consumption. Since AS are often energy-constrained edge devices, energy-efficient sensor fusion methods have been proposed. However, existing methods either fail to adapt to changing scenario conditions or to optimize energy efficiency system-wide. We propose CARMA: a context-aware sensor fusion approach that uses context to dynamically reconfigure the computation flow on a Field-Programmable Gate Array (FPGA) at runtime. By clock-gating unused sensors and model sub-components, CARMA significantly reduces the energy used by a multi-sensory object detector without compromising performance. We use a Deep-learning Processor Unit (DPU) based reconfiguration approach to minimize the latency of model reconfiguration. We evaluate multiple context-identification strategies, propose a novel system-wide energy-performance joint optimization, and evaluate scenario-specific perception performance. Across challenging real-world sensing contexts, CARMA outperforms state-of-the-art methods with up to 1.3x speedup and 73% lower energy consumption.Comment: Accepted to be published in the 2023 ACM/IEEE International Symposium on Low Power Electronics and Design (ISLPED 2023

arXiv.org e-Print Archive

System Abstractions for Scalable Application Development at the Edge

Author: Hu Bo
Publication venue: EliScholar – A Digital Platform for Scholarly Publishing at Yale
Publication date: 01/04/2022
Field of study

Recent years have witnessed an explosive growth of Internet of Things (IoT) devices, which collect or generate huge amounts of data. Given diverse device capabilities and application requirements, data processing takes place across a range of settings, from on-device to a nearby edge server/cloud and remote cloud. Consequently, edge-cloud coordination has been studied extensively from the perspectives of job placement, scheduling and joint optimization. Typical approaches focus on performance optimization for individual applications. This often requires domain knowledge of the applications, but also leads to application-specific solutions. Application development and deployment over diverse scenarios thus incur repetitive manual efforts. There are two overarching challenges to provide system-level support for application development at the edge. First, there is inherent heterogeneity at the device hardware level. The execution settings may range from a small cluster as an edge cloud to on-device inference on embedded devices, differing in hardware capability and programming environments. Further, application performance requirements vary significantly, making it even more difficult to map different applications to already heterogeneous hardware. Second, there are trends towards incorporating edge and cloud and multi-modal data. Together, these add further dimensions to the design space and increase the complexity significantly. In this thesis, we propose a novel framework to simplify application development and deployment over a continuum of edge to cloud. Our framework provides key connections between different dimensions of design considerations, corresponding to the application abstraction, data abstraction and resource management abstraction respectively. First, our framework masks hardware heterogeneity with abstract resource types through containerization, and abstracts away the application processing pipelines into generic flow graphs. Further, our framework further supports a notion of degradable computing for application scenarios at the edge that are driven by multimodal sensory input. Next, as video analytics is the killer app of edge computing, we include a generic data management service between video query systems and a video store to organize video data at the edge. We propose a video data unit abstraction based on a notion of distance between objects in the video, quantifying the semantic similarity among video data. Last, considering concurrent application execution, our framework supports multi-application offloading with device-centric control, with a userspace scheduler service that wraps over the operating system scheduler

Yale University

Autonomy and Intelligence in the Computing Continuum: Challenges, Enablers, and Future Directions for Orchestration

Author: Angulo Iñigo
Bennis Mehdi
Dustdar Schahram
González-Gil Alfonso
Kokkonen Henna
Kostakos Panos
Leppänen Teemu
Liyanage Madhusanka
Lovén Lauri
Motlagh Naser Hossein
Nguyen Tri
Partala Juha
Pirttikangas Susanna
Pujol Víctor Casamayor
Riekki Jukka
Sola Ester
Tarkoma Sasu
Publication venue
Publication date: 01/07/2022
Field of study

Future AI applications require performance, reliability and privacy that the existing, cloud-dependant system architectures cannot provide. In this article, we study orchestration in the device-edge-cloud continuum, and focus on AI for edge, that is, the AI methods used in resource orchestration. We claim that to support the constantly growing requirements of intelligent applications in the device-edge-cloud computing continuum, resource orchestration needs to embrace edge AI and emphasize local autonomy and intelligence. To justify the claim, we provide a general definition for continuum orchestration, and look at how current and emerging orchestration paradigms are suitable for the computing continuum. We describe certain major emerging research themes that may affect future orchestration, and provide an early vision of an orchestration paradigm that embraces those research themes. Finally, we survey current key edge AI methods and look at how they may contribute into fulfilling the vision of future continuum orchestration.Comment: 50 pages, 8 figures (Revised content in all sections, added figures and new section

arXiv.org e-Print Archive

Toward energy Autonomy in heterogeneous Modular Plant-Inspired Robots through Artificial evolution

Author: Afarulrazi
Atamian
Balzter
Bongard
Bongard
Braitenberg
Brodbeck
Corucci
Ehleringer
Graf
Graham
Greenman
Heinrich
Hogeweg
Ieropoulos
Kutschera
Lehman
Lessin
Lindenmayer
Lindenmayer
Moreno
Noth
Pfeifer
Philamore
Prinsloo
Prusinkiewicz
Prusinkiewicz
Reece
Rohmer
Sadeghi
Sadeghi
Schmidt
Sims
Sims
Sims
Sinibaldi
Song
Stanley
Stanley
Stoy
Syswerda
Toth
Vargas
Veenstra
Veenstra
Vergara
Wagy
Zahadat
Zamuda
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2017
Field of study

Contemporary robots perform energy intensive tasks—e.g., manipulation and locomotion—making the development of energy autonomous robots challenging. Since plants are primary energy producers in natural ecosystems, we took plants as a source of inspiration for designing our robotics platform. This led us to investigate energy autonomy in robots through employing solar panels. As plants move slowly compared to other large terrestrial organisms, it is expected that plant-inspired robots can enable robotic applications, such as long-term monitoring and exploration, where energy consumption could be minimized. Since it is difficult to manually design robotic systems that adhere to full energy autonomy, we utilize evolutionary algorithms to automate the design and evaluation of energy harvesting robots. We demonstrate how artificial evolution can lead to the design and control of a modular plant-like robot. Robotic phenotypes were acquired through implementing an evolutionary algorithm, a generative encoding and modular building blocks in a simulation environment. The generative encoding is based on a context sensitive Lindenmayer-System (L-System) and the evolutionary algorithm is used to optimize compositions of heterogeneous modular building blocks in the simulation environment. Phenotypes that evolved from the simulation environment are in turn transferred to a physical robot platform. The robotics platform consists of five different types of modules: (1) a base module, (2) a cube module, (3) servo modules, and (4,5) two types of solar panel modules that are used to harvest energy. The control system for the platform is initially evolved in the simulation environment and afterward transferred to an actual physical robot. A few experiments were done showing the relationship between energy cost and the amount of light tracking that evolved in the simulation. The reconfigurable modular robots are eventually used to harvest light with the possibility to be reconfigured based on the needs of the designer, the type of usable modules, and/or the optimal configuration derived from the simulation environment. Long-term energy autonomy has not been tested in this robotics platform. However, we think our robotics platform can serve as a stepping stone toward full energy autonomy in modular robots

Crossref

Directory of Open Access Journals

The IT University of Copenhagen's Repository

An energy-aware scheduling approach for resource-intensive jobs using smart mobile devices as resource providers

Author: Hirsch Jofré Matías Eberardo
Publication venue
Publication date: 28/03/2018
Field of study

The ever-growing adoption of smart mobile devices is a worldwide phenomenon that positions smart-phones and tablets as primary devices for communication and Internet access. In addition to this, the computing capabilities of such devices, often underutilized by their owners, are in continuous improvement. Today, smart mobile devices have multi-core CPUs, several gigabytes of RAM, and ability to communicate through several wireless networking technologies. These facts caught the attention of researchers who have proposed to leverage smart mobile devices aggregated computing capabilities for running resource intensive software. However, such idea is conditioned by key features, named singularities in the context of this thesis, that characterize resource provision with smart mobile devices.These are the ability of devices to change location (user mobility), the shared or non-dedicated nature of resources provided (lack of ownership) and the limited operation time given by the finite energy source (exhaustible resources).Existing proposals materializing this idea differ in the singularities combinations they target and the way they address each singularity, which make them suitable for distinct goals and resource exploitation opportunities. The latter are represented by real life situations where resources provided by groups of smart mobile devices can be exploited, which in turn are characterized by a social context and a networking support used to link and coordinate devices. The behavior of people in a given social context configure a special availability level of resources, while the underlying networking support imposes restrictionson how information flows, computational tasks are distributed and results are collected. The latter constitutes one fundamental difference of proposals mainly because each networking support ?i.e., ad-hoc and infrastructure based? has its own application scenarios. Aside from the singularities addressed and the networking support utilized, the weakest point of most of the proposals is their practical applicability. The performance achieved heavily relies on the accuracy with which task information, including execution time and/or energy required for execution, is provided to feed the resource allocator.The expanded usage of wireless communication infrastructure in public and private buildings, e.g., shoppings, work offices, university campuses and so on, constitutes a networking support that can be naturally re-utilized for leveraging smart mobile devices computational capabilities. In this context, this thesisproposal aims to contribute with an easy-to-implement scheduling approach for running CPU-bound applications on a cluster of smart mobile devices. The approach is aware of the finite nature of smart mobile devices energy, and it does not depend on tasks information to operate. By contrast, it allocatescomputational resources to incoming tasks using a node ranking-based strategy. The ranking weights nodes combining static and dynamic parameters, including benchmark results, battery level, number of queued tasks, among others. This node ranking-based task assignment, or first allocation phase, is complemented with a re-balancing phase using job stealing techniques. The second allocation phase is an aid to the unbalanced load provoked as consequence of the non-dedicated nature of smart mobile devices CPU usage, i.e., the effect of the owner interaction, tasks heterogeneity, and lack of up-to-dateand accurate information of remaining energy estimations. The evaluation of the scheduling approach is through an in-vitro simulation. A novel simulator which exploits energy consumption profiles of real smart mobile devices, as well as, fluctuating CPU usage built upon empirical models, derived from real users interaction data, is another major contribution. Tests that validate the simulation tool are provided and the approach is evaluated in scenarios varying the composition of nodes, tasks and nodes characteristics including different tasks arrival rates, tasks requirements and different levels of nodes resource utilization.Fil: Hirsch Jofré, Matías Eberardo. Consejo Nacional de Investigaciones Científicas y Técnicas. Centro Científico Tecnológico Conicet - Tandil. Instituto Superior de Ingeniería del Software. Universidad Nacional del Centro de la Provincia de Buenos Aires. Instituto Superior de Ingeniería del Software; Argentin

CONICET Digital

Robust Platelet Logistics Planning in Disaster Relief Operations Under Uncertainty: a Coordinated Approach

Author: Kamyabniya A
Lotfi MM
Naderpour M
Yih Y
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/08/2018
Field of study

© 2017, Springer Science+Business Media, LLC. Resource sharing, as a coordination mechanism, can mitigate disruptions in supply and changes in demand. It is particularly crucial for platelets because they have a short lifespan and need to be transferred and allocated within a limited time to prevent waste or shortages. Thus, a coordinated model comprised of a mixed vertical-horizontal structure, for the logistics of platelets, is proposed for disaster relief operations in the response phase. The aim of this research is to reduce the wastage and shortage of platelets due to their critical role in wound healing. We present a bi-objective location-allocation robust possibilistic programming model for designing a two-layer coordinated organization strategy for multi-type blood-derived platelets under demand uncertainty. Computational results, derived using a heuristic ε-constraint algorithm, are reported and discussed to show the applicability of the proposed model. The experimental results indicate that surpluses and shortages in platelets remarkably declined following instigation of a coordinated disaster relief operation

OPUS - University of Technology Sydney

Accelerating Event Stream Processing in On- and Offline Systems

Author: Körber Michael
Publication venue: Philipps-Universität Marburg
Publication date: 01/01/2021
Field of study

Due to a growing number of data producers and their ever-increasing data volume, the ability to ingest, analyze, and store potentially never-ending streams of data is a mission-critical task in today's data processing landscape. A widespread form of data streams are event streams, which consist of continuously arriving notifications about some real-world phenomena. For example, a temperature sensor naturally generates an event stream by periodically measuring the temperature and reporting it with measurement time in case of a substantial change to the previous measurement. In this thesis, we consider two kinds of event stream processing: online and offline. Online refers to processing events solely in main memory as soon as they arrive, while offline means processing event data previously persisted to non-volatile storage. Both modes are supported by widely used scale-out general-purpose stream processing engines (SPEs) like Apache Flink or Spark Streaming. However, such engines suffer from two significant deficiencies that severely limit their processing performance. First, for offline processing, they load the entire stream from non-volatile secondary storage and replay all data items into the associated online engine in order of their original arrival. While this naturally ensures unified query semantics for on- and offline processing, the costs for reading the entire stream from non-volatile storage quickly dominate the overall processing costs. Second, modern SPEs focus on scaling out computations across the nodes of a cluster, but use only a fraction of the available resources of individual nodes. This thesis tackles those problems with three different approaches. First, we present novel techniques for the offline processing of two important query types (windowed aggregation and sequential pattern matching). Our methods utilize well-understood indexing techniques to reduce the total amount of data to read from non-volatile storage. We show that this improves the overall query runtime significantly. In particular, this thesis develops the first index-based algorithms for pattern queries expressed with the Match_Recognize clause, a new and powerful language feature of SQL that has received little attention so far. Second, we show how to maximize resource utilization of single nodes by exploiting the capabilities of modern hardware. Therefore, we develop a prototypical shared-memory CPU-GPU-enabled event processing system. The system provides implementations of all major event processing operators (filtering, windowed aggregation, windowed join, and sequential pattern matching). Our experiments reveal that regarding resource utilization and processing throughput, such a hardware-enabled system is superior to hardware-agnostic general-purpose engines. Finally, we present TPStream, a new operator for pattern matching over temporal intervals. TPStream achieves low processing latency and, in contrast to sequential pattern matching, is easily parallelizable even for unpartitioned input streams. This results in maximized resource utilization, especially for modern CPUs with multiple cores

Publikations- und Dokumentenserver der Universitätsbibliothek Marburg