Search CORE

830 research outputs found

Quantifying the latency benefits of near-edge and in-network FPGA acceleration

Author: Andrew Putnam
Fahmy Suhaib A
Kwon Hyoukjun
Neugebauer Rolf
Noa
Shreejith Shanker
Vipin Kizheppatt
Wenxiao
Yi Shanhe
Zhao
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 27/04/2020
Field of study

Transmitting data to cloud datacenters in distributed IoT applications introduces significant communication latency, but is often the only feasible solution when source nodes are computationally limited. To address latency concerns, cloudlets, in-network computing, and more capable edge nodes are all being explored as a way of moving processing capability towards the edge of the network. Hardware acceleration using Field Programmable Gate Arrays (FPGAs) is also seeing increased interest due to reduced computation latency and improved efficiency. This paper evaluates the the implications of these offloading approaches using a case study neural network based image classification application, quantifying both the computation and communication latency resulting from different platform choices. We consider communication latency including the ingestion of packets for processing on the target platform, showing that this varies significantly with the choice of platform. We demonstrate that emerging in-network accelerator approaches offer much improved and predictable performance as well as better scaling to support multiple data sources

Crossref

Warwick Research Archives Portal Repository

Orchestration and reconfiguration control architecture

Author: Felber Clemens
Jiao Xianjun
Kazaz Tarik
Kotzsch Vincent
Liu Wei
Moerman Ingrid
Paisana Francisco
Pollin Sofie
Seskar Ivan
Vermeulen Tom
Publication venue
Publication date: 01/01/2017
Field of study

Ghent University Academic Bibliography

Semi-dense SLAM on an FPGA SoC

Author: Boikos K
Bouganis C-S
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 29/08/2016
Field of study

Deploying advanced Simultaneous Localisation and Mapping, or SLAM, algorithms in autonomous low-power robotics will enable emerging new applications which require an accurate and information rich reconstruction of the environment. This has not been achieved so far because accuracy and dense 3D reconstruction come with a high computational complexity. This paper discusses custom hardware design on a novel platform for embedded SLAM, an FPGA-SoC, combining an embedded CPU and programmable logic on the same chip. The use of programmable logic, tightly integrated with an efficient multicore embedded CPU stands to provide an effective solution to this problem. In this work an average framerate of more than 4 frames/second for a resolution of 320×240 has been achieved with an estimated power of less than 1 Watt for the custom hardware. In comparison to the software-only version, running on a dual-core ARM processor, an acceleration of 2× has been achieved for LSD-SLAM, without any compromise in the quality of the result

Crossref

Spiral - Imperial College Digital Repository

Design of Embedded Augmented Reality Systems

Author: Ferrández J. M.
Garrigós J.
Martínez J. J.
Toledo J.
Toledo-Moreo R.
Publication venue: 'IntechOpen'
Publication date: 01/01/2010
Field of study

IntechOpen

Design and management of image processing pipelines within CPS : Acquired experience towards the end of the FitOptiVis ECSEL Project

Cyber-Physical Systems (CPSs) are dynamic and reactive systems interacting with processes, environment and, sometimes, humans. They are often distributed with sensors and actuators, characterized for being smart, adaptive, predictive and react in real-time. Indeed, image- and video-processing pipelines are a prime source for environmental information for systems allowing them to take better decisions according to what they see. Therefore, in FitOptiVis, we are developing novel methods and tools to integrate complex image- and video-processing pipelines. FitOptiVis aims to deliver a reference architecture for describing and optimizing quality and resource management for imaging and video pipelines in CPSs both at design- and run-time. The architecture is concretized in low-power, high-performance, smart components, and in methods and tools for combined design-time and run-time multi-objective optimization and adaptation within system and environment constraints.Peer reviewe

Trepo - Institutional Repository of Tampere University