Search CORE

7,721 research outputs found

Approximate FPGA-based LSTMs under Computation Time Constraints

Author: Bouganis Christos-Savvas
Kouris Alexandros
Rizakis Michalis
Venieris Stylianos I.
Publication venue
Publication date: 05/01/2018
Field of study

Recurrent Neural Networks and in particular Long Short-Term Memory (LSTM) networks have demonstrated state-of-the-art accuracy in several emerging Artificial Intelligence tasks. However, the models are becoming increasingly demanding in terms of computational and memory load. Emerging latency-sensitive applications including mobile robots and autonomous vehicles often operate under stringent computation time constraints. In this paper, we address the challenge of deploying computationally demanding LSTMs at a constrained time budget by introducing an approximate computing scheme that combines iterative low-rank compression and pruning, along with a novel FPGA-based LSTM architecture. Combined in an end-to-end framework, the approximation method's parameters are optimised and the architecture is configured to address the problem of high-performance LSTM execution in time-constrained applications. Quantitative evaluation on a real-life image captioning application indicates that the proposed methods required up to 6.5x less time to achieve the same application-level accuracy compared to a baseline method, while achieving an average of 25x higher accuracy under the same computation time constraints.Comment: Accepted at the 14th International Symposium in Applied Reconfigurable Computing (ARC) 201

arXiv.org e-Print Archive

Crossref

Spiral - Imperial College Digital Repository

Building Blocks for Control System Software

Author: Broenink J.F.
Hilderink G.H.
Publication venue: University of Twente, The Netherlands
Publication date: 01/01/2001
Field of study

Software implementation of control laws for industrial systems seem straightforward, but is not. The computer code stemming from the control laws is mostly not more than 10 to 30% of the total. A building-block approach for embedded control system development is advocated to enable a fast and efficient software design process.\ud We have developed the CTJ library, Communicating Threads for Java¿,\ud resulting in fundamental elements for creating building blocks to implement communication using channels. Due to the simulate-ability, our building block method is suitable for a concurrent engineering design approach. Furthermore, via a stepwise refinement process, using verification by simulation, the implementation trajectory can be done efficiently

University of Twente Research Information

Test exploration and validation using transaction level models

Author: Di Carlo Stefano
Imhof M.E
Khaligh R.S
Kochte M.A
Prinetto Paolo Ernesto
Radetzki M.
Wunderlich H.-J
Zollen C.G
Publication venue: IEEE Computer Society
Publication date: 01/01/2009
Field of study

The complexity of the test infrastructure and test strategies in systems-on-chip approaches the complexity of the functional design space. This paper presents test design space exploration and validation of test strategies and schedules using transaction level models (TLMs). Since many aspects of testing involve the transfer of a significant amount of test stimuli and responses, the communication-centric view of TLMs suits this purpose exceptionally wel

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

Channel Estimation And Multiuser Detection In Asynchronous Satellite Communications

Author: Bouallegue Ridha
Chaouech Helmi
Publication venue: 'Academy and Industry Research Collaboration Center (AIRCC)'
Publication date: 01/01/2010
Field of study

In this paper, we propose a new method of channel estimation for asynchronous additive white Gaussian noise channels in satellite communications. This method is based on signals correlation and multiuser interference cancellation which adopts a successive structure. Propagation delays and signals amplitudes are jointly estimated in order to be used for data detection at the receiver. As, a multiuser detector, a single stage successive interference cancellation (SIC) architecture is analyzed and integrated to the channel estimation technique and the whole system is evaluated. The satellite access method adopted is the direct sequence code division multiple access (DS CDMA) one. To evaluate the channel estimation and the detection technique, we have simulated a satellite uplink with an asynchronous multiuser access.Comment: 14 pages, 9 figure

arXiv.org e-Print Archive

Crossref

Cycle Accurate Simulation Model Generation for SoC Prototyping

Author: Fraboulet Antoine
Risset Tanguy
Scherrer Antoine
Publication venue: HAL CCSD
Publication date: 01/05/2004
Field of study

RR 2004-18, ENS-Lyon, 24 pagesWe present new results concerning the integration of high level designed ips into a complete System on Chip. We first introduce a new compu- tation model that can be used for cycle accurate simulation of register transfer level synthesized hardware. Then we provide simulation of a SoC integrating a data-flow ip synthesized with MMAlpha and the So- cLib cycle accurate simulation environment. This integration also vali- dates an efficient generic interface mechanism for data-flow ips

HAL-ENS-LYON

INRIA a CCSD electronic archive server

Hal-Diderot