Enhancing Energy-efficiency by Solving the Throughput Bottleneck of LSTM
  Cells for Embedded FPGAs

Ling, Tianheng; Qian, Chao; Schiele, Gregor

Enhancing Energy-efficiency by Solving the Throughput Bottleneck of LSTM Cells for Embedded FPGAs

Authors: Tianheng Ling
Chao Qian
Gregor Schiele
Publication date: 25 November 2023
Publisher
Doi

Abstract

To process sensor data in the Internet of Things(IoTs), embedded deep learning for 1-dimensional data is an important technique. In the past, CNNs were frequently used because they are simple to optimise for special embedded hardware such as FPGAs. This work proposes a novel LSTM cell optimisation aimed at energy-efficient inference on end devices. Using the traffic speed prediction as a case study, a vanilla LSTM model with the optimised LSTM cell achieves 17534 inferences per second while consuming only 3.8

\mu

J per inference on the FPGA XC7S15 from Spartan-7 family. It achieves at least 5.4

\times

faster throughput and 1.37

\times

more energy efficient than existing approaches.Comment: 12 pages, 7 figure

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2310.16842

Last time updated on 10/05/2024