Search CORE

2 research outputs found

Recommended from our members

Circuit Design of Memristor-based GRU and its Applications in SOC Estimation

Author: Dong Z
Ji X
Lai CS
Qi D
Wang J
Zhang Z
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 06/01/2023
Field of study

10.13039/501100001809-National Natural Science Foundation (Grant Number: 62001149); Zhejiang Provincial Nature Science Foundation of China under Grant No. LQ21F010009; Fundamental Research Funds for the Provincial Universities of Zhejiang under Grant No. GK229909299001-06

Brunel University Research Archive

Optimizing reconfigurable recurrent neural networks

Author: Fan H
Luk W
Meng J
Nakahara H
Niu X
Nurvitadhi E
Que Z
Zeng C
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/06/2020
Field of study

This paper proposes a novel latency-hiding hardware architecture based on column-wise matrix-vector multiplication to eliminate data dependency, improving the throughput of systems of RNN models. In addition, a flexible checkerboard tiling strategy is introduced to allow large weight matrices, while supporting element-based parallelism and vector-based parallelism. These optimizations improve the exploitation of the available parallelism to increase run-time hardware utilization and boost inference throughput. Furthermore, a quantization scheme with fine-tuning is proposed to achieve high accuracy. Evaluation results show that the proposed architecture can enhance performance and energy efficiency with little accuracy loss. It achieves 1.05 to 3.35 times better performance and 1.22 to 3.92 times better hardware utilization than a state-of-theart FPGA-based LSTM design, which shows that our approach contributes to high performance FPGA-based LSTM systems

Crossref

Spiral - Imperial College Digital Repository