Chipmunk: A Systolically Scalable 0.9 mm${}^2$, 3.08 Gop/s/mW @ 1.2 mW
  Accelerator for Near-Sensor Recurrent Neural Network Inference

Benini, Luca; Cavigelli, Lukas; Conti, Francesco; Paulin, Gianna; Susmelj, Igor

research

Chipmunk: A Systolically Scalable 0.9 mm ${}^2$ , 3.08 Gop/s/mW @ 1.2 mW Accelerator for Near-Sensor Recurrent Neural Network Inference

Authors: Luca Benini
Lukas Cavigelli
Francesco Conti
Gianna Paulin
Igor Susmelj
Publication date: 1 January 2018
Publisher
Doi

Abstract

Recurrent neural networks (RNNs) are state-of-the-art in voice awareness/understanding and speech recognition. On-device computation of RNNs on low-power mobile and wearable devices would be key to applications such as zero-latency voice-based human-machine interfaces. Here we present Chipmunk, a small (<1 mm

{}^2

) hardware accelerator for Long-Short Term Memory RNNs in UMC 65 nm technology capable to operate at a measured peak efficiency up to 3.08 Gop/s/mW at 1.24 mW peak power. To implement big RNN models without incurring in huge memory transfer overhead, multiple Chipmunk engines can cooperate to form a single systolic array. In this way, the Chipmunk architecture in a 75 tiles configuration can achieve real-time phoneme extraction on a demanding RNN topology proposed by Graves et al., consuming less than 13 mW of average power

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Crossref

info:doi/10.1109%2Fcicc.2018.8...

Last time updated on 10/08/2021

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

oai:cris.unibo.it:11585/652926

Last time updated on 03/09/2019

Chipmunk: A Systolically Scalable 0.9 mm2{}^22, 3.08 Gop/s/mW @ 1.2 mW Accelerator for Near-Sensor Recurrent Neural Network Inference