An Evaluation of Edge TPU Accelerators for Convolutional Neural Networks

Akin, Berkin; Laudon, James; Narayanaswami, Ravi; Seshadri, Kiran; Yazdanbakhsh, Amir

An Evaluation of Edge TPU Accelerators for Convolutional Neural Networks

Authors: Berkin Akin
James Laudon
Ravi Narayanaswami
Kiran Seshadri
Amir Yazdanbakhsh
Publication date: 20 February 2021
Publisher

Abstract

Edge TPUs are a domain of accelerators for low-power, edge devices and are widely used in various Google products such as Coral and Pixel devices. In this paper, we first discuss the major microarchitectural details of Edge TPUs. Then, we extensively evaluate three classes of Edge TPUs, covering different computing ecosystems, that are either currently deployed in Google products or are the product pipeline, across 423K unique convolutional neural networks. Building upon this extensive study, we discuss critical and interpretable microarchitectural insights about the studied classes of Edge TPUs. Mainly, we discuss how Edge TPU accelerators perform across convolutional neural networks with different structures. Finally, we present our ongoing efforts in developing high-accuracy learned machine learning models to estimate the major performance metrics of accelerators such as latency and energy consumption. These learned models enable significantly faster (in the order of milliseconds) evaluations of accelerators as an alternative to time-consuming cycle-accurate simulators and establish an exciting opportunity for rapid hard-ware/software co-design.Comment: 11 pages, 15 figures, submitted to ISCA 202

Similar works

Full text

Available Versions

arXiv.org e-Print Archive

oai:arXiv.org:2102.10423

Last time updated on 02/03/2021