On the Resilience of RTL NN Accelerators: Fault Characterization and
  Mitigation

Cristal, Adrian; Salami, Behzad; Unsal, Osman

research

On the Resilience of RTL NN Accelerators: Fault Characterization and Mitigation

Authors: Adrian Cristal
Behzad Salami
Osman Unsal
Publication date: 14 June 2018
Publisher
Doi

Abstract

Machine Learning (ML) is making a strong resurgence in tune with the massive generation of unstructured data which in turn requires massive computational resources. Due to the inherently compute- and power-intensive structure of Neural Networks (NNs), hardware accelerators emerge as a promising solution. However, with technology node scaling below 10nm, hardware accelerators become more susceptible to faults, which in turn can impact the NN accuracy. In this paper, we study the resilience aspects of Register-Transfer Level (RTL) model of NN accelerators, in particular, fault characterization and mitigation. By following a High-Level Synthesis (HLS) approach, first, we characterize the vulnerability of various components of RTL NN. We observed that the severity of faults depends on both i) application-level specifications, i.e., NN data (inputs, weights, or intermediate), NN layers, and NN activation functions, and ii) architectural-level specifications, i.e., data representation model and the parallelism degree of the underlying accelerator. Second, motivated by characterization results, we present a low-overhead fault mitigation technique that can efficiently correct bit flips, by 47.3% better than state-of-the-art methods.Comment: 8 pages, 6 figure

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

UPCommons. Portal del coneixement obert de la UPC

oai:upcommons.upc.edu:2117/130...

Last time updated on 18/04/2019

UPCommons

oai:upcommons.upc.edu:2117/130...

Last time updated on 17/04/2020

Crossref

Last time updated on 10/08/2021