Search CORE

9,021 research outputs found

Reliable Linear, Sesquilinear and Bijective Operations On Integer Data Streams Via Numerical Entanglement

Author: Anam Mohammad Ashraful
Andreopoulos Yiannis
Publication venue
Publication date: 16/04/2016
Field of study

A new technique is proposed for fault-tolerant linear, sesquilinear and bijective (LSB) operations on

M

integer data streams (

M\geq3

), such as: scaling, additions/subtractions, inner or outer vector products, permutations and convolutions. In the proposed method, the

M

input integer data streams are linearly superimposed to form

M

numerically-entangled integer data streams that are stored in-place of the original inputs. A series of LSB operations can then be performed directly using these entangled data streams. The results are extracted from the

M

entangled output streams by additions and arithmetic shifts. Any soft errors affecting any single disentangled output stream are guaranteed to be detectable via a specific post-computation reliability check. In addition, when utilizing a separate processor core for each of the

M

streams, the proposed approach can recover all outputs after any single fail-stop failure. Importantly, unlike algorithm-based fault tolerance (ABFT) methods, the number of operations required for the entanglement, extraction and validation of the results is linearly related to the number of the inputs and does not depend on the complexity of the performed LSB operations. We have validated our proposal in an Intel processor (Haswell architecture with AVX2 support) via fast Fourier transforms, circular convolutions, and matrix multiplication operations. Our analysis and experiments reveal that the proposed approach incurs between

0.03\%

7\%

reduction in processing throughput for a wide variety of LSB operations. This overhead is 5 to 1000 times smaller than that of the equivalent ABFT method that uses a checksum stream. Thus, our proposal can be used in fault-generating processor hardware or safety-critical applications, where high reliability is required without the cost of ABFT or modular redundancy.Comment: to appear in IEEE Trans. on Signal Processing, 201

arXiv.org e-Print Archive

UCL Discovery

FINN: A Framework for Fast, Scalable Binarized Neural Network Inference

Author: Blott Michaela
Fraser Nicholas J.
Gambardella Giulio
Jahre Magnus
Leong Philip
Umuroglu Yaman
Vissers Kees
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/12/2016
Field of study

Research has shown that convolutional neural networks contain significant redundancy, and high classification accuracy can be obtained even when weights and activations are reduced from floating point to binary values. In this paper, we present FINN, a framework for building fast and flexible FPGA accelerators using a flexible heterogeneous streaming architecture. By utilizing a novel set of optimizations that enable efficient mapping of binarized neural networks to hardware, we implement fully connected, convolutional and pooling layers, with per-layer compute resources being tailored to user-provided throughput requirements. On a ZC706 embedded FPGA platform drawing less than 25 W total system power, we demonstrate up to 12.3 million image classifications per second with 0.31 {\mu}s latency on the MNIST dataset with 95.8% accuracy, and 21906 image classifications per second with 283 {\mu}s latency on the CIFAR-10 and SVHN datasets with respectively 80.1% and 94.9% accuracy. To the best of our knowledge, ours are the fastest classification rates reported to date on these benchmarks.Comment: To appear in the 25th International Symposium on Field-Programmable Gate Arrays, February 201

arXiv.org e-Print Archive

Crossref

NORA - Norwegian Open Research Archives