Exact and heuristic allocation of multi-kernel applications to multi-FPGA platforms

Casu, Mario R.; Cortadella, Jordi; Lavagno, Luciano; Lazarescu, Mihai T.; Shan, Junnan

research

Exact and heuristic allocation of multi-kernel applications to multi-FPGA platforms

Authors: Mario R. Casu
Jordi Cortadella
Luciano Lavagno
Mihai T. Lazarescu
Junnan Shan
Publication date: 1 January 2019
Publisher: 'Association for Computing Machinery (ACM)'
Doi

Abstract

FPGA-based accelerators demonstrated high energy efficiency compared to GPUs and CPUs. However, single FPGA designs may not achieve sufficient task parallelism. In this work, we optimize the mapping of high-performance multi-kernel applications, like Convolutional Neural Networks, to multi-FPGA platforms. First, we formulate the system level optimization problem, choosing within a huge design space the parallelism and number of compute units for each kernel in the pipeline. Then we solve it using a combination of Geometric Programming, producing the optimum performance solution given resource and DRAM bandwidth constraints, and a heuristic allocator of the compute units on the FPGA cluster.Peer ReviewedPostprint (author's final draft