CoreTSAR: Task Scheduling for Accelerator-aware Runtimes

de Supinski, Bronis R.; Feng, Wu-chun; Rountree, Barry; Scogland, Thomas R. W.

research

CoreTSAR: Task Scheduling for Accelerator-aware Runtimes

Authors: Bronis R. de Supinski
Wu-chun Feng
Barry Rountree
Thomas R. W. Scogland
Publication date: 1 January 2012
Publisher

Abstract

Heterogeneous supercomputers that incorporate computational accelerators such as GPUs are increasingly popular due to their high peak performance, energy efficiency and comparatively low cost. Unfortunately, the programming models and frameworks designed to extract performance from all computational units still lack the flexibility of their CPU-only counterparts. Accelerated OpenMP improves this situation by supporting natural migration of OpenMP code from CPUs to a GPU. However, these implementations currently lose one of OpenMP’s best features, its flexibility: typical OpenMP applications can run on any number of CPUs. GPU implementations do not transparently employ multiple GPUs on a node or a mix of GPUs and CPUs. To address these shortcomings, we present CoreTSAR, our runtime library for dynamically scheduling tasks across heterogeneous resources, and propose straightforward extensions that incorporate this functionality into Accelerated OpenMP. We show that our approach can provide nearly linear speedup to four GPUs over only using CPUs or one GPU while increasing the overall flexibility of Accelerated OpenMP

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Computer Science Technical Reports @Virginia Tech

oai:vtcstechreports.OAI2:1212

Last time updated on 21/06/2013