A domain-specific approach to heterogeneous parallelism
Abstract
Exploiting heterogeneous parallel hardware currently requires mapping application code to multiple disparate programming mod-els. Unfortunately, general-purpose programming models available today can yield high performance but are too low-level to be acces-sible to the average programmer. We propose leveraging domain-specific languages (DSLs) to map high-level application code to heterogeneous devices. To demonstrate the potential of this ap-proach we present OptiML, a DSL for machine learning. OptiML programs are implicitly parallel and can achieve high performance on heterogeneous hardware with no modification required to the source code. For such a DSL-based approach to be tractable at large scales, better tools are required for DSL authors to simplify language creation and parallelization. To address this concern, we introduce Delite, a system designed specifically for DSLs that is both a framework for creating an implicitly parallel DSL as well as a dynamic runtime providing automated targeting to heteroge-neous parallel hardware. We show that OptiML running on Delite achieves single-threaded, parallel, and GPU performance superior to explicitly parallelized MATLAB code in nearly all cases- text
- Categories and Subject Descriptors D.1.3 [Programming Tech- niques
- Concurrent Programming – Parallel programming
- D.3.4 [Programming Languages
- Processors – Code generation
- Opti- mization
- Run-time environments General Terms Languages
- Performance Keywords Parallel Programming
- Domain-Specific Languages
- Dynamic Optimizations