Search CORE

5 research outputs found

最先端高性能計算システムにおける第一原理電子動力学シミュレーションのコデザイン

Author: 廣川祐太
Publication venue
Publication date: 01/01/2018
Field of study

筑波大学 (University of Tsukuba)201

Tsukuba Repository

A Strategy for Automatic Performance Tuning of Stencil Computations on GPUs

Author: Joseph D. Garvey
Tarek S. Abdelrahman
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2018
Field of study

We propose and evaluate a novel strategy for tuning the performance of a class of stencil computations on Graphics Processing Units. The strategy uses a machine learning model to predict the optimal way to load data from memory followed by a heuristic that divides other optimizations into groups and exhaustively explores one group at a time. We use a set of 104 synthetic OpenCL stencil benchmarks that are representative of many real stencil computations. We first demonstrate the need for auto-tuning by showing that the optimization space is sufficiently complex that simple approaches to determining a high-performing configuration fail. We then demonstrate the effectiveness of our approach on NVIDIA and AMD GPUs. Relative to a random sampling of the space, we find configurations that are 12%/32% faster on the NVIDIA/AMD platform in 71% and 4% less time, respectively. Relative to an expert search, we achieve 5% and 9% better performance on the two platforms in 89% and 76% less time. We also evaluate our strategy for different stencil computational intensities, varying array sizes and shapes, and in combination with expert search

University of Toronto Research Repository

Crossref

Directory of Open Access Journals

A Strategy for Automatic Performance Tuning of Stencil Computations on GPUs

Author: Abdelrahman Tarek S.
Garvey Joseph D.
Publication venue
Publication date: 01/01/2018
Field of study

University of Toronto Research Repository

Directory of Open Access Journals