Search CORE

2 research outputs found

High-performance tensor contractions for GPUs

Author: Abdelfattah A.
Baboulin M.
Dobrev V.
Dongarra J.
Earl C.
Falcou J.
Haidar A.
Karlin I.
Kolev Tz
Masliah I.
Tomov S.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2016
Field of study

AbstractWe present a computational framework for high-performance tensor contractions on GPUs. High-performance is difficult to obtain using existing libraries, especially for many independent contractions where each contraction is very small, e.g., sub-vector/warp in size. However, using our framework to batch contractions plus application-specifics, we demonstrate close to peak performance results. In particular, to accelerate large scale tensor-formulated high-order finite element method (FEM) simulations, which is the main focus and motivation for this work, we represent contractions as tensor index reordering plus matrix-matrix multiplications (GEMMs). This is a key factor to achieve algorithmically many-fold acceleration (vs. not using it) due to possible reuse of data loaded in fast memory. In addition to using this context knowledge, we design tensor data-structures, tensor algebra interfaces, and new tensor contraction algorithms and implementations to achieve 90+% of a theoretically derived peak on GPUs. On a K40c GPU for contractions resulting in GEMMs on square matrices of size 8 for example, we are 2.8× faster than CUBLAS, and 8.5× faster than MKL on 16 cores of Intel Xeon E5-2670 (Sandy Bridge) 2.60GHz CPUs. Finally, we apply autotuning and code generation techniques to simplify tuning and provide an architecture-aware, user-friendly interface

Elsevier - Publisher Connector

Crossref

The University of Manchester - Institutional Repository

Multigrid methods with space–time concurrency

Author: A Brandt
AJ Christlieb
BL Buzbee
C Lubich
D Sheen
E Lelarasmee
G Horton
G Horton
G Horton
G Horton
H Sterck De
J Nievergelt
J. B. Schroder
JL Lions
M Emmett
M Ries
MJ Gander
MJ Gander
MJ Gander
ML Minion
P Chartier
R. D. Falgout
RD Falgout
RW Hockney
RW Hockney
S Friedhoff
S Vandewalle
S Vandewalle
S. Friedhoff
S. P. MacLachlan
S. Vandewalle
SF Ashby
T Weinzierl
Tz. V. Kolev
W Hackbusch
WL Miranker
Y Maday
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref