Search CORE

3 research outputs found

Generating optimal CUDA sparse matrix-vector product implementations for evolving GPU hardware

Author: El Zein Ahmed
Rendell Alistair
Publication venue: 'Wiley'
Publication date: 24/02/2016
Field of study

The CUDA model for graphics processing units (GPUs) presents the programmer with a plethora of different programming options. These includes different memory types, different memory access methods and different data types. Identifying which options to use and when is a non-trivial exercise. This paper explores the effect of these different options on the performance of a routine that evaluates sparse matrix-vector products (SpMV) across three different generations of NVIDIA GPU hardware. A process for analysing performance and selecting the subset of implementations that perform best is proposed. The potential for mapping sparse matrix attributes to optimal CUDA SpMV implementations is discussed

The Australian National University

Generating optimal CUDA sparse matrix-vector product implementations for evolving GPU hardware

Author: Armstrong
Baskaran
Bell
Buttari
Davis
Im
Lee
NVIDIA
Sengupta
Williams
Zein
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

GPUにおける拡張精度浮動小数点演算を用いた線形計算の研究

Author: Mukunoki Daichi
椋木大地
Publication venue
Publication date: 01/01/2013
Field of study

筑波大学 (University of Tsukuba)201

Tsukuba Repository