Search CORE

189 research outputs found

Formal Methods for GPGPU Programming: Is the Demand Met?

Author: A Leung
A Schmitz
C Rosen
FR Monteiro
HS Azad
K Kojima
K Kojima
M Safari
M Zheng
P Collingbourne
P Li
R Alur
S Blom
S Blom
T Menzies
Publication venue: Springer
Publication date: 13/11/2020
Field of study

Crossref

University of Twente Research Information

Contract-Based General-Purpose GPU Programming

Author: Kolesnichenko Alexey
Meyer Bertrand
Nanz Sebastian
Poskitt Christopher M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 24/10/2014
Field of study

Using GPUs as general-purpose processors has revolutionized parallel computing by offering, for a large and growing set of algorithms, massive data-parallelization on desktop machines. An obstacle to widespread adoption, however, is the difficulty of programming them and the low-level control of the hardware required to achieve good performance. This paper suggests a programming library, SafeGPU, that aims at striking a balance between programmer productivity and performance, by making GPU data-parallel operations accessible from within a classical object-oriented programming language. The solution is integrated with the design-by-contract approach, which increases confidence in functional program correctness by embedding executable program specifications into the program text. We show that our library leads to modular and maintainable code that is accessible to GPGPU non-experts, while providing performance that is comparable with hand-written CUDA code. Furthermore, runtime contract checking turns out to be feasible, as the contracts can be executed on the GPU

arXiv.org e-Print Archive

CiteSeerX

Repository for Publications and Research Data

Crossref

Institutional Knowledge at Singapore Management University

Exposing errors related to weak memory in GPU applications

Author: Alastair F. Donaldson
Alcantara D. A. F.
Alglave J.
Alglave J.
Alglave J.
Bardsley E.
Chiang W.
Collier W. W.
Coplin J.
Feng W.
Hangal S.
Hwu W.-m. W.
Joshi S.
Lê N. M.
Sanders J.
Tyler Sorensen
Tzeng S.
Xiao S.
Yuki T.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 20/01/2016
Field of study

© 2016 ACM.We present the systematic design of a testing environment that uses stressing and fuzzing to reveal errors in GPU applications that arise due to weak memory effects. We evaluate our approach on seven GPUS spanning three NVIDIA architectures, across ten CUDA applications that use fine-grained concurrency. Our results show that applications that rarely or never exhibit errors related to weak memory when executed natively can readily exhibit these errors when executed in our testing environment. Our testing environment also provides a means to help identify the root causes of such errors, and automatically suggests how to insert fences that harden an application against weak memory bugs. To understand the cost of GPU fences, we benchmark applications with fences provided by the hardening strategy as well as a more conservative, sound fencing strategy

Crossref

Spiral - Imperial College Digital Repository

Correct and efficient accelerator programming

Author: Cohen Albert
Donaldson Alistair F.
Huisman Marieke
Katoen Joost-Pieter
Publication venue: Dagstuhl Publishing
Publication date: 01/01/2013
Field of study

This report documents the program and the outcomes of Dagstuhl Seminar 13142 “Correct and Efficient Accelerator Programming”. The aim of this Dagstuhl seminar was to bring together researchers from various sub-disciplines of computer science to brainstorm and discuss the theoretical foundations, design and implementation of techniques and tools for correct and efficient accelerator programming

University of Twente Research Information

Engineering a static verification tool for GPU kernels

Author: A.F. Donaldson
C. Barrett
C. Flanagan
C.-T. Chou
E. Bardsley
J. Ferrante
J.-C. Filliâtre
K.L. McMillan
L. Moura de
M. Barnett
P. Collingbourne
R. Karrenberg
W.-F. Chiang
Z. Rakamarić
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

We report on practical experiences over the last 2.5 years related to the engineering of GPUVerify, a static verification tool for OpenCL and CUDA GPU kernels, plotting the progress of GPUVerify from a prototype to a fully functional and relatively efficient analysis tool. Our hope is that this experience report will serve the verification community by helping to inform future tooling efforts. © 2014 Springer International Publishing

CiteSeerX

Crossref

Spiral - Imperial College Digital Repository

GPURepair: Automated Repair of GPU Kernels

Author: A Amighi
A Griesmayer
B Jobstmann
BW Boehm
DS Johnson
FR Monteiro
J Deshmukh
L de Moura
N Bjørner
P Černý
PA Abdulla
S Blom
S Joshi
Publication venue
Publication date: 16/11/2020
Field of study

This paper presents a tool for repairing errors in GPU kernels written in CUDA or OpenCL due to data races and barrier divergence. Our novel extension to prior work can also remove barriers that are deemed unnecessary for correctness. We implement these ideas in our tool called GPURepair, which uses GPUVerify as the verification oracle for GPU kernels. We also extend GPUVerify to support CUDA Cooperative Groups, allowing GPURepair to perform inter-block synchronization for CUDA kernels. To the best of our knowledge, GPURepair is the only tool that can propose a fix for intra-block data races and barrier divergence errors for both CUDA and OpenCL kernels and the only tool that fixes inter-block data races for CUDA kernels. We perform extensive experiments on about 750 kernels and provide a comparison with prior work. We demonstrate the superiority of GPURepair through its capability to fix more kernels and its unique ability to remove redundant barriers and handle inter-block data races.Comment: 19 pages, 1 algorithm, 3 figures, 22nd International Conference on Verification Model Checking and Abstract Interpretation (VMCAI 2021

arXiv.org e-Print Archive

Crossref

Interleaving and lock-step semantics for analysis and verification of GPU kernels

Author: A. Habermaier
L. Moura de
M. Barnett
P. Collingbourne
R.A. DeMillo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Graphics Processing Units (GPUs) from leading vendors employ predicated (or guarded) execution to eliminate branching and increase performance. Similarly, a recent GPU verification technique uses predication to reduce verification of GPU kernels (the massively parallel programs that run on GPUs) to verification of a sequential program. Prior work on the formal semantics of lock-step predicated execution for kernels focused on structured programs, where control is organised using if- and while-statements. We provide lock-step execution semantics for GPU kernels that are represented by arbitrary reducible control flow graphs. We present a traditional interleaving semantics and a novel lock-step semantics based on predication, and show that for terminating kernels either both semantics compute identical results or both behave erroneously. The method allows reducing GPU kernel verification to the verification of a sequential, lock-step program to be applied to GPU kernels with arbitrary reducible control flow. We have implemented the method in the GPUVerify tool, and present an evaluation using a set of 163 open source and commercial GPU kernels. Among these kernels, 42 exhibit unstructured control flow which our novel lock-step predication technique can handle fully automatically. This generality comes at a modest price: verification across our benchmark set was on average 2.25 times slower than using an existing approach that specifically targets structured kernels

CiteSeerX

Crossref

Spiral - Imperial College Digital Repository

Symbolic crosschecking of data-parallel floating-point code

Author: Cadar
collingbourne
Kelly PHJ
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 17/12/2013
Field of study

Spiral - Imperial College Digital Repository