1 research outputs found
Exploring Memory Persistency Models for GPUs
Given its high integration density, high speed, byte addressability, and low
standby power, non-volatile or persistent memory is expected to
supplement/replace DRAM as main memory. Through persistency programming models
(which define durability ordering of stores) and durable transaction
constructs, the programmer can provide recoverable data structure (RDS) which
allows programs to recover to a consistent state after a failure. While
persistency models have been well studied for CPUs, they have been neglected
for graphics processing units (GPUs). Considering the importance of GPUs as a
dominant accelerator for high performance computing, we investigate persistency
models for GPUs.
GPU applications exhibit substantial differences with CPUs applications,
hence in this paper we adapt, re-architect, and optimize CPU persistency models
for GPUs. We design a pragma-based compiler scheme to express persistency
models for GPUs. We identify that the thread hierarchy in GPUs offers intuitive
scopes to form epochs and durable transactions. We find that undo logging
produces significant performance overheads. We propose to use idempotency
analysis to reduce both logging frequency and the size of logs. Through both
real-system and simulation evaluations, we show low overheads of our proposed
architecture support.Comment: 18 pages, 16 figure