Search CORE

12 research outputs found

Implications of Shallower Memory Controller Transaction Queues in Scalable Memory Systems

Author: A Marowka
C Bunse
D Wang
Kuan-Ching Li
Mario D. Marino
MCF Chang
NL Binkert
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/07/2015
Field of study

Scalable memory systems provide scalable bandwidth to the core growth demands in multicores and embedded systems processors. In these systems, as memory controllers (MCs) are scaled, memory traffic per MC is reduced, so transaction queues become shallower. As a consequence, there is an opportunity to explore transaction queue utilization and its impact on energy utilization. In this paper, we propose to evaluate the performance and energy-per-bit impact when reducing transaction queue sizes along with the MCs of these systems. Experimental results show that reducing 50 % on the number of entries, bandwidth and energy-per-bit levels are not affected, whilst reducing aggressively of about 90 %, bandwidth is similarly reduced while causing significantly higher energy-per-bit utilization

Crossref

Leeds Beckett Repository

Efficient Probabilistic Model Checking on General Purpose Graphics Processors

Author: A. Bell
A. Marowka
A. Valmari
C. Baier
C. Baier
C.P. Inggs
F. Lerda
G. Ciardo
G.J. Holzmann
H. Hansson
J. Barnat
J. Barnat
J. Barnat
J. Barnat
J. Barnat
J.C. Philips
M. Kwiatkowska
M.Z. Kwiatkowska
S. Edelkamp
S. Edelkamp
S. Edelkamp
S.C. Allmaier
T. Herman
U. Stern
W.J. Stewart
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

Crossref

Parallelism in Ada: Status and Prospects

Author: A. Marowka
M. Frigo
S. Michell
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

Bsp2omp: A Compiler For Translating Bsp Programs To Openmp

Author: Ami Marowka
Bell C.
Berlin K.
Bircsak J.
Bissling R.H.
Bull J.M.
Cantonnet F.
Cantonnet F.
Cappello F.
Chen W.
Hill J.M.D.
Mark Bull J.
Marowka A.
Marowka A.
Marowka A.
Mendelson A.
Merlin J.
Skillicorn D.B.
Sutter H.
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

On parallel software engineering education using python

Author: A Marowka
DH Woo
H Esmaeilzadeh
SH Fuller
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Reformulation of the performance portability metric

Author: Carvalho M
Dreuning H
Hey T
Lecarme O
Marowka A
Pennycook SJ
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

Performance analysis of gpu programming models using the roofline scaling trajectories

Author: A Ilic
A Marowka
L Adhianto
R Xu
S Cook
S Williams
SS Shende
Publication venue: eScholarship, University of California
Publication date: 01/01/2020
Field of study

Performance analysis is a daunting job, especially for the rapid-evolving accelerator technologies. The Roofline Scaling Trajectories technique aims at diagnosing various performance bottlenecks for GPU programming models through the visually intuitive Roofline plots. In this work, we introduce the use of the Roofline Scaling Trajectories to capture major performance bottlenecks on NVIDIA Volta GPU architectures, such as warp efficiency, occupancy, and locality. Using this analysis technique, we explain the performance characteristics of the NAS Parallel Benchmarks (NPB) written with two programming models, CUDA and OpenACC. We present the influence of the programming model on the performance and scaling characteristics. We also leverage the insights of the Roofline Scaling Trajectory analysis to tune some of the NAS Parallel Benchmarks, achieving up to 2