Search CORE

7 research outputs found

From Physics Model to Results: An Optimizing Framework for Cross-Architecture Code Generation

Author: Blazewicz Marek
Brandt Steven R.
Ciznicki Milosz
Hinder Ian
Kierzynka Michal
Koppelman David M.
Löffler Frank
Schnetter Erik
Tao Jian
Publication venue: 'IOS Press'
Publication date: 01/01/2013
Field of study

Starting from a high-level problem description in terms of partial differential equations using abstract tensor notation, the Chemora framework discretizes, optimizes, and generates complete high performance codes for a wide range of compute architectures. Chemora extends the capabilities of Cactus, facilitating the usage of large-scale CPU/GPU systems in an efficient manner for complex applications, without low-level code tuning. Chemora achieves parallelism through MPI and multi-threading, combining OpenMP and CUDA. Optimizations include high-level code transformations, efficient loop traversal strategies, dynamically selected data and instruction cache usage strategies, and JIT compilation of GPU code tailored to the problem characteristics. The discretization is based on higher-order finite differences on multi-block domains. Chemora's capabilities are demonstrated by simulations of black hole collisions. This problem provides an acid test of the framework, as the Einstein equations contain hundreds of variables and thousands of terms.Comment: 18 pages, 4 figures, accepted for publication in Scientific Programmin

arXiv.org e-Print Archive

CiteSeerX

Directory of Open Access Journals

Louisiana State University

MPG.PuRe

Energy aware scheduling model and online heuristics for stencil codes on heterogeneous computing architectures

Author: AA Chandio
AD Pereira
CE Shannon
G Terzopoulos
I Holyer
IM Bomze
J Mei
J Treibig
Jan Weglarz
K Bilal
K Datta
K Kurowski
KA Rojek
Krzysztof Kurowski
M Blazewicz
M Ciznicki
M Ciznicki
M Ciznicki
M Ciznicki
Milosz Ciznicki
S Sellappa
S Williams
VG Vizing
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Real-time implementation of moving object detection in UAV videos using GPUs

Author: A Asaduzzaman
AA Huqqani
Deepak Jaiswal
GD Hager
J Liu
JW Tang
M Ciznicki
M Garland
M Teutsch
MG Sánchez
P Chen
P Kumar
Praveen Kumar
R Samet
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Early Performance Assessment of the ThunderX2 Processor for Lattice Based Simulations

Author: AD Pereira
B Joó
C Bonati
D Yokoyama
E Calore
E Calore
G Oyarzun
K Fürlinger
L Biferale
L Gwennap
M Ciznicki
S Williams
V Stegailov
YJ Lo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

This paper presents an early performance assessment of the ThunderX2, the most recent Arm-based multi-core processor designed for HPC applications. We use as benchmarks well known stencil-based LBM and LQCD algorithms, widely used to study respectively fluid flows, and interaction properties of elementary particles. We run benchmark kernels derived from OpenMP production codes, we measure performance as a function of the number of threads, and evaluate the impact of different choices for data layout. We then analyze our results in the framework of the roofline model, and compare with the performances measured on mainstream Intel Skylake processors. We find that these Arm based processors reach levels of performance competitive with those of other state-of-the-art options

Crossref

Archivio istituzionale della ricerca - Università di Ferrara

REPro.JPEG: a new image compression approach based on reduction/expansion image and JPEG compression for dermatological medical images

Author: Abdmouleh MK
Ali Khalfallah
Amri H
Ciznicki M
Hedi Amri
Jean-Christophe Lapayre
Kassab R
Med-Salim Bouhlel
Sanchez Santana MA
Shih FY
Shin DK
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Unleashing the performance of ccNUMA multiprocessor architectures in heterogeneous stencil computations

Author: A Eltablawy
A Lastovetsky
A Strugarek
D Culler
J Guo
Kamil Halbiniak
L Szustak
L Szustak
Lukasz Szustak
Lukasz Szustak
Lukasz Szustak
M Ciznicki
Ondřej Jakl
P Smolarkiewicz
Roman Wyrzykowski
X Cao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

A novel 3- level energy heterogeneity clustering protocol with hybrid routing for a concentric circular wireless sensor network

Author: A. Chithra
G Smaragdakis
H González
K Romer
K Vivek
L Qing
M Saeidmanesh
Manju Bala
Milosz Ciznicki
R Saravanakumar
R. Shantha Selva Kumari
S Faisal
S Mo
S Sirsikar
SP Singh
Vrinda Gupta
WR Heinzelman
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref