Search CORE

26 research outputs found

Performance Evaluation of Adaptive Routing on Dragonfly-based Production Systems

Author: Chunduri Sudheer
Ghadar Yasaman
Groves Taylor
Harms Kevin
Mendygral Peter
Weiland Michele
Zarins Justs
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 28/06/2021
Field of study

10 Years Later: Cloud Computing is Closing the Performance Gap

Author: Buluc Aydin
Culler David
Ellis Marquita
Guidi Giulia
Yelick Katherine
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2021
Field of study

Can cloud computing infrastructures provide HPC-competitive performance for scientific applications broadly? Despite prolific related literature, this question remains open. Answers are crucial for designing future systems and democratizing high-performance computing. We present a multi-level approach to investigate the performance gap between HPC and cloud computing, isolating different variables that contribute to this gap. Our experiments are divided into (i) hardware and system microbenchmarks and (ii) user application proxies. The results show that today's high-end cloud computing can deliver HPC-competitive performance not only for computationally intensive applications but also for memory- and communication-intensive applications - at least at modest scales - thanks to the high-speed memory systems and interconnects and dedicated batch scheduling now available on some cloud platforms

arXiv.org e-Print Archive

eScholarship - University of California

Modelling fracture in heterogeneous materials on HPC systems using a hybrid MPI/Fortran coarray multi-scale CAFE framework

Author: Cebamanos Luis
Margetts Lee
Shterenlikht Anton
Publication venue
Publication date: 01/01/2018
Field of study

A 3D multi-scale cellular automata finite element (CAFE) framework for modelling fracture in heterogeneous materials is described. The framework is implemented in a hybrid MPI/Fortran coarray code for efficient parallel execution on HPC platforms. Two open source BSD licensed libraries developed by the authors in modern Fortran were used: CGPACK, implementing cellular automata (CA) using Fortran coarrays, and ParaFEM, implementing finite elements (FE) using MPI. The framework implements a two-way concurrent hierarchical information exchange between the structural level (FE) and the microstructure (CA). MPI to coarrays interface and data structures are described. The CAFE framework is used to predict transgranular cleavage propagation in a polycrystalline iron round bar under tension. Novel results enabled by this CAFE framework include simulation of progressive cleavage propagation through individual grains and across grain boundaries, and emergence of a macro-crack from merging of cracks on preferentially oriented cleavage planes in individual crystals. Nearly ideal strong scaling up to at least tens of thousands of cores was demonstrated by CGPACK and by ParaFEM in isolation in prior work on Cray XE6. Cray XC30 and XC40 platforms and CrayPAT profiling were used in this work. Initially the strong scaling limit of hybrid CGPACK/ParaFEM CAFE model was 2000 cores. After replacing all-to-all communication patterns with the nearest neighbour algorithms the strong scaling limit on Cray XC30 was increased to 7000 cores. TAU profiling on non-Cray systems identified deficiencies in Intel Fortran 16 optimisation of remote coarray operations. Finally, coarray synchronisation challenges and opportunities for thread parallelisation in CA are discussed

The University of Manchester - Institutional Repository

Explore Bristol Research

Understanding the Scalability of Molecular Simulation Using Empirical Performance Modeling

Author: A Knüpfer
A Vogel
C Iwainsky
CW Glass
H Berendsen
K Singh
S Deublein
S Plimpton
Publication venue
Publication date: 05/12/2019
Field of study

The final authenticated publication is available online at https://doi.org/10.1007/978-3-030-17872-7_8.Molecular dynamics (MD) simulation allows for the study of static and dynamic properties of molecular ensembles at various molecular scales, from monatomics to macromolecules such as proteins and nucleic acids. It has applications in biology, materials science, biochemistry, and biophysics. Recent developments in simulation techniques spurred the emergence of the computational molecular engineering (CME) field, which focuses specifically on the needs of industrial users in engineering. Within CME, the simulation code ms2 allows users to calculate thermodynamic properties of bulk fluids. It is a parallel code that aims to scale the temporal range of the simulation while keeping the execution time minimal. In this paper, we use empirical performance modeling to study the impact of simulation parameters on the execution time. Our approach is a systematic workflow that can be used as a blue-print in other fields that aim to scale their simulation codes. We show that the generated models can help users better understand how to scale the simulation with minimal increase in execution time.BMBF, 01IH16008D, Verbundprojekt: TaLPas - Task-basierte Lastverteilung und Auto-Tuning in der PartikelsimulationDFG, 323299120, ExtraPeak - Automatische Leistungsmodellierung von HPC-Anwendungen mit multiplen Modellparameter

DepositOnce

Crossref

Auto-tuning compiler options for HPC

Author: Jones Jessica
Publication venue
Publication date: 04/09/2019
Field of study

OPUS

GPCNeT: Designing a Benchmark Suite for Inducing and Measuring Contention in HPC Networks

Author: Austin Brian
Balma Jacob
Chunduri Sudheer
Groves Taylor
Kandalla Krishna
Kumaran Kalyan
Lockwood Glenn
Machinery Assoc Comp
Mendygral Peter
Parker Scott
Warren Steven
Wichmann Nathan
Wright Nicholas
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

Crossref

eScholarship - University of California