Search CORE

28 research outputs found

Load-Balanced Bottleneck Objectives in Process Mapping

Author: Langguth Johannes
Schlag Sebastian
Schulz Christian
Publication venue
Publication date: 27/01/2020
Field of study

We propose a new problem formulation for graph partitioning that is tailored to the needs of time-critical simulations on modern heterogeneous supercomputers

arXiv.org e-Print Archive

KITopen

Netloc: a Tool for Topology-Aware Process Mapping

Author: Bordage Cyril
Foyer Clément
Goglin Brice
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 28/08/2017
Field of study

International audienceInterconnection networks in parallel platforms can be made of thousands of nodes and hundreds of switches. The communication cost between tasks of a parallel application varies significantly with their actual location in such platforms. Topology-aware process mapping consists in matching the application communication pattern with the network topology to improve the communication cost by placing related tasks close on the hardware. We show that our Netloc tool for gathering network topology in a generic way can be combined with the state-of-the-art Scotch partitioner for computing topology-aware MPI process placement. Our experiments with a stencil application on a fat-tree machine show that we are able to significantly improve the runtime in the vast majority of cases

Crossref

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

Fast and high quality topology-aware task mapping

Author: Deveci Mehmet
Kaya Kamer
Uçar Bora
Çatalyürek Ümit V.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2015
Field of study

Considering the large number of processors and the size of the interconnection networks on exascale capable supercomputers, mapping concurrently executable and communicating tasks of an application is a complex problem that needs to be dealt with care. For parallel applications, the communication overhead can be a significant bottleneck on scalability. Topology-aware task-mapping methods that map the tasks to the processors (i.e., cores) by exploiting the underlying network information are very effective to avoid, or at worst bend, this limitation. We propose novel, efficient, and effective task mapping algorithms employing a graph model. The experiments show that the methods are faster than the existing approaches proposed for the same task, and on 4096 processors, the algorithms improve the communication hops and link contentions by 16% and 32%, respectively, on the average. In addition, they improve the average execution time of a parallel SpMV kernel and a communication-only application by 9% and 14%, respectively

Sabanci University Research Database