Extreme scale parallel computing systems will have tens of thousands of optionally accelerator-equiped nodes with hundreds of cores each, as well as deep memory hierarchies and complex interconnect topologies. Such Exascale systems will provide hardware parallelism at multiple levels and will be energy constrained. Their extreme scale and the rapidly deteriorating reliablity of their hardware components means that Exascale systems will exhibit low mean-time-between-failure values. Furthermore, existing programming models already require heroic programming and optimisation efforts to achieve high efficiency on current supercomputers. Invariably, these efforts are platform-specific and non-portable. In this paper we will explore the shortcomings of existing programming models and runtime systems for large scale computing systems. We then propose and discuss important features of programming paradigms and runtime system to deal with large scale computing systems with a special focus on data-intensive applications and resilience. Finally, we also discuss code sustainability issues and propose several software metrics that are of paramount importance for code development for large scale computing systems

Astsatryan, Hrachya

Da Costa, Georges

Fahringer, Thomas

Grasso, Ivan

Hristov, Atanas

Karatza, Helen D.

Lastovetsky, Alexey

Marozzo, Fabrizio

Petcu, Dana

Rico-Gallego, Juan-Antonio

Stavrinides, Georgios L.

Talia, Domenico

Trufio, Paolo

English

Open Archive Toulouse Archive Ouverte

Exascale machines require new programming paradigms and  runtimes

Open Access Repository

Exascale machines require new programming paradigms and runtimes

International audienceExtreme scale parallel computing systems will have tens of thousands of optionally accelerator-equiped nodes with hundreds of cores each, as well as deep memory hierarchies and complex interconnect topologies. Such Exascale systems will provide hardware parallelism at multiple levels and will be energy constrained. Their extreme scale and the rapidly deteriorating reliablity of their hardware components means that Exascale systems will exhibit low mean-time-between-failure values. Furthermore, existing programming models already require heroic programming and optimisation efforts to achieve high efficiency on current supercomputers. Invariably, these efforts are platform-specific and non-portable. In this paper we will explore the shortcomings of existing programming models and runtime systems for large scale computing systems. We then propose and discuss important features of programming paradigms and runtime system to deal with large scale computing systems with a special focus on data-intensive applications and resilience. Finally, we also discuss code sustainability issues and propose several software metrics that are of paramount importance for code development for large scale computing systems

da Costa, Georges

Scientific Publications of the University of Toulouse II Le Mirail

https://www.openaccessrepository.it/record/51976/files/fulltext.pdf

Exascale machines require new programming paradigms and runtimes

Abstract

Similar works

Full text

Available Versions

Open Archive Toulouse Archive Ouverte

Open Access Repository

Scientific Publications of the University of Toulouse II Le Mirail