Search CORE

34 research outputs found

Asynchronous communication in spectral-element and discontinuous Galerkin methods for atmospheric dynamics – a case study using the High-Order Methods Modeling Environment (HOMME-homme_dg_branch)

Author: Jamroz Benjamin F.
Klöfkorn Robert
Publication venue: 'Copernicus GmbH'
Publication date: 01/08/2016
Field of study

The scalability of computational applications on current and next-generation supercomputers is increasingly limited by the cost of inter-process communication. We implement non-blocking asynchronous communication in the High-Order Methods Modeling Environment for the time integration of the hydrostatic fluid equations using both the spectral-element and discontinuous Galerkin methods. This allows the overlap of computation with communication, effectively hiding some of the costs of communication. A novel detail about our approach is that it provides some data movement to be performed during the asynchronous communication even in the absence of other computations. This method produces significant performance and scalability gains in large-scale simulations.publishedVersio

Directory of Open Access Journals

UiS Brage

The ESCAPE project : Energy-efficient Scalable Algorithms for Weather Prediction at Exascale

Author: Baldauf Michael
Bauer Peter
Berg Per
Bosak Bartosz
Bénard Pierre
Błażewicz Marek
Ciesielski Sebastian
Ciznicki Milosz
Clement Valentin
Colavolpe Charles
Deconinck Willem
Degrauwe Daan
Diamantakis Michail
Douriez Louis
Fuhrer Oliver
Gillard Mike
Glinton Michael
Gray Alan
Guibert David
Hamrud Mats
Kulczewski Michał
Kurowski Krzysztof
Kühnlein Christian
Lange Michael
Lock Sarah-Jane
Lysaght Michael
Macfaden Alexander J
Marguinaud Philippe
Mazauric Cyril
McKinstry Alastair
Mengaldo Gianmarco
Messmer Peter
Mozdzynski George
Müller Andreas
New Nick
Nielsen Kristian P
O'Brien Enda
Osuna Carlos
Piotrowski Zbigniew P
Piątek Wojciech
Poulsen Jacob W
Procyk Marcin
Raffin Erwan
Robinson Oisín
Saarinen Sami
Sass Bent H
Shukla Parijat
Smet Geert
Smolarkiewicz Piotr K
Spychala Pawel
Szmelter Joanna
Termonia Piet
Thiemert Daniel
Van Bever Joris
Vigouroux Xavier
Voitus Fabrice
Wedi Nils
Wyszogrodzki Andrzej
Zheng Yongjun
Publication venue: 'Copernicus GmbH'
Publication date: 01/01/2019
Field of study

In the simulation of complex multi-scale flows arising in weather and climate modelling, one of the biggest challenges is to satisfy strict service requirements in terms of time to solution and to satisfy budgetary constraints in terms of energy to solution, without compromising the accuracy and stability of the application. These simulations require algorithms that minimise the energy footprint along with the time required to produce a solution, maintain the physically required level of accuracy, are numerically stable, and are resilient in case of hardware failure. The European Centre for Medium-Range Weather Forecasts (ECMWF) led the ESCAPE (Energy-efficient Scalable Algorithms for Weather Prediction at Exascale) project, funded by Horizon 2020 (H2020) under the FET-HPC (Future and Emerging Technologies in High Performance Computing) initiative. The goal of ESCAPE was to develop a sustainable strategy to evolve weather and climate prediction models to next-generation computing technologies. The project partners incorporate the expertise of leading European regional forecasting consortia, university research, experienced high-performance computing centres, and hardware vendors. This paper presents an overview of the ESCAPE strategy: (i) identify domain-specific key algorithmic motifs in weather prediction and climate models (which we term Weather & Climate Dwarfs), (ii) categorise them in terms of computational and communication patterns while (iii) adapting them to different hardware architectures with alternative programming models, (iv) analyse the challenges in optimising, and (v) find alternative algorithms for the same scheme. The participating weather prediction models are the following: IFS (Integrated Forecasting System); ALARO, a combination of AROME (Application de la Recherche a l'Operationnel a Meso-Echelle) and ALADIN (Aire Limitee Adaptation Dynamique Developpement International); and COSMO-EULAG, a combination of COSMO (Consortium for Small-scale Modeling) and EULAG (Eulerian and semi-Lagrangian fluid solver). For many of the weather and climate dwarfs ESCAPE provides prototype implementations on different hardware architectures (mainly Intel Skylake CPUs, NVIDIA GPUs, Intel Xeon Phi, Optalysys optical processor) with different programming models. The spectral transform dwarf represents a detailed example of the co-design cycle of an ESCAPE dwarf. The dwarf concept has proven to be extremely useful for the rapid prototyping of alternative algorithms and their interaction with hardware; e.g. the use of a domain-specific language (DSL). Manual adaptations have led to substantial accelerations of key algorithms in numerical weather prediction (NWP) but are not a general recipe for the performance portability of complex NWP models. Existing DSLs are found to require further evolution but are promising tools for achieving the latter. Measurements of energy and time to solution suggest that a future focus needs to be on exploiting the simultaneous use of all available resources in hybrid CPU-GPU arrangements

Loughborough University Institutional Repository

Ghent University Academic Bibliography

Enhancing speed and scalability of the ParFlow simulation code

Author: Burstedde Carsten
Fonseca Jose A.
Kollet Stefan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/09/2017
Field of study

Regional hydrology studies are often supported by high resolution simulations of subsurface flow that require expensive and extensive computations. Efficient usage of the latest high performance parallel computing systems becomes a necessity. The simulation software ParFlow has been demonstrated to meet this requirement and shown to have excellent solver scalability for up to 16,384 processes. In the present work we show that the code requires further enhancements in order to fully take advantage of current petascale machines. We identify ParFlow's way of parallelization of the computational mesh as a central bottleneck. We propose to reorganize this subsystem using fast mesh partition algorithms provided by the parallel adaptive mesh refinement library p4est. We realize this in a minimally invasive manner by modifying selected parts of the code to reinterpret the existing mesh data structures. We evaluate the scaling performance of the modified version of ParFlow, demonstrating good weak and strong scaling up to 458k cores of the Juqueen supercomputer, and test an example application at large scale.Comment: The final publication is available at link.springer.co

arXiv.org e-Print Archive

Crossref

Juelich Shared Electronic Resources

Improvements in the Scalability of the NASA Goddard Multiscale Modeling Framework for Hurricane Climate Studies

Author: Chern Jiun-Dar
Shen Bo-Wen
Tao Wei-Kuo
Publication venue
Publication date
Field of study

Improving our understanding of hurricane inter-annual variability and the impact of climate change (e.g., doubling CO2 and/or global warming) on hurricanes brings both scientific and computational challenges to researchers. As hurricane dynamics involves multiscale interactions among synoptic-scale flows, mesoscale vortices, and small-scale cloud motions, an ideal numerical model suitable for hurricane studies should demonstrate its capabilities in simulating these interactions. The newly-developed multiscale modeling framework (MMF, Tao et al., 2007) and the substantial computing power by the NASA Columbia supercomputer show promise in pursuing the related studies, as the MMF inherits the advantages of two NASA state-of-the-art modeling components: the GEOS4/fvGCM and 2D GCEs. This article focuses on the computational issues and proposes a revised methodology to improve the MMF's performance and scalability. It is shown that this prototype implementation enables 12-fold performance improvements with 364 CPUs, thereby making it more feasible to study hurricane climate

NASA Technical Reports Server

Multigrid preconditioners for the mixed finite element dynamical core of the LFRic atmospheric model

Author: Maynard Christopher
Melvin Thomas
Mueller Eike
Publication venue: 'Wiley'
Publication date: 21/07/2020
Field of study

Due to the wide separation of time scales in geophysical fluid dynamics, semi-implicit time integrators are commonly used in operational atmospheric forecast models. They guarantee the stable treatment of fast (acoustic and gravity) waves, while not suffering from severe restrictions on the timestep size. To propagate the state of the atmosphere forward in time, a non-linear equation for the prognostic variables has to be solved at every timestep. Since the nonlinearity is typically weak, this is done with a small number of Newton- or Picard- iterations, which in turn require the efficient solution of a large system on linear equations with O(106 − 109) unknowns. This linear solve is often the computationally most costly part of the model. In this paper an efficient linear solver for the LFRic next-generation model, currently developed by the Met Office, is described. The model uses an advanced mimetic finite element discretisation which makes the construction of efficient solvers challenging compared to models using standard finite-difference and finite-volume methods. The linear solver hinges on a bespoke multigrid preconditioner of the Schur-complement system for the pressure correction. By comparing to Krylov-subspace methods, the superior performance and robustness of the multigrid algorithm is demonstrated for standard test cases and realistic model setups. In production mode, the model will have to run in parallel on 100,000s of processing elements. As confirmed by numerical experiments, one particular advantage of the multigrid solver is its excellent parallel scalability due to avoiding expensive global reduction operations

arXiv.org e-Print Archive

Central Archive at the University of Reading

Crossref

A Performance Study of Horizontally Explicit Vertically Implicit (HEVI) Time-Integrators for Non-Hydrostatic Atmospheric Models

Author: Alves Felipe Augusto Ventura de Braganca
Giraldo Francis X.
Kang Soonpil
Kelly James F.
Reiencke P. Alex
Publication venue
Publication date: 19/11/2023
Field of study

We conduct a thorough study of different forms of horizontally explicit and vertically implicit (HEVI) time-integration strategies for the compressible Euler equations on spherical domains typical of nonhydrostatic global atmospheric applications. We compare the computational time and complexity of two nonlinear variants (NHEVI-GMRES and NHEVI-LU) and a linear variant (LHEVI). We report on the performance of these three variants for a number of additive Runge-Kutta Methods ranging in order of accuracy from second through fifth, and confirm the expected order of accuracy of the HEVI methods for each time-integrator. To gauge the maximum usable time-step of each HEVI method, we run simulations of a nonhydrostatic baroclinic instability for 100 days and then use this time-step to compare the time-to-solution of each method. The results show that NHEVI-LU is 2x faster than NHEVI-GMRES, and LHEVI is 5x faster than NHEVI-LU, for the idealized cases tested. The baroclinic instability and inertia-gravity wave simulations indicate that the optimal choice of time-integrator is LHEVI with either second or third order schemes, as both schemes yield similar time to solution and relative L2 error at their maximum usable time-steps. In the future, we will report on whether these results hold for more complex problems using, e.g., real atmospheric data and/or a higher model top typical of space weather applications.Comment: 36 pages, 10 figures, 2 table

arXiv.org e-Print Archive

The ESCAPE project: Energy-efficient Scalable Algorithms for Weather Prediction at Exascale

Author: Alan Gray (3276414)
Alastair McKinstry (7533326)
Alexander J Macfaden (7533380)
Andreas Müller (352087)
Andrzej Wyszogrodzki (7533362)
Bartosz Bosak (7533356)
Bent H Sass (7533305)
Carlos Osuna (5747222)
Charles Colavolpe (7533284)
Christian Kühnlein (7533260)
Cyril Mazauric (7533368)
Daan Degrauwe (7533293)
Daniel Thiemert (7533272)
David Guibert (7533371)
Enda O'Brien (7533323)
Erwan Raffin (7533365)
Fabrice Voitus (7533281)
Geert Smet (7533296)
George Mozdzynski (7213583)
Gianmarco Mengaldo (7533263)
Jacob W Poulsen (7533308)
Joanna Szmelter (1250304)
Joris Van Bever (7533290)
Kristian P Nielsen (7533302)
Krzysztof Kurowski (372442)
Louis Douriez (7533374)
Marcin Procyk (7533353)
Marek Błażewicz (7533350)
Mats Hamrud (7213580)
Michael Baldauf (7533320)
Michael Glinton (7533275)
Michael Lange (3094497)
Michael Lysaght (7533335)
Michail Diamantakis (5774088)
Michał Kulczewski (7533338)
Mike Gillard (1260393)
Milosz Ciznicki (7533341)
Nick New (7533383)
Nils Wedi (5774006)
Oisín Robinson (7533329)
Oliver Fuhrer (7533314)
Parijat Shukla (7533332)
Pawel Spychala (746616)
Per Berg (7533311)
Peter Bauer (524851)
Peter Messmer (7533377)
Philippe Marguinaud (7533287)
Pierre Bénard (7533278)
Piet Termonia (7533299)
Piotr K Smolarkiewicz (7533266)
Sami Saarinen (7533269)
Sarah-Jane Lock (5774078)
Sebastian Ciesielski (7533347)
Valentin Clement (7533317)
Willem Deconinck (7213469)
Wojciech Piątek (7533344)
Xavier Vigouroux (5547218)
Yongjun Zheng (1660624)
Zbigniew P Piotrowski (7533359)
Publication venue
Publication date: 22/10/2019
Field of study

Abstract. In the simulation of complex multi-scale flows arising in weather and climate modelling, one of the biggest challenges is to satisfy strict service requirements in terms of time to solution and to satisfy budgetary constraints in terms of energy to solution, without compromising the accuracy and stability of the application. These simulations require algorithms that minimise the energy footprint along with the time required to produce a solution, maintain the physically required level of accuracy, are numerically stable, and are resilient in case of hardware failure. The European Centre for Medium-Range Weather Forecasts (ECMWF) led the ESCAPE (Energy-efficient Scalable Algorithms for Weather Prediction at Exascale) project, funded by Horizon 2020 (H2020) under the FET-HPC (Future and Emerging Technologies in High Performance Computing) initiative. The goal of ESCAPE was to develop a sustainable strategy to evolve weather and climate prediction models to next-generation computing technologies. The project partners incorporate the expertise of leading European regional forecasting consortia, university research, experienced high-performance computing centres, and hardware vendors. This paper presents an overview of the ESCAPE strategy: (i) identify domain-specific key algorithmic motifs in weather prediction and climate models (which we term Weather & Climate Dwarfs), (ii) categorise them in terms of computational and communication patterns while (iii) adapting them to different hardware architectures with alternative programming models, (iv) analyse the challenges in optimising, and (v) find alternative algorithms for the same scheme. The participating weather prediction models are the following: IFS (Integrated Forecasting System); ALARO, a combination of AROME (Application de la Recherche à l'Opérationnel à Meso-Echelle) and ALADIN (Aire Limitée Adaptation Dynamique Développement International); and COSMO–EULAG, a combination of COSMO (Consortium for Small-scale Modeling) and EULAG (Eulerian and semi-Lagrangian fluid solver). For many of the weather and climate dwarfs ESCAPE provides prototype implementations on different hardware architectures (mainly Intel Skylake CPUs, NVIDIA GPUs, Intel Xeon Phi, Optalysys optical processor) with different programming models. The spectral transform dwarf represents a detailed example of the co-design cycle of an ESCAPE dwarf. The dwarf concept has proven to be extremely useful for the rapid prototyping of alternative algorithms and their interaction with hardware; e.g. the use of a domain-specific language (DSL). Manual adaptations have led to substantial accelerations of key algorithms in numerical weather prediction (NWP) but are not a general recipe for the performance portability of complex NWP models. Existing DSLs are found to require further evolution but are promising tools for achieving the latter. Measurements of energy and time to solution suggest that a future focus needs to be on exploiting the simultaneous use of all available resources in hybrid CPU–GPU arrangements

Loughborough University Institutional Repository

A Review of Element-Based Galerkin Methods for Numerical Weather Prediction: Finite Elements, Spectral Elements, and Discontinuous Galerkin

Author: Giraldo Francis X.
Houzeaux Guillaume
Jorba Oriol
Kelly James F.
Kopera Michal A.
Marras Simone
Moragues Margarida
Müller Andreas
Vázquez Mariano
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/05/2015
Field of study

Numerical weather prediction (NWP) is in a period of transition. As resolutions increase, global models are moving towards fully nonhydrostatic dynamical cores, with the local and global models using the same governing equations; therefore we have reached a point where it will be necessary to use a single model for both applications. The new dynamical cores at the heart of these unified models are designed to scale efficiently on clusters with hundreds of thousands or even millions of CPU cores and GPUs. Operational and research NWP codes currently use a wide range of numerical methods: finite differences, spectral transform, finite volumes and, increasingly, finite/spectral elements and discontinuous Galerkin, which constitute element-based Galerkin (EBG) methods.Due to their important role in this transition, will EBGs be the dominant power behind NWP in the next 10 years, or will they just be one of many methods to choose from? One decade after the review of numerical methods for atmospheric modeling by Steppeler et al. (Meteorol Atmos Phys 82:287–301, 2003), this review discusses EBG methods as a viable numerical approach for the next-generation NWP models. One well-known weakness of EBG methods is the generation of unphysical oscillations in advection-dominated flows; special attention is hence devoted to dissipation-based stabilization methods. Since EBGs are geometrically flexible and allow both conforming and non-conforming meshes, as well as grid adaptivity, this review is concluded with a short overview of how mesh generation and dynamic mesh refinement are becoming as important for atmospheric modeling as they have been for engineering applications for many years.The authors would like to thank Prof. Eugenio Oñate (U. Politècnica de Catalunya) for his invitation to submit this review article. They are also thankful to Prof. Dale Durran (U. Washington), Dr. Tommaso Benacchio (Met Office), and Dr. Matias Avila (BSC-CNS) for their comments and corrections, as well as insightful discussion with Sam Watson, Consulting Software Engineer (Exa Corp.) Most of the contribution to this article by the first author stems from his Ph.D. thesis carried out at the Barcelona Supercomputing Center (BSCCNS) and Universitat Politècnica de Catalunya, Spain, supported by a BSC-CNS student grant, by Iberdrola Energías Renovables, and by grant N62909-09-1-4083 of the Office of Naval Research Global. At NPS, SM, AM, MK, and FXG were supported by the Office of Naval Research through program element PE-0602435N, the Air Force Office of Scientific Research through the Computational Mathematics program, and the National Science Foundation (Division of Mathematical Sciences) through program element 121670. The scalability studies of the atmospheric model NUMA that are presented in this paper used resources of the Argonne Leadership Computing Facility, which is a DOE Office of Science User Facility supported under Contract DE-AC02-06CH11357. SM, MK, and AM are grateful to the National Research Council of the National Academies.Peer ReviewedPostprint (author's final draft

UPCommons. Portal del coneixement obert de la UPC