Search CORE

22 research outputs found

Software for Exascale Computing - SPPEXA 2016-2019

Author
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

This open access book summarizes the research done and results obtained in the second funding phase of the Priority Program 1648 "Software for Exascale Computing" (SPPEXA) of the German Research Foundation (DFG) presented at the SPPEXA Symposium in Dresden during October 21-23, 2019. In that respect, it both represents a continuation of Vol. 113 in Springer’s series Lecture Notes in Computational Science and Engineering, the corresponding report of SPPEXA’s first funding phase, and provides an overview of SPPEXA’s contributions towards exascale computing in today's sumpercomputer technology. The individual chapters address one or more of the research directions (1) computational algorithms, (2) system software, (3) application software, (4) data management and exploration, (5) programming, and (6) software tools. The book has an interdisciplinary appeal: scholars from computational sub-fields in computer science, mathematics, physics, or engineering will find it of particular interest

OAPEN Library

Recommended from our members

A scalable hp-adaptive finite element software with applications in fiber optics

Author: Henneking Stefan Klaus Wilhelm
Publication venue
Publication date: 05/07/2021
Field of study

In this dissertation, we present a scalable parallel version of hp3D—a finite element (FE) software for analysis and discretization of complex three-dimensional multiphysics applications. The developed software supports hybrid MPI/OpenMP parallelism for large-scale computation on modern manycore architectures. The focus of the effort lies on the development and optimization of the parallel software infrastructure underlying all distributed computation. We discuss the challenges of designing efficient data structures for isotropic and anisotropic hp-adaptive meshes with tetrahedral, hexahedral, prismatic, and pyramidal elements supporting discretization of the exact sequence energy spaces. While the code supports standard Galerkin methods, special emphasis is given to systems arising from discretization with the discontinuous Petrov–Galerkin (DPG) method. The method guarantees discrete stability by employing locally optimal test functions, and it has a built-in error indicator which we exploit to guide mesh adaptivity. In addition to interfacing with third-party packages for various tasks, we have developed our own tools including a parallel nested dissection solver suitable for scalable FE computation of waveguide geometries. We present weak-scaling results with up to 24576 CPU cores and numerical simulations with more than one billion degrees of freedom. The new software capabilities enable solution of challenging wave propagation problems with important applications in acoustics, elastodynamics, and electromagnetics. These applications are difficult to solve in the high-frequency regime because the FE discretization suffers from significant numerical pollution errors that increase with the wavenumber. It is critical to control these errors to obtain a stable and accurate method. We study the pollution effect for waveguide problems with more than 8000 wavelengths in the context of robust DPG FE discretizations for the time-harmonic Maxwell equations. We also discuss adaptive refinement strategies for multi-mode fiber waveguides where the propagating transverse modes must be resolved sufficiently. Our study shows the applicability of the DPG error indicator to this class of problems. Finally, we present both modeling and computational advancements to a unique three-dimensional DPG FE model for the simulation of laser amplification in a fiber amplifier. Fiber laser amplifiers are of interest in communication technology, medical applications, military defense capabilities, and various other fields. Silica fiber amplifiers can achieve high-power operation with great efficiency. At high optical intensities, multi-mode amplifiers suffer from undesired thermal coupling effects which pose a major obstacle in power-scaling of such devices. Our nonlinear 3D vectorial model is based on the time-harmonic Maxwell equations, and it incorporates both amplification via an active dopant and thermal effects via coupling with the heat equation. The model supports co-, counter-, and bi-directional pumping configurations, as well as inhomogeneous and anisotropic material properties. The high-fidelity simulation comes at the cost of a high-order FE discretization with many degrees of freedom per wavelength. To make the computation more feasible, we have developed a novel longitudinal model rescaling, using artificial material parameters with the goal of preserving certain quantities of interest. Numerical tests demonstrate the applicability and utility of this scaled model in the simulation of an ytterbium-doped, step-index fiber amplifier that experiences laser amplification and heating.Computational Science, Engineering, and Mathematic

Texas ScholarWorks

A Distributed and Parallel Asynchronous Unite and Conquer Method to Solve Large Scale Non-Hermitian Linear Systems

Author: Petiton Serge
WU Xinzhe
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 28/01/2018
Field of study

International audienceParallel Krylov Subspace Methods are commonly used for solving large-scale sparse linear systems. Facing the development of extreme scale platforms, the minimization of synchronous global communication becomes critical to obtain good efficiency and scal-ability. This paper highlights a recent development of a hybrid (unite and conquer) method, which combines three computation algorithms together with asynchronous communication to accelerate the resolution of non-Hermitian linear systems and to improve its fault tolerance and reusability. Experimentation shows that our method has an up to 5× speedup and better scalability than the conventional methods for the resolution on hierarchical clusters with hundreds of nodes

INRIA a CCSD electronic archive server

43rd International Symposium on Mathematical Foundations of Computer Science: MFCS 2018, August 27-31, 2018, Liverpool, United Kingdom

Author: International Symposium on Mathematical Foundations of Computer Science <43. 2018, Liverpool>
Publication venue: Schloss Dagstuhl - Leibniz-Zentrum für Informatik GmbH, Dagstuhl Publishing
Publication date: 01/08/2018
Field of study

Digitale Bibliothek Thüringen

Annual Research Report 2002

Author: Weierstrass-Institut für Angewandte Analysis und Stochastik (Berlin)
Publication venue
Publication date: 01/01/2002
Field of study

Publications Server of the Weierstrass Institute for Applied Analysis and Stochastics

Naval Postgraduate School Academic Catalog - September 2022

Author
Publication venue: Monterey, California; Naval Postgraduate School
Publication date: 01/09/2022
Field of study

Calhoun, Institutional Archive of the Naval Postgraduate School

Naval Postgraduate School Academic Catalog - January 2021

Author
Publication venue: Monterey, California; Naval Postgraduate School
Publication date: 01/01/2021
Field of study

Calhoun, Institutional Archive of the Naval Postgraduate School

A Distributed and Parallel Asynchronous Unite and Conquer Method to Solve Large Scale Non-Hermitian Linear Systems with Multiple Right-hand Sides

Author: Petiton Serge
Wu Xinzhe
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

International audienceMany problems in the field of science and engineering often require to solve simultaneously large-scale non-Hermitian sparse linear systems with multiple right-hand sides (RHSs). Efficiently solving such problems on extreme-scale platforms requires the minimization of global communications, reduction of synchronization points and promotion of asynchronous communications. We develop an extension of the Unite and Conquer GMRES/LS-ERAM (UCGLE) method [1] by combining it with Block GMRES method to solve non-Hermitian linear systems with multiple RHSs. UCGLE is a hybrid method consisting of three computing algorithms with asynchronous communication that allow the use of approximate eigenvalues to accelerate to solve of linear systems and to improve their fault tolerance. In this paper, the variant of UCGLE with novel components and manager engine implementations is introduced. This engine is capable of allocating multiple Block GMRES at the same time, each Block GMRES solving the linear systems with a subset of RHSs and accelerating the convergence using the eigenvalues approximated by other eigensolvers. Dividing the entire linear system with multiple RHSs into subsets and solving them simultaneously with different allocated linear solvers allow localizing calculations, reducing global communication, and improving parallel performance. Meanwhile, the asynchronous preconditioning using eigenvalues is able to speed up the convergence and improve the fault tolerance and reusability. Numerical experiments using different test matrices on supercomputer ROMEO indicate that the proposed method achieves a substantial decrease in both computation time and iterative steps with good scaling performance

INRIA a CCSD electronic archive server