Search CORE

8 research outputs found

Implementation and analysis of a parallel vertex-centered finite element segmental refinement multigrid solver

Author: Henneking Stefan
Publication venue: Georgia Institute of Technology
Publication date: 27/05/2016
Field of study

In a parallel vertex-centered finite element multigrid solver, segmental refinement can be used to avoid all inter-process communication on the fine grids. While domain decomposition methods generally require coupled subdomain processing for the numerical solution to a nonlinear elliptic boundary value problem, segmental refinement exploits that subdomains are almost decoupled with respect to high-frequency error components. This allows to perform multigrid with fully decoupled subdomains on the fine grids, which was proposed as a sequential low-storage algorithm by Brandt in the 1970s, and as a parallel algorithm by Brandt and Diskin in 1994. Adams published the first numerical results from a multilevel segmental refinement solver in 2014, confirming the asymptotic exactness of the scheme for a cell-centered finite volume implementation. We continue Brandt’s and Adams’ research by experimentally investigating the scheme’s accuracy with a vertex-centered finite element segmental refinement solver. We confirm that full multigrid accuracy can be preserved for a few segmental refinement levels, although we observe a different dependency on the segmental refinement parameter space. We show that various strategies for the grid transfers between the finest conventional multigrid level and the segmental refinement subdomains affect the solver accuracy. Scaling results are reported for a Cray XC30 with up to 4096 cores.M.S

Scholarly Materials And Research @ Georgia Tech

An Anisotropic $hp$ -Adaptation Framework for Ultraweak Discontinuous Petrov-Galerkin Formulations

Author: Chakraborty Ankit
Demkowicz Leszek
Henneking Stefan
Publication venue
Publication date: 01/09/2023
Field of study

In this article, we present a three-dimensional anisotropic

hp

-mesh refinement strategy for ultraweak discontinuous Petrov--Galerkin (DPG) formulations with optimal test functions. The refinement strategy utilizes the built-in residual-based error estimator accompanying the DPG discretization. The refinement strategy is a two-step process: (a) use the built-in error estimator to mark and isotropically

hp

-refine elements of the (coarse) mesh to generate a finer mesh; (b) use the reference solution on the finer mesh to compute optimal

h

- and

p

-refinements of the selected elements in the coarse mesh. The process is repeated with coarse and fine mesh being generated in every adaptation cycle, until a prescribed error tolerance is achieved. We demonstrate the performance of the proposed refinement strategy using several numerical examples on hexahedral meshes

arXiv.org e-Print Archive

Stability Analysis for Electromagnetic Waveguides. Part 1: Acoustic and Homogeneous Electromagnetic Waveguides

Author: Demkowicz Leszek
Henneking Stefan
Melenk Jens Markus
Publication venue
Publication date: 10/07/2023
Field of study

In a time-harmonic setting, we show for heterogeneous acoustic and homogeneous electromagnetic wavesguides stability estimates with the stability constant depending linearly on the length

L

of the waveguide. These stability estimates are used for the analysis of the (ideal) ultraweak (UW) variant of the Discontinuous Petrov Galerkin (DPG) method. For this UW DPG, we show that the stability deterioration with

L

can be countered by suitably scaling the test norm of the method. We present the ``full envelope approximation'', a UW DPG method based on non-polynomial ansatz functions that allows for treating long waveguides

arXiv.org e-Print Archive

Recommended from our members

A scalable hp-adaptive finite element software with applications in fiber optics

Author: Henneking Stefan Klaus Wilhelm
Publication venue
Publication date: 05/07/2021
Field of study

In this dissertation, we present a scalable parallel version of hp3D—a finite element (FE) software for analysis and discretization of complex three-dimensional multiphysics applications. The developed software supports hybrid MPI/OpenMP parallelism for large-scale computation on modern manycore architectures. The focus of the effort lies on the development and optimization of the parallel software infrastructure underlying all distributed computation. We discuss the challenges of designing efficient data structures for isotropic and anisotropic hp-adaptive meshes with tetrahedral, hexahedral, prismatic, and pyramidal elements supporting discretization of the exact sequence energy spaces. While the code supports standard Galerkin methods, special emphasis is given to systems arising from discretization with the discontinuous Petrov–Galerkin (DPG) method. The method guarantees discrete stability by employing locally optimal test functions, and it has a built-in error indicator which we exploit to guide mesh adaptivity. In addition to interfacing with third-party packages for various tasks, we have developed our own tools including a parallel nested dissection solver suitable for scalable FE computation of waveguide geometries. We present weak-scaling results with up to 24576 CPU cores and numerical simulations with more than one billion degrees of freedom. The new software capabilities enable solution of challenging wave propagation problems with important applications in acoustics, elastodynamics, and electromagnetics. These applications are difficult to solve in the high-frequency regime because the FE discretization suffers from significant numerical pollution errors that increase with the wavenumber. It is critical to control these errors to obtain a stable and accurate method. We study the pollution effect for waveguide problems with more than 8000 wavelengths in the context of robust DPG FE discretizations for the time-harmonic Maxwell equations. We also discuss adaptive refinement strategies for multi-mode fiber waveguides where the propagating transverse modes must be resolved sufficiently. Our study shows the applicability of the DPG error indicator to this class of problems. Finally, we present both modeling and computational advancements to a unique three-dimensional DPG FE model for the simulation of laser amplification in a fiber amplifier. Fiber laser amplifiers are of interest in communication technology, medical applications, military defense capabilities, and various other fields. Silica fiber amplifiers can achieve high-power operation with great efficiency. At high optical intensities, multi-mode amplifiers suffer from undesired thermal coupling effects which pose a major obstacle in power-scaling of such devices. Our nonlinear 3D vectorial model is based on the time-harmonic Maxwell equations, and it incorporates both amplification via an active dopant and thermal effects via coupling with the heat equation. The model supports co-, counter-, and bi-directional pumping configurations, as well as inhomogeneous and anisotropic material properties. The high-fidelity simulation comes at the cost of a high-order FE discretization with many degrees of freedom per wavelength. To make the computation more feasible, we have developed a novel longitudinal model rescaling, using artificial material parameters with the goal of preserving certain quantities of interest. Numerical tests demonstrate the applicability and utility of this scaled model in the simulation of an ytterbium-doped, step-index fiber amplifier that experiences laser amplification and heating.Computational Science, Engineering, and Mathematic

Texas ScholarWorks

Scalable DPG Multigrid Solver for Helmholtz Problems: A Study on Convergence

Author: Badger Jacob
Demkowicz Leszek
Henneking Stefan
Petrides Socratis
Publication venue
Publication date: 04/05/2023
Field of study

This paper presents a scalable multigrid preconditioner targeting large-scale systems arising from discontinuous Petrov-Galerkin (DPG) discretizations of high-frequency wave operators. This work is built on previously developed multigrid preconditioning techniques of Petrides and Demkowicz (Comput. Math. Appl. 87 (2021) pp. 12-26) and extends the convergence results from

\mathcal{O}(10^7)

degrees of freedom (DOFs) to

\mathcal{O}(10^9)

DOFs using a new scalable parallel MPI/OpenMP implementation. Novel contributions of this paper include an alternative definition of coarse-grid systems based on restriction of fine-grid operators, yielding superior convergence results. In the uniform refinement setting, a detailed convergence study is provided, demonstrating h and p robust convergence and linear dependence with respect to the wave frequency. The paper concludes with numerical results on hp-adaptive simulations including a large-scale seismic modeling benchmark problem with high material contrast

arXiv.org e-Print Archive

Recommended from our members

Multi-GPU FFT Matvec for Inverse Problems Involving Shift-Invariant Systems

Author: Fernando Milinda
Ghattas Omar
Henneking Stefan
Venkat Sreeram
Publication venue
Publication date: 28/09/2023
Field of study

Hessian-based algorithms for the solution to inverse problems typically require many actions of the Hessian matrix on a vector (matvecs). A direct approach is often computationally intractable for problems with high-dimensional parameter fields or expensive-to-evaluate forward models. For systems that exhibit shift-invariance (e.g. autonomous systems) structure in their discretized form, the discretized linear parameter-to-observable (p2o) maps are block Toeplitz matrices. Moreover, considering causality for time-invariant systems the p2o map and its adjoint are lower- and upper-triangular block Toeplitz, respectively. By exploiting this structure, Hessian matrices for these types of systems can be compactly represented and Hessian matvecs can be efficiently computed through scalable multi-GPU FFT matvecs. Compact representation follows directly from the definition of Block Toeplitz matrices. Fast matrix-vector multiplication is achieved by embedding the block Toeplitz matrix within a block circulant matrix which is diagonalized by the Discrete Fourier Transform. The matrix-vector product then becomes an element-wise vector operation in Fourier space. Furthermore, the action of the adjoint p2o map corresponds to simply applying the complex conjugate in Fourier space, eliminating the need to separately store the Fourier-transformed forward and adjoint map. Exploiting the triangular block Toeplitz structure in this way yields memory savings proportional to the number of time steps Nt and a computational speedup of O(Nt/ log(Nt)). In the context of explicit methods that are suitable for GPU-based computation, the number of time steps is typically very large due to the CFL condition, making the savings of the algorithm substantial. We develop a multi-GPU FFT matvec code for Hessians corresponding to block Toeplitz p2o matrices utilizing the cuFFT and NCCL libraries. Our implementation achieves 75-90% of the maximum memory bandwidth on NVIDIA A100 80GB GPUs for all custom GPU kernels — which correspond to memory-bound operations. We also show strong and weak multi-GPU scaling on the Frontera RTX nodes with up to 81 GPUs.Texas Advanced Computing Center (TACC

Texas ScholarWorks

Elements:Software A Scalable Open-Source hp-Adaptive FE Software for Complex Multiphysics Applications

Author: Ankit Chakraborty (17041194)
Jacob Badger (17041265)
Leszek Demkowicz (17041260)
Stefan Henneking (17041264)
Publication venue
Publication date: 25/09/2023
Field of study

Computer models can be used to augment, inform, and even replace expensive experimental measurements in science and engineering. However, complex models of engineering applications can quickly exceed computational capability, driving the need for advanced simulation tools. Applications in high-frequency wave simulation--such as submarine sonar (acoustics), fiber optics (electromagnetics), and structural analysis (elastodynamics)--pose a significant challenge for large-scale simulation. This project advances computational modeling capabilities through the development, documentation, and dissemination of a leading-edge simulation software. The effort builds on decades-long research and code development by the investigators and their project collaborators. Distributed as open-source, the software is accessible to the broader scientific community, thereby contributing to fundamental research and education for computer modeling in science and engineering. Furthermore, the project expands the national workforce by training young computational mathematicians at the graduate and postdoctoral levels. The project results are disseminated through conference presentations, workshops and seminars, as well as publications in scientific journals.The hp3D software leverages hybrid MPI/OpenMP parallelism to run efficiently on NSF extreme-scale computing facilities and interfaces with state-of-the-art third-party scientific libraries. In addition to publishing the hp3D code and documentation, this project focuses on the development of a scalable multigrid (MG) solver based on the pre-asymptotically stable discontinuous Petrov-Galerkin (DPG) finite element method. This DPG-MG solver represents a significant advancement in solver technology as 1) the first robust, scalable solver for problems with highly-indefinite operators, such as high-frequency wave propagation; and 2) the first multigrid solver with support for fully anisotropic hp-adaptive hybrid meshes and a reliable built-in error indicator. Serial implementations of the DPG-MG solver have demonstrated near-linear scaling with respect to degrees of freedom in both time and memory; its parallel implementation significantly expands scientific compute capabilities and enables solution of currently intractable problems in 3D wave simulation.</p

FigShare