Search CORE

1,507 research outputs found

Solution of partial differential equations on vector and parallel computers

Author: Ortega J. M.
Voigt R. G.
Publication venue
Publication date
Field of study

The present status of numerical methods for partial differential equations on vector and parallel computers was reviewed. The relevant aspects of these computers are discussed and a brief review of their development is included, with particular attention paid to those characteristics that influence algorithm selection. Both direct and iterative methods are given for elliptic equations as well as explicit and implicit methods for initial boundary value problems. The intent is to point out attractive methods as well as areas where this class of computer architecture cannot be fully utilized because of either hardware restrictions or the lack of adequate algorithms. Application areas utilizing these computers are briefly discussed

NASA Technical Reports Server

Simulation of Laser Propagation in a Plasma with a Frequency Wave Equation

Author: Ballereau
Benamou
Berenger
Dhillon
Dorr
Doumic
Erlangga
F. Nataf
Greenbaum
Heller
Horn
Hüller
Jourdren
Merle
Meurant
Parlett
Plessix
R. Sentis
Rossi
Rutishauser
S. Desroziers
Saad
Sentis
Sentis
Sulem
Publication venue: 'Elsevier BV'
Publication date: 02/05/2007
Field of study

The aim of this work is to perform numerical simulations of the propagation of a laser in a plasma. At each time step, one has to solve a Helmholtz equation in a domain which consists in some hundreds of millions of cells. To solve this huge linear system, one uses a iterative Krylov method with a preconditioning by a separable matrix. The corresponding linear system is solved with a block cyclic reduction method. Some enlightments on the parallel implementation are also given. Lastly, numerical results are presented including some features concerning the scalability of the numerical method on a parallel architecture

arXiv.org e-Print Archive

Crossref

HAL-CEA

Hal-Diderot

A Direct Elliptic Solver Based on Hierarchically Low-rank Schur Complements

Author: A. Aminfar
B.L. Buzbee
I. Ibragimov
J. Xia
J. Xia
J. Xia
L. Grasedyck
P. Amestoy
P. Swarztrauber
P.G. Schmitz
P.G. Schmitz
R.W. Hockney
S. Ambikasaran
S. Chandrasekaran
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/04/2016
Field of study

A parallel fast direct solver for rank-compressible block tridiagonal linear systems is presented. Algorithmic synergies between Cyclic Reduction and Hierarchical matrix arithmetic operations result in a solver with

O(N \log^2 N)

arithmetic complexity and

O(N \log N)

memory footprint. We provide a baseline for performance and applicability by comparing with well known implementations of the

\mathcal{H}

-LU factorization and algebraic multigrid with a parallel implementation that leverages the concurrency features of the method. Numerical experiments reveal that this method is comparable with other fast direct solvers based on Hierarchical Matrices such as

\mathcal{H}

-LU and that it can tackle problems where algebraic multigrid fails to converge

arXiv.org e-Print Archive

Crossref

Parallel accelerated cyclic reduction preconditioner for three-dimensional elliptic PDEs with variable coefficients

Author: Chávez Gustavo
Keyes David
Turkiyyah George
Zampini Stefano
Publication venue: 'Elsevier BV'
Publication date: 23/12/2017
Field of study

We present a robust and scalable preconditioner for the solution of large-scale linear systems that arise from the discretization of elliptic PDEs amenable to rank compression. The preconditioner is based on hierarchical low-rank approximations and the cyclic reduction method. The setup and application phases of the preconditioner achieve log-linear complexity in memory footprint and number of operations, and numerical experiments exhibit good weak and strong scalability at large processor counts in a distributed memory environment. Numerical experiments with linear systems that feature symmetry and nonsymmetry, definiteness and indefiniteness, constant and variable coefficients demonstrate the preconditioner applicability and robustness. Furthermore, it is possible to control the number of iterations via the accuracy threshold of the hierarchical matrix approximations and their arithmetic operations, and the tuning of the admissibility condition parameter. Together, these parameters allow for optimization of the memory requirements and performance of the preconditioner.Comment: 24 pages, Elsevier Journal of Computational and Applied Mathematics, Dec 201

arXiv.org e-Print Archive

eScholarship - University of California

Approximate and Incomplete Factorizations

Author: Chan T.F.
Vorst H.A. van der
Publication venue
Publication date: 01/01/1997
Field of study

In this chapter, we give a brief overview of a particular class of preconditioners known as incomplete factorizations. They can be thought of as approximating the exact LU factorization of a given matrix A (e.g. computed via Gaussian elimination) by disallowing certain ll-ins. As opposed to other PDE-based preconditioners such as multigrid and domain decomposition, this class of preconditioners are primarily algebraic in nature and can in principle be applied to any sparse matrices. When applied to PDE problems, they are usually not optimal in the sense that the condition number of the preconditioned system will grow as the mesh size h is reduced, although usually at a slower rate than for the unpreconditioned system. On the other hand, they are often quite robust with respect to other more algebraic features of the problem such as rough and anisotropic coecients and strong convection terms. We will describe the basic ILU and (modied) MILU preconditioners. Then we will review brie y several variants: more lls, relaxed ILU, shifted ILU, ILQ, as well as block and multilevel variants. We will also touch on a related class of approximate factorization methods which arise more directly from approximating a partial dierential operator by a product of simpler operators. Finally, we will discuss parallelization aspects, including re-ordering, series expansion and domain decomposition techniques. Generally, this class of preconditioner does not possess a high degree of parallelism in its original form. Re-ordering and approximations by truncating certain series expansion will increase the parallelism, but usually with a deterioration in convergence rate. Domain decomposition oers a compromise

Utrecht University Repository

Constraint interface preconditioning for topology optimization problems

Author: Kocvara Michal
Loghin Daniel
Turner James
Publication venue
Publication date: 15/10/2015
Field of study

The discretization of constrained nonlinear optimization problems arising in the field of topology optimization yields algebraic systems which are challenging to solve in practice, due to pathological ill-conditioning, strong nonlinearity and size. In this work we propose a methodology which brings together existing fast algorithms, namely, interior-point for the optimization problem and a novel substructuring domain decomposition method for the ensuing large-scale linear systems. The main contribution is the choice of interface preconditioner which allows for the acceleration of the domain decomposition method, leading to performance independent of problem size.Comment: To be published in SIAM J. Sci. Com

arXiv.org e-Print Archive

University of Birmingham Research Portal