3,777 research outputs found
A weakly stable algorithm for general Toeplitz systems
We show that a fast algorithm for the QR factorization of a Toeplitz or
Hankel matrix A is weakly stable in the sense that R^T.R is close to A^T.A.
Thus, when the algorithm is used to solve the semi-normal equations R^T.Rx =
A^Tb, we obtain a weakly stable method for the solution of a nonsingular
Toeplitz or Hankel linear system Ax = b. The algorithm also applies to the
solution of the full-rank Toeplitz or Hankel least squares problem.Comment: 17 pages. An old Technical Report with postscript added. For further
details, see http://wwwmaths.anu.edu.au/~brent/pub/pub143.htm
Multigrid waveform relaxation for the time-fractional heat equation
In this work, we propose an efficient and robust multigrid method for solving
the time-fractional heat equation. Due to the nonlocal property of fractional
differential operators, numerical methods usually generate systems of equations
for which the coefficient matrix is dense. Therefore, the design of efficient
solvers for the numerical simulation of these problems is a difficult task. We
develop a parallel-in-time multigrid algorithm based on the waveform relaxation
approach, whose application to time-fractional problems seems very natural due
to the fact that the fractional derivative at each spatial point depends on the
values of the function at this point at all earlier times. Exploiting the
Toeplitz-like structure of the coefficient matrix, the proposed multigrid
waveform relaxation method has a computational cost of
operations, where is the number of time steps and is the number of
spatial grid points. A semi-algebraic mode analysis is also developed to
theoretically confirm the good results obtained. Several numerical experiments,
including examples with non-smooth solutions and a nonlinear problem with
applications in porous media, are presented
A simple parallel prefix algorithm for compact finite-difference schemes
A compact scheme is a discretization scheme that is advantageous in obtaining highly accurate solutions. However, the resulting systems from compact schemes are tridiagonal systems that are difficult to solve efficiently on parallel computers. Considering the almost symmetric Toeplitz structure, a parallel algorithm, simple parallel prefix (SPP), is proposed. The SPP algorithm requires less memory than the conventional LU decomposition and is highly efficient on parallel machines. It consists of a prefix communication pattern and AXPY operations. Both the computation and the communication can be truncated without degrading the accuracy when the system is diagonally dominant. A formal accuracy study was conducted to provide a simple truncation formula. Experimental results were measured on a MasPar MP-1 SIMD machine and on a Cray 2 vector machine. Experimental results show that the simple parallel prefix algorithm is a good algorithm for the compact scheme on high-performance computers
- …