Search CORE

868 research outputs found

An efficient multi-core implementation of a novel HSS-structured multifrontal solver using randomized sampling

Author: Ghysels Pieter
Li Xiaoye S.
Napov Artem
Rouet Francois-Henry
Williams Samuel
Publication venue
Publication date: 25/02/2015
Field of study

We present a sparse linear system solver that is based on a multifrontal variant of Gaussian elimination, and exploits low-rank approximation of the resulting dense frontal matrices. We use hierarchically semiseparable (HSS) matrices, which have low-rank off-diagonal blocks, to approximate the frontal matrices. For HSS matrix construction, a randomized sampling algorithm is used together with interpolative decompositions. The combination of the randomized compression with a fast ULV HSS factorization leads to a solver with lower computational complexity than the standard multifrontal method for many applications, resulting in speedups up to 7 fold for problems in our test suite. The implementation targets many-core systems by using task parallelism with dynamic runtime scheduling. Numerical experiments show performance improvements over state-of-the-art sparse direct solvers. The implementation achieves high performance and good scalability on a range of modern shared memory parallel systems, including the Intel Xeon Phi (MIC). The code is part of a software package called STRUMPACK -- STRUctured Matrices PACKage, which also has a distributed memory component for dense rank-structured matrices

arXiv.org e-Print Archive

eScholarship - University of California

DI-fusion

A fast semi-direct least squares algorithm for hierarchically block separable matrices

Author: Greengard Leslie
Ho Kenneth L.
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 01/01/2014
Field of study

We present a fast algorithm for linear least squares problems governed by hierarchically block separable (HBS) matrices. Such matrices are generally dense but data-sparse and can describe many important operators including those derived from asymptotically smooth radial kernels that are not too oscillatory. The algorithm is based on a recursive skeletonization procedure that exposes this sparsity and solves the dense least squares problem as a larger, equality-constrained, sparse one. It relies on a sparse QR factorization coupled with iterative weighted least squares methods. In essence, our scheme consists of a direct component, comprised of matrix compression and factorization, followed by an iterative component to enforce certain equality constraints. At most two iterations are typically required for problems that are not too ill-conditioned. For an

M \times N

HBS matrix with

M \geq N

having bounded off-diagonal block rank, the algorithm has optimal

\mathcal{O} (M + N)

complexity. If the rank increases with the spatial dimension as is common for operators that are singular at the origin, then this becomes

\mathcal{O} (M + N)

in 1D,

\mathcal{O} (M + N^{3/2})

in 2D, and

\mathcal{O} (M + N^{2})

in 3D. We illustrate the performance of the method on both over- and underdetermined systems in a variety of settings, with an emphasis on radial basis function approximation and efficient updating and downdating.Comment: 24 pages, 8 figures, 6 tables; to appear in SIAM J. Matrix Anal. App

arXiv.org e-Print Archive

CiteSeerX

LSMR: An iterative algorithm for sparse least-squares problems

Author: Ayrancı
Ballester
Bokar
Coelho
Correia
Farmer
Fei Wang
Fiveland
Groetsch
Hansen
Hansen
Huang
Jian-hua Yan
Kohse-Höinghaus
Li
Li
Liu
Liu
Liu
Liu
Lou
McCormick
Modest
Namjoo
Paige
Park
Qun-xing Huang
Sielschott
Siewert
Snelling
Wang
Yong Chi
Zhang
Zhou
Zhou
Özisik
Publication venue: 'Elsevier BV'
Publication date: 10/06/2011
Field of study

An iterative method LSMR is presented for solving linear systems

Ax=b

and least-squares problem \min \norm{Ax-b}_2, with

A

being sparse or a fast linear operator. LSMR is based on the Golub-Kahan bidiagonalization process. It is analytically equivalent to the MINRES method applied to the normal equation A\T Ax = A\T b, so that the quantities \norm{A\T r_k} are monotonically decreasing (where

r_k = b - Ax_k

is the residual for the current iterate

x_k

). In practice we observe that \norm{r_k} also decreases monotonically. Compared to LSQR, for which only \norm{r_k} is monotonic, it is safer to terminate LSMR early. Improvements for the new iterative method in the presence of extra available memory are also explored.Comment: 21 page

arXiv.org e-Print Archive

Crossref