Search CORE

100 research outputs found

Block pivoting implementation of a symmetric Toeplitz solver

Author: Alonso-Jordá Pedro
Dolz Zaragozá Manuel Francisco
Vidal Maciá Antonio Manuel
Publication venue: 'Elsevier BV'
Publication date: 01/05/2014
Field of study

Toeplitz matrices are characterized by a special structure that can be exploited in order to obtain fast linear system solvers. These solvers are difficult to parallelize due to their low computational cost and their closely coupled data operations. We propose to transform the Toeplitz system matrix into a Cauchy-like matrix since the latter can be divided into two independent matrices of half the size of the system matrix and each one of these smaller arising matrices can be factorized efficiently in multicore computers. We use OpenMP and store data in memory by blocks in consecutive positions yielding a simple and efficient algorithm. In addition, by exploiting the fact that diagonal pivoting does not destroy the special structure of Cauchy-like matrices, we introduce a local diagonal pivoting technique which improves the accuracy of the solution and the stability of the algorithm.This work was partially supported by the Spanish Ministerio de Ciencia e Innovacion (Project TIN2008-06570-C04-02 and TEC2009-13741), Vicerrectorado de Investigacion de la Universidad Politecnica de Valencia through PAID-05-10 (ref. 2705), and Generalitat Valenciana through project PROMETEO/2009/2013.Alonso-Jordá, P.; Dolz Zaragozá, MF.; Vidal Maciá, AM. (2014). Block pivoting implementation of a symmetric Toeplitz solver. Journal of Parallel and Distributed Computing. 74(5):2392-2399. https://doi.org/10.1016/j.jpdc.2014.02.003S2392239974

RiuNet

Computing the Exponential of Large Block-Triangular Block-Toeplitz Matrices Encountered in Fluid Queues

Author: Bini D. A.
Dendievel S.
Latouche G.
Meini B.
Publication venue
Publication date: 01/01/2015
Field of study

The Erlangian approximation of Markovian fluid queues leads to the problem of computing the matrix exponential of a subgenerator having a block-triangular, block-Toeplitz structure. To this end, we propose some algorithms which exploit the Toeplitz structure and the properties of generators. Such algorithms allow to compute the exponential of very large matrices, which would otherwise be untreatable with standard methods. We also prove interesting decay properties of the exponential of a generator having a block-triangular, block-Toeplitz structure

arXiv.org e-Print Archive

Ghent University Academic Bibliography

Archivio della Ricerca - Università di Pisa

DI-fusion

Accelerating Cosmic Microwave Background map-making procedure through preconditioning

Author: M. Szydlarski
L. Grigori
R. Stompor
Armitage
Cantalupo
Challinor
Chapman
de Bernardis
de Gasperis
De Gersem
Doré
Erhel
Górski
Gutknecht
Hanany
Harrison
Havé
Hestenes
Keihänen
Kharchenko
Morgan
Nicolaides
Oh
Parks
Patanchon
Planck Collaboration VIII
Risler
Saad
Stompor
Stompor
Sutton
Tang
Tegmark
Tegmark
van der Sluis
Wandelt
Wright
Publication venue: 'EDP Sciences'
Publication date: 01/01/2010
Field of study

Estimation of the sky signal from sequences of time ordered data is one of the key steps in Cosmic Microwave Background (CMB) data analysis, commonly referred to as the map-making problem. Some of the most popular and general methods proposed for this problem involve solving generalised least squares (GLS) equations with non-diagonal noise weights given by a block-diagonal matrix with Toeplitz blocks. In this work we study new map-making solvers potentially suitable for applications to the largest anticipated data sets. They are based on iterative conjugate gradient (CG) approaches enhanced with novel, parallel, two-level preconditioners. We apply the proposed solvers to examples of simulated non-polarised and polarised CMB observations, and a set of idealised scanning strategies with sky coverage ranging from nearly a full sky down to small sky patches. We discuss in detail their implementation for massively parallel computational platforms and their performance for a broad range of parameters characterising the simulated data sets. We find that our best new solver can outperform carefully-optimised standard solvers used today by a factor of as much as 5 in terms of the convergence rate and a factor of up to

4

in terms of the time to solution, and to do so without significantly increasing the memory consumption and the volume of inter-processor communication. The performance of the new algorithms is also found to be more stable and robust, and less dependent on specific characteristics of the analysed data set. We therefore conclude that the proposed approaches are well suited to address successfully challenges posed by new and forthcoming CMB data sets.Comment: 19 pages // Final version submitted to A&

HAL-CentraleSupelec

IR@CGCRI - Central Glass and Ceramic Research Institute (CSIR)

HAL - Lille 3

HAL-CEA

Hal-Diderot

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

arXiv.org e-Print Archive

HAL-IN2P3

Crossref

Archivio istituzionale della Ricerca - Università degli Studi di Parma

INRIA a CCSD electronic archive server

Swinburne Research Bank

HAL-OBSPM

Accelerating Cosmic Microwave Background map-making procedure through preconditioning

Author: Grigori Laura
Stompor Radek
Szydlarski Mikolaj
Publication venue: 'EDP Sciences'
Publication date: 26/11/2014
Field of study

4

HAL-CentraleSupelec

HAL - Lille 3

EDP Sciences OAI-PMH repository (1.2.0)

HAL-CEA

Hal-Diderot

arXiv.org e-Print Archive

CiteSeerX

HAL-IN2P3

INRIA a CCSD electronic archive server

HAL-INSU

HAL-OBSPM

HAL-Rennes 1

A three domain covariance framework for EEG/MEG data

Author: Bijma Fetsje
de Gunst Mathisca
de Munck Jan
Roś Beata
Publication venue
Publication date: 09/10/2014
Field of study

In this paper we introduce a covariance framework for the analysis of EEG and MEG data that takes into account observed temporal stationarity on small time scales and trial-to-trial variations. We formulate a model for the covariance matrix, which is a Kronecker product of three components that correspond to space, time and epochs/trials, and consider maximum likelihood estimation of the unknown parameter values. An iterative algorithm that finds approximations of the maximum likelihood estimates is proposed. We perform a simulation study to assess the performance of the estimator and investigate the influence of different assumptions about the covariance factors on the estimated covariance matrix and on its components. Apart from that, we illustrate our method on real EEG and MEG data sets. The proposed covariance model is applicable in a variety of cases where spontaneous EEG or MEG acts as source of noise and realistic noise covariance estimates are needed for accurate dipole localization, such as in evoked activity studies, or where the properties of spontaneous EEG or MEG are themselves the topic of interest, such as in combined EEG/fMRI experiments in which the correlation between EEG and fMRI signals is investigated.Comment: 25 pages, 8 figures, 1 tabl

arXiv.org e-Print Archive

VU Research Portal

A Direct Elliptic Solver Based on Hierarchically Low-rank Schur Complements

Author: A. Aminfar
B.L. Buzbee
I. Ibragimov
J. Xia
J. Xia
J. Xia
L. Grasedyck
P. Amestoy
P. Swarztrauber
P.G. Schmitz
P.G. Schmitz
R.W. Hockney
S. Ambikasaran
S. Chandrasekaran
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/04/2016
Field of study

A parallel fast direct solver for rank-compressible block tridiagonal linear systems is presented. Algorithmic synergies between Cyclic Reduction and Hierarchical matrix arithmetic operations result in a solver with

O(N \log^2 N)

arithmetic complexity and

O(N \log N)

memory footprint. We provide a baseline for performance and applicability by comparing with well known implementations of the

\mathcal{H}

-LU factorization and algebraic multigrid with a parallel implementation that leverages the concurrency features of the method. Numerical experiments reveal that this method is comparable with other fast direct solvers based on Hierarchical Matrices such as

\mathcal{H}

-LU and that it can tackle problems where algebraic multigrid fails to converge

arXiv.org e-Print Archive

Crossref

An Efficient Numerical Technique for Solving the Inverse Gravity Problem of Finding a Lateral Density

Author: A. Kosivets Rostislav
E. Misilov Vladimir
N. Akimova Elena
S. Martyshko Peter
Publication venue: Arab Journals Platform
Publication date: 01/09/2016
Field of study

The main goal of our paper is to construct a technique for the gravity inversion problem of finding a variable density in a horizontal layer on the basis of gravitational data. This technique consists of two steps: extracting the gravitational field and solving the linear integral equation of the density. After discretization and approximation of integral operator, this problem is reduced to solving large systems of linear algebraic equations. To solve these systems, we proposed a memory-efficient algorithm based on the iterative method of minimal residuals. The idea of memory optimization is based on exploiting the block-Toeplitz structure of coefficients matrix. The algorithms were parallelized and implemented using the Uran and UrFU supercomputers. A model problem with synthetic gravitational data was solved

Arab Journals Platform

Designing LU-QR hybrid solvers for performance and stability

Author: Bradley Lowery
Jack Dongarra
Julien Herrmann
Julien Langou
Mathieu Faverge
Yves Robert
Publication venue
Publication date: 21/01/2014
Field of study

Abstract—This paper introduces hybrid LU-QR algorithms for solving dense linear systems of the form Ax = b. Throughout a matrix factorization, these algorithms dynamically alternate LU with local pivoting and QR elimination steps, based upon some robustness criterion. LU elimination steps can be very efficiently parallelized, and are twice as cheap in terms of floatingpoint operations, as QR steps. However, LU steps are not necessarily stable, while QR steps are always stable. The hybrid algorithms execute a QR step when a robustness criterion detects some risk for instability, and they execute an LU step otherwise. Ideally, the choice between LU and QR steps must have a small computational overhead and must provide a satisfactory level of stability with as few QR steps as possible. In this paper, we introduce several robustness criteria and we establish upper bounds on the growth factor of the norm of the updated matrix incurred by each of these criteria. In addition, we describe the implementation of the hybrid algorithms through an extension of the PaRSEC software to allow for dynamic choices during execution. Finally, we analyze both stability and performance results compared to state-of-the-art linear solvers on parallel distributed multicore platforms. I

arXiv.org e-Print Archive

HAL-ENS-LYON

CiteSeerX

Crossref

INRIA a CCSD electronic archive server

Hal-Diderot