Search CORE

738 research outputs found

Accelerating Cosmic Microwave Background map-making procedure through preconditioning

Author: M. Szydlarski
L. Grigori
R. Stompor
Armitage
Cantalupo
Challinor
Chapman
de Bernardis
de Gasperis
De Gersem
Doré
Erhel
Górski
Gutknecht
Hanany
Harrison
Havé
Hestenes
Keihänen
Kharchenko
Morgan
Nicolaides
Oh
Parks
Patanchon
Planck Collaboration VIII
Risler
Saad
Stompor
Stompor
Sutton
Tang
Tegmark
Tegmark
van der Sluis
Wandelt
Wright
Publication venue: 'EDP Sciences'
Publication date: 01/01/2010
Field of study

Estimation of the sky signal from sequences of time ordered data is one of the key steps in Cosmic Microwave Background (CMB) data analysis, commonly referred to as the map-making problem. Some of the most popular and general methods proposed for this problem involve solving generalised least squares (GLS) equations with non-diagonal noise weights given by a block-diagonal matrix with Toeplitz blocks. In this work we study new map-making solvers potentially suitable for applications to the largest anticipated data sets. They are based on iterative conjugate gradient (CG) approaches enhanced with novel, parallel, two-level preconditioners. We apply the proposed solvers to examples of simulated non-polarised and polarised CMB observations, and a set of idealised scanning strategies with sky coverage ranging from nearly a full sky down to small sky patches. We discuss in detail their implementation for massively parallel computational platforms and their performance for a broad range of parameters characterising the simulated data sets. We find that our best new solver can outperform carefully-optimised standard solvers used today by a factor of as much as 5 in terms of the convergence rate and a factor of up to

4

in terms of the time to solution, and to do so without significantly increasing the memory consumption and the volume of inter-processor communication. The performance of the new algorithms is also found to be more stable and robust, and less dependent on specific characteristics of the analysed data set. We therefore conclude that the proposed approaches are well suited to address successfully challenges posed by new and forthcoming CMB data sets.Comment: 19 pages // Final version submitted to A&

HAL-CentraleSupelec

IR@CGCRI - Central Glass and Ceramic Research Institute (CSIR)

HAL - Lille 3

HAL-CEA

Hal-Diderot

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

arXiv.org e-Print Archive

HAL-IN2P3

Crossref

Archivio istituzionale della Ricerca - Università degli Studi di Parma

INRIA a CCSD electronic archive server

Swinburne Research Bank

HAL-OBSPM

Accelerating Cosmic Microwave Background map-making procedure through preconditioning

Author: Grigori Laura
Stompor Radek
Szydlarski Mikolaj
Publication venue: 'EDP Sciences'
Publication date: 26/11/2014
Field of study

4

HAL-CentraleSupelec

HAL - Lille 3

EDP Sciences OAI-PMH repository (1.2.0)

HAL-CEA

Hal-Diderot

arXiv.org e-Print Archive

CiteSeerX

HAL-IN2P3

INRIA a CCSD electronic archive server

HAL-INSU

HAL-OBSPM

HAL-Rennes 1

A bibliography on parallel and vector numerical algorithms

Author: Ortega J. M.
Voigt R. G.
Publication venue
Publication date
Field of study

This is a bibliography of numerical methods. It also includes a number of other references on machine architecture, programming language, and other topics of interest to scientific computing. Certain conference proceedings and anthologies which have been published in book form are listed also

NASA Technical Reports Server

Matrix-free GPU implementation of a preconditioned conjugate gradient solver for anisotropic elliptic PDEs

Author: Guo Xu
Mueller Eike
Scheichl Robert
Shi Sinan
Publication venue
Publication date: 01/01/2013
Field of study

Many problems in geophysical and atmospheric modelling require the fast solution of elliptic partial differential equations (PDEs) in "flat" three dimensional geometries. In particular, an anisotropic elliptic PDE for the pressure correction has to be solved at every time step in the dynamical core of many numerical weather prediction models, and equations of a very similar structure arise in global ocean models, subsurface flow simulations and gas and oil reservoir modelling. The elliptic solve is often the bottleneck of the forecast, and an algorithmically optimal method has to be used and implemented efficiently. Graphics Processing Units have been shown to be highly efficient for a wide range of applications in scientific computing, and recently iterative solvers have been parallelised on these architectures. We describe the GPU implementation and optimisation of a Preconditioned Conjugate Gradient (PCG) algorithm for the solution of a three dimensional anisotropic elliptic PDE for the pressure correction in NWP. Our implementation exploits the strong vertical anisotropy of the elliptic operator in the construction of a suitable preconditioner. As the algorithm is memory bound, performance can be improved significantly by reducing the amount of global memory access. We achieve this by using a matrix-free implementation which does not require explicit storage of the matrix and instead recalculates the local stencil. Global memory access can also be reduced by rewriting the algorithm using loop fusion and we show that this further reduces the runtime on the GPU. We demonstrate the performance of our matrix-free GPU code by comparing it to a sequential CPU implementation and to a matrix-explicit GPU code which uses existing libraries. The absolute performance of the algorithm for different problem sizes is quantified in terms of floating point throughput and global memory bandwidth.Comment: 18 pages, 7 figure

arXiv.org e-Print Archive

CiteSeerX

OPUS