Search CORE

1,372 research outputs found

2HOT: An Improved Parallel Hashed Oct-Tree N-Body Algorithm for Cosmological Simulation

Author: Warren Michael S.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 16/10/2013
Field of study

We report on improvements made over the past two decades to our adaptive treecode N-body method (HOT). A mathematical and computational approach to the cosmological N-body problem is described, with performance and scalability measured up to 256k (

2^{18}

) processors. We present error analysis and scientific application results from a series of more than ten 69 billion (

4096^3

) particle cosmological simulations, accounting for

4 \times 10^{20}

floating point operations. These results include the first simulations using the new constraints on the standard model of cosmology from the Planck satellite. Our simulations set a new standard for accuracy and scientific throughput, while meeting or exceeding the computational efficiency of the latest generation of hybrid TreePM N-body methods.Comment: 12 pages, 8 figures, 77 references; To appear in Proceedings of SC '1

arXiv.org e-Print Archive

CiteSeerX

Directory of Open Access Journals

A SVD accelerated kernel-independent fast multipole method and its application to BEM

Author: Cao Yanchuang
Rong Junjie
Wen Lihua
Publication venue: 'WITPRESS LTD.'
Publication date: 11/03/2013
Field of study

The kernel-independent fast multipole method (KIFMM) proposed in [1] is of almost linear complexity. In the original KIFMM the time-consuming M2L translations are accelerated by FFT. However, when more equivalent points are used to achieve higher accuracy, the efficiency of the FFT approach tends to be lower because more auxiliary volume grid points have to be added. In this paper, all the translations of the KIFMM are accelerated by using the singular value decomposition (SVD) based on the low-rank property of the translating matrices. The acceleration of M2L is realized by first transforming the associated translating matrices into more compact form, and then using low-rank approximations. By using the transform matrices for M2L, the orders of the translating matrices in upward and downward passes are also reduced. The improved KIFMM is then applied to accelerate BEM. The performance of the proposed algorithms are demonstrated by three examples. Numerical results show that, compared with the original KIFMM, the present method can reduce about 40% of the iterating time and 25% of the memory requirement.Comment: 19 pages, 4 figure

arXiv.org e-Print Archive

Crossref