Search CORE

1 research outputs found

Computational advances in gravitational microlensing: a comparison of CPU, GPU, and parallel, large data codes

Author: Abajas
Alcock
Anguita
Aubourg
B.R. Barsdell
Barnes
Bate
Belleman
Bond
C.J. Fluke
Calchi Novati
Chang
Chartas
Dai
Eigenbrod
Floyd
Ford
Fournier
G.F. Lewis
Garsden
Gould
Gould
H. Garsden
Harris
Irwin
Kayser
Keeton
Kochanek
Lewis
Lewis
Mediavilla
Mediavilla
Morgan
N.F. Bate
Owens
Poindexter
Pooley
Pooley
Rauch
Schive
Schneider
Schneider
Schneider
Sumi
Thompson
Udalski
Vanderriest
Wambsganss
Wambsganss
Wambsganss
Witt
Witt
Wyithe
Wyithe
Wyrzykowski
Publication venue: 'Elsevier BV'
Publication date: 01/01/2010
Field of study

To assess how future progress in gravitational microlensing computation at high optical depth will rely on both hardware and software solutions, we compare a direct inverse ray-shooting code implemented on a graphics processing unit (GPU) with both a widely-used hierarchical tree code on a single-core CPU, and a recent implementation of a parallel tree code suitable for a CPU-based cluster supercomputer. We examine the accuracy of the tree codes through comparison with a direct code over a much wider range of parameter space than has been feasible before. We demonstrate that all three codes present comparable accuracy, and choice of approach depends on considerations relating to the scale and nature of the microlensing problem under investigation. On current hardware, there is little difference in the processing speed of the single-core CPU tree code and the GPU direct code, however the recent plateau in single-core CPU speeds means the existing tree code is no longer able to take advantage of Moore's law-like increases in processing speed. Instead, we anticipate a rapid increase in GPU capabilities in the next few years, which is advantageous to the direct code. We suggest that progress in other areas of astrophysical computation may benefit from a transition to GPUs through the use of "brute force" algorithms, rather than attempting to port the current best solution directly to a GPU language -- for certain classes of problems, the simple implementation on GPUs may already be no worse than an optimised single-core CPU version.Comment: 11 pages, 4 figures, accepted for publication in New Astronom

arXiv.org e-Print Archive

Crossref

Swinburne Research Bank