87,031 research outputs found
Parallel Algorithm for Solving Kepler's Equation on Graphics Processing Units: Application to Analysis of Doppler Exoplanet Searches
[Abridged] We present the results of a highly parallel Kepler equation solver
using the Graphics Processing Unit (GPU) on a commercial nVidia GeForce 280GTX
and the "Compute Unified Device Architecture" programming environment. We apply
this to evaluate a goodness-of-fit statistic (e.g., chi^2) for Doppler
observations of stars potentially harboring multiple planetary companions
(assuming negligible planet-planet interactions). We tested multiple
implementations using single precision, double precision, pairs of single
precision, and mixed precision arithmetic. We find that the vast majority of
computations can be performed using single precision arithmetic, with selective
use of compensated summation for increased precision. However, standard single
precision is not adequate for calculating the mean anomaly from the time of
observation and orbital period when evaluating the goodness-of-fit for real
planetary systems and observational data sets. Using all double precision, our
GPU code outperforms a similar code using a modern CPU by a factor of over 60.
Using mixed-precision, our GPU code provides a speed-up factor of over 600,
when evaluating N_sys > 1024 models planetary systems each containing N_pl = 4
planets and assuming N_obs = 256 observations of each system. We conclude that
modern GPUs also offer a powerful tool for repeatedly evaluating Kepler's
equation and a goodness-of-fit statistic for orbital models when presented with
a large parameter space.Comment: 19 pages, to appear in New Astronom
Performance Evaluation of an Extrapolation Method for Ordinary Differential Equations with Error-free Transformation
The application of error-free transformation (EFT) is recently being
developed to solve ill-conditioned problems. It can reduce the number of
arithmetic operations required, compared with multiple precision arithmetic,
and also be applied by using functions supported by a well-tuned BLAS library.
In this paper, we propose the application of EFT to explicit extrapolation
methods to solve initial value problems of ordinary differential equations.
Consequently, our implemented routines can be effective for large-sized linear
ODE and small-sized nonlinear ODE, especially in the case when harmonic
sequence is used
Interval Arithmetic Using SSE-2
We present an implementation of double precision interval arithmetic using the single-instruction-multiple-data SSE-2 instruction and register set extensions. The implementation is part of a package for exact real arithmetic, which defines the interval arithmetic variation that must be used: incorrect operations such as division by zero cause exceptions, loose evaluation of the operations is in effect, and performance is more important than tightness of the produced bounds. The SSE2 extensions are suitable for the job, because they can be used to operate on a pair of double precision numbers and include separate rounding mode control and detection of the exceptional conditions. The paper describes the ideas we use to fit interval arithmetic to this set of instructions, shows a performance comparison with other freely available interval arithmetic packages, and discusses possible very simple hardware extensions that can significantly increase the performance of interval arithmetic
A Modified Staggered Correction Arithmetic with Enhanced Accuracy and Very Wide Exponent Range
A so called staggered precision arithmetic is a special kind of
a multiple precision arithmetic based on the underlying
floating point data format (typically IEEE double format)
and fast floating point operations as well as exact dot product computations.
Due to floating point limitations it is not an arbitrary precision arithmetic.
However, it typically allows computations using several hundred mantissa digits.
A set of new modified staggered arithmetics for real and
complex data as well as for real interval and
complex interval data with very wide exponent range is presented.
Some applications show
the increased accuracy of computed results compared to ordinary staggered
interval computations. The very wide exponent range of the new arithmetic
operations allows computations far beyond the IEEE data formats.
The new arithmetics would be extremly fast, if an exact dot product was
available in hardware (the fused accumulate and add instruction is only
one step in this direction)
- …