24,294 research outputs found
Parallel processing and non-uniform grids in global air quality modeling
A large-scale global air quality model, running efficiently on a single vector processor, is enhanced to make more realistic and more long-term simulations feasible. Two strategies are combined: non-uniform grids and parallel processing. The communication through the hierarchy of non-uniform grids interferes with the inter-processor communication. We discuss load balance in the decomposition of the domain, I/O, and inter-processor communication. A model shows that the communication overhead for both techniques is very low, whence non-uniform grids allow for large speed-ups and high speed-up can be expected from parallelization. The implementation is in progress, and results of experiments will be reported elsewhere
Achieving High Speed CFD simulations: Optimization, Parallelization, and FPGA Acceleration for the unstructured DLR TAU Code
Today, large scale parallel simulations are fundamental tools to handle complex problems. The number of processors in current computation platforms has been recently increased and therefore it is necessary to optimize the application performance and to enhance the scalability of massively-parallel systems. In addition, new heterogeneous architectures, combining conventional processors with specific hardware, like FPGAs, to accelerate the most time consuming functions are considered as a strong alternative to boost the performance.
In this paper, the performance of the DLR TAU code is analyzed and optimized. The improvement of the code efficiency is addressed through three key activities: Optimization, parallelization and hardware acceleration. At first, a profiling analysis of the most time-consuming processes of the Reynolds Averaged Navier Stokes flow solver on a three-dimensional unstructured mesh is performed. Then, a study of the code scalability with new partitioning algorithms are tested to show the most suitable partitioning algorithms for the selected applications. Finally, a feasibility study on the application of FPGAs and GPUs for the hardware acceleration of CFD simulations is presented
Regularization of static self-forces
Various regularization methods have been used to compute the self-force
acting on a static particle in a static, curved spacetime. Many of these are
based on Hadamard's two-point function in three dimensions. On the other hand,
the regularization method that enjoys the best justification is that of
Detweiler and Whiting, which is based on a four-dimensional Green's function.
We establish the connection between these methods and find that they are all
equivalent, in the sense that they all lead to the same static self-force. For
general static spacetimes, we compute local expansions of the Green's functions
on which the various regularization methods are based. We find that these agree
up to a certain high order, and conjecture that they might be equal to all
orders. We show that this equivalence is exact in the case of ultrastatic
spacetimes. Finally, our computations are exploited to provide regularization
parameters for a static particle in a general static and spherically-symmetric
spacetime.Comment: 23 pages, no figure
Numerical non-LTE 3D radiative transfer using a multigrid method
3D non-LTE radiative transfer problems are computationally demanding, and
this sets limits on the size of the problems that can be solved. So far
Multilevel Accelerated Lambda Iteration (MALI) has been to the method of choice
to perform high-resolution computations in multidimensional problems. The
disadvantage of MALI is that its computing time scales as ,
with the number of grid points. When the grid gets finer, the computational
cost increases quadratically. We aim to develop a 3D non-LTE radiative transfer
code that is more efficient than MALI. We implement a non-linear multigrid,
fast approximation storage scheme, into the existing Multi3D radiative transfer
code. We verify our multigrid implementation by comparing with MALI
computations. We show that multigrid can be employed in realistic problems with
snapshots from 3D radiative-MHD simulations as input atmospheres. With
multigrid, we obtain a factor 3.3-4.5 speedup compared to MALI. With
full-multigrid the speed-up increases to a factor 6. The speedup is expected to
increase for input atmospheres with more grid points and finer grid spacing.
Solving 3D non-LTE radiative transfer problems using non-linear multigrid
methods can be applied to realistic atmospheres with a substantial speed-up.Comment: Accepted for publication by A&
- …