24,294 research outputs found

    Parallel processing and non-uniform grids in global air quality modeling

    Get PDF
    A large-scale global air quality model, running efficiently on a single vector processor, is enhanced to make more realistic and more long-term simulations feasible. Two strategies are combined: non-uniform grids and parallel processing. The communication through the hierarchy of non-uniform grids interferes with the inter-processor communication. We discuss load balance in the decomposition of the domain, I/O, and inter-processor communication. A model shows that the communication overhead for both techniques is very low, whence non-uniform grids allow for large speed-ups and high speed-up can be expected from parallelization. The implementation is in progress, and results of experiments will be reported elsewhere

    Achieving High Speed CFD simulations: Optimization, Parallelization, and FPGA Acceleration for the unstructured DLR TAU Code

    Get PDF
    Today, large scale parallel simulations are fundamental tools to handle complex problems. The number of processors in current computation platforms has been recently increased and therefore it is necessary to optimize the application performance and to enhance the scalability of massively-parallel systems. In addition, new heterogeneous architectures, combining conventional processors with specific hardware, like FPGAs, to accelerate the most time consuming functions are considered as a strong alternative to boost the performance. In this paper, the performance of the DLR TAU code is analyzed and optimized. The improvement of the code efficiency is addressed through three key activities: Optimization, parallelization and hardware acceleration. At first, a profiling analysis of the most time-consuming processes of the Reynolds Averaged Navier Stokes flow solver on a three-dimensional unstructured mesh is performed. Then, a study of the code scalability with new partitioning algorithms are tested to show the most suitable partitioning algorithms for the selected applications. Finally, a feasibility study on the application of FPGAs and GPUs for the hardware acceleration of CFD simulations is presented

    Regularization of static self-forces

    Full text link
    Various regularization methods have been used to compute the self-force acting on a static particle in a static, curved spacetime. Many of these are based on Hadamard's two-point function in three dimensions. On the other hand, the regularization method that enjoys the best justification is that of Detweiler and Whiting, which is based on a four-dimensional Green's function. We establish the connection between these methods and find that they are all equivalent, in the sense that they all lead to the same static self-force. For general static spacetimes, we compute local expansions of the Green's functions on which the various regularization methods are based. We find that these agree up to a certain high order, and conjecture that they might be equal to all orders. We show that this equivalence is exact in the case of ultrastatic spacetimes. Finally, our computations are exploited to provide regularization parameters for a static particle in a general static and spherically-symmetric spacetime.Comment: 23 pages, no figure

    Numerical non-LTE 3D radiative transfer using a multigrid method

    Full text link
    3D non-LTE radiative transfer problems are computationally demanding, and this sets limits on the size of the problems that can be solved. So far Multilevel Accelerated Lambda Iteration (MALI) has been to the method of choice to perform high-resolution computations in multidimensional problems. The disadvantage of MALI is that its computing time scales as O(n2)\mathcal{O}(n^2), with nn the number of grid points. When the grid gets finer, the computational cost increases quadratically. We aim to develop a 3D non-LTE radiative transfer code that is more efficient than MALI. We implement a non-linear multigrid, fast approximation storage scheme, into the existing Multi3D radiative transfer code. We verify our multigrid implementation by comparing with MALI computations. We show that multigrid can be employed in realistic problems with snapshots from 3D radiative-MHD simulations as input atmospheres. With multigrid, we obtain a factor 3.3-4.5 speedup compared to MALI. With full-multigrid the speed-up increases to a factor 6. The speedup is expected to increase for input atmospheres with more grid points and finer grid spacing. Solving 3D non-LTE radiative transfer problems using non-linear multigrid methods can be applied to realistic atmospheres with a substantial speed-up.Comment: Accepted for publication by A&
    corecore