1,768 research outputs found
ECHO: an Eulerian Conservative High Order scheme for general relativistic magnetohydrodynamics and magnetodynamics
We present a new numerical code, ECHO, based on an Eulerian Conservative High
Order scheme for time dependent three-dimensional general relativistic
magnetohydrodynamics (GRMHD) and magnetodynamics (GRMD). ECHO is aimed at
providing a shock-capturing conservative method able to work at an arbitrary
level of formal accuracy (for smooth flows), where the other existing GRMHD and
GRMD schemes yield an overall second order at most. Moreover, our goal is to
present a general framework, based on the 3+1 Eulerian formalism, allowing for
different sets of equations, different algorithms, and working in a generic
space-time metric, so that ECHO may be easily coupled to any solver for
Einstein's equations. Various high order reconstruction methods are implemented
and a two-wave approximate Riemann solver is used. The induction equation is
treated by adopting the Upwind Constrained Transport (UCT) procedures,
appropriate to preserve the divergence-free condition of the magnetic field in
shock-capturing methods. The limiting case of magnetodynamics (also known as
force-free degenerate electrodynamics) is implemented by simply replacing the
fluid velocity with the electromagnetic drift velocity and by neglecting the
matter contribution to the stress tensor. ECHO is particularly accurate,
efficient, versatile, and robust. It has been tested against several
astrophysical applications, including a novel test on the propagation of large
amplitude circularly polarized Alfven waves. In particular, we show that
reconstruction based on a Monotonicity Preserving filter applied to a fixed
5-point stencil gives highly accurate results for smooth solutions, both in
flat and curved metric (up to the nominal fifth order), while at the same time
providing sharp profiles in tests involving discontinuities.Comment: 20 pages, revised version submitted to A&
Lagrangian ADER-WENO Finite Volume Schemes on Unstructured Triangular Meshes Based On Genuinely Multidimensional HLL Riemann Solvers
In this paper we use the genuinely multidimensional HLL Riemann solvers
recently developed by Balsara et al. to construct a new class of
computationally efficient high order Lagrangian ADER-WENO one-step ALE finite
volume schemes on unstructured triangular meshes. A nonlinear WENO
reconstruction operator allows the algorithm to achieve high order of accuracy
in space, while high order of accuracy in time is obtained by the use of an
ADER time-stepping technique based on a local space-time Galerkin predictor.
The multidimensional HLL and HLLC Riemann solvers operate at each vertex of the
grid, considering the entire Voronoi neighborhood of each node and allows for
larger time steps than conventional one-dimensional Riemann solvers. The
results produced by the multidimensional Riemann solver are then used twice in
our one-step ALE algorithm: first, as a node solver that assigns a unique
velocity vector to each vertex, in order to preserve the continuity of the
computational mesh; second, as a building block for genuinely multidimensional
numerical flux evaluation that allows the scheme to run with larger time steps
compared to conventional finite volume schemes that use classical
one-dimensional Riemann solvers in normal direction. A rezoning step may be
necessary in order to overcome element overlapping or crossing-over. We apply
the method presented in this article to two systems of hyperbolic conservation
laws, namely the Euler equations of compressible gas dynamics and the equations
of ideal classical magneto-hydrodynamics (MHD). Convergence studies up to
fourth order of accuracy in space and time have been carried out. Several
numerical test problems have been solved to validate the new approach
Adaptive Mesh Refinement for Hyperbolic Systems based on Third-Order Compact WENO Reconstruction
In this paper we generalize to non-uniform grids of quad-tree type the
Compact WENO reconstruction of Levy, Puppo and Russo (SIAM J. Sci. Comput.,
2001), thus obtaining a truly two-dimensional non-oscillatory third order
reconstruction with a very compact stencil and that does not involve
mesh-dependent coefficients. This latter characteristic is quite valuable for
its use in h-adaptive numerical schemes, since in such schemes the coefficients
that depend on the disposition and sizes of the neighboring cells (and that are
present in many existing WENO-like reconstructions) would need to be recomputed
after every mesh adaption.
In the second part of the paper we propose a third order h-adaptive scheme
with the above-mentioned reconstruction, an explicit third order TVD
Runge-Kutta scheme and the entropy production error indicator proposed by Puppo
and Semplice (Commun. Comput. Phys., 2011). After devising some heuristics on
the choice of the parameters controlling the mesh adaption, we demonstrate with
many numerical tests that the scheme can compute numerical solution whose error
decays as , where is the average
number of cells used during the computation, even in the presence of shock
waves, by making a very effective use of h-adaptivity and the proposed third
order reconstruction.Comment: many updates to text and figure
Study of interpolation methods for high-accuracy computations on overlapping grids
Overset strategy can be an efficient way to keep high-accuracy discretization by decomposing a complex geometry in topologically simple subdomains. Apart from the grid assembly algorithm, the key point of overset technique lies in the interpolation processes which ensure the communications between the overlapping grids. The family of explicit Lagrange and optimized interpolation schemes is studied. The a priori interpolation error is analyzed in the Fourier space, and combined with the error of the chosen discretization to highlight the modification of the numerical error. When high-accuracy algorithms are used an optimization of the interpolation coefficients can enhance the resolvality, which can be useful when high-frequency waves or small turbulent scales need to be supported by a grid. For general curvilinear grids in more than one space dimension, a mapping in a computational space followed by a tensorization of 1-D interpolations is preferred to a direct evaluation of the coefficient in the physical domain. A high-order extension of the isoparametric mapping is accurate and robust since it avoids the inversion of a matrix which may be ill-conditioned. A posteriori error analyses indicate that the interpolation stencil size must be tailored to the accuracy of the discretization scheme. For well discretized wavelengthes, the results show that the choice of a stencil smaller than the stencil of the corresponding finite-difference scheme can be acceptable. Besides the gain of optimization to capture high-frequency phenomena is also underlined. Adding order constraints to the optimization allows an interesting trade-off when a large range of scales is considered. Finally, the ability of the present overset strategy to preserve accuracy is illustrated by the diffraction of an acoustic source by two cylinders, and the generation of acoustic tones in a rotor–stator interaction. Some recommandations are formulated in the closing section
Opt: A Domain Specific Language for Non-linear Least Squares Optimization in Graphics and Imaging
Many graphics and vision problems can be expressed as non-linear least
squares optimizations of objective functions over visual data, such as images
and meshes. The mathematical descriptions of these functions are extremely
concise, but their implementation in real code is tedious, especially when
optimized for real-time performance on modern GPUs in interactive applications.
In this work, we propose a new language, Opt (available under
http://optlang.org), for writing these objective functions over image- or
graph-structured unknowns concisely and at a high level. Our compiler
automatically transforms these specifications into state-of-the-art GPU solvers
based on Gauss-Newton or Levenberg-Marquardt methods. Opt can generate
different variations of the solver, so users can easily explore tradeoffs in
numerical precision, matrix-free methods, and solver approaches. In our
results, we implement a variety of real-world graphics and vision applications.
Their energy functions are expressible in tens of lines of code, and produce
highly-optimized GPU solver implementations. These solver have performance
competitive with the best published hand-tuned, application-specific GPU
solvers, and orders of magnitude beyond a general-purpose auto-generated
solver
Halide: a language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines
Image processing pipelines combine the challenges of stencil computations and stream programs. They are composed of large graphs of different stencil stages, as well as complex reductions, and stages with global or data-dependent access patterns. Because of their complex structure, the performance difference between a naive implementation of a pipeline and an optimized one is often an order of magnitude. Efficient implementations require optimization of both parallelism and locality, but due to the nature of stencils, there is a fundamental tension between parallelism, locality, and introducing redundant recomputation of shared values.
We present a systematic model of the tradeoff space fundamental to stencil pipelines, a schedule representation which describes concrete points in this space for each stage in an image processing pipeline, and an optimizing compiler for the Halide image processing language that synthesizes high performance implementations from a Halide algorithm and a schedule. Combining this compiler with stochastic search over the space of schedules enables terse, composable programs to achieve state-of-the-art performance on a wide range of real image processing pipelines, and across different hardware architectures, including multicores with SIMD, and heterogeneous CPU+GPU execution. From simple Halide programs written in a few hours, we demonstrate performance up to 5x faster than hand-tuned C, intrinsics, and CUDA implementations optimized by experts over weeks or months, for image processing applications beyond the reach of past automatic compilers.United States. Dept. of Energy (Award DE-SC0005288)National Science Foundation (U.S.) (Grant 0964004)Intel CorporationCognex CorporationAdobe System
Halide: a language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines
Image processing pipelines combine the challenges of stencil computations and stream programs. They are composed of large graphs of different stencil stages, as well as complex reductions, and stages with global or data-dependent access patterns. Because of their complex structure, the performance difference between a naive implementation of a pipeline and an optimized one is often an order of magnitude. Efficient implementations require optimization of both parallelism and locality, but due to the nature of stencils, there is a fundamental tension between parallelism, locality, and introducing redundant recomputation of shared values.
We present a systematic model of the tradeoff space fundamental to stencil pipelines, a schedule representation which describes concrete points in this space for each stage in an image processing pipeline, and an optimizing compiler for the Halide image processing language that synthesizes high performance implementations from a Halide algorithm and a schedule. Combining this compiler with stochastic search over the space of schedules enables terse, composable programs to achieve state-of-the-art performance on a wide range of real image processing pipelines, and across different hardware architectures, including multicores with SIMD, and heterogeneous CPU+GPU execution. From simple Halide programs written in a few hours, we demonstrate performance up to 5x faster than hand-tuned C, intrinsics, and CUDA implementations optimized by experts over weeks or months, for image processing applications beyond the reach of past automatic compilers.United States. Dept. of Energy (Award DE-SC0005288)National Science Foundation (U.S.) (Grant 0964004)Intel CorporationCognex CorporationAdobe System
- …