Search CORE

1,159 research outputs found

A high-order semi-explicit discontinuous Galerkin solver for 3D incompressible flow with application to DNS and LES of turbulent channel flow

Author: Fehn Niklas
Krank Benjamin
Kronbichler Martin
Wall Wolfgang A.
Publication venue: 'Elsevier BV'
Publication date: 05/07/2016
Field of study

We present an efficient discontinuous Galerkin scheme for simulation of the incompressible Navier-Stokes equations including laminar and turbulent flow. We consider a semi-explicit high-order velocity-correction method for time integration as well as nodal equal-order discretizations for velocity and pressure. The non-linear convective term is treated explicitly while a linear system is solved for the pressure Poisson equation and the viscous term. The key feature of our solver is a consistent penalty term reducing the local divergence error in order to overcome recently reported instabilities in spatially under-resolved high-Reynolds-number flows as well as small time steps. This penalty method is similar to the grad-div stabilization widely used in continuous finite elements. We further review and compare our method to several other techniques recently proposed in literature to stabilize the method for such flow configurations. The solver is specifically designed for large-scale computations through matrix-free linear solvers including efficient preconditioning strategies and tensor-product elements, which have allowed us to scale this code up to 34.4 billion degrees of freedom and 147,456 CPU cores. We validate our code and demonstrate optimal convergence rates with laminar flows present in a vortex problem and flow past a cylinder and show applicability of our solver to direct numerical simulation as well as implicit large-eddy simulation of turbulent channel flow at

Re_{\tau}=180

as well as

590

.Comment: 28 pages, in preparation for submission to Journal of Computational Physic

arXiv.org e-Print Archive

OPUS Augsburg

Computational fluid dynamics using Graphics Processing Units: Challenges and opportunities

Author: Sahu Kirti Chandra
Shinn A F
Vanka S P
Publication venue
Publication date: 01/01/2011
Field of study

A new paradigm for computing fluid flows is the use of Graphics Processing Units (GPU), which have recently become very powerful and convenient to use. In the past three years, we have implemented five different fluid flow algorithms on GPUs and have obtained significant speed-ups over a single CPU. Typically, it is possible to achieve a factor of 50-100 over a single CPU. In this review paper, we describe our experiences on the various algorithms developed and the speeds achieved

Crossref

Research Archive of Indian Institute of Technology Hyderabad

Analysis of Iterative Methods for the Steady and Unsteady Stokes Problem: Application to Spectral Element Discretizations

Author: Maday Yvon
Meiron Dan
Patera Anthony T.
Rønquist Einar M.
Publication venue: 'The Japan Society for Industrial and Applied Mathematics'
Publication date: 01/01/1993
Field of study

A new and detailed analysis of the basic Uzawa algorithm for decoupling of the pressure and the velocity in the steady and unsteady Stokes operator is presented. The paper focuses on the following new aspects: explicit construction of the Uzawa pressure-operator spectrum for a semiperiodic model problem; general relationship of the convergence rate of the Uzawa procedure to classical inf-sup discretization analysis; and application of the method to high-order variational discretization

CiteSeerX

Caltech Authors

Multi-Level Parallelism for Incompressible Flow Computations on GPU Clusters

Author: Jacobsen Dana A.
Senocak Inanc
Publication venue: 'IUScholarWorks'
Publication date: 01/01/2013
Field of study

We investigate multi-level parallelism on GPU clusters with MPI-CUDA and hybrid MPI-OpenMP-CUDA parallel implementations, in which all computations are done on the GPU using CUDA. We explore efficiency and scalability of incompressible flow computations using up to 256 GPUs on a problem with approximately 17.2 billion cells. Our work addresses some of the unique issues faced when merging fine-grain parallelism on the GPU using CUDA with coarse-grain parallelism that use either MPI or MPI-OpenMP for communications. We present three different strategies to overlap computations with communications, and systematically assess their impact on parallel performance on two different GPU clusters. Our results for strong and weak scaling analysis of incompressible flow computations demonstrate that GPU clusters offer significant benefits for large data sets, and a dual-level MPI-CUDA implementation with maximum overlapping of computation and communication provides substantial benefits in performance. We also find that our tri-level MPI-OpenMP-CUDA parallel implementation does not offer a significant advantage in performance over the dual-level implementation on GPU clusters with two GPUs per node, but on clusters with higher GPU counts per node or with different domain decomposition strategies a tri-level implementation may exhibit higher efficiency than a dual-level implementation and needs to be investigated further

Boise State University - ScholarWorks

GPU-Accelerated Large-Eddy Simulation of Turbulent Channel Flows

Author: Antoniou A.S.
Briggs W. L.
Cheng W.
Chorin A.J.
Chung D.
Deardorff J.W.
Driest E.V.
Geveler M.
Griebel M.
Hoyas S.
Jacobsen D.A.
Jacobsen D.A.
Jacobsen D.A.
Kerr A.
Kogge P. M.
Meneveau C.
Smagorinksy J.
The Portland Group
Thibault J.C.
Publication venue: 'IUScholarWorks'
Publication date: 09/01/2012
Field of study

High performance computing clusters that are augmented with cost and power efficient graphics processing unit (GPU) provide new opportunities to broaden the use of large-eddy simulation technique to study high Reynolds number turbulent flows in fluids engineering applications. In this paper, we extend our earlier work on multi-GPU acceleration of an incompressible Navier-Stokes solver to include a large-eddy simulation (LES) capability. In particular, we implement the Lagrangian dynamic subgrid scale model and compare our results against existing direct numerical simulation (DNS) data of a turbulent channel flow at Reτ = 180. Overall, our LES results match fairly well with the DNS data. Our results show that the Reτ = 180 case can be entirely simulated on a single GPU, whereas higher Reynolds cases can benefit from a GPU cluster

Crossref

Boise State University - ScholarWorks