6 research outputs found
Two-dimensional unsteadiness map of oblique shock wave/boundary layer interaction with sidewalls
The low-frequency unsteadiness of oblique shock wave/boundary layer interactions (SBLIs) has been investigated using large-eddy simulation (LES) and high-frequency pressure measurements from experiments. Particular attention has been paid to off-centreline behaviour: the LES dataset was generated including sidewalls and experimental pressure measurements were acquired across the entire span of the reflected shock foot. The datasets constitute the first maps of low-frequency unsteadiness in both streamwise and spanwise directions. The results reveal that significant low-frequency shock motion (with St ≈ 0.03) occurs away from the centreline, along most of the central separation shock and in the corner regions. The most powerful low frequency unsteadiness occurs offcentre, likely due to the separation shock being strengthened by shocks arising from the swept interactions on the sidewalls. Both simulation and experimental results exhibit asymmetry about the spanwise centre. In simulations, this may be attributed to a lack of statistical convergence; however, the fact that this is also seen in experiments is indicative that some SBLIs may exhibit some inherent asymmetry across the two spanwise halves of the separation bubble. There is also significant low-frequency power in the corner separations. The relation of the unsteadiness in the corner regions to that in the centre is investigated by means of two-point correlations: a key observation is that significant correlation does not extend across the attached flow channel between the central and corner separations
Batch solution of small PDEs with the OPS DSL
In this paper we discuss the challenges and optimisations opportunities when solving a large number of small, equally sized discretised PDEs on regular grids. We present an extension of the OPS (Oxford Parallel library for Structured meshes) embedded Domain Specific Language, and show how support can be added for solving multiple systems, and how OPS makes it easy to deploy a variety of transformations and optimisations. The new capabilities in OPS allow to automatically apply data structure transformations, as well as execution schedule transformations to deliver high performance on a variety of hardware platforms. We evaluate our work on an industrially representative finance simulation on Intel CPUs, as well as NVIDIA GPUs
Boundary conditions and vortex wandering
A direct numerical simulation of a Batchelor vortex has been carried out in the presence of freely-decaying turbulence, using both periodic and symmetric boundary conditions; the latter most closely approximates typical experimental conditions, while the former is often used in computational simulations for the purposes of numerical convenience. The higher-order velocity statistics were shown to be strongly dependent upon the boundary conditions, but the dependence could be mostly eliminated by correcting for the random, Gaussian modulation of the vortex trajectory commonly referred to as 'wandering' using a technique often employed in the analysis of experimental data. Once corrected for this wandering, the strong peaks in the Reynolds stresses normally observed at the vortex centre were replaced by smaller local extrema located within the core region but away from the centre. The distributions of the corrected Reynolds stresses suggested that the formation and organization of secondary structures within the core is the main mechanism in turbulent production during the linear growth phase of vortex development
Boundary conditions and vortex wandering
A direct numerical simulation of a Batchelor vortex has been carried out in the presence of freely-decaying turbulence, using both periodic and symmetric boundary conditions; the latter most closely approximates typical experimental conditions, while the former is often used in computational simulations for the purposes of numerical convenience. The higher-order velocity statistics were shown to be strongly dependent upon the boundary conditions, but the dependence could be mostly eliminated by correcting for the random, Gaussian modulation of the vortex trajectory commonly referred to as 'wandering' using a technique often employed in the analysis of experimental data. Once corrected for this wandering, the strong peaks in the Reynolds stresses normally observed at the vortex centre were replaced by smaller local extrema located within the core region but away from the centre. The distributions of the corrected Reynolds stresses suggested that the formation and organization of secondary structures within the core is the main mechanism in turbulent production during the linear growth phase of vortex development
Block structured compressible Navier-Stokes solution using the OPS high-level abstraction
In this paper we report the development and validation of a compressible solver with shock capturing, using a domain-specific high-level abstraction framework, OPS, that is being developed at the University of Oxford. OPS uses an active library approach for block-structured meshes, capable of generating codes for a variety of parallel implementations with different parallelization strategies. Performance results on various architectures are reported for the 1D Shu-Osher test case
Large-scale performance of a DSL-based multi-block structured-mesh application for direct numerical simulation
SBLI (Shock-wave/Boundary-layer Interaction) is a large-scale Computational Fluid Dynamics(CFD) application, developed over 20 years at the University of Southampton and extensively used within the UK Turbulence Consortium. It is capable of performing Direct Numerical Simulations (DNS) or Large Eddy Simulation (LES) of shock-wave/boundarylayer interaction problems over highly detailed multi-block structured mesh geometries. SBLI presents major challenges in data organization and movement that need to be overcome for continued high performance on emerging massively parallel hardware platforms. In this paper we present research in achieving this goal through the OPS embedded domainspecific language. OPS targets the domain of multi-block structured mesh applications. It provides an API embedded in C/C++ and Fortran and makes use of automatic code generation and compilation to produce executables capable of running on a range of parallel hardware systems. The core functionality of SBLI is captured using a new framework called OpenSBLI which enables a developer to declare the partial differential equations using Einstein notation and then automatically carryout discretization and generation of OPS (C/C++) API code. OPS is then used to automatically generate a wide range of parallel implementations. Using this multi-layered abstractions approach we demonstrate how new opportunities for further optimizations can be gained, such as fine-tuning the computation intensity and reducing data movement and apply them automatically. Performance results demonstrate there is no performance loss due to the high-level development strategy with OPS and OpenSBLI, with performance matching or exceeding the hand-tuned original code on all CPU nodes tested. The data movement optimizations provide over 3× speedups on CPU nodes, while GPUs provide 5× speedups over the best performing CPU node. The OPS generated parallel code also demonstrates excellent scalability on nearly 100K cores on a Cray XC30 (ARCHER at EPCC) and on over 4K GPUs on a CrayXK7 (Titan at ORNL)