Search CORE

98,959 research outputs found

A multiarchitecture parallel-processing development environment

Author: Blech Richard
Cole Gary
Townsend Scott
Publication venue
Publication date
Field of study

A description is given of the hardware and software of a multiprocessor test bed - the second generation Hypercluster system. The Hypercluster architecture consists of a standard hypercube distributed-memory topology, with multiprocessor shared-memory nodes. By using standard, off-the-shelf hardware, the system can be upgraded to use rapidly improving computer technology. The Hypercluster's multiarchitecture nature makes it suitable for researching parallel algorithms in computational field simulation applications (e.g., computational fluid dynamics). The dedicated test-bed environment of the Hypercluster and its custom-built software allows experiments with various parallel-processing concepts such as message passing algorithms, debugging tools, and computational 'steering'. Such research would be difficult, if not impossible, to achieve on shared, commercial systems

NASA Technical Reports Server

Molecular dynamics for fluid mechanics in arbitrary geometries

Author: Macpherson Graham B.
Reese Jason M.
Publication venue: 'ASME International'
Publication date: 23/07/2008
Field of study

Simulations of nanoscale systems where fluid mechanics plays an important role are required to help design and understand nano-devices and biological systems. A simulation method which hybridises molecular dynamics (MD) and continuum computational fluid dynamics (CFD) models is able to accurately represent the relevant physical phenomena and be computationally tractable. An MD code has been written to perform MD simulations in systems where the geometry is described by a mesh of unstructured arbitrary polyhedral cells that have been spatially decomposed into irregular portions for parallel processing. The MD code that has been developed may be used for simulations on its own, or may serve as the MD component of a hybrid method. The code has been implemented using OpenFOAM, an open source C++ CFD toolbox (www.openfoam.org). The requirements for two key enabling components are described. 1) Parallel generation of initial configurations of molecules in arbitrary geometries. 2) Calculation of intermolecular pair forces, including between molecules that lie on mesh portions assigned to different, and possibly non-neighbouring processors. A case study of flow in a realistic nanoscale mixing channel, where the geometry is drawn and meshed in engineering CAD tools is simulated to demonstrate the capabilities of the code

University of Strathclyde Institutional Repository

The TheLMA project: Multi-GPU Implementation of the Lattice Boltzmann Method

Author: Kuznik F.
Obrecht C.
Roux J.-J.
Tourancheau Bernard
Publication venue: 'SAGE Publications'
Publication date: 29/06/2011
Field of study

International audienceIn this paper, we describe the implementation of a multi-graphical processing unit (GPU) fluid flow solver based on the lattice Boltzmann method (LBM). The LBM is a novel approach in computational fluid dynamics, with numerous interesting features from a computational, numerical, and physical standpoint. Our program is based on CUDA and uses POSIX threads to manage multiple computation devices. Using recently released hardware, our solver may therefore run eight GPUs in parallel, which allows us to perform simulations at a rather large scale. Performance and scalability are excellent, the speedup over sequential implementations being at least of two orders of magnitude. In addition, we discuss tiling and communication issues for present and forthcoming implementations

HAL-ENS-LYON

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Hal-Diderot

Instrumentation, performance visualization, and debugging tools for multiprocessors

Author: Fineman Charles E.
Hontalas Philip J.
Yan Jerry C.
Publication venue
Publication date
Field of study

The need for computing power has forced a migration from serial computation on a single processor to parallel processing on multiprocessor architectures. However, without effective means to monitor (and visualize) program execution, debugging, and tuning parallel programs becomes intractably difficult as program complexity increases with the number of processors. Research on performance evaluation tools for multiprocessors is being carried out at ARC. Besides investigating new techniques for instrumenting, monitoring, and presenting the state of parallel program execution in a coherent and user-friendly manner, prototypes of software tools are being incorporated into the run-time environments of various hardware testbeds to evaluate their impact on user productivity. Our current tool set, the Ames Instrumentation Systems (AIMS), incorporates features from various software systems developed in academia and industry. The execution of FORTRAN programs on the Intel iPSC/860 can be automatically instrumented and monitored. Performance data collected in this manner can be displayed graphically on workstations supporting X-Windows. We have successfully compared various parallel algorithms for computational fluid dynamics (CFD) applications in collaboration with scientists from the Numerical Aerodynamic Simulation Systems Division. By performing these comparisons, we show that performance monitors and debuggers such as AIMS are practical and can illuminate the complex dynamics that occur within parallel programs

NASA Technical Reports Server

Design and Implementation of Particle Systems for Meshfree Methods with High Performance

Author: Bilotta Giuseppe
Hérault Alexis
Zago Vito
Publication venue: 'IntechOpen'
Publication date: 31/12/2018
Field of study

Particle systems, commonly associated with computer graphics, animation, and video games, are an essential component in the implementation of numerical methods ranging from the meshfree methods for computational fluid dynamics and related applications (e.g., smoothed particle hydrodynamics, SPH) to minimization methods for arbitrary problems (e.g., particle swarm optimization, PSO). These methods are frequently embarrassingly parallel in nature, making them a natural fit for implementation on massively parallel computational hardware such as modern graphics processing units (GPUs). However, naive implementations fail to fully exploit the capabilities of this hardware. We present practical solutions to the challenges faced in the efficient parallel implementation of these particle systems, with a focus on performance, robustness, and flexibility. The techniques are illustrated through GPUSPH, the first implementation of SPH to run completely on GPU, and currently supporting multi-GPU clusters, uniform precision independent of domain size, and multiple SPH formulations

IntechOpen

Crossref

Predicting Flows of Rarefied Gases

Author: LeBeau Gerald J.
Wilmoth Richard G.
Publication venue
Publication date
Field of study

DSMC Analysis Code (DAC) is a flexible, highly automated, easy-to-use computer program for predicting flows of rarefied gases -- especially flows of upper-atmospheric, propulsion, and vented gases impinging on spacecraft surfaces. DAC implements the direct simulation Monte Carlo (DSMC) method, which is widely recognized as standard for simulating flows at densities so low that the continuum-based equations of computational fluid dynamics are invalid. DAC enables users to model complex surface shapes and boundary conditions quickly and easily. The discretization of a flow field into computational grids is automated, thereby relieving the user of a traditionally time-consuming task while ensuring (1) appropriate refinement of grids throughout the computational domain, (2) determination of optimal settings for temporal discretization and other simulation parameters, and (3) satisfaction of the fundamental constraints of the method. In so doing, DAC ensures an accurate and efficient simulation. In addition, DAC can utilize parallel processing to reduce computation time. The domain decomposition needed for parallel processing is completely automated, and the software employs a dynamic load-balancing mechanism to ensure optimal parallel efficiency throughout the simulation

NASA Technical Reports Server

Survey Report on Multi GPU Implementation in Open Source CFD Solver

Author: Suraj Salvi
Publication venue: 'Auricle Technologies, Pvt., Ltd.'
Publication date: 30/04/2016
Field of study

A CFD (Computational Fluid Dynamics) solver is used for simulation of fluid flow and heat transfer. It helps in analyzing the various parameters in fluid flow of scientific applications. OpenFOAM (Open Field Operation and Manipulation) [14] is an open source Software which can be used to solve Computational Fluid Dynamics (CFD) problems. The OpenFOAM solves various equations e.g. Mass Conservation, Conjugate Heat Transformer. OpenFOAM solves CFD problems serially as well as in parallel. The motivation of the project is to reduce time for solving CFD problems by implementing it on GPU. The OpenFOAM a software contribution which (Open Field Operation and Manipulation) is open source Tool. OpenFOAM has a many of features to get answer to anything from complex liquid (or gas) moves getting mixed in trouble chemical reactions, turbulence and heat give property in law, to solid driving power and electromagnetic. Existing OpenFOAM executes serially, by introducing parallelism, time required for execution of OpenFOAM can be reduced. For introducing parallelism, System requires CUDA (Compute Unified Device Architecture) enabled CPU (Central Processing Unit). To achieve parallelism, openFOAM requires CUDA support and support is achieved by of gpu library. Library is used to introduce parallelism for single GPU and multigpu can improve performance of OpenFOAM

International Journal on Recent and Innovation Trends in Computing and Communication

CFD code evaluation for internal flow modeling

Author: Chung T. J.
Publication venue
Publication date
Field of study

Research on the computational fluid dynamics (CFD) code evaluation with emphasis on supercomputing in reacting flows is discussed. Advantages of unstructured grids, multigrids, adaptive methods, improved flow solvers, vector processing, parallel processing, and reduction of memory requirements are discussed. As examples, researchers include applications of supercomputing to reacting flow Navier-Stokes equations including shock waves and turbulence and combustion instability problems associated with solid and liquid propellants. Evaluation of codes developed by other organizations are not included. Instead, the basic criteria for accuracy and efficiency have been established, and some applications on rocket combustion have been made. Research toward an ultimate goal, the most accurate and efficient CFD code, is in progress and will continue for years to come

NASA Technical Reports Server

Visualization and Tracking of Parallel CFD Simulations

Author: Kremenetsky Mark
Vaziri Arsi
Publication venue
Publication date
Field of study

We describe a system for interactive visualization and tracking of a 3-D unsteady computational fluid dynamics (CFD) simulation on a parallel computer. CM/AVS, a distributed, parallel implementation of a visualization environment (AVS) runs on the CM-5 parallel supercomputer. A CFD solver is run as a CM/AVS module on the CM-5. Data communication between the solver, other parallel visualization modules, and a graphics workstation, which is running AVS, are handled by CM/AVS. Partitioning of the visualization task, between CM-5 and the workstation, can be done interactively in the visual programming environment provided by AVS. Flow solver parameters can also be altered by programmable interactive widgets. This system partially removes the requirement of storing large solution files at frequent time steps, a characteristic of the traditional 'simulate (yields) store (yields) visualize' post-processing approach

NASA Technical Reports Server