Search CORE

8,451 research outputs found

Highly accelerated simulations of glassy dynamics using GPUs: caveats on limited floating-point precision

Author: Anderson
Block
Dekker
Felix Höfling
Flenner
Frenkel
Götze
Hansen
Harvey
Knuth
Knuth
Kob
Kob
Kob
Lippert
Liu
Mosayebi
Peter H. Colberg
Plimpton
Preis
Rapaport
Sagan
Stone
van Meel
Voelz
Weeks
Xu
Yang
Zagha
Publication venue: 'Elsevier BV'
Publication date: 01/01/2011
Field of study

Modern graphics processing units (GPUs) provide impressive computing resources, which can be accessed conveniently through the CUDA programming interface. We describe how GPUs can be used to considerably speed up molecular dynamics (MD) simulations for system sizes ranging up to about 1 million particles. Particular emphasis is put on the numerical long-time stability in terms of energy and momentum conservation, and caveats on limited floating-point precision are issued. Strict energy conservation over 10^8 MD steps is obtained by double-single emulation of the floating-point arithmetic in accuracy-critical parts of the algorithm. For the slow dynamics of a supercooled binary Lennard-Jones mixture, we demonstrate that the use of single-floating point precision may result in quantitatively and even physically wrong results. For simulations of a Lennard-Jones fluid, the described implementation shows speedup factors of up to 80 compared to a serial implementation for the CPU, and a single GPU was found to compare with a parallelised MD simulation using 64 distributed cores.Comment: 12 pages, 7 figures, to appear in Comp. Phys. Comm., HALMD package licensed under the GPL, see http://research.colberg.org/projects/halm

arXiv.org e-Print Archive

Institute of Transport Research:Publications

Crossref

A GPU-accelerated package for simulation of flow in nanoporous source rocks with many-body dissipative particle dynamics

Author: Andrew Matthew
Blumers Ansel
Deo Milind
Goral Jan
Huang Hai
Kane Joshua
Li Zhen
Luo Lixiang
Tang Yu-Hang
Xia Yidong
Publication venue: 'Elsevier BV'
Publication date: 25/03/2019
Field of study

Mesoscopic simulations of hydrocarbon flow in source shales are challenging, in part due to the heterogeneous shale pores with sizes ranging from a few nanometers to a few micrometers. Additionally, the sub-continuum fluid-fluid and fluid-solid interactions in nano- to micro-scale shale pores, which are physically and chemically sophisticated, must be captured. To address those challenges, we present a GPU-accelerated package for simulation of flow in nano- to micro-pore networks with a many-body dissipative particle dynamics (mDPD) mesoscale model. Based on a fully distributed parallel paradigm, the code offloads all intensive workloads on GPUs. Other advancements, such as smart particle packing and no-slip boundary condition in complex pore geometries, are also implemented for the construction and the simulation of the realistic shale pores from 3D nanometer-resolution stack images. Our code is validated for accuracy and compared against the CPU counterpart for speedup. In our benchmark tests, the code delivers nearly perfect strong scaling and weak scaling (with up to 512 million particles) on up to 512 K20X GPUs on Oak Ridge National Laboratory's (ORNL) Titan supercomputer. Moreover, a single-GPU benchmark on ORNL's SummitDev and IBM's AC922 suggests that the host-to-device NVLink can boost performance over PCIe by a remarkable 40\%. Lastly, we demonstrate, through a flow simulation in realistic shale pores, that the CPU counterpart requires 840 Power9 cores to rival the performance delivered by our package with four V100 GPUs on ORNL's Summit architecture. This simulation package enables quick-turnaround and high-throughput mesoscopic numerical simulations for investigating complex flow phenomena in nano- to micro-porous rocks with realistic pore geometries

arXiv.org e-Print Archive

Clemson University: TigerPrints

Accelerated Modeling of Near and Far-Field Diffraction for Coronagraphic Optical Systems

Author: Abdellah
Akeret
Cooke
Cooley
Douglas
Douglas
Fangohr
Frigo
Greenfield
Greenfield
Hirst
Jones
Kluyver
Lawrence
Lumbres
Macintosh
Marois
Mendillo
Morgan
Noecker
Pavlyk
Perrin
Shimobaba
Soummer
Steinbach
Stone
Yamamoto
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 17/06/2018
Field of study

Accurately predicting the performance of coronagraphs and tolerancing optical surfaces for high-contrast imaging requires a detailed accounting of diffraction effects. Unlike simple Fraunhofer diffraction modeling, near and far-field diffraction effects, such as the Talbot effect, are captured by plane-to-plane propagation using Fresnel and angular spectrum propagation. This approach requires a sequence of computationally intensive Fourier transforms and quadratic phase functions, which limit the design and aberration sensitivity parameter space which can be explored at high-fidelity in the course of coronagraph design. This study presents the results of optimizing the multi-surface propagation module of the open source Physical Optics Propagation in PYthon (POPPY) package. This optimization was performed by implementing and benchmarking Fourier transforms and array operations on graphics processing units, as well as optimizing multithreaded numerical calculations using the NumExpr python library where appropriate, to speed the end-to-end simulation of observatory and coronagraph optical systems. Using realistic systems, this study demonstrates a greater than five-fold decrease in wall-clock runtime over POPPY's previous implementation and describes opportunities for further improvements in diffraction modeling performance.Comment: Presented at SPIE ASTI 2018, Austin Texas. 11 pages, 6 figure

arXiv.org e-Print Archive

Crossref

Fast computing of scattering maps of nanostructures using graphical processing units

Author: Butler
Diaz
Favre-Nicolin
Gelisio
Grosse-Kunstleve
Gutmann
Henke
Hubert Renevier
Immirzi
Johann Coraux
Katcho
Keating
Kegel
Langs
Marie-Ingrid Richard
Minkevich
Nield
Niquet
Plimpton
Proffen
Richard
Robinson
Schmeisser
Schmidbauer
Stangl
Stillinger
Takagi
Tardif
Ten Eyck
Ten Eyck
Tersoff
Vincent Favre-Nicolin
Welberry
Wintersberger
Publication venue: 'International Union of Crystallography (IUCr)'
Publication date: 02/04/2011
Field of study

Scattering maps from strained or disordered nano-structures around a Bragg reflection can either be computed quickly using approximations and a (Fast) Fourier transform, or using individual atomic positions. In this article we show that it is possible to compute up to 4.10^10 $reflections.atoms/s using a single graphic card, and we evaluate how this speed depends on number of atoms and points in reciprocal space. An open-source software library (PyNX) allowing easy scattering computations (including grazing incidence conditions) in the Python language is described, with examples of scattering from non-ideal nanostructures.Comment: 7 pages, 4 figure

arXiv.org e-Print Archive

Crossref

Hal - Université Grenoble Alpes

HAL-CEA

NLSEmagic: Nonlinear Schr\"odinger Equation Multidimensional Matlab-based GPU-accelerated Integrators using Compact High-order Schemes

Author: Butcher
Caplan
Carretero-González
Castillo
Dai
Davis
Debnath
Diethelm
Donoho
Jang
Kelley
Kevrekidis
Kivshar
Michéa
Muruganandam
Post
R.M. Caplan
Roberts
Stone
Stratton
Vetter
Vudragovic
Publication venue: 'Elsevier BV'
Publication date: 01/01/2011
Field of study

We present a simple to use, yet powerful code package called NLSEmagic to numerically integrate the nonlinear Schr\"odinger equation in one, two, and three dimensions. NLSEmagic is a high-order finite-difference code package which utilizes graphic processing unit (GPU) parallel architectures. The codes running on the GPU are many times faster than their serial counterparts, and are much cheaper to run than on standard parallel clusters. The codes are developed with usability and portability in mind, and therefore are written to interface with MATLAB utilizing custom GPU-enabled C codes with the MEX-compiler interface. The packages are freely distributed, including user manuals and set-up files.Comment: 37 pages, 13 figure

arXiv.org e-Print Archive

CiteSeerX

Crossref