Search CORE

1,687 research outputs found

Enhanced molecular dynamics performance with a programmable graphics processor

Author: Anderson
D.C. Rapaport
Harris
Nyland
Owens
Rahman
Rapaport
Rapaport
Stone
van Meel
Verlet
Publication venue: 'Elsevier BV'
Publication date: 26/01/2011
Field of study

Design considerations for molecular dynamics algorithms capable of taking advantage of the computational power of a graphics processing unit (GPU) are described. Accommodating the constraints of scalable streaming-multiprocessor hardware necessitates a reformulation of the underlying algorithm. Performance measurements demonstrate the considerable benefit and cost-effectiveness of such an approach, which produces a factor of 2.5 speed improvement over previous work for the case of the soft-sphere potential.Comment: 20 pages (v2: minor additions and changes; v3: corrected typos

arXiv.org e-Print Archive

High Performance Direct Gravitational N-body Simulations on Graphics Processing Units

Author: Belleman Robert
Geldof Peter
Zwart Simon Portegies
Publication venue: 'Elsevier BV'
Publication date: 01/01/2007
Field of study

We present the results of gravitational direct

N

-body simulations using the commercial graphics processing units (GPU) NVIDIA Quadro FX1400 and GeForce 8800GTX, and compare the results with GRAPE-6Af special purpose hardware. The force evaluation of the

N

-body problem was implemented in Cg using the GPU directly to speed-up the calculations. The integration of the equations of motions were, running on the host computer, implemented in C using the 4th order predictor-corrector Hermite integrator with block time steps. We find that for a large number of particles (N \apgt 10^4) modern graphics processing units offer an attractive low cost alternative to GRAPE special purpose hardware. A modern GPU continues to give a relatively flat scaling with the number of particles, comparable to that of the GRAPE. Using the same time step criterion the total energy of the

N

-body system was conserved better than to one in

10^6

on the GPU, which is only about an order of magnitude worse than obtained with GRAPE. For N\apgt 10^6 the GeForce 8800GTX was about 20 times faster than the host computer. Though still about an order of magnitude slower than GRAPE, modern GPU's outperform GRAPE in their low cost, long mean time between failure and the much larger onboard memory; the GRAPE-6Af holds at most 256k particles whereas the GeForce 8800GTF can hold 9 million particles in memory.Comment: Submitted to New Astronom

arXiv.org e-Print Archive

CiteSeerX

Quantum ESPRESSO: a modular and open-source software project for quantum simulations of materials

Author: Ahlrichs R Furche F Hättig C Klopper W M Sierka M Weigend F
Alexander Smogunov
Alfredo Pasquarello
Allen M P
Anderson E
Andrea Dal Corso
Anton Kokalj
Ari P Seitsonen
Baerends E J
Blackford L S
Bussi G
Bylaska E J
Carlo Cavazzoni
Carlo Sbraccia
Christos Gougoussis
Dabo I Cances E Li Y Marzari N
Dal Corso A
Dal Corso A
Davide Ceresoli
di Meo R Dal Corso A Giannozzi P Cozzini S
Dovesi R
Dreizler R M
Fletcher R
Francesco Mauri
Frisch M J
Gabriele Sclauzero
Giannozzi P
Guido Fratesi
Guido L Chiarotti
Gygi F
Hellmann H
Isaev E
Ismaila Dabo
Koelling D D
Kokalj A
Kresse G
Layla Martin-Samos
Lorenzo Paulatto
MacDonald A H
Martin R M
Marx D
Marzari N
Matteo Calandra
Matteo Cococcioni
Michele Lazzeri
Mosca Conte A
Nguyen H-V
Nicola Bonini
Nicola Marzari
Paolo Giannozzi
Paolo Umari
Parr R G
Ralph Gebauer
Renata M Wentzcovitch
Riccardo Mazzarello
Roberto Car
Sandro Scandolo
Stefano Baroni
Stefano de Gironcoli
Stefano Fabris
Stefano Paolini
Szabo A
Thonhauser T Ceresoli D Mostofi A Marzari N Resta R Vanderbilt D
Uwe Gerstmann
Publication venue: 'IOP Publishing'
Publication date: 01/01/2009
Field of study

Quantum ESPRESSO is an integrated suite of computer codes for electronic-structure calculations and materials modeling, based on density-functional theory, plane waves, and pseudopotentials (norm-conserving, ultrasoft, and projector-augmented wave). Quantum ESPRESSO stands for "opEn Source Package for Research in Electronic Structure, Simulation, and Optimization". It is freely available to researchers around the world under the terms of the GNU General Public License. Quantum ESPRESSO builds upon newly-restructured electronic-structure codes that have been developed and tested by some of the original authors of novel electronic-structure algorithms and applied in the last twenty years by some of the leading materials modeling groups worldwide. Innovation and efficiency are still its main focus, with special attention paid to massively-parallel architectures, and a great effort being devoted to user friendliness. Quantum ESPRESSO is evolving towards a distribution of independent and inter-operable codes in the spirit of an open-source project, where researchers active in the field of electronic-structure calculations are encouraged to participate in the project by contributing their own codes or by implementing their own ideas into existing codes.Comment: 36 pages, 5 figures, resubmitted to J.Phys.: Condens. Matte

Hal-Diderot

HAL-Ecole des Ponts ParisTech

Archivio istituzionale della ricerca - Università di Padova

arXiv.org e-Print Archive

INRIA a CCSD electronic archive server

King's Research Portal

Archivio della ricerca- Università di Roma La Sapienza

Simulation of 1+1 dimensional surface growth and lattices gases using GPUs

Author: Amir
Barabási
Barma
Bernaschi
Castro
Chowdhury
Chowdhury
Edwards
Facsko
Family
Forster
Gergely Ódor
Gross
Géza Ódor
Halpin-Healy
Harris
Henrik Schulz
Hinrichsen
Hwa
Janssen
Juhász
Juhász
Kardar
Kardar
Katz
Klumpp
Krug
Krug
Ligget
Meakin
Máté Ferenc Nagy
Plischke
Preis
Prähofer
Sasamoto
Sasamoto
Schittmann
Schliwa
van Beijeren
van Meel
Weigel
Ódor
Ódor
Ódor
Ódor
Ódor
Publication venue: 'Elsevier BV'
Publication date: 01/01/2011
Field of study

Restricted solid on solid surface growth models can be mapped onto binary lattice gases. We show that efficient simulation algorithms can be realized on GPUs either by CUDA or by OpenCL programming. We consider a deposition/evaporation model following Kardar-Parisi-Zhang growth in 1+1 dimensions related to the Asymmetric Simple Exclusion Process and show that for sizes, that fit into the shared memory of GPUs one can achieve the maximum parallelization speedup ~ x100 for a Quadro FX 5800 graphics card with respect to a single CPU of 2.67 GHz). This permits us to study the effect of quenched columnar disorder, requiring extremely long simulation times. We compare the CUDA realization with an OpenCL implementation designed for processor clusters via MPI. A two-lane traffic model with randomized turning points is also realized and the dynamical behavior has been investigated.Comment: 20 pages 12 figures, 1 table, to appear in Comp. Phys. Com

arXiv.org e-Print Archive

ELTE Digital Institutional Repository (EDIT)