Search CORE

134 research outputs found

Enhanced molecular dynamics performance with a programmable graphics processor

Author: Anderson
D.C. Rapaport
Harris
Nyland
Owens
Rahman
Rapaport
Rapaport
Stone
van Meel
Verlet
Publication venue: 'Elsevier BV'
Publication date: 26/01/2011
Field of study

Design considerations for molecular dynamics algorithms capable of taking advantage of the computational power of a graphics processing unit (GPU) are described. Accommodating the constraints of scalable streaming-multiprocessor hardware necessitates a reformulation of the underlying algorithm. Performance measurements demonstrate the considerable benefit and cost-effectiveness of such an approach, which produces a factor of 2.5 speed improvement over previous work for the case of the soft-sphere potential.Comment: 20 pages (v2: minor additions and changes; v3: corrected typos

arXiv.org e-Print Archive

Crossref

Cortical Surface Area Differentiates Familial High Risk Individuals Who Go on to Develop Schizophrenia

Author: Bois C.
Fletcher P.C.
Giles S.
Johnstone E.C.
Lawrie S.M.
Levita L.
McIntosh A.M.
Owens D.C.
Ronan L.
Whalley H.C.
Publication venue: 'Elsevier BV'
Publication date: 01/09/2015
Field of study

BACKGROUND: Schizophrenia is associated with structural brain abnormalities that may be present before disease onset. It remains unclear whether these represent general vulnerability indicators or are associated with the clinical state itself. METHODS: To investigate this, structural brain scans were acquired at two time points (mean scan interval 1.87 years) in a cohort of individuals at high familial risk of schizophrenia (n 5 142) and control subjects (n 5 36). Cortical reconstructions were generated using FreeSurfer. The high-risk cohort was subdivided into individuals that remained well during the study, individuals that had transient psychotic symptoms, and individuals that subsequently became ill. Baseline measures and longitudinal change in global estimates of thickness and surface area and lobar values were compared, focusing on overall differences between high-risk individuals and control subjects and then on group differences within the high-risk cohort. RESULTS: Longitudinally, control subjects showed a significantly greater reduction in cortical surface area compared with the high-risk group. Within the high-risk group, differences in surface area at baseline predicted clinical course, with individuals that subsequently became ill having significantly larger surface area than individuals that remained well during the study. For thickness, longitudinal reductions were most prominent in the frontal, cingulate, and occipital lobes in all high-risk individuals compared with control subjects. CONCLUSIONS: Our results suggest that larger surface areas at baseline may be associated with mechanisms that go above and beyond a general familial disposition. A relative preservation over time of surface area, coupled with a thinning of the cortex compared with control subjects, may serve as vulnerability markers of schizophrenia

Crossref

Edinburgh Research Archive

White Rose Research Online

An MPI-CUDA Implementation for Massively Parallel Incompressible Flow Computations on Multi-GPU Clusters

Author: Bolz J.
Brandvik T.
Buck I.
Elsen E.
Fan Z.
Goodnight N.
Griebel M.
Gropp W.
Göddeke D.
Göddeke D.
Göddeke D.
Harris M.J.
Hempel R.
Intel
Kindratenko V.
Krüger J.
Liu Y.
Owens J.D.
Schive H.
Showerman M.
Simek V.
Tölke J.
Wan D.C.
Zhao Y.
Publication venue: 'IUScholarWorks'
Publication date: 01/01/2010
Field of study

Modern graphics processing units (GPUs) with many-core architectures have emerged as general-purpose parallel computing platforms that can accelerate simulation science applications tremendously. While multi-GPU workstations with several TeraFLOPS of peak computing power are available to accelerate computational problems, larger problems require even more resources. Conventional clusters of central processing units (CPU) are now being augmented with multiple GPUs in each compute-node to tackle large problems. The heterogeneous architecture of a multi-GPU cluster with a deep memory hierarchy creates unique challenges in developing scalable and efficient simulation codes. In this study, we pursue mixed MPI-CUDA implementations and investigate three strategies to probe the efficiency and scalability of incompressible flow computations on the Lincoln Tesla cluster at the National Center for Supercomputing Applications (NCSA). We exploit some of the advanced features of MPI and CUDA programming to overlap both GPU data transfer and MPI communications with computations on the GPU. We sustain approximately 2.4 TeraFLOPS on the 64 nodes of the NCSA Lincoln Tesla cluster using 128 GPUs with a total of 30,720 processing elements. Our results demonstrate that multi-GPU clusters can substantially accelerate computational fluid dynamics (CFD) simulations

Crossref

Boise State University - ScholarWorks

A Full-Depth Amalgamated Parallel 3D Geometric Multigrid Solver for GPU Clusters

Author: Brandt A.
Brandvik T.
Corrigan A.
Cwire
Cwire
Elsen E.
Fan Z.
Goodnight N.
Griebel M.
Gropp W. D.
Göddeke D.
Hempel R.
Kindratenko V.
Matsuoka S.
McBryan O. A.
Micikevicius P.
Owens J.D.
Press W. H.
Schive H.
Showerman M.
Thibault J. C.
Tokyo Institute
Wan D.C.
Publication venue: 'IUScholarWorks'
Publication date: 04/01/2011
Field of study

Numerical computations of incompressible flow equations with pressure-based algorithms necessitate the solution of an elliptic Poisson equation, for which multigrid methods are known to be very efficient. In our previous work we presented a dual-level (MPI-CUDA) parallel implementation of the Navier-Stokes equations to simulate buoyancy-driven incompressible fluid flows on GPU clusters with simple iterative methods while focusing on the scalability of the overall solver. In the present study we describe the implementation and performance of a multigrid method to solve the pressure Poisson equation within our MPI-CUDA parallel incompressible flow solver. Various design decisions and algorithmic choices for multigrid methods are explored in light of NVIDIA’s recent Fermi architecture. We discuss how unique aspects of an MPI-CUDA implementation for GPU clusters is related to the software choices made to implement the multigrid method. We propose a new coarse grid solution method of embedded multigrid with amalgamation and show that the parallel implementation retains the numerical efficiency of the multigrid method. Performance measurements on the NCSA Lincoln and TACC Longhorn clusters are presented for up to 64 GPUs

Crossref

Boise State University - ScholarWorks

Concentration of synchrotron beams by means of monolithic polycapillary X-ray optics

Author: Compton
D.C. Aloisi
F.A. Hofmann
Huang
I.L. Klotzko
J.B. Ullrich
K.G. Huang
N. Gao
Ramanathan
S.M. Owens
Thiel
Ullrich
Ullrich
Ullrich
W.M. Gibson
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Scalability of Incompressible Flow Computations on Multi-GPU Clusters Using Dual-Level and Tri-Level Parallelism

Author: Balaji P.
Bova S. W.
Cappello F.
Cappello F.
Cappello F.
Cwire
Cwire
Cwire
Dong S.
Elsen E.
Goglin B.
Griebel M.
Gropp W.
Guermond J.L.L.
Göddeke D.
Hager G.
Hempel R.
Henty D. S.
Kindratenko V.
Luong P.
Lusk E.
Nakajima K.
Nakajima K.
Owens J.D.
Rabenseifner R.
Schive H.
Showerman M.
Simon H.
Thibault J. C.
Wan D.C.
Publication venue: 'IUScholarWorks'
Publication date: 04/01/2011
Field of study

High performance computing using graphics processing units (GPUs) is gaining popularity in the scientific computing field, with many large compute clusters being augmented with multiple GPUs in each node. We investigate hybrid tri-level (MPI-OpenMP-CUDA) parallel implementations to explore the efficiency and scalability of incompressible flow computations on GPU clusters up to 128 GPUS. This work details some of the unique issues faced when merging fine-grain parallelism on the GPU using CUDA with coarse-grain parallelism using OpenMP for intra-node and MPI for inter-node communication. Comparisons between the tri-level MPI-OpenMP-CUDA and dual-level MPI-CUDA implementations are shown using computationally large computational fluid dynamics (CFD) simulations. Our results demonstrate that a tri-level parallel implementation does not provide a significant advantage in performance over the dual-level implementation, however further research is needed to justify our conclusion for a cluster with a high GPU per node density or when using software that can utilize OpenMP’s fine-grain parallelism more effectively

Crossref

Boise State University - ScholarWorks

Measurement of the Isolated Photon Cross Section in p-pbar Collisions at sqrt{s}=1.96 TeV

Author: A. Alton
A. Askew
A. Baden
A. Bean
A. Bellavance
A. Boehnlein
A. Brandt
A. Bross
A. Chandra
A. Das
A. Duperrin
A. Dyshkant
A. Evdokimov
A. Garcia-Bellido
A. Gay
A. Goussiou
A. Haas
A. Harel
A. Jenkins
A. Jonckheere
A. Juste
A. Khanov
A. Kharchilava
A. Koubarovsky
A. Kryemadhi
A. Kumar
A. Kupco
A. Leflat
A. Lobodenko
A. Lounis
A. Magerkurth
A. Melnitchouk
A. Mendes
A. Meyer
A. Nomerotski
A. Patwa
A. Pompoš
A. Quadt
A. Santoro
A. Schwartzman
A. Sopczak
A. Stone
A. Sznajder
A. Sánchez-Hernández
A. Vartapetian
A. White
A. Yurkewicz
A. Zabi
A. Zatserklyaniy
A. Zieminski
A.-C. Le Bihan
A.-M. Magnan
A.A. Shchukin
A.C.S. Assis Jesus
A.K.A. Maciel
A.L. Lyon
A.M. Kalinin
A.P. Heinson
A.S. Ito
A.S. Turcot
A.V. Ferapontov
A.V. Kozelov
Abachi
Abazov
Abazov
Abazov
Abbott
Abe
Acosta
Acosta
Albajar
Alekhin
Alitti
Apanasevich
Aurenche
B. Abbott
B. Andrieu
B. Baldin
B. Choudhary
B. Clément
B. Cox
B. Davies
B. Gómez
B. Hoeneisen
B. Klima
B. Kothari
B. Quinn
B. Spurlock
B. Tiller
B. Tuchming
B. Vachon
B. Zhou
B. Åsman
B.C.K. Casey
B.G. Pope
B.S. Acharya
Barlow
Baur
Berger
Binoth
Binoth
Bourhis
C. Autermann
C. Avila
C. Barnes
C. Biscarat
C. Clément
C. De Oliveira Martins
C. Garcia
C. Han
C. Hensel
C. Jarvis
C. Johnson
C. Leonidopoulos
C. Magass
C. Noeding
C. Royon
C. Schmitt
C. Schwanenberger
C. Tully
C. Zeitnitz
C.E. Gerber
C.F. Galea
C.P. Buszello
Catani
D. Bauer
D. Bloch
D. Brown
D. Buchholz
D. Chakraborty
D. Chapin
D. Claes
D. Coppage
D. Cutts
D. Denisov
D. Edmunds
D. Gelé
D. Gillberg
D. Hedin
D. Karmanov
D. Kau
D. Khatidze
D. Käfer
D. Lincoln
D. Meder
D. Schaile
D. Shpakov
D. Strom
D. Tsybychev
D. Wicke
D. Zhang
D. Zieminska
D.A. Stoyanova
D.A. Wijngaarden
D.C. O'Neil
D.K. Cho
D.R. Wood
D.V. Bandurin
E. Barberis
E. Busato
E. Cheu
E. De La Cruz-Burelo
E. Gallas
E. Galyaev
E. Kajfasz
E. Nagy
E. Nurse
E. Perez
E. Shabalina
E. Von Toerne
E.G. Zverev
E.M. Gregores
E.W. Varnes
F. Badaud
F. Blekman
F. Borcherding
F. Charles
F. Déliot
F. Fiedler
F. Filthaut
F. Lehner
F. Rizatdinova
F. Villeneuve-Seguier
G. Alkhazov
G. Alverson
G. Bernardi
G. Blazey
G. Borissov
G. Brooijmans
G. Davies
G. Ginther
G. Grenier
G. Gutierrez
G. Hesketh
G. Landsberg
G. Obrant
G. Pawloski
G. Sajot
G. Savage
G. Watts
G.A. Alves
G.A. Davis
G.D. Alexeev
G.J. Otero y Garzón
G.R. Snow
G.S. Muanza
G.W. Wilson
Gluck
Gluck
Gordon
Gordon
H. Castilla-Valdez
H. da Motta
H. Dong
H. Evans
H. Fox
H. Greenlee
H. Kim
H. Miettinen
H. Schellman
H. Severini
H. Weerts
H.A. Neal
H.B. Malbouisson
H.B. Prosper
H.D. Wahl
H.D. Yoo
H.E. Fisk
H.J. Lubatti
H.S. Mao
H.T. Diehl
I. Bertram
I. Blackler
I. Fleck
I. Hall
I. Iashvili
I. Katsanos
I. Ripp-Baudot
I. Torchiani
I.A. Vasilyev
J. Barreto
J. Cammin
J. Dyer
J. Ellison
J. Elmsheuser
J. Estrada
J. Fast
J. Gardner
J. Haley
J. Hays
J. Huang
J. Kasper
J. Kotcher
J. Kozminski
J. Kvita
J. Lazoflores
J. Leveque
J. Li
J. Linnemann
J. Meyer
J. Mitrevski
J. Molina
J. Monk
J. Parsons
J. Qian
J. Snow
J. Stark
J. Steele
J. Strandberg
J. Temple
J. Warchol
J. Womersley
J. Yu
J. Zhu
J.-F. Grivaz
J.-L. Agram
J.-P. Konrath
J.-R. Vlimant
J.D. Degenhardt
J.D. Hobbs
J.F. Bartlett
J.G. Hegeman
J.G.R. Lima
J.M. Butler
J.M. Hauptman
J.M. Heinmiller
J.M. Kalk
J.M. Kohli
J.P. Negret
J.R. Kalk
K. Bos
K. De
K. Gounder
K. Hanagaki
K. Harder
K. Jakobs
K. Johns
K. Ranjan
K. Smolek
K. Soustruznik
K. Stevenson
K. Yip
K.J. Rani
K.M. Black
K.M. Chan
K.W. Merritt
L. Bagby
L. Berntzon
L. Christofek
L. Duflot
L. Feligioni
L. Han
L. Lobo
L. Lueking
L. Mendoza
L. Mundim
L. Sawyer
L. Sonnenschein
L. Stutte
L. Uvarov
L. Wang
L.S. Vertogradov
L.V. Dudko
Laenen
Lafferty
Lai
Lipatov
M. Abolins
M. Adams
M. Agelou
M. Ahsan
M. Anastasoaie
M. Arov
M. Begalli
M. Begel
M. Besançon
M. Binder
M. Buehler
M. Cooke
M. Corcoran
M. Das
M. Demarteau
M. Diesburg
M. Doidge
M. Eads
M. Fortner
M. Hohlfeld
M. Jaffré
M. Johnson
M. Kopal
M. Lokajicek
M. Lynker
M. Martens
M. Merkin
M. Michaut
M. Mulders
M. Naimuddin
M. Narain
M. Petteni
M. Rijssenbeek
M. Shamim
M. Sosebee
M. Souza
M. Strauss
M. Strovink
M. Talby
M. Titov
M. Tomoto
M. Vaupel
M. Verzocchi
M. Voutilainen
M. Vreeswijk
M. Wayne
M. Weber
M. Wetstein
M. Wobisch
M. Yan
M. Zielinski
M.-A. Pleier
M.-C. Cousinou
M.-E. Pol
M.A. Strang
M.D. Hildreth
M.P. Sanders
M.W. Grünewald
Martin
Martin
N. Gollub
N. Makovec
N. Oliveira
N. Oshima
N. Parashar
N. Parua
N. Varelas
N. Wermes
N. Xuan
N.A. Naumann
N.J. Buchanan
N.J. Hadley
N.K. Mondal
N.M. Cason
O. Atramentov
O. Boeriu
Owens
P. Banerjee
P. Bargassa
P. Baringer
P. de Jong
P. Demine
P. Ermolov
P. Gay
P. Gutierrez
P. Houben
P. Jonsson
P. Lebrun
P. Lewis
P. Love
P. Mättig
P. Neustroev
P. Padley
P. Pétroff
P. Rubinov
P. Schieferdecker
P. Skubic
P. Slattery
P. Tamburello
P. Telford
P. Verdier
P.A. Rapidis
P.C. Bhat
P.D. Grannis
P.J. van den Berg
P.K. Mal
P.L.M. Podesta-Lerma
P.M. Perea
P.M. Tuts
P.N. Ratoff
P.W. Balm
Peterson
Ph. Gris
Q. Xu
Q.Z. Li
R. Bernhard
R. Beuselinck
R. Brock
R. Demina
R. Gelhaus
R. Harrington
R. Hauser
R. Hirosky
R. Hooper
R. Illingworth
R. Jesik
R. Kaur
R. Kehoe
R. Lipton
R. McCarthy
R. McCroskey
R. Partridge
R. Piegaia
R. Ruchti
R. Schwienhorst
R. Ströhmer
R. Van Kooten
R. Yamada
R.A. Sidwell
R.D. Schamberger
R.E. Hall
R.F. Rodrigues
R.J. Madaras
R.K. Shivpuri
R.P. Smith
R.W. Moore
S. Anderson
S. Banerjee
S. Beauceron
S. Blessing
S. Burdin
S. Burke
S. Calvet
S. Caron
S. Chakrabarti
S. Choi
S. Crépé-Renaudin
S. Dean
S. Desai
S. Doulas
S. Eno
S. Fu
S. Fuess
S. Greder
S. Grünendahl
S. Hagopian
S. Jabeen
S. Jain
S. Kahn
S. Kermiche
S. Kesisoglou
S. Krzywdzinski
S. Kunori
S. Lager
S. Lammers
S. Malik
S. Nelson
S. Protopopescu
S. Reucroft
S. Robinson
S. Sengupta
S. Snyder
S. Sumowidagdo
S. Söldner-Rembold
S. Towers
S. Trincaz-Duvoid
S. Uvarov
S. Uzunyan
S. Yacoob
S.B. Beri
S.E.K. Mattingly
S.F. Novaes
S.H. Ahn
S.J. de Jong
S.J. Hong
S.J. Wimpenny
S.K. Park
S.L. Linn
S.N. Fatakia
S.P. Denisov
S.R. Dugad
S.W. Youn
Sjöstrand
Stump
T. Adams
T. Andeen
T. Bose
T. Christiansen
T. Edwards
T. Ferbel
T. Gadfort
T. Golling
T. Hebbeker
T. Kurča
T. Moulik
T. Nunnemann
T. Scanlon
T. Toole
T. Trefzger
T. Vu Anh
T. Yasuda
T. Zhao
T.A. Bolton
T.H. Burnett
T.J. Kim
T.R. Wyatt
U. Bassler
U. Blumenschein
U. Heintz
V. Bhatnagar
V. Buescher
V. Gavrilov
V. Hynek
V. Jain
V. Lesne
V. O'Dell
V. Oguri
V. Shary
V. Simak
V. Sirotenko
V. Stolin
V. White
V. Zutshi
V.A. Bezzubov
V.D. Elvira
V.I. Rud
V.L. Malyshev
V.M. Abazov
V.M. Korablev
V.M. Podstavkov
V.N. Evdokimov
V.V. Lipaev
W. Carvalho
W. Fisher
W. Taylor
W.D. Shephard
W.E. Cooper
W.L. Prado da Silva
W.M. Lee
W.M. van Leeuwen
X. Song
Y. Arnoud
Y. Coadou
Y. Gershtein
Y. Hu
Y. Maravin
Y. Pogorelov
Y. Scheglov
Y. Xie
Y. Yen
Y.A. Yatsunenko
Y.D. Mutaf
Y.M. Kharzheev
Z. Zhao
Z.D. Greenwood
Publication venue: 'Elsevier BV'
Publication date: 25/11/2005
Field of study

The cross section for the inclusive production of isolated photons has been measured in p anti-p collisions at sqrt{s}=1.96 TeV with the D0 detector at the Fermilab Tevatron Collider. The photons span transverse momenta 23 to 300 GeV and have pseudorapidity |eta|<0.9. The cross section is compared with the results from two next-to-leading order perturbative QCD calculations. The theoretical predictions agree with the measurement within uncertainties.Comment: 7 pages, 5 figures, submitted to Phys.Lett.

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Hal - Université Grenoble Alpes

HAL AMU

HAL Clermont Université

The University of Manchester - Institutional Repository

Hal-Diderot

arXiv.org e-Print Archive

HAL-IN2P3

Elsevier - Publisher Connector

Crossref

Oxford University Research Archive

Radboud Repository