Search CORE

2,000 research outputs found

Dynamic analysis of structures on multicore computers:Achieving efficiency through object oriented design

Author: Mackie R.I.
Publication venue: 'Elsevier BV'
Publication date: 18/04/2013
Field of study

University of Dundee Online Publications

Computational methods and software systems for dynamics and control of large space structures

Author: Farhat C.
Felippa C. A.
Park K. C.
Pramono E.
Publication venue
Publication date
Field of study

Two key areas of crucial importance to the computer-based simulation of large space structures are discussed. The first area involves multibody dynamics (MBD) of flexible space structures, with applications directed to deployment, construction, and maneuvering. The second area deals with advanced software systems, with emphasis on parallel processing. The latest research thrust in the second area involves massively parallel computers

NASA Technical Reports Server

Block Locally Optimal Preconditioned Eigenvalue Xolvers (BLOPEX) in hypre and PETSc

Author: Argentati M.E.
Argentati M.E.
Knyazev A.V.
Knyazev A.V.
Lashuk I.
Lashuk I.
Ovtchinnikov E.
Ovtchinnikov E.
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 17/05/2007
Field of study

We describe our software package Block Locally Optimal Preconditioned Eigenvalue Xolvers (BLOPEX) publicly released recently. BLOPEX is available as a stand-alone serial library, as an external package to PETSc (``Portable, Extensible Toolkit for Scientific Computation'', a general purpose suite of tools for the scalable solution of partial differential equations and related problems developed by Argonne National Laboratory), and is also built into {\it hypre} (``High Performance Preconditioners'', scalable linear solvers package developed by Lawrence Livermore National Laboratory). The present BLOPEX release includes only one solver--the Locally Optimal Block Preconditioned Conjugate Gradient (LOBPCG) method for symmetric eigenvalue problems. {\it hypre} provides users with advanced high-quality parallel preconditioners for linear systems, in particular, with domain decomposition and multigrid preconditioners. With BLOPEX, the same preconditioners can now be efficiently used for symmetric eigenvalue problems. PETSc facilitates the integration of independently developed application modules with strict attention to component interoperability, and makes BLOPEX extremely easy to compile and use with preconditioners that are available via PETSc. We present the LOBPCG algorithm in BLOPEX for {\it hypre} and PETSc. We demonstrate numerically the scalability of BLOPEX by testing it on a number of distributed and shared memory parallel systems, including a Beowulf system, SUN Fire 880, an AMD dual-core Opteron workstation, and IBM BlueGene/L supercomputer, using PETSc domain decomposition and {\it hypre} multigrid preconditioning. We test BLOPEX on a model problem, the standard 7-point finite-difference approximation of the 3-D Laplacian, with the problem size in the range

10^5-10^8

.Comment: Submitted to SIAM Journal on Scientific Computin

arXiv.org e-Print Archive

WestminsterResearch

A Survey on Hardware-aware and Heterogeneous Computing on Multicore Processors and Accelerators

Author: Buchty Rainer
Heuveline Vincent
Karl Wolfgang
Weiß Jan-Philipp
Publication venue: Karlsruher Institut für Technologie
Publication date: 01/01/2009
Field of study

KITopen

LFRic: meeting the challenges of scalability and performance portability in weather and climate models

Author: Adams S. V.
Ford R. W.
Hambley M.
Hobson J. M.
Kavcic I.
Maynard C. M.
Melvin T.
Mueller E. H
Mullerworth S.
Porter A. R.
Rezny M.
Shipway B. J.
Wong R.
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

This paper describes LFRic: the new weather and climate modelling system being developed by the UK Met Office to replace the existing Unified Model in preparation for exascale computing in the 2020s. LFRic uses the GungHo dynamical core and runs on a semi-structured cubed-sphere mesh. The design of the supporting infrastructure follows object-oriented principles to facilitate modularity and the use of external libraries where possible. In particular, a `separation of concerns' between the science code and parallel code is imposed to promote performance portability. An application called PSyclone, developed at the STFC Hartree centre, can generate the parallel code enabling deployment of a single source science code onto different machine architectures. This paper provides an overview of the scientific requirement, the design of the software infrastructure, and examples of PSyclone usage. Preliminary performance results show strong scaling and an indication that hybrid MPI/OpenMP performs better than pure MPI

arXiv.org e-Print Archive

Central Archive at the University of Reading

OPUS

ePubs: the open archive for STFC research publications

Applications for Ultrascale Computing

Author: Bongo Lars Ailo
Ciegis Raimondas
Frasheri Neki
Gong Jing
Kimovski Dragi
Kropf Peter
Margenov Svetozar
Mihajlovic Milan
Neytcheva Maya
Rauber Thomas
Runger Gudula
Trobec Roman
Wuyts Roel
Wyrzykowski Roman
Publication venue: 'FSAEIHE South Ural State University (National Research University)'
Publication date: 01/01/2015
Field of study

The University of Manchester - Institutional Repository

Analyzing and Modeling the Performance of the HemeLB Lattice-Boltzmann Simulation Environment

Author: Bernabeu Miguel O.
Carver Hywel B.
Coveney Peter V.
Groen Derek
Hetherington James
Nash Rupert W.
Publication venue: 'Elsevier BV'
Publication date: 18/03/2013
Field of study

We investigate the performance of the HemeLB lattice-Boltzmann simulator for cerebrovascular blood flow, aimed at providing timely and clinically relevant assistance to neurosurgeons. HemeLB is optimised for sparse geometries, supports interactive use, and scales well to 32,768 cores for problems with ~81 million lattice sites. We obtain a maximum performance of 29.5 billion site updates per second, with only an 11% slowdown for highly sparse problems (5% fluid fraction). We present steering and visualisation performance measurements and provide a model which allows users to predict the performance, thereby determining how to run simulations with maximum accuracy within time constraints.Comment: Accepted by the Journal of Computational Science. 33 pages, 16 figures, 7 table

arXiv.org e-Print Archive

Elsevier - Publisher Connector

UCL Discovery