Search CORE

409 research outputs found

Block Locally Optimal Preconditioned Eigenvalue Xolvers (BLOPEX) in hypre and PETSc

Author: Argentati M.E.
Argentati M.E.
Knyazev A.V.
Knyazev A.V.
Lashuk I.
Lashuk I.
Ovtchinnikov E.
Ovtchinnikov E.
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 17/05/2007
Field of study

We describe our software package Block Locally Optimal Preconditioned Eigenvalue Xolvers (BLOPEX) publicly released recently. BLOPEX is available as a stand-alone serial library, as an external package to PETSc (``Portable, Extensible Toolkit for Scientific Computation'', a general purpose suite of tools for the scalable solution of partial differential equations and related problems developed by Argonne National Laboratory), and is also built into {\it hypre} (``High Performance Preconditioners'', scalable linear solvers package developed by Lawrence Livermore National Laboratory). The present BLOPEX release includes only one solver--the Locally Optimal Block Preconditioned Conjugate Gradient (LOBPCG) method for symmetric eigenvalue problems. {\it hypre} provides users with advanced high-quality parallel preconditioners for linear systems, in particular, with domain decomposition and multigrid preconditioners. With BLOPEX, the same preconditioners can now be efficiently used for symmetric eigenvalue problems. PETSc facilitates the integration of independently developed application modules with strict attention to component interoperability, and makes BLOPEX extremely easy to compile and use with preconditioners that are available via PETSc. We present the LOBPCG algorithm in BLOPEX for {\it hypre} and PETSc. We demonstrate numerically the scalability of BLOPEX by testing it on a number of distributed and shared memory parallel systems, including a Beowulf system, SUN Fire 880, an AMD dual-core Opteron workstation, and IBM BlueGene/L supercomputer, using PETSc domain decomposition and {\it hypre} multigrid preconditioning. We test BLOPEX on a model problem, the standard 7-point finite-difference approximation of the 3-D Laplacian, with the problem size in the range

10^5-10^8

.Comment: Submitted to SIAM Journal on Scientific Computin

arXiv.org e-Print Archive

WestminsterResearch

Spectral methods for CFD

Author: Hussaini M. Yousuff
Streett Craig L.
Zang Thomas A.
Publication venue
Publication date
Field of study

One of the objectives of these notes is to provide a basic introduction to spectral methods with a particular emphasis on applications to computational fluid dynamics. Another objective is to summarize some of the most important developments in spectral methods in the last two years. The fundamentals of spectral methods for simple problems will be covered in depth, and the essential elements of several fluid dynamical applications will be sketched

NASA Technical Reports Server

Computational methods and software systems for dynamics and control of large space structures

Author: Farhat C.
Felippa C. A.
Park K. C.
Pramono E.
Publication venue
Publication date
Field of study

Two key areas of crucial importance to the computer-based simulation of large space structures are discussed. The first area involves multibody dynamics (MBD) of flexible space structures, with applications directed to deployment, construction, and maneuvering. The second area deals with advanced software systems, with emphasis on parallel processing. The latest research thrust in the second area involves massively parallel computers

NASA Technical Reports Server

Parallel unstructured solvers for linear partial differential equations

Author: Becker Dulcenéia
Publication venue: Cranfield University
Publication date: 01/05/2006
Field of study

This thesis presents the development of a parallel algorithm to solve symmetric systems of linear equations and the computational implementation of a parallel partial differential equations solver for unstructured meshes. The proposed method, called distributive conjugate gradient - DCG, is based on a single-level domain decomposition method and the conjugate gradient method to obtain a highly scalable parallel algorithm. An overview on methods for the discretization of domains and partial differential equations is given. The partition and refinement of meshes is discussed and the formulation of the weighted residual method for two- and three-dimensions presented. Some of the methods to solve systems of linear equations are introduced, highlighting the conjugate gradient method and domain decomposition methods. A parallel unstructured PDE solver is proposed and its actual implementation presented. Emphasis is given to the data partition adopted and the scheme used for communication among adjacent subdomains is explained. A series of experiments in processor scalability is also reported. The derivation and parallelization of DCG are presented and the method validated throughout numerical experiments. The method capabilities and limitations were investigated by the solution of the Poisson equation with various source terms. The experimental results obtained using the parallel solver developed as part of this work show that the algorithm presented is accurate and highly scalable, achieving roughly linear parallel speed-up in many of the cases tested

Cranfield CERES

Three real-space discretization techniques in electronic structure calculations

Author: Aichinger
Aichinger
Aichinger
Ainsworth
Alatalo
Arias
Atkins
Auer
Auer
Auer
Barbiellini
Barbiellini
Barnett
Baye
Beck
Becke
Becke
Beylkin
Beylkin
Beylkin
Blöchl
Blöchl
Boroński
Bowler
Braess
Brandbyge
Brandt
Brandt
Briggs
Briggs
Briggs
Burdick
Bylaska
Calloni
Calzolari
Car
Casida
Castro
Castro
Chakraborty
Chelikowsky
Cho
Chui
Chulkov
Ciarlet
Collatz
Cools
Costiner
Cowan
Dahlke
Dahmen
Datta
Datta
Daubechies
Davidson
Davidson
Davis
Daykov
Dongarra
Donoho
Duff
Elman
Fattebert
Fiete
Fischer
Froese Fischer
Gisbergen
Goedecker
Goedecker
Golub
Gruenebaum
Gupta
Gygi
Gygi
Gygi
Hackbush
Harju
Havu
Havu
Havu
Havu
He
Heiskanen
Heiskanen
Hernández
Hirose
Hochbruck
Hochbruck
Hochbruck
Hohenberg
Hughes
Höynälänmaa
Jamorski
Jiang
Karypis
Khomyakov
Kleinman
Knyazev
Knyazev
Kobus
Kohn
Kouwenhoven
Kresse
Kresse
Kresse
Krieger
Kronik
Langreth
Levine
Lindemann
Lippert
Liu
Makkonen
Makkonen
Mandel
Manninen
Modine
Mortensen
Nardelli
Nogueira
Ogando
Ono
Oosterkamp
Pask
Patel
Payne
Polizzi
Pulay
Pulay
Reid
Reimann
Rummukainen
Runge
Räsänen
Räsänen
Saad
Saarikoski
Saarikoski
Saarikoski
Saarikoski
Sarria
Schmid
Schmidt
Schwarz
Sivan
Skylaris
Soler
Springer
Stathopoulos
Stevenson
Stratman
Szabo
Taylor
te Velde
te Velde
Thygesen
Torsti
Torsti
Torsti
Torsti
Tsuchida
Tsuchida
Tsuchida
Tymczak
van der Vorst
Vasilyev
Waghmare
Wang
Wang
Wei
Wesseling
Wijesekera
Wood
Wortmann
Yamaguchi
Publication venue: 'Wiley'
Publication date: 24/01/2006
Field of study

A characteristic feature of the state-of-the-art of real-space methods in electronic structure calculations is the diversity of the techniques used in the discretization of the relevant partial differential equations. In this context, the main approaches include finite-difference methods, various types of finite-elements and wavelets. This paper reports on the results of several code development projects that approach problems related to the electronic structure using these three different discretization methods. We review the ideas behind these methods, give examples of their applications, and discuss their similarities and differences.Comment: 39 pages, 10 figures, accepted to a special issue of "physica status solidi (b) - basic solid state physics" devoted to the CECAM workshop "State of the art developments and perspectives of real-space electronic structure techniques in condensed matter and molecular physics". v2: Minor stylistic and typographical changes, partly inspired by referee comment

arXiv.org e-Print Archive

Crossref

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Using GPU to Accelerate Linear Computations in Power System Applications

Author: Li Xue
Publication venue: TRACE: Tennessee Research and Creative Exchange
Publication date: 01/12/2015
Field of study

With the development of advanced power system controls, the industrial and research community is becoming more interested in simulating larger interconnected power grids. It is always critical to incorporate advanced computing technologies to accelerate these power system computations. Power flow, one of the most fundamental computations in power system analysis, converts the solution of non-linear systems to that of a set of linear systems via the Newton method or one of its variants. An efficient solution to these linear equations is the key to improving the performance of power flow computation, and hence to accelerating other power system applications based on power flow computation, such as optimal power flow, contingency analysis, etc. This dissertation focuses on the exploration of iterative linear solvers and applicable preconditioners, with graphic processing unit (GPU) implementations to achieve performance improvement on the linear computations in power flow computations. An iterative conjugate gradient solver with Chebyshev preconditioner is studied first, and then the preconditioner is extended to a two-step preconditioner. At last, the conjugate gradient solver and the two-step preconditioner are integrated with MATPOWER to solve the practical fast decoupled load flow (FDPF), and an inexact linear solution method is proposed to further save the runtime of FDPF. Performance improvement is reported by applying these methods and GPU-implementation. The final complete GPU-based FDPF with inexact linear solving can achieve nearly 3x performance improvement over the MATPOWER implementation for a test system with 11,624 buses. A supporting study including a quick estimation of the largest eigenvalue of the linear system which is required by the Chebyshev preconditioner is presented as well. This dissertation demonstrates the potential of using GPU with scalable methods in power flow computation

University of Tennessee, Knoxville: Trace

Hypercube matrix computation task

Author: Calalo Ruel H.
Imbriale William A.
Jacobi Nathan
Liewer Paulett C.
Lockhart Thomas G.
Lyons James R.
Lyzenga Gregory A.
Manshadi Farzin
Patterson Jean E.
Publication venue
Publication date
Field of study

A major objective of the Hypercube Matrix Computation effort at the Jet Propulsion Laboratory (JPL) is to investigate the applicability of a parallel computing architecture to the solution of large-scale electromagnetic scattering problems. Three scattering analysis codes are being implemented and assessed on a JPL/California Institute of Technology (Caltech) Mark 3 Hypercube. The codes, which utilize different underlying algorithms, give a means of evaluating the general applicability of this parallel architecture. The three analysis codes being implemented are a frequency domain method of moments code, a time domain finite difference code, and a frequency domain finite elements code. These analysis capabilities are being integrated into an electromagnetics interactive analysis workstation which can serve as a design tool for the construction of antennas and other radiating or scattering structures. The first two years of work on the Hypercube Matrix Computation effort is summarized. It includes both new developments and results as well as work previously reported in the Hypercube Matrix Computation Task: Final Report for 1986 to 1987 (JPL Publication 87-18)

NASA Technical Reports Server