Search CORE

553 research outputs found

Rectangular Full Packed Format for Cholesky's Algorithm: Factorization, Solution and Inversion

Author: Andersen B. S.
Fred G. Gustavson
Gustavson F. G.
Gustavson F. G.
Jack J. Dongarra
Jerzy Waśniewski
Julien Langou
Publication venue
Publication date: 01/01/2009
Field of study

We describe a new data format for storing triangular, symmetric, and Hermitian matrices called RFPF (Rectangular Full Packed Format). The standard two dimensional arrays of Fortran and C (also known as full format) that are used to represent triangular and symmetric matrices waste nearly half of the storage space but provide high performance via the use of Level 3 BLAS. Standard packed format arrays fully utilize storage (array space) but provide low performance as there is no Level 3 packed BLAS. We combine the good features of packed and full storage using RFPF to obtain high performance via using Level 3 BLAS as RFPF is a standard full format representation. Also, RFPF requires exactly the same minimal storage as packed format. Each LAPACK full and/or packed triangular, symmetric, and Hermitian routine becomes a single new RFPF routine based on eight possible data layouts of RFPF. This new RFPF routine usually consists of two calls to the corresponding LAPACK full format routine and two calls to Level 3 BLAS routines. This means {\it no} new software is required. As examples, we present LAPACK routines for Cholesky factorization, Cholesky solution and Cholesky inverse computation in RFPF to illustrate this new work and to describe its performance on several commonly used computer platforms. Performance of LAPACK full routines using RFPF versus LAPACK full routines using standard format for both serial and SMP parallel processing is about the same while using half the storage. Performance gains are roughly one to a factor of 43 for serial and one to a factor of 97 for SMP parallel times faster using vendor LAPACK full routines with RFPF than with using vendor and/or reference packed routines

arXiv.org e-Print Archive

CiteSeerX

Crossref

The University of Manchester - Institutional Repository

Online Research Database In Technology

Performance Improvements of Common Sparse Numerical Linear Algebra Computations

Author: Luszczek Piotr Rafal
Publication venue: TRACE: Tennessee Research and Creative Exchange
Publication date: 01/01/2003
Field of study

Manufacturers of computer hardware are able to continuously sustain an unprecedented pace of progress in computing speed of their products, partially due to increased clock rates but also because of ever more complicated chip designs. With new processor families appearing every few years, it is increasingly harder to achieve high performance rates in sparse matrix computations. This research proposes new methods for sparse matrix factorizations and applies in an iterative code generalizations of known concepts from related disciplines. The proposed solutions and extensions are implemented in ways that tend to deliver efficiency while retaining ease of use of existing solutions. The implementations are thoroughly timed and analyzed using a commonly accepted set of test matrices. The tests were conducted on modern processors that seem to have gained an appreciable level of popularity and are fairly representative for a wider range of processor types that are available on the market now or in the near future. The new factorization technique formally introduced in the early chapters is later on proven to be quite competitive with state of the art software currently available. Although not totally superior in all cases (as probably no single approach could possibly be), the new factorization algorithm exhibits a few promising features. In addition, an all-embracing optimization effort is applied to an iterative algorithm that stands out for its robustness. This also gives satisfactory results on the tested computing platforms in terms of performance improvement. The same set of test matrices is used to enable an easy comparison between both investigated techniques, even though they are customarily treated separately in the literature. Possible extensions of the presented work are discussed. They range from easily conceivable merging with existing solutions to rather more evolved schemes dependent on hard to predict progress in theoretical and algorithmic research

University of Tennessee, Knoxville: Trace

CiteSeerX

A Fully Portable High Performance Minimal Storage Hybrid Format Cholesky Algorithm

Author: Andersen Bjarne S.
Gunnels John A.
Gustavson Fred G.
Reid John K.
Wasniewski Jerzy
Publication venue
Publication date: 01/01/2004
Field of study

Online Research Database In Technology

Efficient Stand-Alone Generalized Inverse Algorithms and Software for Engineering/Sciences Applications: Research and Education

Author: Kadiam Subhash Chandra Bose S V
Publication venue: ODU Digital Commons
Publication date: 01/07/2012
Field of study

Efficient numerical procedures for finding the generalized (or pseudo) inverse of a general (square/rectangle, symmetrical/unsymmetrical, non-singular/singular, real/complex numbers) matrix and solving systems of Simultaneous Linear Equations (SLE) are formulated and explained. The developed procedures and its associated computer software (under MATLAB computer environment) have been based on special Cholesky factorization schemes (for a singular matrix), the generalized inverse of the matrix product, and were further enhanced by the Domain Decomposition (DD) formulation. Test matrices from different fields of applications have been chosen, tested and compared with other existing algorithms. The results of the numerical tests have indicated that the developed procedures are far more efficient than existing algorithms. Furthermore, an educational version of the generalized inverse algorithms and software for solving SLE has also been developed to run any FORTRAN and/or \u27C\u27 programs over the web. This developed technology and software is freely available and can run on any device with internet connectivity and browser capability

Old Dominion University

A parameter estimation subroutine package

Author: Bierman G. J.
Nead M. W.
Publication venue
Publication date
Field of study

Linear least squares estimation and regression analyses continue to play a major role in orbit determination and related areas. A library of FORTRAN subroutines were developed to facilitate analyses of a variety of estimation problems. An easy to use, multi-purpose set of algorithms that are reasonably efficient and which use a minimal amount of computer storage are presented. Subroutine inputs, outputs, usage and listings are given, along with examples of how these routines can be used. The routines are compact and efficient and are far superior to the normal equation and Kalman filter data processing algorithms that are often used for least squares analyses

NASA Technical Reports Server

Numerical Methods for Model Predictive Control

Author
Publication venue
Publication date
Field of study

This thesis mainly deals with the extended linear quadratic control problem, that is a special case of equality constrained quadratic program. A number of other problems in optimal control and estimation of linear systems can be reduced to this form. Furthermore, it arises as sub-problem in sequential quadratic programs methods and interior-point methods for the solution of optimal control and estimation in case of non-linear systems and in presence of inequality constraints. This thesis can be divided into two parts. In the first part, a number of methods for the solution of the extended linear quadratic control problem are presented and analyzed. These methods have been implemented in efficient C code and compared each other. In the second part, this problem is expanded taking into account also inequality constraints. Two interior-point methods are presented and analyzed. Both methods have been implemented in C code and compared each othe

Padua Thesis and Dissertation Archive

A parameter estimation subroutine package

Author: Bierman G. J.
Nead W. M.
Publication venue
Publication date
Field of study

Linear least squares estimation and regression analyses continue to play a major role in orbit determination and related areas. FORTRAN subroutines have been developed to facilitate analyses of a variety of parameter estimation problems. Easy to use multipurpose sets of algorithms are reported that are reasonably efficient and which use a minimal amount of computer storage. Subroutine inputs, outputs, usage and listings are given, along with examples of how these routines can be used

NASA Technical Reports Server