Search CORE

573 research outputs found

Computing the Rank Profile Matrix

Author: Bourbaki N.
Dongarra J. J.
Grigor'ev D. Y.
Malaschonok G. I.
Storjohann A.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 06/07/2015
Field of study

The row (resp. column) rank profile of a matrix describes the staircase shape of its row (resp. column) echelon form. In an ISSAC'13 paper, we proposed a recursive Gaussian elimination that can compute simultaneously the row and column rank profiles of a matrix as well as those of all of its leading sub-matrices, in the same time as state of the art Gaussian elimination algorithms. Here we first study the conditions making a Gaus-sian elimination algorithm reveal this information. Therefore, we propose the definition of a new matrix invariant, the rank profile matrix, summarizing all information on the row and column rank profiles of all the leading sub-matrices. We also explore the conditions for a Gaussian elimination algorithm to compute all or part of this invariant, through the corresponding PLUQ decomposition. As a consequence, we show that the classical iterative CUP decomposition algorithm can actually be adapted to compute the rank profile matrix. Used, in a Crout variant, as a base-case to our ISSAC'13 implementation, it delivers a significant improvement in efficiency. Second, the row (resp. column) echelon form of a matrix are usually computed via different dedicated triangular decompositions. We show here that, from some PLUQ decompositions, it is possible to recover the row and column echelon forms of a matrix and of any of its leading sub-matrices thanks to an elementary post-processing algorithm

arXiv.org e-Print Archive

HAL-ENS-LYON

Crossref

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

Hal-Diderot

A survey of parallel algorithms for fractal image compression

Author: Beaumont J. M.
Bodo Z. P.
Chang H. T.
Davis G. M.
Dongarra J.
Huffman D. A.
Mallat S. G.
Publication venue: 'Multi-Science Publishing Co. Ltd.'
Publication date: 01/06/2007
Field of study

This paper presents a short survey of the key research work that has been undertaken in the application of parallel algorithms for Fractal image compression. The interest in fractal image compression techniques stems from their ability to achieve high compression ratios whilst maintaining a very high quality in the reconstructed image. The main drawback of this compression method is the very high computational cost that is associated with the encoding phase. Consequently, there has been significant interest in exploiting parallel computing architectures in order to speed up this phase, whilst still maintaining the advantageous features of the approach. This paper presents a brief introduction to fractal image compression, including the iterated function system theory upon which it is based, and then reviews the different techniques that have been, and can be, applied in order to parallelize the compression algorithm

CiteSeerX

Crossref

White Rose Research Online

Benchmark based on application signature to analyze and predict their behavior

Author: A Hoisie
A Wong
D Terpstra
DH Bailey
JJ Dongarra
PN Brown
V Marjanović
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/06/2019
Field of study

Currently, there are benchmark sets that measure the performance of HPC systems under specific computing and communication properties. These benchmarks represent the kernels of applications that measure specific hardware components. If the user’s application is not represented by any benchmark, it is not possible to obtain an equivalent performance metric. In this work, we propose a benchmark based on the signature of an MPI application obtained by the PAS2P method. PAS2P creates the application signature in order to predict the execution time, which we believe will be very adjusted in relation to the execution time of the full application. The signature has two performance qualities: the bounded time to execute it (a benchmark property) and the quality of prediction. Therefore, we propose to extend the signature by giving the benchmark capacities such as the efficiency of the application over the HPC system. The performance metrics will be performed by the benchmark proposed. The experimentation validates our proposal with an average error of prediction close to 7%.Instituto de Investigación en Informátic

Crossref

Servicio de Difusión de la Creación Intelectual

Case studies on the development of ScaLAPACK and the NAG Numerical PVM Library

Author: A Geist
CH Koelbel
CL Lawson
DS Dodson
DS Dodson
E Anderson
E Anderson
J Choi
J Choi
J Choi
J Choi
J Choi
J Choi
J Demmel
JJ Dongarra
JJ Dongarra
JJ Dongarra
JJ Dongarra
JJ Dongarra
JJ Dongarra
JJ Dongarra
JJ Dongarra
JJ Dongarra
LS Blackford
M Snir
RC Whaley
S Hammarling
SI Feldman
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1997
Field of study

Crossref

Geometric Random Inner Products: A New Family of Tests for Random Number Generators

Author: A. M. Ferrenberg
A. N. Kolmogorov
A. N. Kolmogorov
B. L. Holian
E. Fischbach
Ephraim Fischbach
G. Fishman
I. Vattulainen
I. Vattulainen
I. Vattulainen
I. Vattulainen
J. Dongarra
J. Gleeson
K. V. Tretiakov
R. Ziff
R. Ziff
S. J. Tu
S. M. Pincus
Shu-Ju Tu
Publication venue: 'American Physical Society (APS)'
Publication date: 06/09/2002
Field of study

We present a new computational scheme, GRIP (Geometric Random Inner Products), for testing the quality of random number generators. The GRIP formalism utilizes geometric probability techniques to calculate the average scalar products of random vectors generated in geometric objects, such as circles and spheres. We show that these average scalar products define a family of geometric constants which can be used to evaluate the quality of random number generators. We explicitly apply the GRIP tests to several random number generators frequently used in Monte Carlo simulations, and demonstrate a new statistical property for good random number generators

arXiv.org e-Print Archive

Crossref

Purdue E-Pubs

CERN Document Server

Learning from the Success of MPI

Author: A. Geist
A. Skjellum
C.H. Koelbel
J. Boyle
J. Cownie
J. Dongarra
J.L. Traeff
K. Krechmer
Message Passing Interface Forum
Message Passing Interface Forum MPI2
N. Carriero
O. Zaki
P.B. Hansen
R. Hempel
R.C. Whaley
R.W. Numrich
W. Gropp
W. Gropp
W.W. Carlson
Publication venue
Publication date: 01/01/2001
Field of study

The Message Passing Interface (MPI) has been extremely successful as a portable way to program high-performance parallel computers. This success has occurred in spite of the view of many that message passing is difficult and that other approaches, including automatic parallelization and directive-based parallelism, are easier to use. This paper argues that MPI has succeeded because it addresses all of the important issues in providing a parallel programming model.Comment: 12 pages, 1 figur

arXiv.org e-Print Archive

CiteSeerX

Crossref

UNT Digital Library

Numerical electrokinetics

Author: B Dünweg
Dongarra J
Dünweg B
Garbow N
Hückel E
Landau L D
Lobaskin V
Lobaskin V
Medebach M
Meister A
Overbeek J T G
Palberg T
Press W H
R Schmitz
Schmitz R Dünweg B
von Smoluchowski M
Publication venue: 'IOP Publishing'
Publication date: 01/01/2012
Field of study

A new lattice method is presented in order to efficiently solve the electrokinetic equations, which describe the structure and dynamics of the charge cloud and the flow field surrounding a single charged colloidal sphere, or a fixed array of such objects. We focus on calculating the electrophoretic mobility in the limit of small driving field, and systematically linearise the equations with respect to the latter. This gives rise to several subproblems, each of which is solved by a specialised numerical algorithm. For the total problem we combine these solvers in an iterative procedure. Applying this method, we study the effect of the screening mechanism (salt screening vs. counterion screening) on the electrophoretic mobility, and find a weak non-trivial dependence, as expected from scaling theory. Furthermore, we find that the orientation of the charge cloud (i. e. its dipole moment) depends on the value of the colloid charge, as a result of a competition between electrostatic and hydrodynamic effects.Comment: accepted for publication in Journal of Physics Condensed Matter (proceedings of the 2012 CODEF conference

arXiv.org e-Print Archive

Crossref

MPG.PuRe

Parallel Programming Models for Dense Linear Algebra on Heterogeneous Systems

Author: Abalenkovs Maksims
Abdelfattah Ahmad
Dongarra Jack
Gates M.
Haidar A
Kurzak Jakub
Luszczek Piotr
Tomov Stanimire
Yamazaki I.
YarKhan A.
Publication venue: 'FSAEIHE South Ural State University (National Research University)'
Publication date: 01/01/2015
Field of study

The University of Manchester - Institutional Repository