Search CORE

6 research outputs found

Recommended from our members

LDRD report : parallel repartitioning for optimal solver performance.

Author: Boman Erik Gunnar
Devine Karen Dragon
Heaphy Robert
Hendrickson Bruce Alan
Heroux Michael Allen
Preis Robert (University of Paderborn, Paderborn, Germany)
Publication venue: Sandia National Laboratories
Publication date: 01/02/2004
Field of study

We have developed infrastructure, utilities and partitioning methods to improve data partitioning in linear solvers and preconditioners. Our efforts included incorporation of data repartitioning capabilities from the Zoltan toolkit into the Trilinos solver framework, (allowing dynamic repartitioning of Trilinos matrices); implementation of efficient distributed data directories and unstructured communication utilities in Zoltan and Trilinos; development of a new multi-constraint geometric partitioning algorithm (which can generate one decomposition that is good with respect to multiple criteria); and research into hypergraph partitioning algorithms (which provide up to 56% reduction of communication volume compared to graph partitioning for a number of emerging applications). This report includes descriptions of the infrastructure and algorithms developed, along with results demonstrating the effectiveness of our approaches

UNT Digital Library

How Are We Doing? A Self-Assessment of the Quality of Services andSystems at NERSC, 2005-2006

Author
Publication venue: 'Office of Scientific and Technical Information (OSTI)'
Publication date
Field of study

Crossref

How are we doing? A self-assessment of the quality of services and systems at NERSC (2001)

Author
Publication venue: 'Office of Scientific and Technical Information (OSTI)'
Publication date
Field of study

Crossref

LDRD report : parallel repartitioning for optimal solver performance.

Author
Publication venue: 'Office of Scientific and Technical Information (OSTI)'
Publication date
Field of study

Crossref

Communication Support for Adaptive Computation\Lambda

Author
Publication venue
Publication date
Field of study

Ali Pinary and Bruce Hendricksonz 1 Introduction In this work we address two problems associated with redistributing data amongst processors. The first problem is that of determining the inter-processor communication pattern necessary to perform a calculation like matrix-vector multiplication. Consider the situation when a calculation is first described or when it is repartitioned after dynamic load balancing. Processors do not know what communication operations to perform to enable the matrix-vector multiplication to proceed. Assuming the matrix is partitioned by rows, looking at its own domain allows each processor can determine what it wants to receive, but it does not know which processor owns these desired data. We propose a distributed directory algorithm to efficiently determine the communication pattern (i.e., what a processor needs to receive from and send to every other processor). Our experiments show that the proposed algorithm performs efficiently on large numbers of processors

CiteSeerX