Search CORE

11 research outputs found

Distributed memory compiler design for sparse problems

Author: Berryman Harry
Hiranandani Seema
Saltz Joel
Wu Janet
Publication venue
Publication date
Field of study

A compiler and runtime support mechanism is described and demonstrated. The methods presented are capable of solving a wide range of sparse and unstructured problems in scientific computing. The compiler takes as input a FORTRAN 77 program enhanced with specifications for distributing data, and the compiler outputs a message passing program that runs on a distributed memory computer. The runtime support for this compiler is a library of primitives designed to efficiently support irregular patterns of distributed array accesses and irregular distributed array partitions. A variety of Intel iPSC/860 performance results obtained through the use of this compiler are presented

NASA Technical Reports Server

Distributed memory compiler methods for irregular problems: Data copy reuse and runtime partitioning

Author: Das Raja
Mavriplis Dimitri
Ponnusamy Ravi
Saltz Joel
Publication venue
Publication date: 01/10/1991
Field of study

Outlined here are two methods which we believe will play an important role in any distributed memory compiler able to handle sparse and unstructured problems. We describe how to link runtime partitioners to distributed memory compilers. In our scheme, programmers can implicitly specify how data and loop iterations are to be distributed between processors. This insulates users from having to deal explicitly with potentially complex algorithms that carry out work and data partitioning. We also describe a viable mechanism for tracking and reusing copies of off-processor data. In many programs, several loops access the same off-processor memory locations. As long as it can be verified that the values assigned to off-processor memory locations remain unmodified, we show that we can effectively reuse stored off-processor data. We present experimental data from a 3-D unstructured Euler solver run on iPSC/860 to demonstrate the usefulness of our methods

NASA Technical Reports Server

Syracuse University Research Facility and Collaborative Environment

Distributed Memory Compiler Methods for Irregular Problems -- Data Copy Reuse and Runtime Partitioning

Author: Das Raja
Mavriplis Dimitri
Ponnusamy Ravi
Saltz Joel
Publication venue: SURFACE at Syracuse University
Publication date: 01/10/1991
Field of study

This paper outlines two methods which we believe will play an important role in any distributed memory compiler able to handle sparse and unstructured problems. We describe how to link runtime partitioners to distributed memory compilers. In our scheme, programmers can implicitly specify how data and loop iterations are to be distributed between processors. This insulates users from having to deal explicitly with potentially complex algorithms that carry out work and data partitioning. We also describe a viable mechanism for tracking and reusing copies of off-processor data. In many programs, several loops access the same off-processor memory locations. As long as it can be verified that the values assigned to off-processor memory locations remain unmodified, we show that we can effectively reuse stored off-processor data. We present experimental data from a 3-D unstructured Euler solver run on an iPSC/860 to demonstrate the usefulness of our methods

Syracuse University Research Facility and Collaborative Environment

PARTI primitives for unstructured and block structured problems

Author: A. Sussman
André
Ashcraft
Baden
Baxter
Berger
Berryman
Brezany
Chase
Chen
Cheung
Choudhary
Chrisochoides
D. Mavriplis
Das
Foster
Fox
Fox
Gerndt
Hatcher
Havlak
Hiranandani
Hiranandani
J. Saltz
K. Crowley
Kennedy
Koelbel
Lemke
Li
Li
Li
Lonsdale
Lu
Lui
Mavriplis
Mirchandaney
Pothen
R. Das
R. Ponnusamy
Rogers
Rosing
S. Gupta
Saltz
Saltz
Saltz
Simon
Tseng
Vatsa
Williams
Williams
Zima
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Run-time and Compile-time Support for Adaptive Irregular Problems

Author: Das Raja
Hwang Yuan-Shin
Moon Bongki
Ponnusamy Ravi
Saltz Joel
Sharma Shamik D.
Publication venue
Publication date: 15/10/1998
Field of study

In adaptive irregular problems the data arrays are accessed via indirection arrays, and data access patterns change during computation. Implementing such problems on distributed memory machines requires support for dynamic data partitioning, efficient preprocessing and fast data migration. This research presents efficient runtime primitives for such problems. This new set of primitives is part of the CHAOS library. It subsumes the previous PARTI library which targeted only static irregular problems. To demonstrate the efficacy of the runtime support, two real adaptive irregular applications have been parallelized using CHAOS primitives: a molecular dynamics code (CHARMM) and a particle-in-cell code (DSMC). The paper also proposes extensions to Fortran D which can allow compilers to generate more efficient code for adaptive problems. These language extensions have been implemented in the Syracuse Fortran 90D/HPF prototype compiler. The performance of the compiler parallelized codes is compared with the hand parallelized versions. (Also cross-referenced as UMIACS-TR-94-55

Digital Repository at the University of Maryland

Run-time and compile-time support for adaptive irregular problems

Author: Hwang Yuan-Shin
Moon Bongki
Ponnusamy Ravi
Sharma Shamik D.
Publication venue: SURFACE at Syracuse University
Publication date: 01/01/1994
Field of study

Syracuse University Research Facility and Collaborative Environment

Runtime and language support for compiling adaptive irregular programs on distributed-memory machines

Author: Baden
Berger
Bird
Bodin
Bokhari
Bozkus
Brooks
Brooks
Chakrabarti
Chapman
Das
Das
Fox
Hiranandani
Koelbel
Lam
Leland
Lu
Mansour
Mavriplis
Mirchandaney
Nicol
Nour-Omid
Pathon
Ponnusamy
Ponnusamy
Rault
Rosing
Saltz
Saltz
V. Hanxleden
V. Hanxleden
Van Gunsteren
Venkatakrishnan
Venkatkrishnan
Vidwans
Weiner
Williams
Williams
Wilmoth
Wu
Publication venue: 'Wiley'
Publication date
Field of study

Crossref

A design methodology for portable software on parallel computers

Author: Chrisman Dan A.
Miller Keith W.
Nicol David M.
Publication venue
Publication date
Field of study

This final report for research that was supported by grant number NAG-1-995 documents our progress in addressing two difficulties in parallel programming. The first difficulty is developing software that will execute quickly on a parallel computer. The second difficulty is transporting software between dissimilar parallel computers. In general, we expect that more hardware-specific information will be included in software designs for parallel computers than in designs for sequential computers. This inclusion is an instance of portability being sacrificed for high performance. New parallel computers are being introduced frequently. Trying to keep one's software on the current high performance hardware, a software developer almost continually faces yet another expensive software transportation. The problem of the proposed research is to create a design methodology that helps designers to more precisely control both portability and hardware-specific programming details. The proposed research emphasizes programming for scientific applications. We completed our study of the parallelizability of a subsystem of the NASA Earth Radiation Budget Experiment (ERBE) data processing system. This work is summarized in section two. A more detailed description is provided in Appendix A ('Programming Practices to Support Eventual Parallelism'). Mr. Chrisman, a graduate student, wrote and successfully defended a Ph.D. dissertation proposal which describes our research associated with the issues of software portability and high performance. The list of research tasks are specified in the proposal. The proposal 'A Design Methodology for Portable Software on Parallel Computers' is summarized in section three and is provided in its entirety in Appendix B. We are currently studying a proposed subsystem of the NASA Clouds and the Earth's Radiant Energy System (CERES) data processing system. This software is the proof-of-concept for the Ph.D. dissertation. We have implemented and measured the performance of a portion of this subsystem on the Intel iPSC/2 parallel computer. These results are provided in section four. Our future work is summarized in section five, our acknowledgements are stated in section six, and references for published papers associated with NAG-1-995 are provided in section seven

NASA Technical Reports Server

Limits to parallelism in scientific computing

Author: Chrisman Dan Alvin, Jr.
Publication venue: W&M ScholarWorks
Publication date: 01/01/1999
Field of study

The goal of our research is to decrease the execution time of scientific computing applications. We exploit the application\u27s inherent parallelism to achieve this goal. This exploitation is expensive as we analyze sequential applications and port them to parallel computers. Many scientifically computational problems appear to have considerable exploitable parallelism; however, upon implementing a parallel solution on a parallel computer, limits to the parallelism are encountered. Unfortunately, many of these limits are characteristic of a specific parallel computer. This thesis explores these limits.;We study the feasibility of exploiting the inherent parallelism of four NASA scientific computing applications. We use simple models to predict each application\u27s degree of parallelism at several levels of granularity. From this analysis, we conclude that it is infeasible to exploit the inherent parallelism of two of the four applications. The interprocessor communication of one application is too expensive relative to its computation cost. The input and output costs of the other application are too expensive relative to its computation cost. We exploit the parallelism of the remaining two applications and measure their performance on an Intel iPSC/2 parallel computer. We parallelize an Optimal Control Boundary Value Problem. This guidance control problem determines an optimal trajectory of a boat in a river. We parallelize the Carbon Dioxide Slicing technique which is a macrophysical cloud property retrieval algorithm. This technique computes the height at the top of a cloud using cloud imager measurements. We consider the feasibility of exploiting its massive parallelism on a MasPar MP-2 parallel computer. We conclude that many limits to parallelism are surmountable while other limits are inescapable.;From these limits, we elucidate some fundamental issues that must be considered when porting similar problems to yet-to-be designed computers. We conclude that the technological improvements to reduce the isolation of computational units frees a programmer from many of the programmer\u27s current concerns about the granularity of the work. We also conclude that the technological improvements to relax the regimented guidance of the computational units allows a programmer to exploit the inherent heterogeneous parallelism of many applications

College of William & Mary: W&M Publish

Annual Review of Progress in Applied Computational Electromagnetics

Author
Publication venue: Monterey, California. Naval Postgraduate School
Publication date: 16/03/1998
Field of study

Approved for public release; distribution is unlimited

Calhoun, Institutional Archive of the Naval Postgraduate School