Search CORE

100 research outputs found

Compiling global name-space programs for distributed execution

Author: Koelbel Charles
Mehrotra Piyush
Publication venue
Publication date
Field of study

Distributed memory machines do not provide hardware support for a global address space. Thus programmers are forced to partition the data across the memories of the architecture and use explicit message passing to communicate data between processors. The compiler support required to allow programmers to express their algorithms using a global name-space is examined. A general method is presented for analysis of a high level source program and along with its translation to a set of independently executing tasks communicating via messages. If the compiler has enough information, this translation can be carried out at compile-time. Otherwise run-time code is generated to implement the required data movement. The analysis required in both situations is described and the performance of the generated code on the Intel iPSC/2 is presented

NASA Technical Reports Server

Parallel language constructs for tensor product computations on loosely coupled architectures

Author: Mehrotra Piyush
Vanrosendale John
Publication venue
Publication date
Field of study

Distributed memory architectures offer high levels of performance and flexibility, but have proven awkard to program. Current languages for nonshared memory architectures provide a relatively low level programming environment, and are poorly suited to modular programming, and to the construction of libraries. A set of language primitives designed to allow the specification of parallel numerical algorithms at a higher level is described. Tensor product array computations are focused on along with a simple but important class of numerical algorithms. The problem of programming 1-D kernal routines is focused on first, such as parallel tridiagonal solvers, and then how such parallel kernels can be combined to form parallel tensor product algorithms is examined

NASA Technical Reports Server

Programming distributed memory architectures using Kali

Author: Mehrotra Piyush
Vanrosendale John
Publication venue
Publication date
Field of study

Programming nonshared memory systems is more difficult than programming shared memory systems, in part because of the relatively low level of current programming environments for such machines. A new programming environment is presented, Kali, which provides a global name space and allows direct access to remote data values. In order to retain efficiency, Kali provides a system on annotations, allowing the user to control those aspects of the program critical to performance, such as data distribution and load balancing. The primitives and constructs provided by the language is described, and some of the issues raised in translating a Kali program for execution on distributed memory systems are also discussed

NASA Technical Reports Server

High Performance FORTRAN

Author: Mehrotra Piyush
Publication venue
Publication date
Field of study

High performance FORTRAN is a set of extensions for FORTRAN 90 designed to allow specification of data parallel algorithms. The programmer annotates the program with distribution directives to specify the desired layout of data. The underlying programming model provides a global name space and a single thread of control. Explicitly parallel constructs allow the expression of fairly controlled forms of parallelism in particular data parallelism. Thus the code is specified in a high level portable manner with no explicit tasking or communication statements. The goal is to allow architecture specific compilers to generate efficient code for a wide variety of architectures including SIMD, MIMD shared and distributed memory machines

NASA Technical Reports Server

HPC Matters! How Supercomputing Supports NASA's Mission

Author: Mehrotra Piyush
Publication venue
Publication date
Field of study

No abstract availabl

NASA Technical Reports Server

Semi-automatic process partitioning for parallel computation

Author: Koelbel Charles
Mehrotra Piyush
Vanrosendale John
Publication venue
Publication date
Field of study

On current multiprocessor architectures one must carefully distribute data in memory in order to achieve high performance. Process partitioning is the operation of rewriting an algorithm as a collection of tasks, each operating primarily on its own portion of the data, to carry out the computation in parallel. A semi-automatic approach to process partitioning is considered in which the compiler, guided by advice from the user, automatically transforms programs into such an interacting task system. This approach is illustrated with a picture processing example written in BLAZE, which is transformed into a task system maximizing locality of memory reference

NASA Technical Reports Server

Vienna FORTRAN: A FORTRAN language extension for distributed memory multiprocessors

Author: Chapman Barbara
Mehrotra Piyush
Zima Hans
Publication venue
Publication date
Field of study

Exploiting the performance potential of distributed memory machines requires a careful distribution of data across the processors. Vienna FORTRAN is a language extension of FORTRAN which provides the user with a wide range of facilities for such mapping of data structures. However, programs in Vienna FORTRAN are written using global data references. Thus, the user has the advantage of a shared memory programming paradigm while explicitly controlling the placement of data. The basic features of Vienna FORTRAN are presented along with a set of examples illustrating the use of these features

NASA Technical Reports Server

Supporting shared data structures on distributed memory architectures

Author: Koelbel Charles
Mehrotra Piyush
Vanrosendale John
Publication venue
Publication date
Field of study

Programming nonshared memory systems is more difficult than programming shared memory systems, since there is no support for shared data structures. Current programming languages for distributed memory architectures force the user to decompose all data structures into separate pieces, with each piece owned by one of the processors in the machine, and with all communication explicitly specified by low-level message-passing primitives. A new programming environment is presented for distributed memory architectures, providing a global name space and allowing direct access to remote parts of data values. The analysis and program transformations required to implement this environment are described, and the efficiency of the resulting code on the NCUBE/7 and IPSC/2 hypercubes are described

NASA Technical Reports Server

High performance FORTRAN without templates: An alternative model for distribution and alignment

Author: Chapman Barbara
Mehrotra Piyush
Zima Hans
Publication venue
Publication date
Field of study

Language extensions of FORTRAN are being developed which permit the user to map data structures to the individual processors of distributed memory machines. These languages allow a programming style in which global data references are used. Current efforts are focussed on designing a common basis for such languages, the result of which is known as High Performance Fortran (HPF). One of the central debates in the HPF effort revolves around the concept of templates, introduced as an abstract index space to which data could be aligned. A model for the mapping of data which provides the functionality of High Performance Fortran distributions without the use of templates is presented

NASA Technical Reports Server

Semi-automatic Process Decomposition for Non-shared Memory Machines

Author: Koelbel Charles
Mehrotra Piyush
Publication venue: 'Purdue University (bepress)'
Publication date: 16/08/1988
Field of study

Purdue E-Pubs