Search CORE

5 research outputs found

Run-time Support for Distributed Object Sharing in Safe Programming Languages

Author: Cox A.L.
Hu Y.C.
Wallach D.S.
Yu W.
Zwaenepoel W.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 16/08/2005
Field of study

We present a new run-time system that supports object sharing in a distributed system. The key insight in this system is that a handle-based implementation of such a system enables effcient and transparent sharing of data with both fine-grained and coarse-grained access patterns. In addition, it supports effcient execution of garbage-collected programs. In contrast, conventional distributed shared memory (DSM) systems are limited to providing only one granularity with good performance, and have experienced diffculty in effciently supporting garbage collection. A safe language, in which no pointer arithmetic is allowed, can transparently be compiled into a handle-based system and constitutes its preferred mode of use. A programmer can also directly use a handle-based programming model that avoids pointer arithmetic on the handles, and achieve the same performance but without the programming benefits of a safe programming language. This new run-time system, DOSA (Distributed Object Sharing Architecture), provides a shared object space abstraction rather than a shared address space abstraction. The key to its effciency is the observation that a handle-based distributed implementation permits VM-based access and modification detection without suffering false sharing for fine-grained access patterns. We compare DOSA to TreadMarks, a conventional DSM system that is effcient at handling coarse-grained sharing. The performance of fine-grained applications and garbage-collected applications is considerably better than in TreadMarks. The performance of coarse-grained applications is nearly as good as in TreadMarks. Since the performance of such applications is already good in TreadMarks, we consider this an acceptable performance penalty

Infoscience - École polytechnique fédérale de Lausanne

An Efficient Garbage Collection Scheme for Parallel Computer Architectures

Author: Ian Watson
Paul Watson
Publication venue: Springer-Verlag
Publication date: 01/01/1987
Field of study

this paper describes a modified form of Reference Count garbage collection which removes the need for synchronisation, and gives greater locality of store accessing. This makes it attractive for parallel machines, but does not overcome the problem of reclaiming circular structures. The solution adopted for the Flagship machine is to implement the modified Reference Count scheme, and also implement a secondary Mark-Scan collector to remove circular structures (which are rarely created in this machine). Because it is not the main method of garbage collection, the efficiency of the Mark-Scan collector is not so critical, and its overheads, for example synchronisation costs, can be more readily tolerate

CiteSeerX

The University of Manchester - Institutional Repository

An Efficient Garbage Collection Scheme for Parallel Computer Architectures

Author: Watson I
Watson P
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date
Field of study

Newcastle University E-Prints

Parallel functional programming for message-passing multiprocessors

Author: Ostheimer Gerald
Publication venue: The University of St Andrews
Publication date: 21/05/2018
Field of study

We propose a framework for the evaluation of implicitly parallel functional programs on message passing multiprocessors with special emphasis on the issue of load bounding. The model is based on a new encoding of the lambda-calculus in Milner's pi-calculus and combines lazy evaluation and eager (parallel) evaluation in the same framework. The pi-calculus encoding serves as the specification of a more concrete compilation scheme mapping a simple functional language into a message passing, parallel program. We show how and under which conditions we can guarantee successful load bounding based on this compilation scheme. Finally we discuss the architectural requirements for a machine to support our model efficiently and we present a simple RISC-style processor architecture which meets those criteria

St Andrews Research Repository

Sparsely Faceted Arrays: A Mechanism Supporting Parallel Allocation, Communication, and Garbage Collection

Author: Jeremy Brown
Jeremy Hanford Brown
Jeremy Hanford Brown
Jeremy Hanford Brown
Publication venue
Publication date: 01/01/2002
Field of study

Conventional parallel computer architectures do not provide support for non-uniformly distributed objects. In this thesis, I introduce sparsely faceted arrays (SFAs), a new low-level mechanism for naming regions of memory, or facets, on different processors in a distributed, shared memory parallel processing system. Sparsely faceted arrays address the disconnect between the global distributed arrays provided by conventional architectures (e.g. the Cray T3 series), and the requirements of high-level parallel programming methods that wish to use objects that are distributed over only a subset of processing elements. A sparsely faceted array names a virtual globally-distributed array, but actual facets are lazily allocated. By providing simple semantics and making efficient use of memory, SFAs enable efficient implementation of a variety of non-uniformly distributed data structures and related algorithms. I present example applications which use SFAs, and describe and evaluate simple hardware mechanisms for implementing SFAs. Keeping track of which nodes have allocated facets for a particular SFA is an important task that suggests the need for automatic memory management, including garbage collection. To address this need, I first argue that conventional tracing techniques such as mark/sweep and copying GC are inherently unscalable in parallel systems. I then present a parallel memory-management strategy, based on reference-counting, that is capable of garbage collecting sparsely faceted arrays. I also discuss opportunities for hardware support of this garbage collection strategy. I have implemented a high-level hardware/OS simulator featuring hardware support for sparsely faceted arrays and automatic garbage collection. I describe the simulator and outline a few of the numerous details associated with a "real" implementation of SFAs and SFA-aware garbage collection. Simulation results are used throughout this thesis in the evaluation of hardware support mechanisms

CiteSeerX

DSpace@MIT