Search CORE

3,916 research outputs found

A support architecture for reliable distributed computing systems

Author: Mckendry Martin S.
Publication venue
Publication date
Field of study

The Clouds kernel design was through several design phases and is nearly complete. The object manager, the process manager, the storage manager, the communications manager, and the actions manager are examined

NASA Technical Reports Server

Report on the Second European SIGOPS Workshop "making distributed systems work"

Author: Mullender Sape
Publication venue: ACM
Publication date: 01/01/1987
Field of study

University of Twente Research Information

Fault tolerance distributed computing

Author: LeBlanc Richard Joseph
Publication venue: Georgia Institute of Technology
Publication date: 01/01/1986
Field of study

Issued as Funds expenditure reports [nos. 1-4], Quarterly progress reports [nos. 1-3], and Final report, Project no. G-36-63

Scholarly Materials And Research @ Georgia Tech

Distributed transactions for reliable systems

Author: Alfred Z. Spector
Anonymous
Banatre J.P.
Dahl O.J.
Daniel Duchamp
Daniels Dean S.
Date C.J.
Dean Daniels
Gray James N.
Gray James N.
Jeffrey L. Eppinger
Jensen E.D.
Joy William
Lampson Butler W.
Liskov B.
Liskov Barbara
Liskov Barbara
Moss J. Eliot B.
Nelson Bruce Jay
Overview March Perq System
Randy Pausch
Reed David P.
Reference
Schwarz Peter M.
Spector Alfred Z.
Watson R.W.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Acquisition of computer research equipment

Author: Miller Raymond Edward
Publication venue: Georgia Institute of Technology
Publication date: 01/01/1987
Field of study

Issued as Final report, Project no. G-36-61

Scholarly Materials And Research @ Georgia Tech

Maintaining consistency in distributed systems

Author: Birman Kenneth P.
Publication venue
Publication date: 01/01/1991
Field of study

In systems designed as assemblies of independently developed components, concurrent access to data or data structures normally arises within individual programs, and is controlled using mutual exclusion constructs, such as semaphores and monitors. Where data is persistent and/or sets of operation are related to one another, transactions or linearizability may be more appropriate. Systems that incorporate cooperative styles of distributed execution often replicate or distribute data within groups of components. In these cases, group oriented consistency properties must be maintained, and tools based on the virtual synchrony execution model greatly simplify the task confronting an application developer. All three styles of distributed computing are likely to be seen in future systems - often, within the same application. This leads us to propose an integrated approach that permits applications that use virtual synchrony with concurrent objects that respect a linearizability constraint, and vice versa. Transactional subsystems are treated as a special case of linearizability

CiteSeerX

NASA Technical Reports Server

eCommons@Cornell

Recovering Shared Objects Without Stable Storage

Author: Michael Ellis
Ports Dan R. K.
Sharma Naveen Kr.
Szekeres Adriana
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 31st International Symposium on Distributed Computing (DISC 2017)
Publication date: 01/01/2017
Field of study

This paper considers the problem of building fault-tolerant shared objects when processes can crash and recover but lose their persistent state on recovery. This Diskless Crash-Recovery (DCR) model matches the way many long-lived systems are built. We show that it presents new challenges, as operations that are recorded at a quorum may not persist after some of the processes in that quorum crash and then recover. To address this problem, we introduce the notion of crash-consistent quorums, where no recoveries happen during the quorum responses. We show that relying on crash-consistent quorums enables a recovery procedure that can recover all operations that successfully finished. Crash-consistent quorums can be easily identified using a mechanism we term the crash vector, which tracks the causal relationship between crashes, recoveries, and other operations. We apply crash-consistent quorums and crash vectors to build two storage primitives. We give a new algorithm for multi-writer, multi-reader atomic registers in the DCR model that guarantees safety under all conditions and termination under a natural condition. It improves on the best prior protocol for this problem by requiring fewer rounds, fewer nodes to participate in the quorum, and a less restrictive liveness condition. We also present a more efficient single-writer, single-reader atomic set - a virtual stable storage abstraction. It can be used to lift any existing algorithm from the traditional Crash-Recovery model to the DCR model. We examine a specific application, state machine replication, and show that existing diskless protocols can violate their correctness guarantees, while ours offers a general and correct solution

Dagstuhl Research Online Publication Server

Containers : A Sound Basis For a True Single System Image

Author: Lottiaux Renaud
Morin Christine
Publication venue: HAL CCSD
Publication date: 01/01/2000
Field of study

Clusters of SMPs are attractive for executing shared memory parallel applications but reconciling high performance and ease of programming remains an open issue. A possible approach is to provide an efficient Single System Image (SSI) operating system giving the illusion of an SMP machine. In this paper, we introduce the concept of container as a mechanism to unify global resource management at the lowest operating system level. Higher level operating system services such as virtual memory system and file cache can be easily implemented based on containers and transparently take benefit of the whole memory resource available in the cluster

HAL-CentraleSupelec

CiteSeerX

INRIA a CCSD electronic archive server

HAL-Rennes 1