A Formal Model of Crash Recovery in a Distributed Systems

Abstract

Abstract-A formal model for atomic commit protocols for a distributed database system is introduced. The model is used to prove existence results about resilient protocols for site failures that do not partition the network and then for partitioned networks. For site failures, a pessimistic recovery technique, called independent recovery, is introduced and the class of failures for which resilient protocols exist is identified. For partitioned networks, two cases are studied: the pessimistic case in which messages are lost, and the optimistic case in which no messages are lost. In all cases, fundamental limitations on the resiliency of protocols are derived. Index Tenns-Commit protocols, crash recovery, distributed database systems, distributed systems, fault tolerance, transaction management. I

    Similar works

    Full text

    thumbnail-image

    Available Versions