1 research outputs found

    Independent Global Snapshots in Large Distributed Systems

    No full text
    Distributed systems depend on consistent global snapshots for process recovery and garbage collection activity. We provide exact conditions for an arbitrary checkpoint based on independent dependency tracking within clusters of nodes.. The method permits that nodes (within clusters) can independently compute dependency information based on available ( local ) information. The existing models of global snapshot computations provide the necessary and sufficient conditions. But, these require expensive global computations. The proposed computations can be performed by a node to identify existing global checkpoints. The nodes can also compute conditions to make a checkpoint, or conditions, such that a collection of checkpoints, can belong to a global snapshot. 1 Introduction Distributed systems use process recovery mechanisms for recovery from transient failures. The mechanisms are based on periodic creation and saving of globally consistent snapshots [14]. Many applications including p..
    corecore