3 research outputs found
Cache Based Fault Recovery for Distributed Systems
No cache based techniques for roll-forward fault recovery exist at present. A split-cache approach is proposed that provides efficient support for checkpointing and roll-forward fault recovery in distributed systems. This approach obviates the use of discrete stable storage or explicit synchronization among the processors. Stability of the checkpoint intervals is used as a driver for real time operations. 1. Introduction Recovery from transient faults to sustain a system 's functional and temporal requirements constitutes a much researched area for dependable distributed systems. Various hardware and software checkpointing schemes are proposed to constrain the recovery times by rolling back to a stable system state and restarting. However, roll-back recovery, by its very nature of rolling-back to a prior consistent operational state and re-trying the operation, involves a time penalty of lack of forward progress while the retry operation is performed. Thus, for systems with real-time..
Cache based fault recovery for distributed systems
No cache based techniques for roll-forward fault recovery exist at present. A split-cache approach is proposed that provides e cient support for checkpointing and roll-forward fault recovery in distributed systems. This approach obviates the use of discrete stable storage or explicit synchronization among the processors. Stability of the checkpoint intervals is used as a driver for real time operations. 1