Intelligent Fault-Tolerant Mechanism for Data Centers of Cloud Infrastructure

Gupta, Punit; H S, Madhusudhan; Kumar T, Satish; Mustapha, S. M. F. D. Syed; Tripathi, Rajan Prasad

Intelligent Fault-Tolerant Mechanism for Data Centers of Cloud Infrastructure

Authors: Punit Gupta
Madhusudhan H S
Satish Kumar T
S. M. F. D. Syed Mustapha
Rajan Prasad Tripathi
Publication date: 8 February 2022
Publisher: ZU Scholars

Abstract

Fault tolerance in cloud computing is considered as one of the most vital issues to deliver reliable services. Checkpoint/restart is one of the methods used to enhance the reliability of the cloud services. However, many existing methods do not focus on virtual machine (VM) failure that occurs due to the higher response time of a node, byzantine fault, and performance fault, and existing methods also ignore the optimization during the recovery phase. This paper proposes a checkpoint/restart mechanism to enhance reliability of cloud services. Our work is threefold: (1) we design an algorithm to identify virtual machine failure due to several faults; (2) an algorithm to optimize the checkpoint interval time is designed; (3) lastly, the asynchronous checkpoint/restart with log-based recovery mechanism is used to restart the failed tasks. The valuation results obtained using a real-time dataset shows that the proposed model reduces power consumption and improves the performance with a better fault tolerance solution compared to the nonoptimization method

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

ZU Scholars (Zayed University)

oai:zuscholars.zu.ac.ae:works-...

Last time updated on 05/05/2022