1 research outputs found

    Resilient Virtual Clusters

    No full text
    Abstract—Clusters of computers can provide, in aggregate, reliable services despite the failure of individual computers. System-level virtualization is widely used to consolidate the workload of multiple physical systems as multiple virtual machines (VMs) on a single physical computer. A single physical computer thus forms a virtual cluster of VMs. Akey difficulty with virtualization is that the failure of the virtualization infrastructure (VI) often leads to the failure of multiple VMs. This is likely to overload ‘‘cluster computing’’ resiliency mechanisms, typically designed to tolerate the failure of only a single node at a time. By supporting recovery from failure of key VI components, we have enhanced the resiliency ofaVI(Xen), thus enabling the use of existing ‘‘cluster computing’ ’ techniques to provide resilient virtual clusters. In the overwhelming majority of cases, these enhancements allow recovery from errors in the VI to be accomplished without the failure of more than a single VM. The resulting resiliency ofthe virtual cluster is demonstrated by running two existing ‘‘cluster computing’ ’ systems while subjecting the VI to injected faults
    corecore