2 research outputs found

    Impact FD: An Unreliable Failure Detector Based on Process Relevance and Confidence in the System

    Get PDF
    International audienceThis paper presents a new unreliable failure detector, called the Impact failure detector (FD) that, contrarily to the majority of traditional FDs, outputs a trust level value which expresses the degree of confidence in the system. An impact factor is assigned to each process and the trust level is equal to the sum of the impact factors of the processes not suspected of failure. Moreover, a threshold parameter defines a lower bound value for the trust level, over which the confidence in the system is ensured. In particular, we defined a f l exi bi l i t y property that denotes the capacity of the Impact FD to tolerate a certain margin of failures or false suspicions, i.e., its capacity of considering different sets of responses that lead the system to trusted states. The Impact FD is suitable for systems that present node redundancy, heterogeneity of nodes, clustering feature, and allow a margin of failures which does not degrade the confidence in the system. The paper also includes a timer-based distributed algorithm which implements an Impact FD, as well as its proof of correctness, for systems whose links are lossy asynchronous or for those whose all (or some) links are eventually timely. Performance evaluation results, based on PlanetLab [1] traces, confirm the degree of flexible applicability of our failure detector and that, due to the accepted margin of failure, both failures and false suspicions are more tolerated when compared to traditional unreliable failure detectors

    A Failure Detector That Gives Information on the Degree of Confidence in the System

    No full text
    International audienceThis work proposes a new and flexible unreliablefailure detector, denoted Impact Failure Detector (FD), whoseoutput gives the trust level of a set of processes. By expressingthe relevance of each node by an impact factor value as well asan acceptable margin of failure in the system, the Impact FDenables the user to tune the failure detection configuration inaccordance with the requirements of the application: in somescenarios, the failure of low impact or redundant nodes does notjeopardize the confidence in the system, while the crash resultingfrom a high impact factor may seriously affect it. Either a softeror stricter monitoring is thus possible. Performance evaluationresults using real PlanetLab [1] traces confirm the degree offlexibility of our failure detector and that, due to the margin offailure, the number of false responses may be reduced when it iscompared with traditional unreliable failure detectors
    corecore