Search CORE

11,281 research outputs found

Tolerating Correlated Failures in Massively Parallel Stream Processing Engines

Author: Su Li
Zhou Yongluan
Publication venue
Publication date: 04/02/2016
Field of study

Fault-tolerance techniques for stream processing engines can be categorized into passive and active approaches. A typical passive approach periodically checkpoints a processing task's runtime states and can recover a failed task by restoring its runtime state using its latest checkpoint. On the other hand, an active approach usually employs backup nodes to run replicated tasks. Upon failure, the active replica can take over the processing of the failed task with minimal latency. However, both approaches have their own inadequacies in Massively Parallel Stream Processing Engines (MPSPE). The passive approach incurs a long recovery latency especially when a number of correlated nodes fail simultaneously, while the active approach requires extra replication resources. In this paper, we propose a new fault-tolerance framework, which is Passive and Partially Active (PPA). In a PPA scheme, the passive approach is applied to all tasks while only a selected set of tasks will be actively replicated. The number of actively replicated tasks depends on the available resources. If tasks without active replicas fail, tentative outputs will be generated before the completion of the recovery process. We also propose effective and efficient algorithms to optimize a partially active replication plan to maximize the quality of tentative outputs. We implemented PPA on top of Storm, an open-source MPSPE and conducted extensive experiments using both real and synthetic datasets to verify the effectiveness of our approach

arXiv.org e-Print Archive

University of Southern Denmark Research Output

Recommended from our members

Analysis of operating system diversity for intrusion tolerance

Author: Abd-El-Malek
Alhazmi
Anbalagan
Bessani
Castro
Castro
Chou
Clement
Correia
Deswarte
Garcia
Gashi
Gorbenko
Hatton
Hofmeyr
Joseph
Kapitza
Kim
Knight
Koopman
Kotla
Lamport
Littlewood
Littlewood
Miller
Moniz
Nikora
Ozment
Randell
Rescorla
Roeder
Sousa
Veríssimo
Wood
Yin
Publication venue: 'Wiley'
Publication date: 15/01/2013
Field of study

One of the key benefits of using intrusion-tolerant systems is the possibility of ensuring correct behavior in the presence of attacks and intrusions. These security gains are directly dependent on the components exhibiting failure diversity. To what extent failure diversity is observed in practical deployment depends on how diverse are the components that constitute the system. In this paper, we present a study with operating system's (OS's) vulnerability data from the NIST National Vulnerability Database (NVD). We have analyzed the vulnerabilities of 11 different OSs over a period of 18 years, to check how many of these vulnerabilities occur in more than one OS. We found this number to be low for several combinations of OSs. Hence, although there are a few caveats on the use of NVD data to support definitive conclusions, our analysis shows that by selecting appropriate OSs, one can preclude (or reduce substantially) common vulnerabilities from occurring in the replicas of the intrusion-tolerant system

City Research Online

Crossref

Recommended from our members

A survey on online monitoring approaches of computer-based systems

Author: Stankovic V.
Strigini L.
Publication venue: Centre for Software Reliability, City University London
Publication date
Field of study

This report surveys forms of online data collection that are in current use (as well as being the subject of research to adapt them to changing technology and demands), and can be used as inputs to assessment of dependability and resilience, although they are not primarily meant for this use

City Research Online