Search CORE

5 research outputs found

OS-Assisted Task Preemption for Hadoop

Author: Dell'Amico Matteo
Michiardi Pietro
Pastorelli Mario
Publication venue
Publication date: 10/02/2014
Field of study

This work introduces a new task preemption primitive for Hadoop, that allows tasks to be suspended and resumed exploiting existing memory management mechanisms readily available in modern operating systems. Our technique fills the gap that exists between the two extremes cases of killing tasks (which waste work) or waiting for their completion (which introduces latency): experimental results indicate superior performance and very small overheads when compared to existing alternatives

arXiv.org e-Print Archive

Crossref

Enabling Fast Failure Recovery in Shared Hadoop Clusters: Towards Failure-Aware Scheduling

Author: Antoniu Gabriel
Ibrahim Shadi
Yildiz Orcun
Publication venue: 'Elsevier BV'
Publication date: 10/03/2016
Field of study

International audienceHadoop emerged as the de facto state-of-the-art system for MapReduce-based data analytics. The reliability of Hadoop systems depends in part on how well they handle failures. Currently, Hadoop handles machine failures by re-executing all the tasks of the failed machines (i.e., executing recovery tasks). Unfortunately, this elegant solution is entirely entrusted to the core of Hadoop and hidden from Hadoop schedulers. The unawareness of failures therefore may prevent Hadoop schedulers from operating correctly towards meeting their objectives (e.g., fairness, job priority) and can significantly impact the performance of MapReduce applications. This paper presents Chronos, a failure-aware scheduling strategy that enables an early yet smart action for fast failure recovery while still operating within a specific scheduler objective. Upon failure detection, rather than waiting an uncertain amount of time to get resources for recovery tasks, Chronos leverages a lightweight preemption technique to carefully allocate these resources. In addition, Chronos considers data locality when scheduling recovery tasks to further improve the performance. We demonstrate the utility of Chronos by combining it with Fifo and Fair schedulers. The experimental results show that Chronos recovers to a correct scheduling behavior within a couple of seconds only and reduces the job completion times by up to 55% compared to state-of-the-art schedulers

HAL-CentraleSupelec

Crossref

INRIA a CCSD electronic archive server

HAL-Inserm

Hal-Diderot

HAL-Rennes 1

OS-assisted task preemption for Hadoop

Author: Pastorelli Mario
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 10/02/2014
Field of study

EURECOM Repository

OS-assisted task preemption for hadoop

Author
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 25/02/2022
Field of study

none3This work introduces a new task preemption primitive for Hadoop, that allows tasks to be suspended and resumed exploiting existing memory management mechanisms readily available in modern operating systems. Our technique fills the gap that exists between the two extreme cases of killing tasks (which waste work) or waiting for their completion (which introduces latency): experimental results indicate superior performance and very small overheads when compared to existing alternatives.mixedPastorelli Mario; Dell'Amico M.; Michiardi P.Pastorelli, Mario; Dell'Amico, M.; Michiardi, P

Archivio istituzionale della ricerca - Università di Genova