Obtaining Dynamic Scheduling Policies with Simulation and Machine Learning

Carastan-Santos, Danilo; de Camargo, Raphael Y.

Obtaining Dynamic Scheduling Policies with Simulation and Machine Learning

Authors: Danilo Carastan-Santos
Raphael Y. de Camargo
Publication date: 12 November 2017
Publisher: HAL CCSD

Abstract

International audienceDynamic scheduling of tasks in large-scale HPC platforms is normally accomplished using ad-hoc heuristics, based on task characteristics, combined with some backfilling strategy. Defining heuristics that work efficiently in different scenarios is a difficult task, specially when considering the large variety of task types and platform architectures. In this work, we present a methodology based on simulation and machine learning to obtain dynamic scheduling policies. Using simulations and a workload generation model, we can determine the characteristics of tasks that lead to a reduction in the mean slowdown of tasks in an execution queue. Modeling these characteristics using a nonlinear function and applying this function to select the next task to execute in a queue dramatically improved the mean task slowdown in synthetic workloads. When applied to real workload traces from highly different machines, these functions still resulted in important performance improvements, attesting the generalization capability of the obtained heuristics

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

INRIA a CCSD electronic archive server

oai:HAL:hal-01618940v1

Last time updated on 21/11/2017

Hal - Université Grenoble Alpes

oai:HAL:hal-01618940v1

Last time updated on 04/12/2017