Intelligent Resource Scheduling at Scale: a Machine Learning Perspective

Chen, Y; Ouyang, X; Townend, P; Xu, J; Yang, R

research

Intelligent Resource Scheduling at Scale: a Machine Learning Perspective

Authors: Y Chen
X Ouyang
P Townend
J Xu
R Yang
Publication date: 17 May 2018
Publisher: 'Institute of Electrical and Electronics Engineers (IEEE)'
Doi

Abstract

Resource scheduling in a computing system addresses the problem of packing tasks with multi-dimensional resource requirements and non-functional constraints. The exhibited heterogeneity of workload and server characteristics in Cloud-scale or Internet-scale systems is adding further complexity and new challenges to the problem. Compared with,,,, existing solutions based on ad-hoc heuristics, Machine Learning (ML) has the potential to improve further the efficiency of resource management in large-scale systems. In this paper we,,,, will describe and discuss how ML could be used to understand automatically both workloads and environments, and to help to cope with scheduling-related challenges such as consolidating co-located workloads, handling resource requests, guaranteeing application's QoSs, and mitigating tailed stragglers. We will introduce a generalized ML-based solution to large-scale resource scheduling and demonstrate its effectiveness through a case study that deals with performance-centric node classification and straggler mitigation. We believe that an MLbased method will help to achieve architectural optimization and efficiency improvement

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Crossref

info:doi/10.1109%2Fsose.2018.0...

Last time updated on 10/08/2021

White Rose Research Online

oai:eprints.whiterose.ac.uk:13...

Last time updated on 15/05/2018