Search CORE

11,327 research outputs found

Scheduling Parallel Jobs on a Network of Heterogeneous Platforms

Author: Jansen Klaus
Trystram Denis
Publication venue
Publication date: 01/01/2015
Field of study

We consider the problem of scheduling parallel jobs on a network of heterogeneous platforms. Given a set J of n jobs where each job j 2 J is described by a pair (pj ; qj) with a processing time pj and number qj of processors required and a set of N heterogeneous platforms Pi with mi processors, the goal is to and a schedule for all jobs on the platforms minimizing the maximum completion time. Unless P = NP there is no approximation algorithm with absolute ratio better than 2 for the problem. We propose an approximation algorithm with absolute ratio 2 improving the previously best known algorithms. This closes the gap between the lower bound of 2 and the best approximation ratio

MACAU: Open Access Repository of Kiel University

Predictability of Fixed-Job Priority Schedulers on Heterogeneous Multiprocessor Real-Time Systems

Author: Cucu-Grosjean Liliana
Goossens Joël
Publication venue
Publication date: 01/01/2009
Field of study

The multiprocessor Fixed-Job Priority (FJP) scheduling of real-time systems is studied. An important property for the schedulability analysis, the predictability (regardless to the execution times), is studied for heterogeneous multiprocessor platforms. Our main contribution is to show that any FJP schedulers are predictable on unrelated platforms. A convenient consequence is the fact that any FJP schedulers are predictable on uniform multiprocessors

arXiv.org e-Print Archive

CiteSeerX

INRIA a CCSD electronic archive server

DI-fusion

HAL-Rennes 1

Libra: An Economy driven Job Scheduling System for Clusters

Author: Ali Nosheen
Buyya Rajkumar
Hayat Zahra
Lotia Nausheen
Sherwani Jahanzeb
Publication venue
Publication date: 01/01/2002
Field of study

Clusters of computers have emerged as mainstream parallel and distributed platforms for high-performance, high-throughput and high-availability computing. To enable effective resource management on clusters, numerous cluster managements systems and schedulers have been designed. However, their focus has essentially been on maximizing CPU performance, but not on improving the value of utility delivered to the user and quality of services. This paper presents a new computational economy driven scheduling system called Libra, which has been designed to support allocation of resources based on the users? quality of service (QoS) requirements. It is intended to work as an add-on to the existing queuing and resource management system. The first version has been implemented as a plugin scheduler to the PBS (Portable Batch System) system. The scheduler offers market-based economy driven service for managing batch jobs on clusters by scheduling CPU time according to user utility as determined by their budget and deadline rather than system performance considerations. The Libra scheduler ensures that both these constraints are met within an O(n) run-time. The Libra scheduler has been simulated using the GridSim toolkit to carry out a detailed performance analysis. Results show that the deadline and budget based proportional resource allocation strategy improves the utility of the system and user satisfaction as compared to system-centric scheduling strategies.Comment: 13 page

arXiv.org e-Print Archive

CiteSeerX

Using Pilot Systems to Execute Many Task Workloads on Supercomputers

Author: Andre Merzky
E Hwang
J Preto
M Wilde
R Pordes
RH Castain
T Maeno
TE Cheatham III
Y Sugita
Publication venue
Publication date: 30/07/2018
Field of study

High performance computing systems have historically been designed to support applications comprised of mostly monolithic, single-job workloads. Pilot systems decouple workload specification, resource selection, and task execution via job placeholders and late-binding. Pilot systems help to satisfy the resource requirements of workloads comprised of multiple tasks. RADICAL-Pilot (RP) is a modular and extensible Python-based pilot system. In this paper we describe RP's design, architecture and implementation, and characterize its performance. RP is capable of spawning more than 100 tasks/second and supports the steady-state execution of up to 16K concurrent tasks. RP can be used stand-alone, as well as integrated with other application-level tools as a runtime system

arXiv.org e-Print Archive

Crossref