1,184 research outputs found
A generalized parallel task model for recurrent real-time processes
A model is considered for representing recurrent precedence-constrained tasks that are to execute on multiprocessor platforms. A recurrent task is specified as a directed acyclic graph (DAG), a period, and a relative deadline. Each vertex of the DAG represents a sequential job, while the edges of the DAG represent precedence constraints between these jobs. All the jobs of the DAG are released simultaneously and need to complete execution within the specified relative deadline of their release. Each task may release j
Scheduling on Hybrid Platforms: Improved Approximability Window
Modern platforms are using accelerators in conjunction with standard
processing units in order to reduce the running time of specific operations,
such as matrix operations, and improve their performance. Scheduling on such
hybrid platforms is a challenging problem since the algorithms used for the
case of homogeneous resources do not adapt well. In this paper we consider the
problem of scheduling a set of tasks subject to precedence constraints on
hybrid platforms, composed of two types of processing units. We propose a
-approximation algorithm and a conditional lower bound of 3 on
the approximation ratio. These results improve upon the 6-approximation
algorithm proposed by Kedad-Sidhoum et al. as well as the lower bound of 2 due
to Svensson for identical machines. Our algorithm is inspired by the former one
and distinguishes the allocation and the scheduling phases. However, we propose
a different allocation procedure which, although is less efficient for the
allocation sub-problem, leads to an improved approximation ratio for the whole
scheduling problem. This approximation ratio actually decreases when the number
of processing units of each type is close and matches the conditional lower
bound when they are equal
Energy Efficient Scheduling via Partial Shutdown
Motivated by issues of saving energy in data centers we define a collection
of new problems referred to as "machine activation" problems. The central
framework we introduce considers a collection of machines (unrelated or
related) with each machine having an {\em activation cost} of . There
is also a collection of jobs that need to be performed, and is
the processing time of job on machine . We assume that there is an
activation cost budget of -- we would like to {\em select} a subset of
the machines to activate with total cost and {\em find} a schedule
for the jobs on the machines in minimizing the makespan (or any other
metric).
For the general unrelated machine activation problem, our main results are
that if there is a schedule with makespan and activation cost then we
can obtain a schedule with makespan \makespanconstant T and activation cost
\costconstant A, for any . We also consider assignment costs for
jobs as in the generalized assignment problem, and using our framework, provide
algorithms that minimize the machine activation and the assignment cost
simultaneously. In addition, we present a greedy algorithm which only works for
the basic version and yields a makespan of and an activation cost .
For the uniformly related parallel machine scheduling problem, we develop a
polynomial time approximation scheme that outputs a schedule with the property
that the activation cost of the subset of machines is at most and the
makespan is at most for any
Scheduling techniques to improve the worst-case execution time of real-time parallel applications on heterogeneous platforms
The key to providing high performance and energy-efficient execution for hard real-time applications is the time predictable and efficient usage of heterogeneous multiprocessors. However, schedulability analysis of parallel applications executed on unrelated heterogeneous multiprocessors is challenging and has not been investigated adequately by earlier works. The unrelated model is suitable to represent many of the multiprocessor platforms available today because a task (i.e., sequential code) may exhibit a different work-case-execution-time (WCET) on each type of processor on an unrelated heterogeneous multiprocessors platform. A parallel application can be realistically modeled as a directed acyclic graph (DAG), where the nodes are sequential tasks and the edges are dependencies among the tasks. This thesis considers a sporadic DAG model which is used broadly to analyze and verify the real-time requirements of parallel applications. A global work-conserving scheduler can efficiently utilize an unrelated platform by executing the tasks of a DAG on different processor types. However, it is challenging to compute an upper bound on the worst-case schedule length of the DAG, called makespan, which is used to verify whether the deadline of a DAG is met or not. There are two main challenges. First, because of the heterogeneity of the processors, the WCET for each task of the DAG depends on which processor the task is executing on during actual runtime. Second, timing anomalies are the main obstacle to compute the makespan even for the simpler case when all the processors are of the same type, i.e., homogeneous multiprocessors. To that end, this thesis addresses the following problem: How we can schedule multiple sporadic DAGs on unrelated multiprocessors such that all the DAGs meet their deadlines. Initially, the thesis focuses on homogeneous multiprocessors that is a special case of unrelated multiprocessors to understand and tackle the main challenge of timing anomalies. A novel timing-anomaly-free scheduler is proposed which can be used to compute the makespan of a DAG just by simulating the execution of the tasks based on this proposed scheduler. A set of representative task-based parallel OpenMP applications from the BOTS benchmark suite are modeled as DAGs to investigate the timing behavior of real-world applications. A simulation framework is developed to evaluate the proposed method. Furthermore, the thesis targets unrelated multiprocessors and proposes a global scheduler to execute the tasks of a single DAG to an unrelated multiprocessors platform. Based on the proposed scheduler, methods to compute the makespan of a single DAG are introduced. A set of representative parallel applications from the BOTS benchmark suite are modeled as DAGs that execute on unrelated multiprocessors. Furthermore, synthetic DAGs are generated to examine additional structures of parallel applications and various platform capabilities. A simulation framework that simulates the execution of the tasks of a DAG on an unrelated multiprocessor platform is introduced to assess the effectiveness of the proposed makespan computations. Finally, based on the makespan computation of a single DAG this thesis presents the design and schedulability analysis of global and federated scheduling of sporadic DAGs that execute on unrelated multiprocessors
Stochastic scheduling on unrelated machines
Two important characteristics encountered in many real-world scheduling problems are heterogeneous machines/processors and a certain degree of uncertainty about the actual sizes of jobs. The first characteristic entails machine dependent processing times of jobs and is captured by the classical unrelated machine scheduling model.The second characteristic is adequately addressed by stochastic processing times of jobs as they are studied in classical stochastic scheduling models. While there is an extensive but separate literature for the two scheduling models, we study for the first time a combined model that takes both characteristics into account simultaneously. Here, the processing time of job on machine is governed by random variable , and its actual realization becomes known only upon job completion. With being the given weight of job , we study the classical objective to minimize the expected total weighted completion time , where is the completion time of job . By means of a novel time-indexed linear programming relaxation, we compute in polynomial time a scheduling policy with performance guarantee . Here, is arbitrarily small, and is an upper bound on the squared coefficient of variation of the processing times. We show that the dependence of the performance guarantee on is tight, as we obtain a lower bound for the type of policies that we use. When jobs also have individual release dates , our bound is . Via , currently best known bounds for deterministic scheduling are contained as a special case
Bounding the execution time of parallel applications on unrelated multiprocessors
Heterogeneous multiprocessors, that consist of processor types with different execution capabilities, are critical today, and in future, to offer high performance and high energy efficiency. In order to use them in hard real-time systems to support parallel processing, a tight estimation of the upper bound on the completion time (WCET) of parallel applications is needed. This paper presents, for the first time, a closed-form solution for the calculation of the WCET for task-based parallel applications modeled as directed acyclic-graphs (DAG) using the general unrelated multiprocessor model that is capable of modeling a wide range of heterogeneous multiprocessor platforms. The paper contributes with a polynomial time algorithm to calculate the WCET (i.e., makespan) for the unrelated model. In addition, it presents simulation results that are based on modeling a set of representative OpenMP task-based parallel applications from the BOTS benchmark suite
Optimal Dataflow Scheduling on a Heterogeneous Multiprocessor With Reduced Response Time Bounds
Heterogeneous computing platforms with multiple types of computing resources have been widely used in many industrial systems to process dataflow tasks with pre-defined affinity of tasks to subgroups of resources. For many dataflow workloads with soft real-time requirements, guaranteeing fast and bounded response times is often the objective. This paper presents a new set of analysis techniques showing that a classical real-time scheduler, namely earliest-deadline first (EDF), is able to support dataflow tasks scheduled on such heterogeneous platforms with provably bounded response times while incurring no resource capacity loss, thus proving EDF to be an optimal solution for this scheduling problem. Experiments using synthetic workloads with widely varied parameters also demonstrate that the magnitude of the response time bounds yielded under the proposed analysis is reasonably small under all scenarios. Compared to the state-of-the-art soft real-time analysis techniques, our test yields a 68% reduction on response time bounds on average. This work demonstrates the potential of applying EDF into practical industrial systems containing dataflow-based workloads that desire guaranteed bounded response times
- …