Search CORE

47,950 research outputs found

A Novel Workload Allocation Strategy for Batch Jobs

Author: Fleming P.J.
Shenfield Alex
Publication venue: 'Deanship of Scientific Research'
Publication date: 01/01/2013
Field of study

The distribution of computational tasks across a diverse set of geographically distributed heterogeneous resources is a critical issue in the realisation of true computational grids. Conventionally, workload allocation algorithms are divided into static and dynamic approaches. Whilst dynamic approaches frequently outperform static schemes, they usually require the collection and processing of detailed system information at frequent intervals - a task that can be both time consuming and unreliable in the real-world. This paper introduces a novel workload allocation algorithm for optimally distributing the workload produced by the arrival of batches of jobs. Results show that, for the arrival of batches of jobs, this workload allocation algorithm outperforms other commonly used algorithms in the static case. A hybrid scheduling approach (using this workload allocation algorithm), where information about the speed of computational resources is inferred from previously completed jobs, is then introduced and the efficiency of this approach demonstrated using a real world computational grid. These results are compared to the same workload allocation algorithm used in the static case and it can be seen that this hybrid approach comprehensively outperforms the static approach

Crossref

Sheffield Hallam University Research Archive

Flexible provisioning of Web service workflows

Author: Aggarwal R.
Aghdaie N.
Akkiraju R.
Baccelli F.
Eder J.
Friese T.
Jaeger M. C.
Klusch M.
Long D. D. E.
Mandell D.
Martin D.
McDermott D.
McIlraith S. A.
Nicholas R. Jennings
O'Brien A.
Paolucci M.
Raiffa H.
Russell S.
Sebastian Stein
Sirin E.
Smith T. M.
Stein S.
Stein S.
Szomszor M.
Terry R. Payne
Tillman F. A.
Weatherspoon H.
Yu T.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/02/2008
Field of study

Web services promise to revolutionise the way computational resources and business processes are offered and invoked in open, distributed systems, such as the Internet. These services are described using machine-readable meta-data, which enables consumer applications to automatically discover and provision suitable services for their workflows at run-time. However, current approaches have typically assumed service descriptions are accurate and deterministic, and so have neglected to account for the fact that services in these open systems are inherently unreliable and uncertain. Specifically, network failures, software bugs and competition for services may regularly lead to execution delays or even service failures. To address this problem, the process of provisioning services needs to be performed in a more flexible manner than has so far been considered, in order to proactively deal with failures and to recover workflows that have partially failed. To this end, we devise and present a heuristic strategy that varies the provisioning of services according to their predicted performance. Using simulation, we then benchmark our algorithm and show that it leads to a 700% improvement in average utility, while successfully completing up to eight times as many workflows as approaches that do not consider service failures

CiteSeerX

Southampton (e-Prints Soton)

Crossref

Spiral - Imperial College Digital Repository

A Three-Level Parallelisation Scheme and Application to the Nelder-Mead Algorithm

Author: Bugajev Andrej
Kriauzienė Rima
Čiegis Raimondas
Publication venue
Publication date: 22/09/2019
Field of study

We consider a three-level parallelisation scheme. The second and third levels define a classical two-level parallelisation scheme and some load balancing algorithm is used to distribute tasks among processes. It is well-known that for many applications the efficiency of parallel algorithms of the second and third level starts to drop down after some critical parallelisation degree is reached. This weakness of the two-level template is addressed by introduction of one additional parallelisation level. As an alternative to the basic solver some new or modified algorithms are considered on this level. The idea of the proposed methodology is to increase the parallelisation degree by using less efficient algorithms in comparison with the basic solver. As an example we investigate two modified Nelder-Mead methods. For the selected application, a few partial differential equations are solved numerically on the second level, and on the third level the parallel Wang's algorithm is used to solve systems of linear equations with tridiagonal matrices. A greedy workload balancing heuristic is proposed, which is oriented to the case of a large number of available processors. The complexity estimates of the computational tasks are model-based, i.e. they use empirical computational data

arXiv.org e-Print Archive

Directory of Open Access Journals

VGTU Journals (Vilnius Gediminas Technical University - Vilnius Tech)

Dominant Resource Fairness in Cloud Computing Systems with Heterogeneous Servers

Author: Li Baochun
Liang Ben
Wang Wei
Publication venue
Publication date: 31/07/2013
Field of study

We study the multi-resource allocation problem in cloud computing systems where the resource pool is constructed from a large number of heterogeneous servers, representing different points in the configuration space of resources such as processing, memory, and storage. We design a multi-resource allocation mechanism, called DRFH, that generalizes the notion of Dominant Resource Fairness (DRF) from a single server to multiple heterogeneous servers. DRFH provides a number of highly desirable properties. With DRFH, no user prefers the allocation of another user; no one can improve its allocation without decreasing that of the others; and more importantly, no user has an incentive to lie about its resource demand. As a direct application, we design a simple heuristic that implements DRFH in real-world systems. Large-scale simulations driven by Google cluster traces show that DRFH significantly outperforms the traditional slot-based scheduler, leading to much higher resource utilization with substantially shorter job completion times

arXiv.org e-Print Archive

CiteSeerX

Crossref

Multidimensional integration in a heterogeneous network environment

Author: Butler
Collins
Geist
Gross
Hammersley
Kalos
L'Ecuyer
Lepage
Lepage
Press
Siniša Veseli
Stroud
Publication venue: 'Elsevier BV'
Publication date: 17/10/1997
Field of study

We consider several issues related to the multidimensional integration using a network of heterogeneous computers. Based on these considerations, we develop a new general purpose scheme which can significantly reduce the time needed for evaluation of integrals with CPU intensive integrands. This scheme is a parallel version of the well-known adaptive Monte Carlo method (the VEGAS algorithm), and is incorporated into a new integration package which uses the standard set of message-passing routines in the PVM software system.Comment: 19 pages, latex, 5 postscript figures include

arXiv.org e-Print Archive

Crossref

CERN Document Server

Hardware acceleration of reaction-diffusion systems:a guide to optimisation of pattern formation algorithms using OpenACC

Author: Falconer Ruth E.
Houston Alasdair N.
Otten Wilfred
Portell Xavier
Publication venue
Publication date: 10/06/2019
Field of study

Reaction Diffusion Systems (RDS) have widespread applications in computational ecology, biology, computer graphics and the visual arts. For the former applications a major barrier to the development of effective simulation models is their computational complexity - it takes a great deal of processing power to simulate enough replicates such that reliable conclusions can be drawn. Optimizing the computation is thus highly desirable in order to obtain more results with less resources. Existing optimizations of RDS tend to be low-level and GPGPU based. Here we apply the higher-level OpenACC framework to two case studies: a simple RDS to learn the ‘workings’ of OpenACC and a more realistic and complex example. Our results show that simple parallelization directives and minimal data transfer can produce a useful performance improvement. The relative simplicity of porting OpenACC code between heterogeneous hardware is a key benefit to the scientific computing community in terms of speed-up and portability

Abertay Research Portal

Crossref