Search CORE

35 research outputs found

Weighted Scheduling of Time-Sensitive Coflows

Author: Brun Olivier
De Pellergrini Francesco
El-Azouzi Rachid
Luu Quang-Trung
Prabhu Balakrishna J.
Richier Cédric
Publication venue
Publication date: 22/10/2023
Field of study

Datacenter networks commonly facilitate the transmission of data in distributed computing frameworks through coflows, which are collections of parallel flows associated with a common task. Most of the existing research has concentrated on scheduling coflows to minimize the time required for their completion, i.e., to optimize the average dispatch rate of coflows in the network fabric. Nevertheless, modern applications often produce coflows that are specifically intended for online services and mission-crucial computational tasks, necessitating adherence to specific deadlines for their completion. In this paper, we introduce \wdcoflow,~ a new algorithm to maximize the weighted number of coflows that complete before their deadline. By combining a dynamic programming algorithm along with parallel inequalities, our heuristic solution performs at once coflow admission control and coflow prioritization, imposing a

\sigma

-order on the set of coflows. With extensive simulation, we demonstrate the effectiveness of our algorithm in improving up to

3\times

more coflows that meet their deadline in comparison the best SoA solution, namely

\mathtt{CS\text{-}MHA}

. Furthermore, when weights are used to differentiate coflow classes, \wdcoflow~ is able to improve the admission per class up to

4\times

, while increasing the average weighted coflow admission rate.Comment: Submitted to IEEE Transactions on Cloud Computing. Parts of this work have been presented at IFIP Networking 202

arXiv.org e-Print Archive

Efficient Approximation Algorithms for Scheduling Coflows with Total Weighted Completion Time in Identical Parallel Networks

Author: Chen Chi-Yeh
Publication venue
Publication date: 04/06/2023
Field of study

This paper addresses the scheduling problem of coflows in identical parallel networks, which is a well-known

NP

-hard problem. Coflow is a relatively new network abstraction used to characterize communication patterns in data centers. We consider both flow-level scheduling and coflow-level scheduling problems. In the flow-level scheduling problem, flows within a coflow can be transmitted through different network cores. However, in the coflow-level scheduling problem, flows within a coflow must be transmitted through the same network core. The key difference between these two problems lies in their scheduling granularity. Previous approaches relied on linear programming to solve the scheduling order. In this paper, we enhance the efficiency of solving by utilizing the primal-dual method. For the flow-level scheduling problem, we propose a

(6-\frac{2}{m})

-approximation algorithm with arbitrary release times and a

(5-\frac{2}{m})

-approximation algorithm without release time, where

m

represents the number of network cores. Additionally, for the coflow-level scheduling problem, we introduce a

(4m+1)

-approximation algorithm with arbitrary release times and a

(4m)

-approximation algorithm without release time. To validate the effectiveness of our proposed algorithms, we conduct simulations using both synthetic and real traffic traces. The results demonstrate the superior performance of our algorithms compared to previous approach, emphasizing their practical utility

arXiv.org e-Print Archive

Graviton: Twisting Space and Time to Speed Up CoFlows

Author: Akshay Jajoo
Rohan Gandhi
Y Charlie Hu
Publication venue
Publication date: 06/03/2020
Field of study

CiteSeerX

When Network Matters: Data Center Scheduling with Network Tasks

Author: Giroire Frédéric
Huin Nicolas
Pérennes Stéphane
Tomassilli Andrea
Publication venue: HAL CCSD
Publication date: 29/04/2019
Field of study

International audienceWe consider the placement of jobs inside a data center. Traditionally, this is done by a task orchestrator without taking into account network constraints. According to recent studies, network transfers represent up to 50% of the completion time of classical jobs. Thus, network resources must be considered when placing jobs in a data center. In this paper, we propose a new scheduling framework, introducing network tasks that need to be executed on network machines alongside traditional (CPU) tasks. The model takes into account the competition between communications for the network resources, which is not considered in the formerly proposed scheduling models with communication. Network transfers inside a data center can be easily modeled in our framework. As we show, classical algorithms do not efficiently handle a limited amount of network bandwidth. We thus propose new provably efficient algorithms with the goal of minimizing the makespan in this framework. We show their efficiency and the importance of taking into consideration network capacity through extensive simulations on workflows built from Google data center traces

Crossref

INRIA a CCSD electronic archive server