Journal of Xidian University ›› 2024, Vol. 51 ›› Issue (2): 13-27.doi: 10.19665/j.issn1001-2400.20231206

• Information and Communications Engineering • Previous Articles     Next Articles

Workflow deployment method based on graph segmentation with communication and computation jointly optimized

MA Yinghong1(), LIN Liwan1(), JIAO Yi2(), LI Qinyao3()   

  1. 1. School of Telecommunications Engineering,Xidian University,Xi’an 710071,China
    2. Runjian Stock,Xi’an 710075,China
    3. Center of Innovation and Application,China Academy of Aerospace Aerodynamics,Beijing 100074,China
  • Received:2022-11-29 Online:2024-04-20 Published:2024-01-04

Abstract:

For the purpose of improving computing efficiency,it becomes an important way for cloud data centers to deal with the continuous growth of computing and network tasks by decomposes complex large-scale tasks into simple tasks and modeling them into workflows,which are then completed by parallel distributed computing clusters.However,the communication bandwidth consumption caused by inter-task transmission can easily cause network congestion in data center.It is of great significance to deploy workflow scientifically,taking into account both computing efficiency and communication overhead.There are two typical types of workflow deployment algorithms:list-based workflow deployment algorithm and cluster-based workflow deployment algorithm.However,the former focuses on improving the computing efficiency while does not pay attention to the inter-task communication cost,so the deployment of large-scale workflow is easy to bring heavy network load.The latter focuses on minimizing the communication cost,but sacrifices the parallel computing efficiency of the tasks in the workflow,which results in a long workflow completion time.This work fully explores the dependency and parallelism between tasks in workflow,from the perspective of graph theory.By improving the classic graph segmentation algorithm,community discovery algorithm,the balance between minimizing communication cost and maximizing computation parallelism was achieved in the process of workflow task partitioning.Simulation results show that,under different workflow scales,the proposed algorithm reduces the communication cost by 35%~50%,compared with the typical list-based deployment algorithm,and the workflow completion time by 50%~65%,compared with the typical cluster-based deployment algorithm.Moreover,its performance has good stability for workflows with different communication-calculation ratios.

Key words: cloud computing, data center, workflow, task deployment, graph theory

CLC Number: 

  • TP393