TY - GEN
T1 - Cloud-native workflow scheduling using a hybrid priority rule and dynamic task parallelism
AU - Shin, Jungeun
AU - Arroyo, Diana
AU - Tantawi, Asser
AU - Wang, Chen
AU - Youssef, Alaa
AU - Nagi, Rakesh
N1 - Publisher Copyright:
© 2022 Owner/Author.
PY - 2022/11/7
Y1 - 2022/11/7
N2 - Demand for efficient cloud-native workflow scheduling is growing as many data science workloads are composed of several tasks with dependencies. As container technology becomes more prevalent in cloud communities, containerized workflow orchestration tools are introduced and become standard for scheduling workflows. However, current schedulers use simple heuristics and rely on the user's choice on priority and parallelism level of tasks without accounting for workflow-specific information. We introduce a workflow-aware scheduling algorithm that uses workflow information for scheduling tasks, without user input, with an objective of improving resource utilization and minimizing weighted workflow completion time, duration multiplied by user specific workflow priority. Our scheduler comprises of two strategies, a hybrid priority rule inspired by production planning ideas, and a task splitting rule based on a convex task processing time curve for the parallelism level. Using simulation, we demonstrate that our algorithm (1) produces an efficient balance of weighted workflow completion time and resource utilization and (2) outperforms deterministic parallelism.
AB - Demand for efficient cloud-native workflow scheduling is growing as many data science workloads are composed of several tasks with dependencies. As container technology becomes more prevalent in cloud communities, containerized workflow orchestration tools are introduced and become standard for scheduling workflows. However, current schedulers use simple heuristics and rely on the user's choice on priority and parallelism level of tasks without accounting for workflow-specific information. We introduce a workflow-aware scheduling algorithm that uses workflow information for scheduling tasks, without user input, with an objective of improving resource utilization and minimizing weighted workflow completion time, duration multiplied by user specific workflow priority. Our scheduler comprises of two strategies, a hybrid priority rule inspired by production planning ideas, and a task splitting rule based on a convex task processing time curve for the parallelism level. Using simulation, we demonstrate that our algorithm (1) produces an efficient balance of weighted workflow completion time and resource utilization and (2) outperforms deterministic parallelism.
UR - http://www.scopus.com/inward/record.url?scp=85143256946&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85143256946&partnerID=8YFLogxK
U2 - 10.1145/3542929.3563495
DO - 10.1145/3542929.3563495
M3 - Conference contribution
AN - SCOPUS:85143256946
T3 - SoCC 2022 - Proceedings of the 13th Symposium on Cloud Computing
SP - 72
EP - 77
BT - SoCC 2022 - Proceedings of the 13th Symposium on Cloud Computing
PB - Association for Computing Machinery
T2 - 13th Annual ACM Symposium on Cloud Computing, SoCC 2022
Y2 - 7 November 2022 through 11 November 2022
ER -