2024-10-16
2024-08-20
2024-07-22
Abstract—Cloud computing is a paradigm shift in service delivery that promises a leap in efficiency and flexibility in using computing resources. As cloud infrastructures are widely deployed around the globe, many data- and computeintensive scientific workflows have been moved from traditional high-performance computing platforms and grids to clouds. With the rapidly increasing number of cloud users in various science domains, it has become a critical task for the cloud service provider to perform efficient job scheduling while still guaranteeing the workflow completion time as specified in the Service Level Agreement (SLA). Based on practical models for cloud utilization, we formulate a delay-constrained workflow optimization problem to maximize resource utilization for high system throughput and propose a two-step scheduling algorithm to minimize the cloud overhead under a user-specified execution time bound. Extensive simulation results illustrate that the proposed algorithm achieves lower computing overhead or higher resource utilization than existing methods under the execution time bound, and also significantly reduces the total workflow execution time by strategically selecting appropriate mapping nodes for prioritized modules. Index Terms—Scientific workflow, workflow scheduling, cloud computing Cite: Michelle M. Zhu, Fei Cao, and Chase Q. Wu, "High-Throughput Scientific Workflow Scheduling under Deadline Constraint in Clouds," Journal of Communications, vol. 9, no. 4, pp. 312-321, 2014. Doi: 10.12720/jcm.9.4.312-321