WfBench: Automated Generation of Scientific Workflow Benchmarks

Coleman, Tainã; Casanova, Henri; Maheshwari, Ketan; Pottier, Loïc; Wilkinson, Sean R.; Wozniak, Justin; Suter, Frédéric; Shankar, Mallikarjun; da Silva, Rafael Ferreira

doi:10.1109/PMBS56514.2022.00014

Computer Science > Distributed, Parallel, and Cluster Computing

arXiv:2210.03170 (cs)

[Submitted on 6 Oct 2022]

Title:WfBench: Automated Generation of Scientific Workflow Benchmarks

Authors:Tainã Coleman, Henri Casanova, Ketan Maheshwari, Loïc Pottier, Sean R. Wilkinson, Justin Wozniak, Frédéric Suter, Mallikarjun Shankar, Rafael Ferreira da Silva

View PDF

Abstract:The prevalence of scientific workflows with high computational demands calls for their execution on various distributed computing platforms, including large-scale leadership-class high-performance computing (HPC) clusters. To handle the deployment, monitoring, and optimization of workflow executions, many workflow systems have been developed over the past decade. There is a need for workflow benchmarks that can be used to evaluate the performance of workflow systems on current and future software stacks and hardware platforms.
We present a generator of realistic workflow benchmark specifications that can be translated into benchmark code to be executed with current workflow systems. Our approach generates workflow tasks with arbitrary performance characteristics (CPU, memory, and I/O usage) and with realistic task dependency structures based on those seen in production workflows. We present experimental results that show that our approach generates benchmarks that are representative of production workflows, and conduct a case study to demonstrate the use and usefulness of our generated benchmarks to evaluate the performance of workflow systems under different configuration scenarios.

Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC)
Cite as:	arXiv:2210.03170 [cs.DC]
	(or arXiv:2210.03170v1 [cs.DC] for this version)
	https://doi.org/10.48550/arXiv.2210.03170
Related DOI:	https://doi.org/10.1109/PMBS56514.2022.00014

Submission history

From: Rafael Ferreira Da Silva [view email]
[v1] Thu, 6 Oct 2022 19:22:06 UTC (1,233 KB)

Computer Science > Distributed, Parallel, and Cluster Computing

Title:WfBench: Automated Generation of Scientific Workflow Benchmarks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Distributed, Parallel, and Cluster Computing

Title:WfBench: Automated Generation of Scientific Workflow Benchmarks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators