Scientific workflows are critical to scientific data analysis and often involve computationally intensive processing of large datasets on compute clusters. As such, their execution tends to be long-running and resource-intensive, resulting in significant energy consumption and carbon emissions. While carbon-aware computing methods have received considerable attention in general cloud contexts, their application to scientific data analysis workflows remains a critical research gap. Our study addresses this oversight by showing how the delay tolerance, interruptibility, and scalability of scientific workflows can be leveraged for a significantly more sustainable execution model. In this study, we first quantify the problem of carbon emissions associated with running scientific workflows, and then demonstrate the transformative potential for carbon-aware workflow execution. We estimate the carbon footprint of seven real-world Nextflow workflows executed on diverse dedicated cluster and public cloud resources using high-resolution average and marginal grid carbon intensity data from open and commercial data providers. Furthermore, we conduct a systematic evaluation of the impact of carbon-aware temporal shifting, and the dynamic pausing and resuming of the workflow. Moreover, we investigate the impact of resource scaling at both workflow and workflow task levels. Finally, we report substantial potential reductions in overall carbon emissions, with temporal shifting capable of decreasing emissions by over 80%, and resource scaling by 67%.
翻译:科学工作流对科学数据分析至关重要,通常涉及在计算集群上对大规模数据集进行计算密集型处理。因此,其执行过程往往耗时较长且资源密集,导致显著的能源消耗和碳排放。尽管碳感知计算方法在通用云环境中已受到相当多的关注,但其在科学数据分析工作流中的应用仍是一个关键的研究空白。我们的研究通过展示如何利用科学工作流的延迟容忍性、可中断性和可扩展性来实现显著更可持续的执行模型,从而弥补了这一疏忽。在本研究中,我们首先量化了运行科学工作流相关的碳排放问题,然后论证了碳感知工作流执行的变革潜力。我们使用来自公开和商业数据提供商的高分辨率平均及边际电网碳强度数据,估算了七个真实世界Nextflow工作流在不同专用集群和公共云资源上执行时的碳足迹。此外,我们对碳感知时间迁移、工作流的动态暂停与恢复的影响进行了系统评估。同时,我们研究了在工作流层面和工作流任务层面进行资源扩展的影响。最后,我们报告了在总体碳排放方面存在的巨大减排潜力:时间迁移能够减少超过80%的排放,而资源扩展可减少67%的排放。