The rising popularity of computational workflows is driven by the need for repetitive and scalable data processing, sharing of processing know-how, and transparent methods. As both combined records of analysis and descriptions of processing steps, workflows should be reproducible, reusable, adaptable, and available. Workflow sharing presents opportunities to reduce unnecessary reinvention, promote reuse, increase access to best practice analyses for non-experts, and increase productivity. In reality, workflows are scattered and difficult to find, in part due to the diversity of available workflow engines and ecosystems, and because workflow sharing is not yet part of research practice. WorkflowHub provides a unified registry for all computational workflows that links to community repositories, and supports both the workflow lifecycle and making workflows findable, accessible, interoperable, and reusable (FAIR). By interoperating with diverse platforms, services, and external registries, WorkflowHub adds value by supporting workflow sharing, explicitly assigning credit, enhancing FAIRness, and promoting workflows as scholarly artefacts. The registry has a global reach, with hundreds of research organisations involved, and more than 700 workflows registered.
翻译:计算工作流的日益普及,源于对可重复且可扩展的数据处理、处理专业知识共享以及方法透明化的需求。作为分析过程的综合记录与处理步骤的描述,工作流应具备可重现性、可重用性、可适应性和可获取性。工作流共享为减少不必要的重复开发、促进重用、为非专家提供最佳实践分析方法的访问途径以及提升科研效率创造了机遇。然而现实中,工作流往往分散各处且难以发现,部分原因在于现有工作流引擎与生态系统的多样性,以及工作流共享尚未成为常规研究实践。WorkflowHub为所有计算工作流提供了一个统一的注册中心,该中心与社区存储库相连接,既支持工作流的全生命周期管理,也致力于实现工作流的可发现、可访问、可互操作和可重用(FAIR原则)。通过与多样化平台、服务及外部注册中心实现互操作,WorkflowHub在支持工作流共享、明确分配学术贡献、提升FAIR化程度以及推动工作流作为学术成果认可等方面创造重要价值。该注册中心已覆盖全球范围,吸引了数百家研究机构参与,并收录了超过700个注册工作流。