当前位置:
X-MOL 学术
›
arXiv.cs.OH
›
论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
How do users design scientific workflows? The Case of Snakemake
arXiv - CS - Other Computer Science Pub Date : 2023-09-25 , DOI: arxiv-2309.14097 Sebastian Pohl, Nourhan Elfaramawy, Kedi Cao, Birte Kehr, Matthias Weidlich
arXiv - CS - Other Computer Science Pub Date : 2023-09-25 , DOI: arxiv-2309.14097 Sebastian Pohl, Nourhan Elfaramawy, Kedi Cao, Birte Kehr, Matthias Weidlich
Scientific workflows automate the analysis of large-scale scientific data,
fostering the reuse of data processing operators as well as the reproducibility
and traceability of analysis results. In exploratory research, however,
workflows are continuously adapted, utilizing a wide range of tools and
software libraries, to test scientific hypotheses. Script-based workflow
engines cater to the required flexibility through direct integration of
programming primitives but lack abstractions for interactive exploration of the
workflow design by a user during workflow execution. To derive requirements for
such interactive workflows, we conduct an empirical study on the use of
Snakemake, a popular Python-based workflow engine. Based on workflows collected
from 1602 GitHub repositories, we present insights on common structures of
Snakemake workflows, as well as the language features typically adopted in
their specification.
中文翻译:
用户如何设计科学的工作流程?蛇形的案例
科学工作流程自动化大规模科学数据的分析,促进数据处理操作员的重用以及分析结果的可重复性和可追溯性。然而,在探索性研究中,工作流程不断调整,利用各种工具和软件库来测试科学假设。基于脚本的工作流引擎通过直接集成编程原语来满足所需的灵活性,但缺乏用户在工作流执行期间对工作流设计进行交互式探索的抽象。为了得出此类交互式工作流程的要求,我们对 Snakemake(一种流行的基于 Python 的工作流程引擎)的使用进行了实证研究。基于从 1602 个 GitHub 存储库收集的工作流程,我们提出了对 Snakemake 工作流程的常见结构的见解,
更新日期:2023-09-26
中文翻译:
用户如何设计科学的工作流程?蛇形的案例
科学工作流程自动化大规模科学数据的分析,促进数据处理操作员的重用以及分析结果的可重复性和可追溯性。然而,在探索性研究中,工作流程不断调整,利用各种工具和软件库来测试科学假设。基于脚本的工作流引擎通过直接集成编程原语来满足所需的灵活性,但缺乏用户在工作流执行期间对工作流设计进行交互式探索的抽象。为了得出此类交互式工作流程的要求,我们对 Snakemake(一种流行的基于 Python 的工作流程引擎)的使用进行了实证研究。基于从 1602 个 GitHub 存储库收集的工作流程,我们提出了对 Snakemake 工作流程的常见结构的见解,