Pegasus in the cloud: Science automation through workflow technologies

E Deelman, K Vahi, M Rynge, G Juve… - IEEE Internet …, 2016 - ieeexplore.ieee.org
The Pegasus Workflow Management System maps abstract, resource-independent workflow
descriptions onto distributed computing resources. As a result of this planning process …

Towards Reproducibility in Scientific Workflows: An Infrastructure‐Based Approach

I Santana-Perez… - Scientific …, 2015 - Wiley Online Library
It is commonly agreed that in silico scientific experiments should be executable and
repeatable processes. Most of the current approaches for computational experiment …

Reproducibility of execution environments in computational science using semantics and clouds

I Santana-Perez, RF da Silva, M Rynge… - Future Generation …, 2017 - Elsevier
In the past decades, one of the most common forms of addressing reproducibility in scientific
workflow-based computational science has consisted of tracking the provenance of the …

Transparent deployment of scientific workflows across clouds-kubernetes approach

M Orzechowski, B Balis, K Pawlik… - 2018 IEEE/ACM …, 2018 - ieeexplore.ieee.org
We present an end-to-end solution for automation of scientific workflow deployment and
execution on distributed computing infrastructures. The solution integrates de-facto standard …

Asterism: Pegasus and dispel4py hybrid workflows for data-intensive science

R Filgueira, RF Da Silva, A Krause… - … Workshop on Data …, 2016 - ieeexplore.ieee.org
We present Asterism, an open source data-intensive framework, which combines the
strengths of traditional workflow management systems with new parallel stream-based …

Reproducible and portable big data analytics in the cloud

X Wang, P Guo, X Li, A Gangopadhyay… - … on Cloud Computing, 2023 - ieeexplore.ieee.org
Cloud computing has become a major approach to help reproduce computational
experiments. Yet there are still two main difficulties in reproducing batch based Big Data …

Reproducibility of computational experiments on kubernetes-managed container clouds with hyperflow

M Orzechowski, B Baliś, RG Słota, J Kitowski - … Science–ICCS 2020: 20th …, 2020 - Springer
We propose a comprehensive solution for reproducibility of scientific workflows. We focus
particularly on Kubernetes-managed container clouds, increasingly important in scientific …

A semantic-based approach to attain reproducibility of computational environments in scientific workflows: A case study

I Santana-Perez, R Ferreira da Silva, M Rynge… - Euro-Par 2014: Parallel …, 2014 - Springer
Reproducible research in scientific workflows is often addressed by tracking the provenance
of the produced results. While this approach allows inspecting intermediate and final results …

Cloud infrastructure provenance collection and management to reproduce scientific workflows execution

K Hasham, K Munir, R McClatchey - Future Generation Computer Systems, 2018 - Elsevier
The emergence of Cloud computing provides a new computing paradigm for scientific
workflow execution. It provides dynamic, on-demand and scalable resources that enable the …

The challenge of reproducible ml: an empirical study on the impact of bugs

E Rivera-Landos, F Khomh… - 2021 IEEE 21st …, 2021 - ieeexplore.ieee.org
Reproducibility is a crucial requirement in scientific research. When results of research
studies and scientific papers have been found difficult or impossible to reproduce, we face a …