A survey on provenance: What for? What form? What from?

M Herschel, R Diestelkämper, H Ben Lahmar - The VLDB Journal, 2017‏ - Springer
Provenance refers to any information describing the production process of an end product,
which can be anything from a piece of digital data to a physical object. While this survey …

A survey on collecting, managing, and analyzing provenance from scripts

JF Pimentel, J Freire, L Murta… - ACM Computing Surveys …, 2019‏ - dl.acm.org
Scripts are widely used to design and run scientific experiments. Scripting languages are
easy to learn and use, and they allow complex tasks to be specified and executed in fewer …

Practical whole-system provenance capture

T Pasquier, X Han, M Goldstein, T Moyer… - Proceedings of the …, 2017‏ - dl.acm.org
Data provenance describes how data came to be in its present form. It includes data sources
and the transformations that have been applied to them. Data provenance has many uses …

Reprozip: Computational reproducibility with ease

F Chirigati, R Rampin, D Shasha, J Freire - Proceedings of the 2016 …, 2016‏ - dl.acm.org
We present ReproZip, the recommended packaging tool for the SIGMOD Reproducibility
Review. ReproZip was designed to simplify the process of making an existing computational …

YesWorkflow: a user-oriented, language-independent tool for recovering workflow information from scripts

T McPhillips, T Song, T Kolisnik, S Aulenbach… - arxiv preprint arxiv …, 2015‏ - arxiv.org
Scientific workflow management systems offer features for composing complex
computational pipelines from modular building blocks, for executing the resulting automated …

Improving reproducibility of data science pipelines through transparent provenance capture

L Rupprecht, JC Davis, C Arnold, Y Gur… - Proceedings of the …, 2020‏ - dl.acm.org
Data science has become prevalent in a large variety of domains. Inherent in its practice is
an exploratory, probing, and fact finding journey, which consists of the assembly, adaptation …

Understanding experiments and research practices for reproducibility: an exploratory study

S Samuel, B König-Ries - PeerJ, 2021‏ - peerj.com
Scientific experiments and research practices vary across disciplines. The research
practices followed by scientists in each domain play an essential role in the …

[HTML][HTML] Sustainable computational science: the ReScience initiative

NP Rougier, K Hinsen, F Alexandre, T Arildsen… - PeerJ Computer …, 2017‏ - peerj.com
Licence This is an open access article distributed under the terms of the Creative Commons
Attribution License, which permits unrestricted use, distribution, reproduction and adaptation …

Provenance in collaborative in silico scientific research: A survey

E Jandre, B Diirr, V Braganholo - ACM SIGMOD Record, 2020‏ - dl.acm.org
Science is a collaborative activity by definition. Research is usually conducted by several
scientists working together, and this behavior has been intensified in recent years …

Provdb: Lifecycle management of collaborative analysis workflows

H Miao, A Chavan, A Deshpande - Proceedings of the 2nd Workshop on …, 2017‏ - dl.acm.org
As data-driven methods are becoming pervasive in a wide variety of disciplines, there is an
urgent need to develop scalable and sustainable tools to simplify the process of data …