Provenance and scientific workflows: challenges and opportunities

SB Davidson, J Freire - Proceedings of the 2008 ACM SIGMOD …, 2008 - dl.acm.org
Provenance in the context of workflows, both for the data they derive and for their
specification, is an essential component to allow for result reproducibility, sharing, and …

The foundations for provenance on the web

L Moreau - Foundations and Trends® in Web Science, 2010 - nowpublishers.com
Provenance, ie, the origin or source of something, is becoming an important concern, since it
offers the means to verify data products, to infer their quality, to analyse the processes that …

Curated databases

P Buneman, J Cheney, WC Tan… - Proceedings of the twenty …, 2008 - dl.acm.org
Curated databases are databases that are populated and updated with a great deal of
human effort. Most reference works that one traditionally found on the reference shelves of …

Approximate lineage for probabilistic databases

C Ré, D Suciu - Proceedings of the VLDB Endowment, 2008 - dl.acm.org
In probabilistic databases, lineage is fundamental to both query processing and
understanding the data. Current systems sa Trio or Mystiq use a complete approach in …

Prime: A methodology for develo** provenance-aware applications

S Miles, P Groth, S Munroe, L Moreau - ACM Transactions on Software …, 2011 - dl.acm.org
Provenance refers to the past processes that brought about a given (version of an) object,
item or entity. By knowing the provenance of data, users can often better understand, trust …

Fine-grained and efficient lineage querying of collection-based workflow provenance

P Missier, NW Paton, K Belhajjame - Proceedings of the 13th …, 2010 - dl.acm.org
The management and querying of workflow provenance data underpins a collection of
activities, including the analysis of workflow results, and the debugging of workflows or …

Data lineage model for Taverna workflows with lightweight annotation requirements

P Missier, K Belhajjame, J Zhao, M Roos… - … and Annotation of Data …, 2008 - Springer
The provenance, or lineage, of a workflow data product can be reconstructed by kee** a
complete trace of workflow execution. This lineage information, however, is likely to be both …

Linking multiple workflow provenance traces for interoperable collaborative science

P Missier, B Ludäscher, S Bowers… - The 5th workshop on …, 2010 - ieeexplore.ieee.org
Scientific collaboration increasingly involves data sharing between separate groups. We
consider a scenario where data products of scientific workflows are published and then used …

InfraPhenoGrid: a scientific workflow infrastructure for plant phenomics on the grid

C Pradal, S Artzet, J Chopard, D Dupuis… - Future Generation …, 2017 - Elsevier
Plant phenoty** consists in the observation of physical and biochemical traits of plant
genotypes in response to environmental conditions. Challenges, in particular in context of …

LIVE: a lineage-supported versioned DBMS

A Das Sarma, M Theobald, J Widom - International Conference on …, 2010 - Springer
This paper presents LIVE, a complete DBMS designed for applications with many stored
derived relations, and with a need for simple versioning capabilities when base data is …