Reproducibility in scientific computing

P Ivie, D Thain - ACM Computing Surveys (CSUR), 2018 - dl.acm.org
Reproducibility is widely considered to be an essential requirement of the scientific process.
However, a number of serious concerns have been raised recently, questioning whether …

Scientific workflows: moving across paradigms

CS Liew, MP Atkinson, M Galea, TF Ang… - ACM Computing …, 2016 - dl.acm.org
Modern scientific collaborations have opened up the opportunity to solve complex problems
that require both multidisciplinary expertise and large-scale computational experiments …

Clowder: Open source data management for long tail data

L Marini, I Gutierrez-Polo, R Kooper… - Proceedings of the …, 2018 - dl.acm.org
Clowder is an open source data management system to support data curation of long tail
data and metadata across multiple research domains and diverse data types. Institutions …

Social science data repositories in data deluge: A case study of ICPSR's workflow and practices

W Jeng, D He, Y Chi - The Electronic Library, 2017 - emerald.com
Purpose Owing to the recent surge of interest in the age of the data deluge, the importance
of researching data infrastructures is increasing. The open archival information system …

Brown dog: Leveraging everything towards autocuration

S Padhy, G Jansen, J Alameda, E Black… - … Conference on Big …, 2015 - ieeexplore.ieee.org
We present Brown Dog, two highly extensible services that aim to leverage any existing
pieces of code, libraries, services, or standalone software (past or present) towards …

Big provenance stream processing for data intensive computations

I Suriarachchi, S Withana, B Plale - 2018 IEEE 14th …, 2018 - ieeexplore.ieee.org
In the business and research landscape of today, data analysis consumes public and
proprietary data from numerous sources, and utilizes any one or more of popular data …

Automatic glomerulus extraction in whole slide images towards computer aided diagnosis

Y Zhao, EF Black, L Marini, K McHenry… - 2016 IEEE 12th …, 2016 - ieeexplore.ieee.org
Renal biopsies form the gold standard of diagnostic and prognostic assessments of renal
transplants. With the addition of new quantitative strategies to supplement renal biopsy …

Advancing distributed data management for the HydroShare hydrologic information system

H Yi, R Idaszak, M Stealey, C Calloway… - … Modelling & Software, 2018 - Elsevier
Abstract HydroShare (https://www. hydroshare. org) is an online collaborative system to
support the open sharing of hydrologic data, analytical tools, and computer models …

Prune: A preserving run environment for reproducible scientific computing

P Ivie, D Thain - 2016 IEEE 12th International Conference on e …, 2016 - ieeexplore.ieee.org
Computing as a whole suffers from a crisis of reproducibility. Programs executed in one
context are astonishingly hard to reproduce in another context, resulting in wasted effort by …

Server‐side workflow execution using data grid technology for reproducible analyses of data‐intensive hydrologic systems

BT Essawy, JL Goodall, H Xu… - Earth and Space …, 2016 - Wiley Online Library
Many geoscience disciplines utilize complex computational models for advancing
understanding and sustainable management of Earth systems. Executing such models and …