Provenance data in the machine learning lifecycle in computational science and engineering

R Souza, L Azevedo, V Lourenço… - 2019 IEEE/ACM …, 2019 - ieeexplore.ieee.org
Machine Learning (ML) has become essential in several industries. In Computational
Science and Engineering (CSE), the complexity of the ML lifecycle comes from the large …

Efficient runtime capture of multiworkflow data using provenance

R Souza, L Azevedo, R Thiago… - 2019 15th …, 2019 - ieeexplore.ieee.org
Computational Science and Engineering (CSE) projects are typically developed by
multidisciplinary teams. Despite being part of the same project, each team manages its own …

[HTML][HTML] Dfanalyzer: runtime dataflow analysis tool for computational science and engineering applications

V Silva, V Campos, T Guedes, J Camata, D de Oliveira… - SoftwareX, 2020 - Elsevier
DfAnalyzer is a tool for monitoring, debugging, and analyzing dataflows generated by
Computational Science and Engineering (CSE) applications. It collects strategic raw data …

Towards optimizing the execution of spark scientific workflows using machine learning‐based parameter tuning

D de Oliveira, F Porto, C Boeres… - Concurrency and …, 2021 - Wiley Online Library
In the last few years, Apache Spark has become a de facto the standard framework for big
data systems on both industry and academy projects. Spark is used to execute compute‐and …

Kee** track of user steering actions in dynamic workflows

R Souza, V Silva, JJ Camata, ALGA Coutinho… - Future Generation …, 2019 - Elsevier
In long-lasting scientific workflow executions in HPC machines, computational scientists (the
users in this work) often need to fine-tune several workflow parameters. These tunings are …

Integrating provenance capture and UML with UML2PROV: Principles and experience

C Sáenz-Adán, B Pérez… - IEEE Transactions …, 2020 - ieeexplore.ieee.org
In response to the increasing calls for algorithmic accountability, UML2PROV is a novel
approach to address the existing gap between application design, where models are …

Machine Learning-assisted Computational Steering of Large-scale Scientific Simulations

W Liu, Q Ye, CQ Wu, Y Liu, X Zhou… - 2021 IEEE Intl Conf on …, 2021 - ieeexplore.ieee.org
Next-generation scientific applications in various fields are experiencing a rapid transition
from traditional experiment-based methodologies to large-scale computation-intensive …

[PDF][PDF] Captura de dados de proveniência para apoiar a análise de hiperparâmetros em redes de aprendizado profundo

D Pina - 2020 - cos.ufrj.br
Algoritmos de aprendizado de máquina pertencem a uma área em Inteligência Artificial que,
com base em análises sobre grandes conjuntos de dados,“aprendem” seu comportamento …

[PDF][PDF] Supporting User Steering In Large-Scale Workflows With Provenance Data

R Souza - 2019 - researchgate.net
How to enable computational scientists and engineers to monitor and understand their
experiments when they are steering them on large-scale computers? This is the central …

User Steering Support in Large-scale Workflows

R Souza, M Mattoso, P Valduriez - Anais Estendidos do XXXVI …, 2021 - sol.sbc.org.br
Large-scale workflows that execute on High-Performance Computing machines need to be
dynamically steered by users. This means that users analyze big data files, assess key …