Towards defining a trustworthy artificial intelligence system development maturity model

SD Das, PK Bala, AN Mishra - Journal of Computer Information …, 2024 - Taylor & Francis
The trustworthiness of artificial intelligence (AI) has been challenged for quite some time.
The AI results will be trustworthy and reliable if trust-related principles are built into the AI …

Deep learning provenance data integration: a practical approach

D Pina, A Chapman, D De Oliveira… - … Proceedings of the ACM …, 2023 - dl.acm.org
A Deep Learning (DL) life cycle involves several data transformations, such as performing
data pre-processing, defining datasets to train and test a deep neural network (DNN), and …

Hyppo: using equivalences to optimize pipelines in exploratory machine learning

A Kontaxakis, D Sacharidis, A Simitsis… - 2024 IEEE 40th …, 2024 - ieeexplore.ieee.org
We present HYPPO, a novel system to optimize pipelines encountered in exploratory
machine learning. HYPPO exploits alternative computational paths of artifacts from past …

Metadata representations for queryable repositories of machine learning models

Z Li, H Kant, R Hai, A Katsifodimos, M Brambilla… - IEEE …, 2023 - ieeexplore.ieee.org
Machine learning (ML) practitioners and organizations are building model repositories of pre-
trained models, referred to as model zoos. These model zoos contain metadata describing …

MLflow2PROV: extracting provenance from machine learning experiments

M Schlegel, KU Sattler - Proceedings of the Seventh Workshop on Data …, 2023 - dl.acm.org
Supporting iterative and explorative workflows for develo** machine learning (ML)
models, ML experiment management systems (ML EMSs), such as MLflow, are increasingly …

Efficient Multi-Task Large Model Training via Data Heterogeneity-aware Model Management

Y Wang, S Zhu, F Fu, X Miao, J Zhang, J Zhu… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent foundation models are capable of handling multiple machine learning (ML) tasks
and multiple data modalities with the unified base model structure and several specialized …

Don't Push the Button! Exploring Data Leakage Risks in Machine Learning and Transfer Learning

A Apicella, F Isgrò, R Prevete - arxiv preprint arxiv:2401.13796, 2024 - arxiv.org
Machine Learning (ML) has revolutionized various domains, offering predictive capabilities
in several areas. However, with the increasing accessibility of ML tools, many practitioners …

Extracting Provenance of Machine Learning Experiment Pipeline Artifacts

M Schlegel, KU Sattler - European Conference on Advances in Databases …, 2023 - Springer
Experiment management systems (EMSs), such as MLflow, are increasingly used to
streamline the collection and management of machine learning (ML) artifacts in iterative and …

An empirical study of challenges in machine learning asset management

Z Zhao, Y Chen, AA Bangash, B Adams… - Empirical Software …, 2024 - Springer
Context: In machine learning (ML) applications, assets include not only the ML models
themselves, but also the datasets, algorithms, and deployment tools that are essential in the …

Machine learning experiment management tools: a mixed-methods empirical study

S Idowu, O Osman, D Strüber, T Berger - Empirical Software Engineering, 2024 - Springer
Abstract Machine Learning (ML) experiment management tools support ML practitioners and
software engineers when building intelligent software systems. By managing large numbers …