[HTML][HTML] The pipeline for the continuous development of artificial intelligence models—Current state of research and practice

M Steidl, M Felderer, R Ramler - Journal of Systems and Software, 2023 - Elsevier
Companies struggle to continuously develop and deploy Artificial Intelligence (AI) models to
complex production systems due to AI characteristics while assuring quality. To ease the …

Management of machine learning lifecycle artifacts: A survey

M Schlegel, KU Sattler - ACM SIGMOD Record, 2023 - dl.acm.org
The explorative and iterative nature of develo** and operating ML applications leads to a
variety of artifacts, such as datasets, features, models, hyperparameters, metrics, software …

Distributed deep learning on data systems: a comparative analysis of approaches

Y Zhang, F Mcquillan, N Jayaram, N Kak… - Proceedings of the …, 2021 - par.nsf.gov
Deep learning (DL) is growing in popularity for many data analytics applications, including
among enterprises. Large business-critical datasets in such settings typically reside in …

Spade: Synthesizing data quality assertions for large language model pipelines

S Shankar, H Li, P Asawa, M Hulsebos, Y Lin… - Proceedings of the …, 2024 - dl.acm.org
Large language models (LLMs) are being increasingly deployed as part of pipelines that
repeatedly process or generate data of some sort. However, a common barrier to …

An empirical study on ML DevOps adoption trends, efforts, and benefits analysis

DE Rzig, F Hassan, M Kessentini - Information and Software Technology, 2022 - Elsevier
Abstract Context: Machine Learning (ML), including Deep Learning (DL), based systems,
have become ubiquitous in today's solutions to many real-world problems. ML-based …

Building continuous integration services for machine learning

B Karlaš, M Interlandi, C Renggli, W Wu… - Proceedings of the 26th …, 2020 - dl.acm.org
Continuous integration (CI) has been a de facto standard for building industrial-strength
software. Yet, there is little attention towards applying CI to the development of machine …

Ease. ml: A lifecycle management system for machine learning

L Aguilar Melgar, D Dao, S Gan… - … of the Annual …, 2021 - research-collection.ethz.ch
We present Ease. ML, a lifecycle management system for machine learning (ML). Unlike
many existing works, which focus on improving individual steps during the lifecycle of ML …

[HTML][HTML] JENGA: a framework to study the impact of data errors on the predictions of machine learning models

S Schelter, T Rukat, F Biessmann - 2021 - amazon.science
Machine learning (ML) is increasingly used to automate decision making in various
domains. Almost all common ML models are susceptible to data errors in the serving data …

Spade: Synthesizing assertions for large language model pipelines

S Shankar, H Li, P Asawa, M Hulsebos, Y Lin… - arxiv preprint arxiv …, 2024 - arxiv.org
Operationalizing large language models (LLMs) for custom, repetitive data pipelines is
challenging, particularly due to their unpredictable and potentially catastrophic failures …

[PDF][PDF] Ease. ML: A lifecycle management system for MLDev and MLOps

L Aguilar, D Dao, S Gan, NM Gurel… - … on Innovative Data …, 2021 - vldb.org
We present Ease. ML, a lifecycle management system for machine learning (ML). Unlike
many existing works, which focus on improving individual steps during the lifecycle of ML …