Accelerating human-in-the-loop machine learning: Challenges and opportunities

D **n, L Ma, J Liu, S Macke, S Song… - Proceedings of the …, 2018 - dl.acm.org
Development of machine learning (ML) workflows is a tedious process of iterative
experimentation: developers repeatedly make changes to workflows until the desired …

Data management in machine learning: Challenges, techniques, and systems

A Kumar, M Boehm, J Yang - Proceedings of the 2017 ACM International …, 2017 - dl.acm.org
Large-scale data analytics using statistical machine learning (ML), popularly called
advanced analytics, underpins many modern data-driven applications. The data …

Data validation for machine learning

N Polyzotis, M Zinkevich, S Roy… - … of machine learning …, 2019 - proceedings.mlsys.org
Abstract Machine learning is a powerful tool for gleaning knowledge from massive amounts
of data. While a great deal of machine learning research has focused on improving the …

Towards demystifying serverless machine learning training

J Jiang, S Gan, Y Liu, F Wang, G Alonso… - Proceedings of the …, 2021 - dl.acm.org
The appeal of serverless (FaaS) has triggered a growing interest on how to use it in data-
intensive applications such as ETL, query processing, or machine learning (ML). Several …

Tfx: A tensorflow-based production-scale machine learning platform

D Baylor, E Breck, HT Cheng, N Fiedel… - Proceedings of the 23rd …, 2017 - dl.acm.org
Creating and maintaining a platform for reliably producing and deploying machine learning
models requires careful orchestration of many components---a learner for generating …

Automating large-scale data quality verification

S Schelter, D Lange, P Schmidt, M Celikel… - Proceedings of the …, 2018 - dl.acm.org
Modern companies and institutions rely on data to guide every single business process and
decision. Missing or incorrect information seriously compromises any decision process …

Data lifecycle challenges in production machine learning: a survey

N Polyzotis, S Roy, SE Whang, M Zinkevich - ACM Sigmod Record, 2018 - dl.acm.org
Machine learning has become an essential tool for gleaning knowledge from data and
tackling a diverse set of computationally hard tasks. However, the accuracy of a machine …

The art and practice of data science pipelines: A comprehensive study of data science pipelines in theory, in-the-small, and in-the-large

S Biswas, M Wardat, H Rajan - … of the 44th International Conference on …, 2022 - dl.acm.org
Increasingly larger number of software systems today are including data science
components for descriptive, predictive, and prescriptive analytics. The collection of data …

[HTML][HTML] On challenges in machine learning model management

S Schelter, F Biessmann, T Januschowski, D Salinas… - 2015 - amazon.science
The training, maintenance, deployment, monitoring, organization and documentation of
machine learning (ML) models–in short model management–is a critical task in virtually all …

Probabilistic demand forecasting at scale

JH Böse, V Flunkert, J Gasthaus… - Proceedings of the …, 2017 - dl.acm.org
We present a platform built on large-scale, data-centric machine learning (ML) approaches,
whose particular focus is demand forecasting in retail. At its core, this platform enables the …