[HTML][HTML] On challenges in machine learning model management

S Schelter, F Biessmann, T Januschowski, D Salinas… - 2015 - amazon.science
The training, maintenance, deployment, monitoring, organization and documentation of
machine learning (ML) models–in short model management–is a critical task in virtually all …

Towards linear algebra over normalized data

L Chen, A Kumar, J Naughton, JM Patel - arxiv preprint arxiv:1612.07448, 2016 - arxiv.org
Providing machine learning (ML) over relational data is a mainstream requirement for data
analytics systems. While almost all the ML tools require the input data to be presented as a …

An intermediate representation for optimizing machine learning pipelines

A Kunft, A Katsifodimos, S Schelter, S Breß… - Proceedings of the …, 2019 - dl.acm.org
Machine learning (ML) pipelines for model training and validation typically include
preprocessing, such as data cleaning and feature engineering, prior to training an ML …

LaraDB: A minimalist kernel for linear and relational algebra computation

D Hutchison, B Howe, D Suciu - Proceedings of the 4th ACM SIGMOD …, 2017 - dl.acm.org
Analytics tasks manipulate structured data with variants of relational algebra (RA) and
quantitative data with variants of linear algebra (LA). The two computational models have …

Automating and optimizing data-centric what-if analyses on native machine learning pipelines

S Grafberger, P Groth, S Schelter - … of the ACM on Management of Data, 2023 - dl.acm.org
Software systems that learn from data with machine learning (ML) are used in critical
decision-making processes. Unfortunately, real-world experience shows that the pipelines …

[PDF][PDF] SPOOF: Sum-Product Optimization and Operator Fusion for Large-Scale Machine Learning.

T Elgamal, S Luo, M Boehm, AV Evfimievski… - CIDR, 2017 - cidrdb.org
SPOOF: Sum-Product Optimization and Operator Fusion for Large-Scale Machine Learning
Page 1 © 2017 IBM Corporation SPOOF: Sum-Product Optimization and Operator Fusion for …

Architecting intermediate layers for efficient composition of data management and machine learning systems

S Abeysinghe, F Wang, G Essertel, T Rompf - arxiv preprint arxiv …, 2023 - arxiv.org
Modern data analytics workloads combine relational data processing with machine learning
(ML). Most DBMS handle these workloads by offloading these ML operations to external …

On the expressive power of linear algebra on graphs

F Geerts - Theory of Computing Systems, 2021 - Springer
There is a long tradition in understanding graphs by investigating their adjacency matrices
by means of linear algebra. Similarly, logic-based graph query languages are commonly …

On the expressive power of query languages for matrices

R Brijder, F Geerts, JVD Bussche… - ACM Transactions on …, 2019 - dl.acm.org
We investigate the expressive power of MATLANG, a formal language for matrix
manipulation based on common matrix operations and linear algebra. The language can be …

[PDF][PDF] Samsara: Declarative machine learning on distributed dataflow systems

S Schelter, A Palumbo, S Quinn, S Marthi… - NIPS Workshop …, 2016 - deem.berlin
We present Samsara, a domain-specific language for declarative machine learning in
cluster environments. Samsara allows its users to specify programs using a set of common …