Robust query driven cardinality estimation under changing workloads

P Negi, Z Wu, A Kipf, N Tatbul, R Marcus… - Proceedings of the …, 2023 - dl.acm.org
Query driven cardinality estimation models learn from a historical log of queries. They are
lightweight, having low storage requirements, fast inference and training, and are easily …

Automatic database knob tuning: a survey

X Zhao, X Zhou, G Li - IEEE Transactions on Knowledge and …, 2023 - ieeexplore.ieee.org
Knob tuning plays an important role in database optimization, which tunes knob settings to
optimize the database performance or improve resource utilization. However, there are …

FactorJoin: a new cardinality estimation framework for join queries

Z Wu, P Negi, M Alizadeh, T Kraska… - Proceedings of the ACM …, 2023 - dl.acm.org
Cardinality estimation is one of the most fundamental and challenging problems in query
optimization. Neither classical nor learning-based methods yield satisfactory performance …

FLASH: Fast model adaptation in ML-centric cloud platforms

H Qiu, W Mao, A Patke, S Cui, C Wang… - Proceedings of …, 2024 - proceedings.mlsys.org
The emergence of ML in various cloud system management tasks (eg, workload autoscaling
and job scheduling) has become a core driver of ML-centric cloud platforms. However, there …

A Comparative Study and Component Analysis of Query Plan Representation Techniques in ML4DB Studies

Y Zhao, Z Li, G Cong - Proceedings of the VLDB Endowment, 2023 - dl.acm.org
Query plan is widely used as input in machine learning for databases (ML4DB) research,
with query plan representation as a critical step. However, existing studies typically focus on …

[PDF][PDF] ZeroTune: Learned Zero-Shot Cost Models for Parallelism Tuning in Stream Processing

P Agnihotri, B Koldehofe, P Stiegele, R Heinrich… - ICDE …, 2024 - kom.tu-darmstadt.de
This paper introduces ZeroTune, a novel cost model for parallel and distributed stream
processing that can be used to effectively set initial parallelism degrees of streaming …

Detect, distill and update: Learned DB systems facing out of distribution data

M Kurmanji, P Triantafillou - Proceedings of the ACM on Management of …, 2023 - dl.acm.org
Machine Learning (ML) is changing DBs as many DB components are being replaced by ML
models. One open problem in this setting is how to update such ML models in the presence …

Tuning machine learning to address process mining requirements

P Ceravolo, SB Junior, E Damiani… - IEEE Access, 2024 - ieeexplore.ieee.org
Machine learning models are routinely integrated into process mining pipelines to carry out
tasks like data transformation, noise reduction, anomaly detection, classification, and …

Zero-shot cost models for parallel stream processing

P Agnihotri, B Koldehofe, C Binnig… - Proceedings of the Sixth …, 2023 - dl.acm.org
This paper addresses the challenge of predicting the level of parallelism in distributed
stream processing (DSP) systems, which are essential to deal with different high workload …

Robust and budget-constrained encoding configurations for in-memory database systems

M Boissier - Proceedings of the VLDB Endowment, 2021 - dl.acm.org
Data encoding has been applied to database systems for decades as it mitigates bandwidth
bottlenecks and reduces storage requirements. But even in the presence of these …