Robust query driven cardinality estimation under changing workloads
Query driven cardinality estimation models learn from a historical log of queries. They are
lightweight, having low storage requirements, fast inference and training, and are easily …
lightweight, having low storage requirements, fast inference and training, and are easily …
Automatic database knob tuning: a survey
Knob tuning plays an important role in database optimization, which tunes knob settings to
optimize the database performance or improve resource utilization. However, there are …
optimize the database performance or improve resource utilization. However, there are …
FactorJoin: a new cardinality estimation framework for join queries
Cardinality estimation is one of the most fundamental and challenging problems in query
optimization. Neither classical nor learning-based methods yield satisfactory performance …
optimization. Neither classical nor learning-based methods yield satisfactory performance …
FLASH: Fast model adaptation in ML-centric cloud platforms
The emergence of ML in various cloud system management tasks (eg, workload autoscaling
and job scheduling) has become a core driver of ML-centric cloud platforms. However, there …
and job scheduling) has become a core driver of ML-centric cloud platforms. However, there …
A Comparative Study and Component Analysis of Query Plan Representation Techniques in ML4DB Studies
Query plan is widely used as input in machine learning for databases (ML4DB) research,
with query plan representation as a critical step. However, existing studies typically focus on …
with query plan representation as a critical step. However, existing studies typically focus on …
[PDF][PDF] ZeroTune: Learned Zero-Shot Cost Models for Parallelism Tuning in Stream Processing
This paper introduces ZeroTune, a novel cost model for parallel and distributed stream
processing that can be used to effectively set initial parallelism degrees of streaming …
processing that can be used to effectively set initial parallelism degrees of streaming …
Detect, distill and update: Learned DB systems facing out of distribution data
M Kurmanji, P Triantafillou - Proceedings of the ACM on Management of …, 2023 - dl.acm.org
Machine Learning (ML) is changing DBs as many DB components are being replaced by ML
models. One open problem in this setting is how to update such ML models in the presence …
models. One open problem in this setting is how to update such ML models in the presence …
Tuning machine learning to address process mining requirements
Machine learning models are routinely integrated into process mining pipelines to carry out
tasks like data transformation, noise reduction, anomaly detection, classification, and …
tasks like data transformation, noise reduction, anomaly detection, classification, and …
Zero-shot cost models for parallel stream processing
This paper addresses the challenge of predicting the level of parallelism in distributed
stream processing (DSP) systems, which are essential to deal with different high workload …
stream processing (DSP) systems, which are essential to deal with different high workload …
Robust and budget-constrained encoding configurations for in-memory database systems
M Boissier - Proceedings of the VLDB Endowment, 2021 - dl.acm.org
Data encoding has been applied to database systems for decades as it mitigates bandwidth
bottlenecks and reduces storage requirements. But even in the presence of these …
bottlenecks and reduces storage requirements. But even in the presence of these …