Learnedsqlgen: Constraint-aware sql generation using reinforcement learning

L Zhang, C Chai, X Zhou, G Li - … of the 2022 International Conference on …, 2022 - dl.acm.org
Many database optimization problems, eg, slow SQL diagnosis, database testing, optimizer
tuning, require a large volume of SQL queries. Due to privacy issues, it is hard to obtain real …

Comprehensive and efficient workload compression

S Deep, A Gruenheid, P Koutris, J Naughton… - arxiv preprint arxiv …, 2020 - arxiv.org
This work studies the problem of constructing a representative workload from a given input
analytical query workload where the former serves as an approximation with guarantees of …

Optimal resource allocation for serverless queries

A Pimpley, S Li, A Srivastava, V Rohra, Y Zhu… - arxiv preprint arxiv …, 2021 - arxiv.org
Optimizing resource allocation for analytical workloads is vital for reducing costs of cloud-
data services. At the same time, it is incredibly hard for users to allocate resources per query …

[PDF][PDF] Towards Optimal Resource Allocation for Big Data Analytics.

A Pimpley, S Li, R Sen, S Srinivasan, A **dal - EDBT, 2022 - openproceedings.org
Optimizing resource allocation for analytical workloads is vital for reducing operational costs
in modern cloud-oriented query processing services. At the same time, it is incredibly hard …

Doppler: automated SKU recommendation in migrating SQL workloads to the cloud

J Cahoon, W Wang, Y Zhu, K Lin, S Liu… - arxiv preprint arxiv …, 2022 - arxiv.org
Selecting the optimal cloud target to migrate SQL estates from on-premises to the cloud
remains a challenge. Current solutions are not only time-consuming and error-prone …

Lauca: A workload duplicator for benchmarking transactional database performance

S Weng, Q Wang, L Qu, R Zhang, P Cai… - … on Knowledge and …, 2024 - ieeexplore.ieee.org
Generating synthetic workloads is essential and critical to the performance evaluation of
database systems. When benchmarking database performance for a specific application, the …

Sparkcruise: Workload optimization in managed spark clusters at microsoft

A Roy, A **dal, P Gomatam, X Ouyang… - Proceedings of the …, 2021 - dl.acm.org
Today cloud companies offer fully managed Spark services. This has made it easy to
onboard new customers but has also increased the volume of users and their workload …

[PDF][PDF] LST-Bench: Benchmarking Log-Structured Tables in the Cloud

J Camacho-Rodríguez, A Agrawal… - Proceedings of the …, 2024 - dl.acm.org
Data processing engines increasingly leverage distributed file systems for scalable, cost-
effective storage. While the Apache Parquet columnar format has become a popular choice …

DoppelGanger++: Towards Fast Dependency Graph Generation for Database Replay

W Lee, J Ha, WS Han, C Park, M Park, J Han… - Proceedings of the ACM …, 2024 - dl.acm.org
A database replay system (DRS) captures workloads on a production system and then
replays them in a test system to test various system changes, avoiding any risk before …

Towards a benchmark for learned systems

L Bindschaedler, A Kipf, T Kraska… - 2021 IEEE 37th …, 2021 - ieeexplore.ieee.org
This paper aims to initiate a discussion around benchmarking data management systems
with machine-learned components. Traditional benchmarks such as TPC or YCSB are …