Query optimization through the looking glass, and what we found running the join order benchmark

V Leis, B Radke, A Gubichev, A Mirchev, P Boncz… - The VLDB Journal, 2018 - Springer
Finding a good join order is crucial for query performance. In this paper, we introduce the
Join Order Benchmark that works on real-life data riddled with correlations and introduces …

Ai meets ai: Leveraging query executions to improve index recommendations

B Ding, S Das, R Marcus, W Wu, S Chaudhuri… - Proceedings of the …, 2019 - dl.acm.org
State-of-the-art index tuners rely on query optimizer's cost estimates to search for the index
configuration with the largest estimated execution cost improvement. Due to well-known …

Magic mirror in my hand, which is the best in the land? an experimental evaluation of index selection algorithms

J Kossmann, S Halfpap, M Jankrift… - Proceedings of the VLDB …, 2020 - dl.acm.org
Indexes are essential for the efficient processing of database workloads. Proposed solutions
for the relevant and challenging index selection problem range from metadata-based simple …

Quicksel: Quick selectivity learning with mixture models

Y Park, S Zhong, B Mozafari - Proceedings of the 2020 ACM SIGMOD …, 2020 - dl.acm.org
Estimating the selectivity of a query is a key step in almost any cost-based query optimizer.
Most of today's databases rely on histograms or samples that are periodically refreshed by …

The data calculator: Data structure design and cost synthesis from first principles and learned cost models

S Idreos, K Zoumpatianos, B Hentschel… - Proceedings of the …, 2018 - dl.acm.org
Data structures are critical in any data-driven scenario, but they are notoriously hard to
design due to a massive design space and the dependence of performance on workload …

Smoke: Fine-grained lineage at interactive speed

F Psallidas, E Wu - arxiv preprint arxiv:1801.07237, 2018 - arxiv.org
Data lineage describes the relationship between individual input and output data items of a
workflow, and has served as an integral ingredient for both traditional (eg, debugging …

The case for learned spatial indexes

V Pandey, A van Renen, A Kipf, I Sabek, J Ding… - arxiv preprint arxiv …, 2020 - arxiv.org
Spatial data is ubiquitous. Massive amounts of data are generated every day from billions of
GPS-enabled devices such as cell phones, cars, sensors, and various consumer-based …

Cosine: a cloud-cost optimized self-designing key-value storage engine

S Chatterjee, M Jagadeesan, W Qin… - Proceedings of the VLDB …, 2021 - dl.acm.org
We present a self-designing key-value storage engine, Cosine, which can always take the
shape of the close to" perfect" engine architecture given an input workload, a cloud budget …

Analytical Queries: A Comprehensive Survey

P Kurapov, A Melik-Adamyan - arxiv preprint arxiv:2311.15730, 2023 - arxiv.org
Modern hardware heterogeneity brings efficiency and performance opportunities for
analytical query processing. In the presence of continuous data volume and complexity …

Efficient scalable multi-attribute index selection using recursive strategies

R Schlosser, J Kossmann… - 2019 IEEE 35th …, 2019 - ieeexplore.ieee.org
An efficient selection of indexes is indispensable for database performance. For large
problem instances with hundreds of tables, existing approaches are not suitable: They either …