Query optimization through the looking glass, and what we found running the join order benchmark
Finding a good join order is crucial for query performance. In this paper, we introduce the
Join Order Benchmark that works on real-life data riddled with correlations and introduces …
Join Order Benchmark that works on real-life data riddled with correlations and introduces …
Ai meets ai: Leveraging query executions to improve index recommendations
State-of-the-art index tuners rely on query optimizer's cost estimates to search for the index
configuration with the largest estimated execution cost improvement. Due to well-known …
configuration with the largest estimated execution cost improvement. Due to well-known …
Magic mirror in my hand, which is the best in the land? an experimental evaluation of index selection algorithms
Indexes are essential for the efficient processing of database workloads. Proposed solutions
for the relevant and challenging index selection problem range from metadata-based simple …
for the relevant and challenging index selection problem range from metadata-based simple …
Quicksel: Quick selectivity learning with mixture models
Estimating the selectivity of a query is a key step in almost any cost-based query optimizer.
Most of today's databases rely on histograms or samples that are periodically refreshed by …
Most of today's databases rely on histograms or samples that are periodically refreshed by …
The data calculator: Data structure design and cost synthesis from first principles and learned cost models
Data structures are critical in any data-driven scenario, but they are notoriously hard to
design due to a massive design space and the dependence of performance on workload …
design due to a massive design space and the dependence of performance on workload …
Smoke: Fine-grained lineage at interactive speed
Data lineage describes the relationship between individual input and output data items of a
workflow, and has served as an integral ingredient for both traditional (eg, debugging …
workflow, and has served as an integral ingredient for both traditional (eg, debugging …
The case for learned spatial indexes
Spatial data is ubiquitous. Massive amounts of data are generated every day from billions of
GPS-enabled devices such as cell phones, cars, sensors, and various consumer-based …
GPS-enabled devices such as cell phones, cars, sensors, and various consumer-based …
Cosine: a cloud-cost optimized self-designing key-value storage engine
We present a self-designing key-value storage engine, Cosine, which can always take the
shape of the close to" perfect" engine architecture given an input workload, a cloud budget …
shape of the close to" perfect" engine architecture given an input workload, a cloud budget …
Analytical Queries: A Comprehensive Survey
Modern hardware heterogeneity brings efficiency and performance opportunities for
analytical query processing. In the presence of continuous data volume and complexity …
analytical query processing. In the presence of continuous data volume and complexity …
Efficient scalable multi-attribute index selection using recursive strategies
An efficient selection of indexes is indispensable for database performance. For large
problem instances with hundreds of tables, existing approaches are not suitable: They either …
problem instances with hundreds of tables, existing approaches are not suitable: They either …