Quicksel: Quick selectivity learning with mixture models
Estimating the selectivity of a query is a key step in almost any cost-based query optimizer.
Most of today's databases rely on histograms or samples that are periodically refreshed by …
Most of today's databases rely on histograms or samples that are periodically refreshed by …
Main memory database systems
F Faerber, A Kemper, PÅ Larson… - … and Trends® in …, 2017 - nowpublishers.com
This article provides an overview of recent developments in mainmemory database systems.
With growing memory sizes and memory prices drop** by a factor of 10 every 5 years …
With growing memory sizes and memory prices drop** by a factor of 10 every 5 years …
Efficiently approximating selectivity functions using low overhead regression models
Today's query optimizers use fast selectivity estimation techniques but are known to be
susceptible to large estimation errors. Recent work on supervised learned models for …
susceptible to large estimation errors. Recent work on supervised learned models for …
Improved selectivity estimation by combining knowledge from sampling and synopses
Estimating selectivities remains a critical task in query processing. Optimizers rely on the
accuracy of selectivities when generating execution plans and, in approximate query …
accuracy of selectivities when generating execution plans and, in approximate query …
[PDF][PDF] Enhanced Featurization of Queries with Mixed Combinations of Predicates for ML-based Cardinality Estimation.
Background. For some years now, Machine Learning (ML) has been applied to the
cardinality estimation problem [8, 12, 32, 33]. In general, ML means arbitrary function …
cardinality estimation problem [8, 12, 32, 33]. In general, ML means arbitrary function …
Methods and systems for one dimensional heterogeneous histograms
BACKGROUND Query optimization depends in large part on efficient and accurate
cardinality estimates of intermediate results and the plan for the order of a sequence or set of …
cardinality estimates of intermediate results and the plan for the order of a sequence or set of …
Statisticum: data statistics management in SAP HANA
We introduce a new concept of leveraging traditional data statistics as dynamic data integrity
constraints. These data statistics produce transient database constraints, which are valid as …
constraints. These data statistics produce transient database constraints, which are valid as …
Cardinality estimation for database query planning
(57) ABSTRACT A system for cardinality estimation for database query planning is provided.
In some implementations, the system performs operations comprising selecting a subset of …
In some implementations, the system performs operations comprising selecting a subset of …
Boosting data filtering on columnar encoding with SIMD
In columnar databases, data is generally stored in an encoded format to save storage space
and reduce I/O. Popular encoding schemes include dictionary encoding, delta encoding, run …
and reduce I/O. Popular encoding schemes include dictionary encoding, delta encoding, run …
Small selectivities matter: Lifting the burden of empty samples
Every year more and more advanced approaches to cardinality estimation are published,
using learned models or other data and workload specific synopses. In contrast, the majority …
using learned models or other data and workload specific synopses. In contrast, the majority …