[HTML][HTML] Approaching sales forecasting using recurrent neural networks and transformers

I Vallés-Pérez, E Soria-Olivas, M Martínez-Sober… - Expert Systems with …, 2022 - Elsevier
Accurate and fast demand forecast is one of the hot topics in supply chain for enabling the
precise execution of the corresponding downstream processes (inbound and outbound …

Architecting intermediate layers for efficient composition of data management and machine learning systems

S Abeysinghe, F Wang, G Essertel, T Rompf - arxiv preprint arxiv …, 2023 - arxiv.org
Modern data analytics workloads combine relational data processing with machine learning
(ML). Most DBMS handle these workloads by offloading these ML operations to external …

Optimizing tensor programs on flexible storage

M Schleich, A Shaikhha, D Suciu - … of the ACM on Management of Data, 2023 - dl.acm.org
Tensor programs often need to process large tensors (vectors, matrices, or higher order
tensors) that require a specialized storage format for their memory layout. Several such …

Functional collection programming with semi-ring dictionaries

A Shaikhha, M Huot, J Smith, D Olteanu - Proceedings of the ACM on …, 2022 - dl.acm.org
This paper introduces semi-ring dictionaries, a powerful class of compositional and purely
functional collections that subsume other collection types such as sets, multisets, arrays …

Building a compiled query engine in python

H Shahrokhi, A Shaikhha - Proceedings of the 32nd ACM SIGPLAN …, 2023 - dl.acm.org
The simplicity of Python and its rich set of libraries has made it the most popular language
for data science. Moreover, the interpreted nature of Python offers an easy debugging …

The relational data borg is learning

D Olteanu - arxiv preprint arxiv:2008.07864, 2020 - arxiv.org
This paper overviews an approach that addresses machine learning over relational data as
a database problem. This is justified by two observations. First, the input to the learning task …

Pytond: Efficient python data science on the shoulders of databases

H Shahrokhi, A Kaboli, M Ghorbani… - 2024 IEEE 40th …, 2024 - ieeexplore.ieee.org
Python data science libraries such as Pandas and NumPy have recently gained immense
popularity. Although these libraries are feature-rich and easy to use, their scalability …

An intermediate representation for hybrid database and machine learning workloads

A Shaikhha, M Schleich, D Olteanu - Proceedings of the VLDB …, 2021 - dl.acm.org
IFAQ is an intermediate representation and compilation framework for hybrid database and
machine learning workloads expressible using iterative programs with functional aggregate …

Fine-tuning data structures for query processing

A Shaikhha, M Kelepeshis, M Ghorbani - Proceedings of the 21st ACM …, 2023 - dl.acm.org
We introduce a framework for automatically choosing data structures for efficient query
processing. Our contributions are twofold. First, we introduce a novel low-level intermediate …

Calibration: A Simple Trick for Wide-table Delta Analytics

Z Huang, E Wu - arxiv preprint arxiv:2210.03851, 2022 - arxiv.org
Data analytics over normalized databases typically requires computing and materializing
expensive joins (wide-tables). Factorized query execution models execution as message …