[HTML][HTML] Approaching sales forecasting using recurrent neural networks and transformers
Accurate and fast demand forecast is one of the hot topics in supply chain for enabling the
precise execution of the corresponding downstream processes (inbound and outbound …
precise execution of the corresponding downstream processes (inbound and outbound …
Architecting intermediate layers for efficient composition of data management and machine learning systems
Modern data analytics workloads combine relational data processing with machine learning
(ML). Most DBMS handle these workloads by offloading these ML operations to external …
(ML). Most DBMS handle these workloads by offloading these ML operations to external …
Optimizing tensor programs on flexible storage
Tensor programs often need to process large tensors (vectors, matrices, or higher order
tensors) that require a specialized storage format for their memory layout. Several such …
tensors) that require a specialized storage format for their memory layout. Several such …
Functional collection programming with semi-ring dictionaries
This paper introduces semi-ring dictionaries, a powerful class of compositional and purely
functional collections that subsume other collection types such as sets, multisets, arrays …
functional collections that subsume other collection types such as sets, multisets, arrays …
Building a compiled query engine in python
The simplicity of Python and its rich set of libraries has made it the most popular language
for data science. Moreover, the interpreted nature of Python offers an easy debugging …
for data science. Moreover, the interpreted nature of Python offers an easy debugging …
The relational data borg is learning
D Olteanu - arxiv preprint arxiv:2008.07864, 2020 - arxiv.org
This paper overviews an approach that addresses machine learning over relational data as
a database problem. This is justified by two observations. First, the input to the learning task …
a database problem. This is justified by two observations. First, the input to the learning task …
Pytond: Efficient python data science on the shoulders of databases
Python data science libraries such as Pandas and NumPy have recently gained immense
popularity. Although these libraries are feature-rich and easy to use, their scalability …
popularity. Although these libraries are feature-rich and easy to use, their scalability …
An intermediate representation for hybrid database and machine learning workloads
IFAQ is an intermediate representation and compilation framework for hybrid database and
machine learning workloads expressible using iterative programs with functional aggregate …
machine learning workloads expressible using iterative programs with functional aggregate …
Fine-tuning data structures for query processing
We introduce a framework for automatically choosing data structures for efficient query
processing. Our contributions are twofold. First, we introduce a novel low-level intermediate …
processing. Our contributions are twofold. First, we introduce a novel low-level intermediate …
Calibration: A Simple Trick for Wide-table Delta Analytics
Data analytics over normalized databases typically requires computing and materializing
expensive joins (wide-tables). Factorized query execution models execution as message …
expensive joins (wide-tables). Factorized query execution models execution as message …