Geco: Quality counterfactual explanations in real time
Machine learning is increasingly applied in high-stakes decision making that directly affect
people's lives, and this leads to an increased demand for systems to explain their decisions …
people's lives, and this leads to an increased demand for systems to explain their decisions …
Data management for machine learning: A survey
Machine learning (ML) has widespread applications and has revolutionized many
industries, but suffers from several challenges. First, sufficient high-quality training data is …
industries, but suffers from several challenges. First, sufficient high-quality training data is …
[HTML][HTML] Approaching sales forecasting using recurrent neural networks and transformers
Accurate and fast demand forecast is one of the hot topics in supply chain for enabling the
precise execution of the corresponding downstream processes (inbound and outbound …
precise execution of the corresponding downstream processes (inbound and outbound …
Amalur: Data integration meets machine learning
Machine learning (ML) training data is often scattered across disparate collections of
datasets, called data silos. This fragmentation poses a major challenge for data-intensive …
datasets, called data silos. This fragmentation poses a major challenge for data-intensive …
In-database machine learning with SQL on GPUs
In machine learning, continuously retraining a model guarantees accurate predictions based
on the latest data as training input. But to retrieve the latest data from a database, time …
on the latest data as training input. But to retrieve the latest data from a database, time …
Architecting intermediate layers for efficient composition of data management and machine learning systems
Modern data analytics workloads combine relational data processing with machine learning
(ML). Most DBMS handle these workloads by offloading these ML operations to external …
(ML). Most DBMS handle these workloads by offloading these ML operations to external …
Optimizing tensor programs on flexible storage
Tensor programs often need to process large tensors (vectors, matrices, or higher order
tensors) that require a specialized storage format for their memory layout. Several such …
tensors) that require a specialized storage format for their memory layout. Several such …
[PDF][PDF] Identifying insufficient data coverage in databases with multiple relations
In today's data-driven world, it is critical that we use appropriate datasets for analysis and
decision-making. Datasets could be biased because they reflect existing inequalities in the …
decision-making. Datasets could be biased because they reflect existing inequalities in the …
Functional collection programming with semi-ring dictionaries
This paper introduces semi-ring dictionaries, a powerful class of compositional and purely
functional collections that subsume other collection types such as sets, multisets, arrays …
functional collections that subsume other collection types such as sets, multisets, arrays …
Indexed Streams: A Formal Intermediate Representation for Fused Contraction Programs
We introduce indexed streams, a formal operational model and intermediate representation
that describes the fused execution of a contraction language that encompasses both sparse …
that describes the fused execution of a contraction language that encompasses both sparse …