Auto-differentiation of relational computations for very large scale machine learning

Y Tang, Z Ding, D Jankov, B Yuan… - International …, 2023 - proceedings.mlr.press
The relational data model was designed to facilitate large-scale data management and
analytics. We consider the problem of how to differentiate computations expressed …

Serving deep learning models with deduplication from relational databases

L Zhou, J Chen, A Das, H Min, L Yu, M Zhao… - arxiv preprint arxiv …, 2022 - arxiv.org
There are significant benefits to serve deep learning models from relational databases. First,
features extracted from databases do not need to be transferred to any decoupled deep …

[PDF][PDF] Evolving exact decompilation

E Schulte, J Ruchti, M Noonan, D Ciarletta… - Workshop on Binary …, 2018 - cs.unm.edu
We introduce a novel technique for C decompilation that provides the correctness
guarantees and readability properties essential for accurate and efficient binary analysis …

Towards automating microservices orchestration through data-driven evolutionary architectures

G Bergami - Service Oriented Computing and Applications, 2024 - Springer
Towards automating microservices orchestration through data-driven evolutionary
architectures | Service Oriented Computing and Applications Skip to main content …

A Comparison of End-to-End Decision Forest Inference Pipelines

H Guan, S Masood, M Dwarampudi, V Gunda… - Proceedings of the …, 2023 - dl.acm.org
Decision forest, including RandomForest, XGBoost, and LightGBM, dominates the machine
learning tasks over tabular data. Recently, several frameworks were developed for decision …

Automatic optimization of matrix implementations for distributed machine learning and linear algebra

S Luo, D Jankov, B Yuan, C Jermaine - Proceedings of the 2021 …, 2021 - dl.acm.org
Machine learning (ML) computations are often expressed using vectors, matrices, or higher-
dimensional tensors. Such data structures can have many different implementations …

Optimizing tensor computations: From applications to compilation and runtime techniques

M Boehm, M Interlandi, C Jermaine - Companion of the 2023 …, 2023 - dl.acm.org
Machine learning (ML) training and scoring fundamentally relies on linear algebra programs
and more general tensor computations. Most ML systems utilize distributed parameter …

A comparison of decision forest inference platforms from a database perspective

H Guan, MR Dwarampudi, V Gunda, H Min… - arxiv preprint arxiv …, 2023 - arxiv.org
Decision forest, including RandomForest, XGBoost, and LightGBM, is one of the most
popular machine learning techniques used in many industrial scenarios, such as credit card …

Monsoon: Multi-step optimization and execution of queries with partially obscured predicates

S Sikdar, C Jermaine - Proceedings of the 2020 ACM SIGMOD …, 2020 - dl.acm.org
User-defined functions (UDFs) in modern SQL database systems and Big Data processing
systems such as Spark---that offer API bindings in high-level languages such as Python or …

Distributed numerical and machine learning computations via two-phase execution of aggregated join trees

D Jankov, B Yuan, S Luo, C Jermaine - Proceedings of the VLDB …, 2021 - par.nsf.gov
When numerical and machine learning (ML) computations are expressed relationally,
classical query execution strategies (hash-based joins and aggregations) can do a poor job …