Data management in machine learning: Challenges, techniques, and systems

A Kumar, M Boehm, J Yang - Proceedings of the 2017 ACM International …, 2017 - dl.acm.org
Large-scale data analytics using statistical machine learning (ML), popularly called
advanced analytics, underpins many modern data-driven applications. The data …

Powerlyra: Differentiated graph computation and partitioning on skewed graphs

R Chen, J Shi, Y Chen, B Zang, H Guan… - ACM Transactions on …, 2019 - dl.acm.org
Natural graphs with skewed distributions raise unique challenges to distributed graph
computation and partitioning. Existing graph-parallel systems usually use a “one-size-fits-all” …

Supporting very large models using automatic dataflow graph partitioning

M Wang, C Huang, J Li - … of the Fourteenth EuroSys Conference 2019, 2019 - dl.acm.org
This paper presents Tofu, a system that partitions very large DNN models across multiple
GPU devices to reduce per-GPU memory footprint. Tofu is designed to partition a dataflow …

On optimizing operator fusion plans for large-scale machine learning in systemml

M Boehm, B Reinwald, D Hutchison… - arxiv preprint arxiv …, 2018 - arxiv.org
Many large-scale machine learning (ML) systems allow specifying custom ML algorithms by
means of linear algebra programs, and then automatically generate efficient execution …

Exploring the hidden dimension in graph processing

M Zhang, Y Wu, K Chen, X Qian, X Li… - 12th USENIX Symposium …, 2016 - usenix.org
Task partitioning of a graph-parallel system is traditionally considered equivalent to the
graph partition problem. Such equivalence exists because the properties associated with …

Bladedisc: Optimizing dynamic shape machine learning workloads via compiler approach

Z Zheng, Z Pan, D Wang, K Zhu, W Zhao… - Proceedings of the …, 2023 - dl.acm.org
Compiler optimization plays an increasingly important role to boost the performance of
machine learning models for data processing and management. With increasingly complex …

[PDF][PDF] SPOOF: Sum-Product Optimization and Operator Fusion for Large-Scale Machine Learning.

T Elgamal, S Luo, M Boehm, AV Evfimievski… - CIDR, 2017 - cidrdb.org
SPOOF: Sum-Product Optimization and Operator Fusion for Large-Scale Machine Learning
Page 1 © 2017 IBM Corporation SPOOF: Sum-Product Optimization and Operator Fusion for …

Unifying data, model and hybrid parallelism in deep learning via tensor tiling

M Wang, C Huang, J Li - arxiv preprint arxiv:1805.04170, 2018 - arxiv.org
Deep learning systems have become vital tools across many fields, but the increasing model
sizes mean that training must be accelerated to maintain such systems' utility. Current …

Optimizing Tensor Computations: From Applications to Compilation and Runtime Techniques

M Boehm, M Interlandi, C Jermaine - Companion of the 2023 …, 2023 - dl.acm.org
Machine learning (ML) training and scoring fundamentally relies on linear algebra programs
and more general tensor computations. Most ML systems utilize distributed parameter …

Towards an efficient maintenance of address space overflow for array based storage system

MT Omar, KMA Hasan - 2016 17th International Conference on …, 2016 - ieeexplore.ieee.org
Array based storage and retrieval systems are demanded in many high dimensional
systems like Big data for their easy maintenance. However, the lack of scalability of the …