Data management in machine learning: Challenges, techniques, and systems
Large-scale data analytics using statistical machine learning (ML), popularly called
advanced analytics, underpins many modern data-driven applications. The data …
advanced analytics, underpins many modern data-driven applications. The data …
Powerlyra: Differentiated graph computation and partitioning on skewed graphs
R Chen, J Shi, Y Chen, B Zang, H Guan… - ACM Transactions on …, 2019 - dl.acm.org
Natural graphs with skewed distributions raise unique challenges to distributed graph
computation and partitioning. Existing graph-parallel systems usually use a “one-size-fits-all” …
computation and partitioning. Existing graph-parallel systems usually use a “one-size-fits-all” …
Supporting very large models using automatic dataflow graph partitioning
This paper presents Tofu, a system that partitions very large DNN models across multiple
GPU devices to reduce per-GPU memory footprint. Tofu is designed to partition a dataflow …
GPU devices to reduce per-GPU memory footprint. Tofu is designed to partition a dataflow …
On optimizing operator fusion plans for large-scale machine learning in systemml
Many large-scale machine learning (ML) systems allow specifying custom ML algorithms by
means of linear algebra programs, and then automatically generate efficient execution …
means of linear algebra programs, and then automatically generate efficient execution …
Exploring the hidden dimension in graph processing
Task partitioning of a graph-parallel system is traditionally considered equivalent to the
graph partition problem. Such equivalence exists because the properties associated with …
graph partition problem. Such equivalence exists because the properties associated with …
Bladedisc: Optimizing dynamic shape machine learning workloads via compiler approach
Compiler optimization plays an increasingly important role to boost the performance of
machine learning models for data processing and management. With increasingly complex …
machine learning models for data processing and management. With increasingly complex …
[PDF][PDF] SPOOF: Sum-Product Optimization and Operator Fusion for Large-Scale Machine Learning.
SPOOF: Sum-Product Optimization and Operator Fusion for Large-Scale Machine Learning
Page 1 © 2017 IBM Corporation SPOOF: Sum-Product Optimization and Operator Fusion for …
Page 1 © 2017 IBM Corporation SPOOF: Sum-Product Optimization and Operator Fusion for …
Unifying data, model and hybrid parallelism in deep learning via tensor tiling
Deep learning systems have become vital tools across many fields, but the increasing model
sizes mean that training must be accelerated to maintain such systems' utility. Current …
sizes mean that training must be accelerated to maintain such systems' utility. Current …
Optimizing Tensor Computations: From Applications to Compilation and Runtime Techniques
Machine learning (ML) training and scoring fundamentally relies on linear algebra programs
and more general tensor computations. Most ML systems utilize distributed parameter …
and more general tensor computations. Most ML systems utilize distributed parameter …
Towards an efficient maintenance of address space overflow for array based storage system
Array based storage and retrieval systems are demanded in many high dimensional
systems like Big data for their easy maintenance. However, the lack of scalability of the …
systems like Big data for their easy maintenance. However, the lack of scalability of the …