InferLine: latency-aware provisioning and scaling for prediction serving pipelines

D Crankshaw, GE Sela, X Mo, C Zumar… - Proceedings of the 11th …, 2020 - dl.acm.org
Serving ML prediction pipelines spanning multiple models and hardware accelerators is a
key challenge in production machine learning. Optimally configuring these pipelines to meet …

Joint task scheduling and containerizing for efficient edge computing

J Zhang, X Zhou, T Ge, X Wang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Container-based operation system (OS) level virtualization has been adopted by many edge-
computing platforms. However, for an edge server, inter-container communications, and …

Mashup: making serverless computing useful for hpc workflows via hybrid execution

RB Roy, T Patel, V Gadepally, D Tiwari - Proceedings of the 27th ACM …, 2022 - dl.acm.org
This work introduces Mashup, a novel strategy to leverage serverless computing model for
executing scientific workflows in a hybrid fashion by taking advantage of both the traditional …

Daydream: Executing dynamic scientific workflows on serverless platforms with hot starts

RB Roy, T Patel, D Tiwari - SC22: International Conference for …, 2022 - ieeexplore.ieee.org
HPC applications are increasingly being designed as dynamic workflows for the ease of
development and scaling. This work demonstrates how the serverless computing model can …

How workflow engines should talk to resource managers: A proposal for a common workflow scheduling interface

F Lehmann, J Bader, F Tschirpke… - 2023 IEEE/ACM …, 2023 - ieeexplore.ieee.org
Scientific workflow management systems (SWMSs) and resource managers together ensure
that tasks are scheduled on provisioned resources so that all dependencies are obeyed …

[PDF][PDF] Inferline: Ml inference pipeline composition framework

D Crankshaw, GE Sela, C Zumar, X Mo… - arxiv preprint arxiv …, 2018 - academia.edu
The dominant cost in production machine learning workloads is not training individual
models but serving predictions from increasingly complex prediction pipelines spanning …

Towards advanced monitoring for scientific workflows

J Bader, J Witzke, S Becker, A Lößer… - … Conference on Big …, 2022 - ieeexplore.ieee.org
Scientific workflows consist of thousands of highly parallelized tasks executed in a
distributed environment involving many components. Automatic tracing and investigation of …

Scsf: A scheduling simulation framework

GP Rodrigo, E Elmroth, PO Östberg… - … Strategies for Parallel …, 2018 - Springer
High-throughput and data-intensive applications are increasingly present, often composed
as workflows, in the workloads of current HPC systems. At the same time, trends for future …

E-hpc: a library for elastic resource management in hpc environments

W Fox, D Ghoshal, A Souza, GP Rodrigo… - Proceedings of the 12th …, 2017 - dl.acm.org
Next-generation data-intensive scientific workflows need to support streaming and real-time
applications with dynamic resource needs on high performance computing (HPC) platforms …

Starship: Mitigating i/o bottlenecks in serverless computing for scientific workflows

R Basu Roy, D Tiwari - Proceedings of the ACM on Measurement and …, 2024 - dl.acm.org
This work highlights the significance of I/O bottlenecks that data-intensive HPC workflows
face in serverless environments-an issue that has been largely overlooked by prior works …