Mashup: making serverless computing useful for hpc workflows via hybrid execution

RB Roy, T Patel, V Gadepally, D Tiwari - Proceedings of the 27th ACM …, 2022 - dl.acm.org
This work introduces Mashup, a novel strategy to leverage serverless computing model for
executing scientific workflows in a hybrid fashion by taking advantage of both the traditional …

Daydream: Executing dynamic scientific workflows on serverless platforms with hot starts

RB Roy, T Patel, D Tiwari - SC22: International Conference for …, 2022 - ieeexplore.ieee.org
HPC applications are increasingly being designed as dynamic workflows for the ease of
development and scaling. This work demonstrates how the serverless computing model can …

DFMan: A graph-based optimization of dataflow scheduling on high-performance computing systems

F Chowdhury, F Di Natale, A Moody… - 2022 IEEE …, 2022 - ieeexplore.ieee.org
Scientific research and development campaigns are materialized by workflows of
applications executing on high-performance computing (HPC) systems. These applications …

HDF5 Cache VOL: Efficient and scalable parallel I/O through caching data on node-local storage

H Zheng, V Vishwanath, Q Koziol… - 2022 22nd IEEE …, 2022 - ieeexplore.ieee.org
Modern-era high performance computing (HPC) systems are providing multiple levels of
memory and storage layers to bridge the performance gap between fast memory and slow …

Hcompress: Hierarchical data compression for multi-tiered storage environments

H Devarajan, A Kougkas, L Logan… - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
Modern scientific applications read and write massive amounts of data through simulations,
observations, and analysis. These applications spend the majority of their runtime in …

Extracting and characterizing I/O behavior of HPC workloads

H Devarajan, K Mohror - 2022 IEEE International Conference …, 2022 - ieeexplore.ieee.org
System administrators set default storage-system configuration parameters with the goal of
providing high per-formance for their system's I/O workloads. However, this gener-alized …

I/O acceleration via multi-tiered data buffering and prefetching

A Kougkas, H Devarajan, XH Sun - Journal of Computer Science and …, 2020 - Springer
Abstract Modern High-Performance Computing (HPC) systems are adding extra layers to the
memory and storage hierarchy, named deep memory and storage hierarchy (DMSH), to …

Hfetch: Hierarchical data prefetching for scientific workflows in multi-tiered storage environments

H Devarajan, A Kougkas, XH Sun - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
In the era of data-intensive computing, accessing data with a high-throughput and low-
latency is more imperative than ever. Data prefetching is a well-known technique for hiding …

Storage-heterogeneity aware task-based programming models to optimize I/O intensive applications

H Elshazly, J Ejarque, RM Badia - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Task-based programming models have enabled the optimized execution of the computation
workloads of applications. These programming models can take advantage of large-scale …

DaYu: Optimizing Distributed Scientific Workflows by Decoding Dataflow Semantics and Dynamics

M Tang, J Cernuda, J Ye, L Guo… - 2024 IEEE …, 2024 - ieeexplore.ieee.org
The combination of ever-growing scientific datasets and distributed workflow complexity
creates I/O performance bottlenecks due to data volume, velocity, and variety. Although the …