I/o access patterns in hpc applications: A 360-degree survey

JL Bez, S Byna, S Ibrahim - ACM Computing Surveys, 2023 - dl.acm.org
The high-performance computing I/O stack has been complex due to multiple software
layers, the inter-dependencies among these layers, and the different performance tuning …

Access patterns and performance behaviors of multi-layer supercomputer i/o subsystems under production load

JL Bez, AM Karimi, AK Paul, B **e, S Byna… - Proceedings of the 31st …, 2022 - dl.acm.org
Scientific computing workloads at HPC facilities have been shifting from traditional
numerical simulations to AI/ML applications for training and inference while processing and …

Machine learning assisted HPC workload trace generation for leadership scale storage systems

AK Paul, JY Choi, AM Karimi, F Wang - Proceedings of the 31st …, 2022 - dl.acm.org
Monitoring and analyzing a wide range of I/O activities in an HPC cluster is important in
maintaining mission-critical performance in a large-scale, multi-user, parallel storage …

Starship: Mitigating i/o bottlenecks in serverless computing for scientific workflows

R Basu Roy, D Tiwari - Proceedings of the ACM on Measurement and …, 2024 - dl.acm.org
This work highlights the significance of I/O bottlenecks that data-intensive HPC workflows
face in serverless environments-an issue that has been largely overlooked by prior works …

Graph3PO: A Temporal Graph Data Processing Method for Latency QoS Guarantee in Object Cloud Storage System

W Zhang, Z Shi, Z Liao, Y Li, Y Du, Y Wu… - Proceedings of the …, 2023 - dl.acm.org
Object cloud storage systems are deployed with diverse applications that have varying
latency service level objectives (SLOs), posting challenges for supporting quality of service …

AIIO: Using Artificial Intelligence for Job-Level and Automatic I/O Performance Bottleneck Diagnosis

B Dong, JL Bez, S Byna - … of the 32nd International Symposium on High …, 2023 - dl.acm.org
Manually diagnosing the I/O performance bottleneck for a single application (hereinafter
referred to as the" job level'') is a tedious and error-prone procedure requiring domain …

User-based I/O Profiling for Leadership Scale HPC Workloads

AH Yazdani, AK Paul, AM Karimi, F Wang… - Proceedings of the 26th …, 2025 - dl.acm.org
I/O constitutes a significant portion of most of the application run-time. Spawning many such
applications concurrently on an HPC system leads to severe I/O contention. Thus …

[PDF][PDF] Ionet: Towards an open machine learning training ground for i/o performance prediction

DH Kurniawan, L Toksoz, A Badam… - Technical …, 2021 - daniarherikurniawan.github.io
Low and stable latency is a critical key to the success of many services, but variable load
and resource sharing in a modern cloud environment introduces resource contention that in …

Ftio: Detecting i/o periodicity using frequency techniques

A Tarraf, A Bandet, F Boito, G Pallez, F Wolf - arxiv preprint arxiv …, 2023 - arxiv.org
Characterizing the temporal I/O behavior of an HPC application is a challenging task, but
informing the system about it can be valuable for techniques such as I/O scheduling, burst …

I/O Throughput Prediction for HPC Applications Using Darshan Logs

DJ Gabriel Jr - 2022 - search.proquest.com
Abstract As most High Performance Computing (HPC) applications deal with large volumes
of data, I/O performance is of critical importance to optimize application performance …