I/o access patterns in hpc applications: A 360-degree survey

JL Bez, S Byna, S Ibrahim - ACM Computing Surveys, 2023 - dl.acm.org
The high-performance computing I/O stack has been complex due to multiple software
layers, the inter-dependencies among these layers, and the different performance tuning …

Parallel i/o evaluation techniques and emerging hpc workloads: A perspective

S Neuwirth, AK Paul - 2021 IEEE International Conference on …, 2021 - ieeexplore.ieee.org
Emerging workloads such as artificial intelligence, big data analytics and complex multi-step
workflows alongside future exascale applications are anticipated future HPC workloads …

Revisiting I/O behavior in large-scale storage systems: The expected and the unexpected

T Patel, S Byna, GK Lockwood, D Tiwari - Proceedings of the …, 2019 - dl.acm.org
Large-scale applications typically spend a large fraction of their execution time performing
I/O to a parallel storage system. However, with rapid progress in compute and storage …

End-to-end I/O monitoring on leading supercomputers

B Yang, W Xue, T Zhang, S Liu, X Ma, X Wang… - ACM Transactions on …, 2023 - dl.acm.org
This paper offers a solution to overcome the complexities of production system I/O
performance monitoring. We present Beacon, an end-to-end I/O resource monitoring and …

DFTracer: An Analysis-Friendly Data Flow Tracer for AI-Driven Workflows

H Devarajan, L Pottier, K Velusamy… - … Conference for High …, 2024 - ieeexplore.ieee.org
Modern HPC workflows involve intricate coupling of simulation, data analytics, and artificial
intelligence (AI) applications to improve time to scientific insight. These workflows require a …

File system semantics requirements of HPC applications

C Wang, K Mohror, M Snir - … of the 30th International Symposium on High …, 2021 - dl.acm.org
Most widely-deployed parallel file systems (PFSs) implement POSIX semantics, which
implies sequential consistency for reads and writes. Strict adherence to POSIX semantics is …

Uncovering access, reuse, and sharing characteristics of {I/O-Intensive} files on {Large-Scale} production {HPC} systems

T Patel, S Byna, GK Lockwood, NJ Wright… - … USENIX Conference on …, 2020 - usenix.org
Large-scale high-performance computing (HPC) applications running on supercomputers
produce large amounts of data routinely and store it in files on multi-PB shared parallel …

Capturing periodic I/O using frequency techniques

A Tarraf, A Bandet, F Boito, G Pallez… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
Many HPC applications perform their I/O in bursts that follow a periodic pattern. This allows
for making predictions as to when a burst occurs. System providers can take advantage of …

Access patterns and performance behaviors of multi-layer supercomputer i/o subsystems under production load

JL Bez, AM Karimi, AK Paul, B **e, S Byna… - Proceedings of the 31st …, 2022 - dl.acm.org
Scientific computing workloads at HPC facilities have been shifting from traditional
numerical simulations to AI/ML applications for training and inference while processing and …

Towards hpc i/o performance prediction through large-scale log analysis

S Kim, A Sim, K Wu, S Byna, Y Son, H Eom - Proceedings of the 29th …, 2020 - dl.acm.org
Large-scale high performance computing (HPC) systems typically consist of many
thousands of CPUs and storage units, while used by hundreds to thousands of users at the …