I/o access patterns in hpc applications: A 360-degree survey
The high-performance computing I/O stack has been complex due to multiple software
layers, the inter-dependencies among these layers, and the different performance tuning …
layers, the inter-dependencies among these layers, and the different performance tuning …
Parallel i/o evaluation techniques and emerging hpc workloads: A perspective
Emerging workloads such as artificial intelligence, big data analytics and complex multi-step
workflows alongside future exascale applications are anticipated future HPC workloads …
workflows alongside future exascale applications are anticipated future HPC workloads …
Revisiting I/O behavior in large-scale storage systems: The expected and the unexpected
Large-scale applications typically spend a large fraction of their execution time performing
I/O to a parallel storage system. However, with rapid progress in compute and storage …
I/O to a parallel storage system. However, with rapid progress in compute and storage …
End-to-end I/O monitoring on leading supercomputers
This paper offers a solution to overcome the complexities of production system I/O
performance monitoring. We present Beacon, an end-to-end I/O resource monitoring and …
performance monitoring. We present Beacon, an end-to-end I/O resource monitoring and …
DFTracer: An Analysis-Friendly Data Flow Tracer for AI-Driven Workflows
Modern HPC workflows involve intricate coupling of simulation, data analytics, and artificial
intelligence (AI) applications to improve time to scientific insight. These workflows require a …
intelligence (AI) applications to improve time to scientific insight. These workflows require a …
File system semantics requirements of HPC applications
Most widely-deployed parallel file systems (PFSs) implement POSIX semantics, which
implies sequential consistency for reads and writes. Strict adherence to POSIX semantics is …
implies sequential consistency for reads and writes. Strict adherence to POSIX semantics is …
Uncovering access, reuse, and sharing characteristics of {I/O-Intensive} files on {Large-Scale} production {HPC} systems
Large-scale high-performance computing (HPC) applications running on supercomputers
produce large amounts of data routinely and store it in files on multi-PB shared parallel …
produce large amounts of data routinely and store it in files on multi-PB shared parallel …
Capturing periodic I/O using frequency techniques
Many HPC applications perform their I/O in bursts that follow a periodic pattern. This allows
for making predictions as to when a burst occurs. System providers can take advantage of …
for making predictions as to when a burst occurs. System providers can take advantage of …
Access patterns and performance behaviors of multi-layer supercomputer i/o subsystems under production load
Scientific computing workloads at HPC facilities have been shifting from traditional
numerical simulations to AI/ML applications for training and inference while processing and …
numerical simulations to AI/ML applications for training and inference while processing and …
Towards hpc i/o performance prediction through large-scale log analysis
Large-scale high performance computing (HPC) systems typically consist of many
thousands of CPUs and storage units, while used by hundreds to thousands of users at the …
thousands of CPUs and storage units, while used by hundreds to thousands of users at the …