I/o access patterns in hpc applications: A 360-degree survey

JL Bez, S Byna, S Ibrahim - ACM Computing Surveys, 2023 - dl.acm.org
The high-performance computing I/O stack has been complex due to multiple software
layers, the inter-dependencies among these layers, and the different performance tuning …

Posetrack: A benchmark for human pose estimation and tracking

M Andriluka, U Iqbal, E Insafutdinov… - Proceedings of the …, 2018 - openaccess.thecvf.com
Existing systems for video-based pose estimation and tracking struggle to perform well on
realistic videos with multiple people and often fail to output body-pose trajectories consistent …

Scalable i/o aggregation for asynchronous multi-level checkpointing

MJ Gossman, B Nicolae, JC Calhoun - Future Generation Computer …, 2024 - Elsevier
Checkpointing distributed HPC applications is a common I/O pattern with many use cases:
resilience, job management, reproducibility, revisiting previous intermediate results, etc. This …

Improving I/O performance for exascale applications through online data layout reorganization

L Wan, A Huebl, J Gu, F Poeschel… - … on Parallel and …, 2021 - ieeexplore.ieee.org
The applications being developed within the US Exascale Computing Project (ECP) to run
on imminent Exascale computers will generate scientific results with unprecedented fidelity …

Toward scalable and asynchronous object-centric data management for HPC

H Tang, S Byna, F Tessier, T Wang… - 2018 18th IEEE/ACM …, 2018 - ieeexplore.ieee.org
Emerging high performance computing (HPC) systems are expected to be deployed with an
unprecedented level of complexity due to a deep system memory and storage hierarchy …

Io-aware job-scheduling: Exploiting the impacts of workload characterizations to select the map** strategy

E Jeannot, G Pallez, N Vidal - The International Journal of …, 2023 - journals.sagepub.com
In high performance, computing concurrent applications are sharing the same file system.
However, the bandwidth which provides access to the storage is limited. Therefore, too …

Improving mpi collective i/o for high volume non-contiguous requests with intra-node aggregation

Q Kang, S Lee, K Hou, R Ross… - … on Parallel and …, 2020 - ieeexplore.ieee.org
Two-phase I/O is a well-known strategy for implementing collective MPI-IO functions. It
redistributes I/O requests among the calling processes into a form that minimizes the file …

Detecting i/o access patterns of hpc workloads at runtime

JL Bez, FZ Boito, R Nou, A Miranda… - 2019 31st …, 2019 - ieeexplore.ieee.org
In this paper, we seek to guide optimization and tuning strategies by identifying the
application's I/O access pattern. We evaluate three machine learning techniques to …

Spatially bursty I/O on supercomputers: Causes, impacts and solutions

J Yu, W Yang, F Wang, D Dong… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Understanding the I/O characteristics of supercomputers is crucial for gras** accurate I/O
workloads and uncovering potential I/O inefficiency. We collect and analyze I/O traces from …

Adding topology and memory awareness in data aggregation algorithms

F Tessier, V Vishwanath, E Jeannot - Future Generation Computer Systems, 2024 - Elsevier
With the growing gap between computing power and the ability of large-scale systems to
ingest data, I/O is becoming the bottleneck for many scientific applications. Improving read …