I/o access patterns in hpc applications: A 360-degree survey

JL Bez, S Byna, S Ibrahim - ACM Computing Surveys, 2023 - dl.acm.org
The high-performance computing I/O stack has been complex due to multiple software
layers, the inter-dependencies among these layers, and the different performance tuning …

A checkpoint of research on parallel i/o for high-performance computing

FZ Boito, EC Inacio, JL Bez, POA Navaux… - ACM Computing …, 2018 - dl.acm.org
We present a comprehensive survey on parallel I/O in the high-performance computing
(HPC) context. This is an important field for HPC because of the historic gap between …

Hello ADIOS: the challenges and lessons of develo** leadership class I/O frameworks

Q Liu, J Logan, Y Tian, H Abbasi… - Concurrency and …, 2014 - Wiley Online Library
Applications running on leadership platforms are more and more bottlenecked by storage
input/output (I/O). In an effort to combat the increasing disparity between I/O throughput and …

An ephemeral burst-buffer file system for scientific applications

T Wang, K Mohror, A Moody, K Sato… - SC'16: Proceedings of …, 2016 - ieeexplore.ieee.org
Burst buffers are becoming an indispensable hardware resource on large-scale
supercomputers to buffer the bursty I/O from scientific applications. However, there is a lack …

A study on data deduplication in HPC storage systems

D Meister, J Kaiser, A Brinkmann… - SC'12: Proceedings …, 2012 - ieeexplore.ieee.org
Deduplication is a storage saving technique that is highly successful in enterprise backup
environments. On a file system, a single data block might be stored multiple times across …

Stacker: an autonomic data movement engine for extreme-scale data staging-based in-situ workflows

P Subedi, P Davis, S Duan, S Klasky… - … Conference for High …, 2018 - ieeexplore.ieee.org
Data staging and in-situ workflows are being explored extensively as an approach to
address data-related costs at very large scales. However, the impact of emerging storage …

Scaling embedded in-situ indexing with deltaFS

Q Zheng, CD Cranor, D Guo, GR Ganger… - … Conference for High …, 2018 - ieeexplore.ieee.org
Analysis of large-scale simulation output is a core element of scientific inquiry, but analysis
queries may experience significant I/O overhead when the data is not structured for efficient …

Leveraging data deduplication to improve the performance of primary storage systems in the cloud

B Mao, H Jiang, S Wu, L Tian - … of the 4th annual Symposium on Cloud …, 2013 - dl.acm.org
Recent studies have shown that moderate to high data redundancy exists in primary storage
systems, such as VM-based, enterprise and HPC storage systems, which indicates that the …

Exploring data staging across deep memory hierarchies for coupled data intensive simulation workflows

T **, F Zhang, Q Sun, H Bui… - 2015 IEEE …, 2015 - ieeexplore.ieee.org
As applications target extreme scales, data staging and in-situ/in-transit data processing
have been proposed to address the data challenges and improve scientific discovery …

Improving storage availability in cloud-of-clouds with hybrid redundant data distribution

B Mao, S Wu, H Jiang - 2015 IEEE International Parallel and …, 2015 - ieeexplore.ieee.org
With the increasing utilization and popularity of the cloud infrastructure, more and more data
are moved to the cloud storage systems. This makes the availability of cloud storage …