I/o access patterns in hpc applications: A 360-degree survey
The high-performance computing I/O stack has been complex due to multiple software
layers, the inter-dependencies among these layers, and the different performance tuning …
layers, the inter-dependencies among these layers, and the different performance tuning …
A checkpoint of research on parallel i/o for high-performance computing
We present a comprehensive survey on parallel I/O in the high-performance computing
(HPC) context. This is an important field for HPC because of the historic gap between …
(HPC) context. This is an important field for HPC because of the historic gap between …
Hello ADIOS: the challenges and lessons of develo** leadership class I/O frameworks
Applications running on leadership platforms are more and more bottlenecked by storage
input/output (I/O). In an effort to combat the increasing disparity between I/O throughput and …
input/output (I/O). In an effort to combat the increasing disparity between I/O throughput and …
An ephemeral burst-buffer file system for scientific applications
Burst buffers are becoming an indispensable hardware resource on large-scale
supercomputers to buffer the bursty I/O from scientific applications. However, there is a lack …
supercomputers to buffer the bursty I/O from scientific applications. However, there is a lack …
A study on data deduplication in HPC storage systems
Deduplication is a storage saving technique that is highly successful in enterprise backup
environments. On a file system, a single data block might be stored multiple times across …
environments. On a file system, a single data block might be stored multiple times across …
Stacker: an autonomic data movement engine for extreme-scale data staging-based in-situ workflows
Data staging and in-situ workflows are being explored extensively as an approach to
address data-related costs at very large scales. However, the impact of emerging storage …
address data-related costs at very large scales. However, the impact of emerging storage …
Scaling embedded in-situ indexing with deltaFS
Analysis of large-scale simulation output is a core element of scientific inquiry, but analysis
queries may experience significant I/O overhead when the data is not structured for efficient …
queries may experience significant I/O overhead when the data is not structured for efficient …
Leveraging data deduplication to improve the performance of primary storage systems in the cloud
Recent studies have shown that moderate to high data redundancy exists in primary storage
systems, such as VM-based, enterprise and HPC storage systems, which indicates that the …
systems, such as VM-based, enterprise and HPC storage systems, which indicates that the …
Exploring data staging across deep memory hierarchies for coupled data intensive simulation workflows
As applications target extreme scales, data staging and in-situ/in-transit data processing
have been proposed to address the data challenges and improve scientific discovery …
have been proposed to address the data challenges and improve scientific discovery …
Improving storage availability in cloud-of-clouds with hybrid redundant data distribution
With the increasing utilization and popularity of the cloud infrastructure, more and more data
are moved to the cloud storage systems. This makes the availability of cloud storage …
are moved to the cloud storage systems. This makes the availability of cloud storage …