cuSZp2: A GPU Lossy Compressor with Extreme Throughput and Optimized Compression Ratio
Existing GPU lossy compressors suffer from expensive data movement overheads,
inefficient memory access patterns, and high synchronization latency, resulting in limited …
inefficient memory access patterns, and high synchronization latency, resulting in limited …
CUSZ-i: High-Ratio Scientific Lossy Compression on GPUs with Optimized Multi-Level Interpolation
Error-bounded lossy compression is a critical technique for significantly reducing scientific
data volumes. Compared to CPU-based compressors, GPU-based compressors exhibit …
data volumes. Compared to CPU-based compressors, GPU-based compressors exhibit …
gzccl: Compression-accelerated collective communication framework for gpu clusters
GPU-aware collective communication has become a major bottleneck for modern computing
platforms as GPU computing power rapidly rises. A traditional approach is to directly …
platforms as GPU computing power rapidly rises. A traditional approach is to directly …
Ceresz: Enabling and scaling error-bounded lossy compression on cerebras cs-2
Today's scientific applications running on supercomputers produce large volumes of data,
leading to critical data storage and communication challenges. To tackle the challenges …
leading to critical data storage and communication challenges. To tackle the challenges …
Hoszp: An efficient homomorphic error-bounded lossy compressor for scientific data
Error-bounded lossy compression has been a critical technique to significantly reduce the
sheer amounts of simulation datasets for high-performance computing (HPC) scientific …
sheer amounts of simulation datasets for high-performance computing (HPC) scientific …
Overcoming memory constraints in quantum circuit simulation with a high-fidelity compression framework
Full-state quantum circuit simulation requires exponentially increased memory size to store
the state vector as the number of qubits scales, presenting significant limitations in classical …
the state vector as the number of qubits scales, presenting significant limitations in classical …
hZCCL: Accelerating Collective Communication with Co-Designed Homomorphic Compression
As network bandwidth struggles to keep up with rapidly growing computing capabilities, the
efficiency of collective communication has become a critical challenge for exa-scale …
efficiency of collective communication has become a critical challenge for exa-scale …
POSTER: Optimizing Collective Communications with Error-bounded Lossy Compression for GPU Clusters
GPU-aware collective communication has become a major bottleneck for modern computing
platforms as GPU computing power rapidly rises. To address this issue, traditional …
platforms as GPU computing power rapidly rises. To address this issue, traditional …
A Survey on Error-Bounded Lossy Compression for Scientific Datasets
Error-bounded lossy compression has been effective in significantly reducing the data
storage/transfer burden while preserving the reconstructed data fidelity very well. Many error …
storage/transfer burden while preserving the reconstructed data fidelity very well. Many error …
A Portable, Fast, DCT-based Compressor for AI Accelerators
Lossy compression can be an effective tool in AI training and inference to reduce memory
requirements, storage footprint, and in some cases, execution time. With the rise of novel …
requirements, storage footprint, and in some cases, execution time. With the rise of novel …