High-performance effective scientific error-bounded lossy compression with auto-tuned multi-component interpolation J Liu, S Di, K Zhao, X Liang, S Jin, Z Jian, J Huang, S Wu, Z Chen, ... Proceedings of the ACM on Management of Data 2 (1), 1-27, 2024 | 16 | 2024 |
Anatomy of high-performance gemm with online fault tolerance on gpus S Wu, Y Zhai, J Liu, J Huang, Z Jian, B Wong, Z Chen Proceedings of the 37th International Conference on Supercomputing, 360-372, 2023 | 11 | 2023 |
A survey on error-bounded lossy compression for scientific datasets S Di, J Liu, K Zhao, X Liang, R Underwood, Z Zhang, M Shah, Y Huang, ... arXiv preprint arXiv:2404.02840, 2024 | 10 | 2024 |
gzccl: Compression-accelerated collective communication framework for gpu clusters J Huang, S Di, X Yu, Y Zhai, J Liu, Y Huang, K Raffenetti, H Zhou, K Zhao, ... Proceedings of the 38th ACM International Conference on Supercomputing, 437-448, 2024 | 8 | 2024 |
C-Coll: Introducing error-bounded lossy compression into MPI collectives J Huang, S Di, X Yu, Y Zhai, J Liu, K Raffenetti, H Zhou, K Zhao, Z Chen, ... arXiv preprint arXiv:2304.03890, 2023 | 8 | 2023 |
An optimized error-controlled mpi collective framework integrated with lossy compression J Huang, S Di, X Yu, Y Zhai, Z Zhang, J Liu, X Lu, K Raffenetti, H Zhou, ... 2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2024 | 7 | 2024 |
Cliz: Optimizing lossy compression for climate datasets with adaptive fine-tuned data prediction Z Jian, S Di, J Liu, K Zhao, X Liang, H Xu, R Underwood, S Wu, J Huang, ... 2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2024 | 6 | 2024 |
Exploring wavelet transform usages for error-bounded scientific data compression J Huang, J Liu, S Di, Y Zhai, Z Jian, S Wu, K Zhao, Z Chen, Y Guo, ... 2023 IEEE International Conference on Big Data (BigData), 4233-4239, 2023 | 6 | 2023 |
Ft-gemm: A fault tolerant high performance gemm implementation on x86 cpus S Wu, Y Zhai, J Huang, Z Jian, Z Chen Proceedings of the 32nd International Symposium on High-Performance Parallel …, 2023 | 5 | 2023 |
Poster: Optimizing collective communications with error-bounded lossy compression for gpu clusters J Huang, S Di, X Yu, Y Zhai, J Liu, Y Huang, K Raffenetti, H Zhou, K Zhao, ... Proceedings of the 29th ACM SIGPLAN Annual Symposium on Principles and …, 2024 | 3 | 2024 |
FT K-Means: A High-Performance K-Means on GPU with Fault Tolerance S Wu, Y Ding, Y Zhai, J Liu, J Huang, Z Jian, H Dai, S Di, BM Wong, ... 2024 IEEE International Conference on Cluster Computing (CLUSTER), 322-334, 2024 | 2 | 2024 |
Turbofft: A high-performance fast fourier transform with fault tolerance on gpu S Wu, Y Zhai, J Liu, J Huang, Z Jian, H Dai, S Di, Z Chen, F Cappello arXiv preprint arXiv:2405.02520, 2024 | 2 | 2024 |
CUSZ-i: High-Ratio Scientific Lossy Compression on GPUs with Optimized Multi-Level Interpolation J Liu, J Tian, S Wu, S Di, B Zhang, R Underwood, Y Huang, J Huang, ... SC24: International Conference for High Performance Computing, Networking …, 2024 | 1 | 2024 |
Hoszp: An efficient homomorphic error-bounded lossy compressor for scientific data T Agarwal, S Di, J Huang, Y Huang, G Gopalakrishnan, R Underwood, ... arXiv preprint arXiv:2408.11971, 2024 | 1 | 2024 |
Ft-blas: A fault tolerant high performance blas implementation on x86 cpus Y Zhai, E Giem, K Zhao, J Liu, J Huang, BM Wong, CR Shelton, Z Chen IEEE Transactions on Parallel and Distributed Systems 34 (12), 3207-3223, 2023 | 1 | 2023 |
Accelerating mpi collectives with process-in-process-based multi-object techniques J Huang, K Ouyang, Y Zhai, J Liu, M Si, K Raffenetti, H Zhou, A Hori, ... Proceedings of the 32nd International Symposium on High-Performance Parallel …, 2023 | 1 | 2023 |
Accelerating fault-tolerant blas on x86 cpus Y Zhai, E Giem, K Zhao, J Liu, J Huang, B Wong, C Shelton, Z Chen July, 2022 | 1 | 2022 |
LCP: Enhancing Scientific Data Management with Lossy Compression for Particles L Zhang, R Li, C Ren, S Di, J Liu, J Huang, R Underwood, P Grosset, ... Proceedings of the ACM on Management of Data 3 (1), 1-27, 2025 | | 2025 |
PSZ: Enhancing the SZ Scientific Lossy Compressor With Progressive Data Retrieval Z Yang, S Di, R Li, X Li, L Zhang, J Huang, J Liu, F Cappello, K Zhao arXiv preprint arXiv:2502.04093, 2025 | | 2025 |
TurboFFT: Co-Designed High-Performance and Fault-Tolerant Fast Fourier Transform on GPUs S Wu, Y Zhai, J Liu, J Huang, Z Jian, H Dai, S Di, F Cappello, Z Chen arXiv preprint arXiv:2412.05824, 2024 | | 2024 |